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PSAFE (Protein Structure and Function 


Exploration) 


A Resource for Biochemistry Students and Instructors 


The Protein Structure and Function Exploration (PSAFE) project* http://psafeonline. 
net at the University of Virginia was created as a semester-long project for biochemistry 
students to explore structure—function relationships in macromolecules and to research 
and write about their findings. Students doing the PSAFE project use the molecular- 
document capability of the ICM Browser (from Molsoft LLC) to accomplish their goals. 
In this exercise, each student progresses through tutorials about the basics of biomolecular 
structure to individual in-depth analysis of a chosen macromolecule and how it achieves its 
function. The project culminates in each student creating an interactive graphical display 
and accompanying narrative describing the chosen macromolecule’s structure-function 
relationships, together with relevant references. These student creations are reviewed and 
edited before posting to an archival website (http://psafeonline.net). The PSAFE archive 
contains descriptions of well over 1300 different proteins and nucleic acids and is available 
to anyone with an interest in exploring structure-function relationships in macromolecules. 
This semester-long research and writing project has been added to the Biochemistry 7e, 
online course to provide students with the opportunity to explore the structure and func- 
tion of an assigned protein. The project culminates in the composition of an interactive 
narrative describing the macromolecule’s structure and function. 


*Magnotti, E., Moy, J., Sleppy, R., Carey, A., Firdyiwek, Y., Garrett, R. H., and Grisham, 
C., 2019. “Developing and Implementing a Free Online Protein Structure and Function 
Exploration Project to Teach Undergraduate Students Macromolecular Structure—Function 
Relationships.” Journal of Chemical Education 96:729-733. 
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Laboratory Techniques in Biochemistry 


All of our knowledge of biochemistry is the outcome of experiments. For the most part, 
this text presents biochemical knowledge as established fact, but students should never lose 
sight of the obligatory connection between scientific knowledge and its validation by obser- 
vation and analysis. The path of discovery by experimental research is often indirect, tortu- 
ous, and confounding before the truth is realized. Laboratory techniques lie at the heart of 
scientific inquiry, and many techniques of biochemistry are presented within these pages to 
foster a deeper understanding of the biochemical principles and concepts that they reveal. 


Recombinant DNA Techniques 


Restriction endonuclease digestion of DNA Section 10.6d 
Restriction mapping Section 10. 6e-f 

DNA sequencing Section 11.1 

Nucleic acid hybridization Section 11.3 

Chemical synthesis of nucleic acids Section 11.6 
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Construction of genomic DNA libraries Section 12.2 

Combinatorial libraries of synthetic oligomers Section 12.2 
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PCR (polymerase chain reaction) Section 12.2d 

mRNA isolation Section 12.2€ 

Construction of cDNA libraries Section 12.2e 

Expressed sequence tags (ESTs) Section 12.2e 

Southern blotting Section 12.2d 

Gene chips (DNA microarrays) Section 12.2f 

Protein expression from cDNA inserts Section 12.3 

Screening protein expression libraries with antibodies Section 12.3a 
Two-hybrid systems to identify protein-protein interactions Section 12.3¢ 
Reporter gene constructs Section 12.3b 

RT-qPCR (real-time quantitative PCR) Section 12.3a 

In vitro mutagenesis Section 12.3d 

ChIP-Seq (chromatin immunoprecipitation-DNA sequencing) Section 12.3¢ 
RNAi Section 12.4 

Genome editing with CRISPR-Casg Section 12.5d 

Base editing with CRISPR-Casg Section 12.5¢ 

Prime editing with CRISPR-Casg Section 12.5f 

Gene silencing with CRISPR-Casg Section 12.5 

High-throughput screening Section 12.6 

BioBricks Section 12.7a 

CRISPR/Casg Section 12.5 


Probing the Function of Biomolecules 


Green fluorescent protein Section 4.4 

Plotting enzyme kinetic data Section 13.3/ 

Enzyme inhibition Section 13.4 

Isotopic tracers as molecular probes Section 17.4¢ 
RNAi Section 12.4 

NMR spectroscopy Section 17.4d 

Transgenic animals Section 28.8 

DNA footprinting Section 29.1 


Techniques Relevant to Clinical Biochemistry 


Human gene therapy Section 12.5 
Metabolomic analysis Section 17.5 


Fluxomics Section 17.5 

Tumor diagnosis with positron emission tomography (PET) Section 18.7 
Glucose monitoring devices Section 22.1 

Fluoro-substituted analogs as therapeutic agents Section 26.8 
“Knockout” mice Section 28.7 


Isolation/Purification of Macromolecules 


lon exchange chromatography Section 5.2 

Protein purification protocols Section 5.2 

Dialysis and ultrafiltration Section 5.2 

Size exclusion chromatography Section 5.2 
SDS-polyacrylamide gel electrophoresis Section 5.2 
Isoelectric focusing Section 5.2 

Two-dimensional gel electrophoresis Section 5.2 
Hydrophobic interaction chromatography Section 5.2 
High-performance liquid chromatography Section 5.2 
Affinity chromatography Section 5.2 

Ultracentrifugation Section 5.2 

Fractionation of cell extracts by centrifugation Section 5.2 
Proteomic analysis by multidimensional protein identification 


technology Section 5.8b 


Analyzing the Physical and Chemical Properties 
of Biomolecules 


Titration of weak acids Section 2.2b 

Preparation of buffers Section 2.3 

Measurement of standard reduction potentials Section 3.6 
Edman degradation Section 4.3 

Nuclear magnetic resonance (NMR) Sections 4.5 and 6.4 
Estimation of protein concentration Section 5.2 

Amino acid analysis Section 4.6 

Amino acid sequence determination Section 5.3 

Mass spectrometry of proteins Section 5.3i 

Solid-phase peptide synthesis Section 5.5 

Cryo-Electron Microscopy Section 6.4 

Membrane lipid phase transitions Section 9.4b 

Nucleic acid hydrolysis Section 10.6 

DNA sequencing Section 11.1 

Single-molecule DNA sequencing Section 11.1e,f 

Density gradient (isopycnic) centrifugation Section 11.3 
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xxxii 


Biochemistry 


Scientific understanding of the molecular nature of life is growing at an astounding rate. 
Significantly, society is the prime beneficiary of this increased understanding. Cures 
for diseases, better public health, remedies for environmental pollution, and the devel- 
opment of cheaper and safer natural products are just a few practical benefits of this 
knowledge. In addition, this expansion of information fuels, in the words of Thomas 
Jefferson, “the illimitable freedom of the human mind.” Scientists can use the tools of 
biochemistry and molecular biology to explore all aspects of an organism— from basic 
questions about its chemical composition, through inquiries into the complexities of its 
metabolism, differentiation, and development, to analysis of its evolution, and even its 
behavior. New procedures based on the results of these explorations lie at the heart of 
the many modern medical miracles. Biochemistry is a science whose boundaries now 
encompass all aspects of biology, from molecules to cells, to organisms, to ecology, and 
to all aspects of health care. This seventh edition of Biochemistry embodies and reflects 
the expanse of this knowledge. We hope that this new edition will encourage students to 
ask questions of their own and to push the boundaries of their curiosity about science. 


Making Connections 


As the explication of natural phenomena rests more and more on biochemistry, its inclu- 
sion in undergraduate and graduate curricula in biology, chemistry, and the health sci- 
ences becomes imperative. The challenge to authors and instructors is a formidable one: 
how to familiarize students with the essential features of modern biochemistry in an 
introductory course or textbook. Fortunately, the increased scope of knowledge allows 
scientists to make generalizations connecting the biochemical properties of living sys- 
tems with the character of their constituent molecules. As a consequence, these general- 
izations, validated by numerous examples, emerge in time as principles of biochemistry, 
principles that are useful in discerning and describing new relationships between diverse 
biomolecular functions and in predicting the mechanisms underlying newly discovered 
biomolecular processes. Nevertheless, it is increasingly apparent that students must 
develop skills in inquiry-based learning, so that beyond this first encounter with bio- 
chemical principles and concepts students are equipped to explore science on their own. 
Much of the design of this new edition is meant to foster the development of such skills. 

We are both biochemists, but one of us spent his career in a biology department, and 
the other in a chemistry department. Undoubtedly, we each view biochemistry through 
the lens of our respective disciplines. We believe, however, that our collaboration on this 
textbook represents a melding of our perspectives that will provide new dimensions of 
appreciation and understanding for all students. 
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Our Audience 


This biochemistry textbook is designed to communicate the fundamental principles 
upon which all life is based to students encountering biochemistry for the first time. We 
aim to bring an appreciation of biochemistry to a broad audience that includes under- 
graduates majoring in the life sciences, physical sciences, or premedical programs as well 
as medical students and graduate students in the various health sciences for whom bio- 
chemistry is an important route to understanding human physiology. To make this sub- 
ject matter more relevant and interesting to all readers, we emphasize, where appropriate, 
the biochemistry of humans. 


Objectives and Building on Previous Editions 


We carry forward the clarity of purpose found in previous editions; namely, to 
illuminate for students the principles governing the structure, function, and interactions 
of biological molecules. At the same time, this new edition has been revised to reflect 
tremendous developments in biochemistry. Significantly, emphasis is placed on the 
interrelationships of ideas so that students can begin to appreciate the overarching 
questions of biochemistry. 


New Features 


New Textbook Features 


¢ Think-Pair-Share Problems in every chapter encourage students to work collabora- 
tively to further their understanding of biochemistry concepts. 


e Careers in Chemistry showcase various career paths that students can take after study- 
ing biochemistry. By seeing a diverse, inclusive, and equitable chemical community, all 
students know that they have a future in chemistry-related careers. 

e Over 160 New and Revised Exercises were added to the book and online course to 
keep the content current. 

e Step-by-Step Solutions were added to examples to help students master the approach 
to complicated problems. 

e Recent Developments in Biochemistry: A number of Deeper Look and Critical Develop- 
ments in Biochemistry boxes were added to address the latest developments in biochem- 
istry such as the human proteome, NMR and Cryo-EM, AlfaFold, and CRISPR-Cas9. 


New to the Online Course and Resources 


e MCAT Essays and Practice Problems accompanied by detailed answer explanations, 
written by members of medical school admissions and advisors, have been added to 
the online course in OWLv? to prepare students for the biochemistry section of the 
MCAT exam. 


e Hints and Targeted Feedback. Over fifty percent of problems in OWLv2 now come 
with targeted feedback on common errors that students make. The targeted feed- 
back explains why the student's answer is incorrect and guides them toward a correct 
solution. 

e Laddered Assessments. Conceptual mastery modules are combined with more tradi- 
tional homework questions into one structured learning path, organized by topic at 
the chapter level. 

e Over 300 New Problems were added to the online course. 

e PSAFE: Protein Structure and Function Exploration Project. Students research a protein 
and use protein-modeling software to practice biochemistry actively in this research 
and writing semester-long project. 

e Protein Structure and Function Exploration Library of Proteins and Nucleic Acids is an 
online resource (http://www.psafeonline.net/) of over 1300 protein and nucleic acid 
animations and interactive views, sorted by proteins and chapters in this textbook. 
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New to this Edition 


Chapter 3 A new Deeper Look box called “Why Nature Chose Phosphates.” 


Chapter 4 A new Deeper Look box called “Why Nature Chose Selenium,” and a revised 
Critical Developments in Biochemistry box on the incorporation of unnatural amino 
acids into proteins. 


Chapter 5 A new exercise on determining amino-acid sequence from mass spectrometry 
data. Efforts to describe the human proteome are introduced in a new Deeper Look box: 
“The Human Proteome Project and the Human Protein Atlas.” 


Chapter 6 A new Critical Developments in Biochemistry box on the X-ray crystallography 
of proteins and a revised Deeper Look feature on metamorphic proteins, which exist as 
an ensemble of structures of similar energies and stabilities. New Critical Developments 
in Biochemistry boxes on the Protein Data Bank (PDB) and “NMR and Cryo-EM — 
Revolutionary Methods that Probe Protein Structure and Dynamics” (including 
cryo-EM structure of the 2P spike protein mutant related to development of COVID- 
19 vaccines). New coverage of AlphaFold, artificial intelligence software that accurately 
predicts the three-dimensional structures of proteins based solely on their amino acid 
sequences. 


Chapter 8 A new Human Biochemistry box on the development of vaccines for 
COVID-19 that employ lipid nanoparticles and make RNA vaccines practical and 
effective. 


Chapter 9 New coverage of monotopic, bitopic, and polytopic membrane proteins and 
a section called “Dynamic Exchange of Lipids and Proteins with Membrane Domains.” 


Chapter 10 An updated introduction to the many roles of small RNAs in the regulation 
of gene expression: miRNAs and the long, noncoding RNAs (lincR NAs). 


Chapter 11 Completion of the complete nucleotide sequence determination for the human 
genome in 2022. Focus on the rapidly growing science of bioinformatics: the study of the 
nature and organization of biological information, including functional genomics and 
proteomics. New principles emerging about the higher order structural organization of 
chromosomes, chromosome territories, and chromosome dynamics. Synthetic chromo- 
somes, DNA as a data storage medium. 


Chapter 12 Genome engineering with CRISPR-Cas9, CRISPR-Cas9 in gene-editing and 
base-editing. A new section on human gene therapy, rewriting the human genome. 


Chapter 14 A new Critical Developments in Biochemistry box on Nobel laureate Frances 
Arnold and her pioneering work on the directed evolution of enzymes. 


Chapter 17 Metabolites at the center of life, metabolomics as the driver of all the 
other —omics. 


Chapter 18 The newly-discovered protein kinase activity of protein kinase M2 (PK M2), 
its stimulation by SAICAR (an intermediate in the purine biosynthetic pathway), and its 
role in tumor proliferation. 


Chapter 20 The new molecular structures of the supercomplexes that carry out electron 
transport and oxidative phosphorylation, including the new structure of Complex I and 
insights regarding its mechanism of action. 

Chapter 22 A new Human Biochemistry box called “Metformin — A Diabetes Drug 
with Multiple Actions,” and a Critical Developments in Biochemistry feature describing 
how the consumption of ATP promotes and supports the metabolism of cancer cells. 
Chapter 25 Synthesis of cysteine in humans by a reverse transsulfurylation pathway. 
A new Human Biochemistry box on amino acid metabolism and cancer. 


Chapter 26 Figure describing the purinosome metabolon that synthesizes purines on a 
PRPP platform. 
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Chapter 27 New molecular graphic of mTORC1, the master integrator of information 
about nutrient status and a global regulator of biosynthesis. A new section on AMPK 
inhibition of mTORC1 through reversible phosphorylation. A new section on SIRT1, 
mTORC1, AMPK, caloric restriction, and metabolic syndrome. 


Chapter 28 New discussion and figure illustrating that DNA polymerases are immo- 
bilized within replication factories. New figure showing the structural organization of 
eukaryotic DNA replicons. 


Chapter 29 Updated discussion of eukaryotic gene regulatory sequences such as promot- 
ers, enhancers, insulators, and silencers. 


Chapter 30 Pyrrolysine as the twenty-second amino acid. New art illustrating the richly 
detailed events in eukaryotic translation initiation. 


Chapter 31 New structures of the SecY channel and the SecA ATPase, and cryo-EM 
structures of co-translational and post-translational states of the Sec61 translocon com- 
plex. New cryo-EM structures of a cullin-RING-UBE2D ubiquitin ligation assembly. 


Chapter 32 New cryo-EM structure of the neurotensin receptor-arrestin2 complex. New 
section on ion channels that respond to physical stimuli (including temperature, voltage, 
pH, redox status, and mechanical phenomena) rather than chemical signals. 


Features 


e Clarity of Instruction This edition was re-organized for increased clarity and readability. 
Many of the lengthier figure legends were shortened and more information was 
included directly within illustrations. These changes will help the more visual reader. 


e Visual Instruction The richness of the Protein Data Bank (www.pdb.org) and avail- 
ability of molecular graphics software has been exploited to enliven this text. Over 
1100 images of prominent proteins and nucleic acids involved with essential biologi- 
cal functions illustrate and inform the subject matter and were prepared especially for 
this book. 


e Essential Questions Organization Each chapter in this book is framed around an 
Essential Question that invites students to become actively engaged in their learning 
and encourages curiosity and imagination about the subject matter. For example, the 
Essential Question of Chapter 3 asks, “What are the laws and principles of thermody- 
namics that allow us to describe the flows and interchanges of heat, energy, and mat- 
ter in biochemical systems?” The section heads then pose Key Questions that serve as 
organizing principles for a lecture such as, “What is the daily human requirement for 
ATP?” The subheadings are designed to be concept statements that respond to the 
section headings. 


e Foundational Biochemistry At the end of each chapter, this feature provides a com- 
prehensive list of the principal facts and concepts that a student should understand 
after reading each chapter. Presented as short statements or descriptive phrases, the 
items in the Foundational Biochemistry list serve as guides to students of the knowl- 
edge they have acquired from the chapter and as checklists the students can review in 
assessing their learning. 


e End-of-Chapter Problems More than 700 end-of-chapter problems are provided. 
They serve as meaningful exercises that help students develop problem-solving 
skills useful in achieving their learning goals. Some problems require students to 
employ calculations to find mathematical answers to relevant structural or functional 
questions. Other questions address conceptual problems whose answers require appli- 
cation and integration of ideas and concepts introduced in the chapter. 


e Think-Pair-Share problems encourage students to work collaboratively to further their 
understanding. 


e Further Readings sections at the end of each chapter make it easy for students to find 
up-to-date additional information about each topic. 


Preface 
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e Critical Developments in Biochemistry essays emphasize recent and historical advances 
in the field. 


e Human Biochemistry essays emphasize the central role of basic biochemistry in med- 
icine and the health sciences. These essays often present clinically important issues 
such as diet, diabetes, and cardiovascular health. 


e A Deeper Look essays expand on the text, highlighting selected topics or experimental 
observations. 


e Laboratory Techniques The experimental nature of biochemistry is highlighted, and a 
list of laboratory techniques found in this book can be seen on page xxxi. 


Instructor and Student Resources 


Instructor and student resources are available online. 

Instructor resources include: 

e Solution and Answer Guide 

e Test Bank 

e Transition Guide from the Sixth Edition to the Seventh Edition 
e Lecture Note PowerPoint slides 

e Image Library slides 

e Guide to Teaching Online 

e Educator’s Guide 
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Marmaduke St. John/Alamy 


The Facts of Life: Chemistry 
Is the Logic of Biological 
Phenomena 


Molecules are lifeless. Yet the properties of living things derive from the properties of 
molecules. Despite the spectacular diversity of life, the elaborate structure of biological 
molecules, and the complexity of vital mechanisms, are life functions ultimately 
interpretable in chemical terms? 


Mos are lifeless. Yet, in appropriate complexity and number, molecules 
compose living things. These living systems are distinct from the inanimate 
world because they have certain extraordinary properties. They can grow, move, 
perform the incredible chemistry of metabolism, respond to stimuli from the 
environment, and, most significantly, replicate themselves with exceptional fidelity. 
The chemistry of the living cell resembles the chemistry of organic reactions. Indeed, 
cellular constituents, or biomolecules, must conform to the chemical and physical 
principles that govern all matter. Despite the spectacular diversity of life, the intricacy 
of biological structures, and the complexity of vital mechanisms, life functions are 
interpretable in chemical terms. Chemistry is the logic of biological phenomena. Living 
organisms are self-sustaining systems of chemical reactions. 


1.1 | What Are the Distinctive Properties 
of Living Systems? 


The most obvious quality of living organisms is that they are complicated and highly 
organized (Figure 1.1). For example, organisms large enough to be seen with the naked 
eye are composed of many cells, typically of many types. In turn, these cells possess 
subcellular structures, called organelles, which are complex assemblies of very large 
polymeric molecules, called macromolecules. These macromolecules show an exquisite 


Part | 
Molecular 


Components 
of Cells 


“.. everything that living things do can be 
understood in terms of the jigglings and 
wigglings of atoms.” 


Richard P. Feynman. Lectures on Physics, 
Addison-Wesley, 1963 


< A sperm fertilizing an egg. 


1.5 


1.6 


What Are the Distinctive Properties 
of Living Systems? 

What Kinds of Molecules Are 
Biomolecules? 


What Is the Structural Organization 
of Complex Biomolecules? 

How Do the Properties of 
Biomolecules Reflect Their Fitness to 
the Living Condition? 

What Are the Organization and 
Structure of Cells? 


What Are Viruses? 
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2 Chapter 1 The Facts of Life: Chemistry Is the Logic of Biological Phenomena 


Entropy A thermodynamic term used to designate that 
amount of energy in a system that is unavailable to do 
work. 


Herbert Kehrer/Corbis 


(b) 


Figure 1.1 (a) Gelada (Theropithecus gelada), a baboon native to the Ethiopian highlands. 
(b) Tropical orchid (Masdevallia norops), Ecuador. 


degree of organization in their intricate three-dimensional architecture, even though 
they are composed of simple sets of chemical building blocks, such as sugars and amino 
acids. Indeed, the complex three-dimensional structure of a macromolecule, known as 
its conformation, is a consequence of interactions between the monomeric units, 
according to their individual chemical properties. 

Biological structures serve functional purposes. That is, biological structures play a 
role in the organism’s existence. From parts of organisms, such as limbs and organs, 
down to the chemical agents of metabolism, such as enzymes and metabolic intermedi- 
ates, a biological purpose can be given for each component. In biology, it is always 
meaningful to seek the purpose of observed structures, organizations, or patterns; that 
is, to ask what functional role they serve within the organism. 

Living systems are actively engaged in energy transformations. Maintenance of the 
highly organized structure and activity of living systems depends on their ability to 
extract energy from the environment. The ultimate source of energy is the sun. Solar 
energy flows from photosynthetic organisms (organisms able to capture light energy by 
the process of photosynthesis) through food chains to herbivores and ultimately to car- 
nivorous predators at the apex of the food pyramid. The biosphere is thus a system 
through which energy flows. Organisms capture some of this energy, be it from photo- 
synthesis or the metabolism of food, by forming special energized biomolecules. ATP 
and NADPH are the two most prominent examples (Figure 1.2). (Commonly used 
abbreviations such as ATP and NADPH are defined on the inside back cover of this 
book.) ATP and NADPH are energized biomolecules because they represent chemically 
useful forms of stored energy. When these molecules react with other molecules in the 
cell, the energy released can be used to drive energetically unfavorable processes. That is, 
ATP, NADPH, and related compounds are the power sources that drive the energy- 
requiring activities of the cell, including biosynthesis, movement, osmotic work against 
concentration gradients, and, in special instances, light emission (bioluminescence). The 
living state is characterized by the flow of energy through the organism. Only upon death 
does an organism reach equilibrium with its inanimate environment. At the expense of 
energy flow, the organism can maintain its intricate order and activity far removed from 
equilibrium with its surroundings, yet exist in a state of apparent constancy over time. 
This state of apparent constancy, or so-called steady state, is actually a very dynamic 
condition: Energy and material are consumed by the organism to maintain its stability 
and order. In contrast, inanimate matter, as exemplified by the universe in totality, is 
moving to a condition of increasing disorder or, in thermodynamic terms, maximum 
entropy. 
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1.1 What Are the Distinctive Properties of Living Systems? 


NH, 
S 
N 
On On | 
| | 2 
-O—P—O—P—O N 
| | O——CH 
2 
© | O 
-O—P=O 
O 
| OH OH NHo 
-O —P=0 


OH O 
| 
"0 =P=07 
Il 
Figure 1.2 ATP and NADPH, two biochemically important Q 
energy-rich compounds. NADPH 


Living systems have a remarkable capacity for self-replication. Generation after 
generation, organisms reproduce virtually identical copies of themselves. This self- 
replication can proceed by a variety of mechanisms, ranging from simple division in 
bacteria to sexual reproduction in plants and animals; but in every case, it is characterized 
by an astounding degree of fidelity (Figure 1.3). Indeed, if the accuracy of self-replication 


Randal H. Garrett 


Figure 1.3 Organisms resemble their parents. Orangutan with infant. 
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In biochemistry and molecular biology, structurally 
complementary means that two structures align and fit 
together like pieces of a puzzle. 
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Figure 1.4 The DNA double helix. Two complementary polynucleotide chains run in opposite direc- 
tions and pair through hydrogen bonding between their nitrogenous bases, A with T and C with G. 
Note that everywhere there is an A in one strand, the other has a T; everywhere there is a C in one 
strand, there is a G in the other. These complementary nucleotide sequences give rise to structural 
complementarity. 


were significantly greater, the evolution of organisms would be hampered. This is so 
because evolution depends upon natural selection operating on individual organisms 
that vary slightly in their fitness for the environment. The fidelity of self-replication 
resides ultimately in the chemical nature of the genetic material. This substance is 
deoxyribonucleic acid, abbreviated as DNA. It consists of a pair of polymeric chains 
built using four different monomers known as deoxynucleotides. These four deoxynu- 
cleotide building blocks are symbolized by the letters A, C, G, and T. Information is 
encoded in each polynucleotide strand in the form of the precise sequence of A, C, G, 
and T deoxynucleotides along its length, much as this sentence contains information as 
encoded in the letters of the words that compose it. The two deoxynucleotide chains are 
structurally complementary to one another (Figure 1.4) in that everywhere there is an A 
in one strand, the other has a T, and everywhere there is a C in one strand, there isa G 
in the other. DNA can generate two identical copies of itself in a rigorously executed 
polymerization process whereby each chain is copied precisely, using the information 
provided by its complementary strand. This process ensures a faithful reproduction of 
the information written by the original polynucleotide strands. In contrast, the mole- 
cules of the inanimate world lack this capacity to replicate. A crude mechanism of rep- 
lication must have existed at life’s origin. 


1.2 | What Kinds of Molecules Are Biomolecules? 


The elemental composition of living matter differs markedly from the relative 
abundance of elements in the earth’s crust. Hydrogen (H), oxygen (O), carbon (C), 
and nitrogen (N) constitute more than 99% of the atoms in the human body, with 
most of the H and O occurring as water, HO. Oxygen, silicon (Si), aluminum (Al), 
and iron (Fe) are the most abundant atoms in the earth’s crust, with hydrogen, car- 
bon, and nitrogen being relatively rare (less than 0.2% each). Nitrogen as dinitrogen 
(N;) is the predominant gas in the atmosphere, and carbon dioxide (CO) is present 
at a level of 0.04%, a small but critical amount. What property unites hydrogen, 
oxygen, carbon, and nitrogen and renders these atoms so suitable to the chemistry 
of life? It is their ability to form covalent bonds by electron-pair sharing. Further- 
more, hydrogen, carbon, nitrogen, and oxygen are among the lightest elements of 
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the periodic table capable of forming such bonds (Figure 1.5). Because the strength Bond 

of covalent bonds is inversely proportional to the atomic weights of the atoms _ „7 Covalent energy 
A : Atoms € pairing bond (kJ/mol) 
involved, hydrogen, carbon, nitrogen, and oxygen form the strongest covalent = : 

bonds. Two other covalent bond-forming elements, phosphorus (as phosphate Mios Me => Miei HY 


[-OPO;?%] derivatives) and sulfur, also play important roles in biomolecules. : . I 
freee —C—H 


1.2a Biomolecules Are Carbon Compounds : : cee » 


All biomolecules contain carbon. The prevalence of carbon is due to its unparalleled 5 : S i 0 
versatility in forming stable covalent bonds through electron-pair sharing. Carbon 

can form up to four such bonds by sharing each of the four electrons in its outer shell 16.4.4: —+-G:N: -C—N’ 
with electrons contributed by other atoms. Atoms commonly found in covalent link- É i ae | k 
age to carbon are carbon itself, hydrogen, oxygen, and nitrogen. Hydrogen can form 

one such bond by contributing its single electron to the formation of an electron pair. -C-+-0: — :C:0: -C—O- 
Oxygen, with two unpaired electrons in its outer shell, can participate in two covalent ` ` oe 
bonds, and nitrogen, which has three unshared electrons, can form three such cova- x : 5 E 7 
lent bonds. Furthermore, carbon, nitrogen, and oxygen can share two electron pairs `Ẹ aCe —> Cst CC 
to form double bonds with one another within biomolecules, a property that enhances 

their chemical versatility. Carbon and nitrogen can even share three electron pairs to . @ adbc N: => ce “N: Ke =N- 
form triple bonds. 

Two properties of carbon covalent bonds merit particular attention. One is the ability 
of carbon to form covalent bonds with itself. The other is the tetrahedral nature of the 
four covalent bonds when carbon atoms form only single bonds. Together these proper- | 6:+-0: 
ties hold the potential for an incredible variety of linear, branched, and cyclic carbon gk oh 
compounds. This diversity is multiplied further by the possibilities for including nitro- 20810; => © zO . O=0O 
gen, oxygen, and hydrogen atoms in these compounds (Figure 1.6). We can therefore A $ 
envision the ability of carbon to generate complex structures in three dimensions. These ‘Nit:N: — :NiN: N=N 
structures, by virtue of appropriately included nitrogen, oxygen, and hydrogen atoms, ; ; y 
can display unique chemistries suitable to the living state. Thus, we may ask, is there any ‘Nit H: — :N:iH 9N—H 


attern or underlying organization that brings order to this astounding potentiality? a oo 
Pp a £ EP y -O:+ H:-—-O:H -O—H 


Figure 1.5 Covalent bond formation by e` pair 
sharing. The energy necessary to break a bond is given 


1.3 | What Is the Structural Organization of Complex ital: 
Biomolecules? 


Examination of the chemical composition of cells reveals a dazzling variety of 
organic compounds covering chemical dimensions such as length and mass 
(Table 1.1). When biomolecules are classified based on size and chemical properties, 
an organizational pattern emerges. The biomolecules are built according to a struc- 
tural hierarchy: Simple molecules are the units for building complex structures. 

The molecular constituents of living matter do not reflect randomly the infinite 
possibilities for combining carbon, hydrogen, oxygen, and nitrogen atoms. Instead, only 
a limited set of the many possibilities is found, and these collections share certain 
properties essential to the living state. The most prominent aspect of biomolecular 
organization is that simple molecules are the building blocks for polymers that contain 
thousands or even millions of atoms, structures so large that we refer to them as 
macromolecules. What properties do these biomolecules possess that make them so Macro - prefix from the Greek “makros” meaning large 
appropriate for the condition of life? or long. 


1.3a Metabolites Are Used to Form the Building Blocks 
of Macromolecules 
The major precursors for the formation of biomolecules are water, carbon dioxide, and 


three inorganic nitrogen compounds—ammonium (NH,"*), nitrate (NO;7), and dini- 
trogen (N>). Metabolic processes transform these inorganic precursors through ever 
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Figure 1.6 Examples of the versatility of C—C bonds to build complex structures: linear, cyclic, branched, and planar. 
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| Table11 | Biomolecular Dimensions 


The dimensions of mass* and length for biomolecules are given typically in daltons and nanometers,* respectively. One dalton (Da) is approximately 
equal to the mass of one hydrogen atom, 1.66 X 107% g. One nanometer (nm) is 10~? m, or 10 A (angstroms). 


Length 
Biomolecule (long dimension, nm) 
Water 0.3 
Alanine 0.5 
Glucose 0.7 
Phospholipid 3.5 
Ribonuclease (a small protein) 4 
Immunoglobulin G (IgG) 14 
Myosin (a large muscle protein) 160 
Ribosome (bacterial) 18 
Bacteriophage ¢X174 (a very small bacterial virus) 25 
Pyruvate dehydrogenase complex (a multienzyme complex) 60 
Tobacco mosaic virus (a plant virus) 300 
Mitochondrion (liver) 1500 
Escherichia coli cell 2000 
Chloroplast (spinach leaf) 8000 
Liver cell 20,000 


Daltons 
18 
89 
180 
750 
12,600 
150,000 
470,000 
2,520,000 
4,700,000 
7,000,000 
40,000,000 


Mass 


Picograms 


6.68 X 1075 
15 
2 
60 
8000 


*Molecular mass is expressed in units of daltons (Da) or kilodaltons (kDa) in this book; alternatively, the dimensionless term molecular weight, symbolized by M,, and defined as the 


ratio of the mass of a molecule to 1 dalton of mass, is used. 
Prefixes used for powers of 10 are as follows: 


106 mega M 10° milli m 
10° kilo k 1076 micro pu 
107) deci d 1073 nano n 
107? centi c 10°" | pico p 
1075 femto f 


more complex levels of biomolecular order (Figure 1.7). In the first step, precursors are 
converted to metabolites, simple organic compounds that are intermediates in cellular 
energy transformation and the biosynthesis of various sets of building blocks: amino 
acids, sugars, nucleotides, fatty acids, and glycerol. Through the covalent linkage of 
these building blocks, macromolecules are constructed: proteins, polysaccharides, 
polynucleotides (DNA and RNA), and lipids. (Strictly speaking, lipids contain 
relatively few building blocks and are therefore not really polymeric like other macro- 
molecules; however, lipids are important contributors to higher levels of complexity.) 
Interactions among macromolecules lead to the next level of structural organization, 
supramolecular complexes. Here, various members of one or more of the classes of 
macromolecules come together to form specific assemblies that serve important sub- 
cellular functions. Examples of these supramolecular assemblies are multifunctional 
enzyme complexes, ribosomes, chromosomes, and cytoskeletal elements. For example, 
a eukaryotic ribosome contains four different RNA molecules and at least 70 unique 
proteins. Ribosomes are the supramolecular complexes where proteins are synthe- 
sized. These supramolecular assemblies are an interesting contrast to their compo- 
nents because their structural integrity is maintained by noncovalent forces, not 
covalent bonds. These noncovalent forces include hydrogen bonds, ionic attractions, 
van der Waals forces, and hydrophobic interactions between macromolecules. Such 
forces maintain these supramolecular assemblies in a highly ordered functional state. 
Although noncovalent forces are weak (less than 40 kJ/mol), they are numerous in 
these assemblies and thus can collectively maintain the essential architecture of the 
supramolecular complex under conditions of temperature, pH, and ionic strength that 
are consistent with cell life. 
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Carbon dioxide 


The inorganic precursors: 

(18-64 daltons) 

Carbon dioxide, Water, Ammonia, 
Nitrogen (No), Nitrate(NO;_) 


Metabolites: 

(50-250 daltons) 

Glucose, Fructose-1,6- 
bisphosphate, 
Glyceraldehyde-3-phosphate, 
3-Phosphoglyceric acid, 
Pyruvate, Citrate, Succinate 


Building blocks: 
(100-350 daltons) 
9 Amino acids, Nucleotides, 
= Monosaccharides, Fatty acids, 
O Glycerol 


Supramolecular complexes: 
(10°-10° daltons) 
Ribosomes, Cytoskeleton, 
Multienzyme complexes 


Figure 1.7 Molecular organization in the cell is a hierarchy. 


Macromolecules: 
(10°-10° daltons) 
Proteins, Nucleic acids, 
Polysaccharides, Lipids 


Nucleus, Mitochondria, 
Chloroplasts, Endoplasmic 
reticulum, Golgi apparatus, 
Vacuole 


1.3b Organelles Represent a Higher Order in Biomolecular Organization 


The next higher rung in the hierarchical ladder is occupied by the organelles. 
Organelles are found only in eukaryotic cells, that is, the cells of higher organisms 
(eukaryotic cells are described in Section 1.5). Organelles are subcellular structures 
dedicated to a specific purpose. Several kinds, such as mitochondria and chloro- 
plasts, evolved from bacteria that entered the cytoplasm of early eukaryotic cells. 
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Other organelles include the nucleus, endoplasmic reticulum, Golgi apparatus, and 
vacuoles as well as other relatively small cellular inclusions, such as peroxisomes, 
lysosomes, and chromoplasts. The nucleus is the repository of genetic information 
contained within the linear sequences of nucleotides in the DNA of chromosomes. 
Mitochondria capture the energy released during aerobic metabolism and use it to 
produce ATP. Chloroplasts endow cells with the ability to carry out photosynthesis, 
capturing light energy and transforming it into metabolically useful chemical forms. 


1.3¢ Membranes Are Supramolecular Assemblies That Define 
the Boundaries of Cells 


Membranes define the boundaries of cells and organelles. As such, they are not easily 
classified as supramolecular assemblies or organelles, although they share the proper- 
ties of both. Membranes resemble supramolecular complexes in their construction 
because they are complexes of proteins and lipids maintained by noncovalent forces. 
Hydrophobic interactions are particularly important in maintaining membrane struc- 
ture. Hydrophobic interactions arise because water molecules prefer to interact with 
each other rather than nonpolar substances. The presence of nonpolar molecules lessens 
the range of opportunities for water—water interaction by forcing water molecules into 
ordered arrays around the nonpolar groups. Such ordering can be minimized if the indi- 
vidual nonpolar molecules redistribute from a dispersed state in the water into an aggre- 
gated organic phase surrounded by water. The spontaneous assembly of membranes in 
the aqueous environment, where life arose and exists, is the natural result of the hydro- 
phobic (“water-fearing”) character of lipids and proteins. Hydrophobic interactions are 
the creative means of membrane formation and the driving force that presumably estab- 
lished the boundary of the first cell. The membranes of organelles, such as nuclei, mito- 
chondria, and chloroplasts, differ from one another, with each having a characteristic 
protein and lipid composition tailored to the organelle’s function. The formation of these 
discrete compartments in the cell allows these cells to carry out unique functions. 


1.3d The Unit of Life Is the Cell 


The cell is characterized as the unit of life, the smallest entity capable of displaying the 
attributes associated uniquely with the living state: growth, metabolism, stimulus 
response, and replication. In previous discussions, we explicitly narrowed the infinity 
of chemical complexity potentially available to organic life and we previewed an orga- 
nizational arrangement, moving from simple to complex, that provides insights into the 
functional and structural plan of the cell. Nevertheless, these features do not explain 
the living characteristics of cells. Can we find other themes represented within biomol- 
ecules that are explicitly chemical yet anticipate or illuminate the living condition? 
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Figure 1.8 (a) Amino acids build proteins. (b) Sugars build polysaccharides. (c) Nucleotides 
build nucleic acids. All these polymerization processes involve bond formations that are 
accompanied by the elimination of water (dehydration synthesis reactions). 


Chapter 1 The Facts of Life: Chemistry Is the Logic of Biological Phenomena 


1.4 | How Do the Properties of Biomolecules Reflect Their 
Fitness to the Living Condition? 


If we consider what attributes of biomolecules render them so fit as components of 
growing, replicating systems, several biologically relevant themes of structure and 
organization emerge. Furthermore, as we study biochemistry, we will see that these 
themes serve as principles of biochemistry. Prominent among them is the necessity for 
information and energy in the maintenance of the living state. Some biomolecules must 
have the capacity to contain the information, or “recipe,” of life. Other biomolecules 
must be able to translate this information so that the organized molecules essential to 
life are synthesized. Interactions between these structures are the processes of life. An 
orderly mechanism for capturing energy from the environment must also exist to 
obtain the energy needed to drive these processes. What properties of biomolecules 
endow them with the potential for such remarkable qualities? 


1.4a Biological Macromolecules and Their Building Blocks 
Have a “Sense” or Directionality 


The macromolecules of cells are polymers built of monomeric units—amino acids in 
proteins, nucleotides in nucleic acids, and carbohydrates in polysaccharides—that 
have structural polarity. That is, these building blocks are not symmetrical, and so 
they can be thought of as having a “head” and a “tail.” Polymerization of these 
units to form macromolecules occurs by head-to-tail linear connections. Because of 
this, the polymer also has a head and a tail, and hence, the macromolecule has a 
“sense” or direction to its structure (Figure 1.8). 
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1.4b Biological Macromolecules Are Informational 


Because biological macromolecules have a sense to their structure, the sequential 
order of their component building blocks, when read along the length of the 
molecule, has the capacity to specify information in the same manner that the letters 
of the alphabet can form words when arranged in a linear sequence (Figure 1.9). Not 
all biological macromolecules are rich in information. Polysaccharides are often 
composed of the same sugar unit repeated over and over, as in cellulose or starch, 
which are homopolymers of many glucose units. On the other hand, proteins 
and polynucleotides are typically composed of building blocks arranged without 
an obvious pattern; that is, their sequences are unique, akin to the letters 
and punctuation that form this descriptive sentence. In these unique sequences 
lies meaning. 

In 1957, Francis Crick laid the foundation for molecular biology when he postu- 
lated a pathway for the flow of information in biological systems. This information is 
encoded in the sequence of building blocks arrayed along informational macromole- 
cules. Crick reasoned that information flows from DNA, the hereditary material and 
the repository of that information, through an RNA intermediate that serves as a 
message to specify the sequence of amino acids in a protein, as summarized in the 
scheme below: 


DNA —> RNA > protein 


RNA resembles DNA. In it, U replaces T as one of the four nucleotide building 
blocks. 

As we will see, a gene is a sequence of nucleotides in DNA that encodes the informa- 
tion to build a protein with a particular amino acid sequence. Significantly, proteins are 
the agents of biological function—they underlie virtually all of the activities that a cell 
or organism displays: 


gene — message — function 


In macromolecular terms, 
-TTCAGCAATAAGGG- > -UUCAGCAAUAAGGG- —> -Phe-Ser-Asn-Lys-Pro- 
(nucleotide sequences) (amino acid sequence) 
In informational terms, 
(DNA/information) + (RNA intermediate) — (biological form/function) 


A nucleotide sequence in DNA is transcribed and written as an RNA sequence, and 
that RNA sequence is translated as a sequence of amino acids in a protein. Crick’s path- 
way of information transfer has become the central dogma of molecular biology. 


A strand of DNA 


A polypeptide segment 


A polysaccharide chain 


Figure 1.9 The sequence of monomeric units in a biological polymer can contain information if the diver- 
sity and order of the units are not overly simple or repetitive. Nucleic acids and proteins are information- 
rich molecules; polysaccharides are not. 
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1.4c Biomolecules Have Characteristic 
Three-Dimensional Architecture 


The structure of any molecule is a unique and specific aspect of its identity. Molecular 
structure reaches its pinnacle in the intricate complexity of biological macromole- 
cules, particularly proteins. Although proteins are linear sequences of covalently 
linked amino acids, the course of the protein chain can turn, fold, and coil in the three 
dimensions of space to establish a specific, highly ordered architecture that is an 
identifying characteristic of the given protein molecule (Figure 1.10). 


1.4d Weak Forces Maintain Biological Structure and 
Determine Biomolecular Interactions 


Covalent bonds hold atoms together so that molecules are formed. In contrast, weak 
chemical forces or noncovalent bonds (hydrogen bonds, van der Waals forces, ionic 
interactions, and hydrophobic interactions) are intramolecular or intermolecular 
attractions between atoms. None of these forces, which typically range from 4 to 30 
kJ/mol, are strong enough to bind free atoms together (Table 1.2). The average kinetic 
energy of molecules at 25°C is 2.5 kJ/mol, so the energy of weak forces is only several 
times greater than the dissociating tendency due to thermal motion of molecules. 
Thus, these weak forces establish interactions that are constantly forming and break- 
ing at physiological temperatures. However, through collective action, large numbers 
of weak forces can create, organize, and stabilize complex structures. These weak 
Figure 1.10 Antigen-binding domain of immunoglobulin forces merit further discussion because their properties profoundly influence the 


G (IgG). (a) Space-filling model showing each atomas nature of the biological structures they build. 
a sphere. (b) Ribbon diagram representation. Each flat 
arrow represents a region of regular 3-dimensional struc- 


ture containing 5 to 15 amino acids. PDBID=1FDL. 1.4e Van der Waals Forces Are Short-Range Interactions 
between Atoms in Nearby Molecules 


Van der Waals forces result from the induced electrical interactions between closely 
approaching atoms or molecules as their negatively charged electron clouds fluctu- 
ate instantaneously in time. These fluctuations allow attraction to occur between 
the positively charged nuclei and the electrons of nearby atoms. Van der Waals 
attractions operate over a very limited interatomic distance (0.3 to 0.6 nm) and are 
an effective bonding interaction at physiological temperatures only when a number 
of atoms in a molecule can interact with several atoms in a neighboring molecule. 
For this to occur, the atoms on interacting molecules must pack together neatly. 
That is, their molecular surfaces must possess a degree of structural complementar- 
ity (Figure 1.11). 

At best, van der Waals interactions are weak and individually contribute 0.4 to 
4.0 kJ/mol of stabilization energy. However, the sum of many such interactions within a 


Table 1.2 Weak Chemical Forces and Their Relative Strengths and Distances 
Strength Distance 

Force (kJ/mol) (nm) Description 

Van der Waals interactions 0.44.0 0.3-0.6 Strength depends on the relative size of the atoms or molecules and the 
distance between them. The size factor determines the area of contact between 
two molecules: The greater the area, the stronger the interaction. 

Hydrogen bonds 12-30 0.3 Relative strength is proportional to the polarity of the H bond donor and H bond 
acceptor. More polar atoms form stronger H bonds. 

Ionic interactions 20 0.25 Strength also depends on the relative polarity of the interacting charged species. 
Some ionic interactions are also H bonds: —NH;*: --~OOC—. 

Hydrophobic interactions <40 — Force is a complex phenomenon determined by the degree to which the structure of 


water is disordered as discrete hydrophobic molecules or molecular regions coalesce. 
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(a) 


yO? 
` 


Gln 121 


Figure 1.11 Van der Waals packing is enhanced in molecules that are structurally complementary. 
Gln!?!, a surface protuberance on lysozyme, is recognized by the antigen-binding site of an antibody 
against lysozyme. Gln!! (pink) fits nicely in a pocket formed by Tyr? (orange), Phe”! (light green), Trp” 
(dark green), and Tyr!” (blue) components of the antibody. (See also Figure 1.15.) (a) Ball-and-stick 
model. (b) Space-filling representation. (From Amit, A. G., et al., 1986. Three-dimensional structure 
of an antigen-antibody complex at 2.8 A resolution. Science 233:747-753, figure 5.) 


macromolecule or between macromolecules can be substantial. Calculations indicate 
that the attractive van der Waals energy between the enzyme lysozyme and a sugar sub- 
strate that it binds is about 60 kJ/mol. 

When two atoms approach each other so closely that their electron clouds begin to 
overlap, strong repulsive van der Waals forces occur, as shown in Figure 1.12. Between 
the repulsive and attractive domains lies a low point in the potential curve. This low 
point defines the distance known as the van der Waals contact distance, which is the 
interatomic distance that results if only van der Waals forces hold two atoms together. 
The limit of approach of two atoms is determined by the sum of their van der Waals 
radii (Table 1.3). 


| Table 1.3 | Radii of Common Biomolecular Atoms 


Atom 
Van der Waals Covalent Represented 

Atom Radius (nm) Radius (nm) to Scale 
H 0.1 0.037 O 
C 0.17 0.077 Q 
N 0.15 0.070 ri) 
O 0.14 0.066 Q 
P 0.19 0.096 Q 
S 0.185 0.104 Q 
Half-thickness 0.17 — | | 
of an aromatic 
ring 


Phe 91 


Trp 92 


Gln 121 


van der Waals 


2.0 radii 


1.0 


Energy (kJ/mol) 


van der Waals 
-1.0 contact distance 
| a 
0 0.2 0.4 0.6 0.8 
r (nm) 
Figure 1.12 The van der Waals interaction energy 


profile as a function of the distance, r, between the 
centers of two atoms. 
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H bonds 
Bonded atoms 
O—H---O 
o—H---07 
O—H---N 
N—H---o 
*N—H---O 
N—H---N 


Donors 


Approximate 
bond length* 


0.27 nm 
0.26 nm 
0.29 nm 
0.30 nm 
0.29 nm 
0.31 nm 


*Lengths given are distances from the atom 
covalently linked to the H to the atom 
H bonded to the hydrogen: 


O—H---O 


Functional groups that are important 
H-bond donors and acceptors: 


Acceptors 


oR 


| 


i 


Figure 1.13 Some biologically important 


H bonds. 


The Facts of Life: Chemistry Is the Logic of Biological Phenomena 


1.4f Hydrogen Bonds Form between H Atoms and Electronegative 
Atoms Like N and O 


Hydrogen bonds form between a hydrogen atom covalently bonded to an electronega- 
tive atom (such as oxygen or nitrogen) and a second electronegative atom that serves 
as the hydrogen bond acceptor. Several important biological examples are given in 
Figure 1.13. Hydrogen bonds, at a strength of 12 to 30 kJ/mol, are stronger than van 
der Waals forces and have an additional property: H bonds are cylindrically sym- 
metrical and tend to be highly directional, forming straight bonds between donor, 
hydrogen, and acceptor atoms. Hydrogen bonds are also more specific than van der 
Waals interactions because they require the presence of complementary hydrogen 
donor and acceptor groups. 


1.4g lonic Interactions Form between Oppositely Charged Groups or 
Molecules 


Ionic interactions, also called electrostatic interactions, are the result of forces between 
electrically charged structures, such as negative carboxyl groups and positive amino 
groups (Figure 1.14). These electrostatic forces average about 20 kJ/mol in aqueous 
solutions. Typically, the electrical charge is radially distributed, so these interactions 
may lack the directionality of hydrogen bonds or the precise fit of van der Waals inter- 
actions. Nevertheless, because the opposite charges are restricted to sterically defined 
positions, ionic interactions can impart a high degree of structural specificity. 

The strength of electrostatic interactions is highly dependent on the nature of the 
interacting species and the distance, r, between them. Electrostatic interactions may 
involve ions (species possessing discrete charges), permanent dipoles (a permanent sepa- 
ration of positive and negative charge), or induced dipoles (a temporary separation of 
positive and negative charge induced by the environment). 


1.4h Hydrophobic Interactions Are the Result of Favorable Interactions 
between Water Molecules 


Hydrophobic interactions result from the strong tendency of water to exclude nonpo- 
lar groups or molecules (see Chapter 2). Hydrophobic interactions arise not because 
of any intrinsic affinity of nonpolar substances for one another (although van der 
Waals forces do promote the weak bonding of nonpolar substances), but because 
water molecules prefer the stronger interactions that they share with one another, 
compared to their interactions with nonpolar molecules. Hydrogen-bonding interac- 
tions between polar water molecules can be more varied and numerous if nonpolar 
molecules merge into a distinct organic phase. This phase separation raises the 
entropy of water because individual nonpolar molecules are no longer dispersed in the 
water, and thus, water molecules are no longer arranged in orderly arrays around 
them. It is these preferential interactions between water molecules that “exclude” 
hydrophobic substances from aqueous solution and drive the tendency of nonpolar 
molecules to cluster together. Thus, nonpolar regions of biological macromolecules 
are often buried in the molecule’s interior to exclude them from the aqueous milieu. 
The formation of oil droplets as hydrophobic nonpolar lipid molecules that coalesce 
in the presence of water is an approximation of this phenomenon. These tendencies 
have important consequences in the creation and maintenance of the macromolecular 
structures and supramolecular assemblies of living cells. 


1.4i The Defining Concept of Biochemistry Is “Molecular Recognition 
through Structural Complementarity” 
The complicated and highly organized patterns of life depend on the ability of bio- 


molecules to recognize and interact with one another in very specific ways. Such 
interactions are fundamental to metabolism, growth, replication, and other vital 
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processes. The interaction of one molecule with another, a protein witha metabolite, (a) NH, 
for example, is most precise if the structure of one is complementary to the structure N Ky 
of the other, as in two connecting pieces of a puzzle or, in the more popular analogy Mgt.. ¢ | N 
for macromolecules and their ligands, a lock and its key (Figure 1.15). Structural com- oO OO N x 


plementarity is the means of recognition in biomolecular interactions. Indeed, this _ | | | 
as fe : a O-P-—O-P-O-P-O-CH, 

principle of structural complementarity is the very essence of biomolecular recognition. Il i i 

Structural complementarity is the clue to understanding the functional properties of O O (0) o 
biological systems. Biological systems, from the macromolecular level to the cellular 
level, operate via specific molecular recognition mechanisms based on structural 
complementarity: A protein recognizes its specific metabolite, an antibody recog- 
nizes its antigen, a strand of DNA recognizes its complementary strand, sperm recog- 
nize an egg. All these interactions involve structural complementarity between 
molecules. 


HO OH 


1.4j Biomolecular Recognition Is Mediated by Weak Chemical Forces 


The weak chemical forces described earlier underlie the interactions that are the basis of 
biomolecular recognition. It is important to realize that because these interactions are 
sufficiently weak, they are readily reversible. Consequently, biomolecular interactions 
tend to be transient; rigid, static lattices of biomolecules that might paralyze cellular activ- 
ities are not formed. Instead, a dynamic interplay occurs between metabolites and macro- 
molecules, hormones and receptors, and all the other participants instrumental to life 
processes. This interplay is initiated upon specific recognition between complementary 
molecules and ultimately results in unique physiological activities. Biological function is 
achieved through mechanisms based on structural complementarity and weak chemical 
interactions. 

This principle of structural complementarity extends to higher interactions essential 
to the establishment of the living condition. For example, the formation of supramo- 
lecular complexes occurs because of recognition and interaction between their various Figure 1.14 Tonic bonds in biological molecules. 
macromolecular components, as governed by the weak forces formed between them. If _ () Magnesium ATP. (b) Intramolecular ionic 

ae bonds between oppositely charged groups on 
a sufficient number of weak bonds can be formed, as between macromolecules comple- mimo acid'residues ina protein. 
mentary in structure to one another, larger structures assemble spontaneously. The 
tendency for nonpolar molecules and parts of molecules to come together through 
hydrophobic interactions also promotes the formation of supramolecular assemblies. 
Very complex subcellular structures are actually spontaneously formed in an assembly Ligand a molecule (or atom) that binds specifically to 
process that is driven by weak forces accumulating through structural complementarity. another molecule (from Latin ligare, to bind). 


Protein strand 


Puzzle Lock and key 


(a) b “oa” (c) 


Figure 1.15 Structural complementarity: (a) the pieces of a puzzle, (b) the lock and its key, (c) a bio- 
logical macromolecule and its ligand—an antigen-antibody complex. The antigen on the right (gold) is 
a small protein, lysozyme, from hen egg white. The antibody molecule (IgG) (/eft) has a pocket that is 
structurally complementary to a surface feature (red) on the antigen. (See also Figure 1.11.) 
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1.4k Weak Forces Restrict Organisms to a Narrow Range 
of Environmental Conditions 


Because biomolecular interactions are governed by weak forces, living systems are 
restricted to a narrow range of physical conditions. Biological macromolecules are 
functionally active only within a narrow range of environmental conditions, such as 
temperature, ionic strength, and relative acidity. Extremes of these conditions disrupt 
the weak forces essential to maintaining the intricate structure of macromolecules. 
The loss of structural order in these complex macromolecules, so-called denaturation, 
is accompanied by loss of function (Figure 1.16). As a consequence, cells cannot toler- 
ate reactions in which large amounts of energy are released, nor can they generate a 
large energy burst to drive energy-requiring processes. Instead, such transformations 
take place via sequential series of chemical reactions whose overall effect achieves 
dramatic energy changes, even though any given reaction in the series proceeds with 
only a modest input or release of energy (Figure 1.17). These sequences of reactions 


ch e ae os 
Figure 1.16 Denaturation and renaturation of an Native Denatured 
intricate protein structure. protein protein 


e 
© 


The combustion of glucose: C;H,,0, + 6 Op ———> 6 COs + 6H,O + 2870 kJ energy 


(a) In an aerobic cell (b) In a bomb calorimeter 


Glycolysis 
LS) 
© 
2 Pyruvate pid) 
as heat 


6 CO, + 6 HyO 30-38 ATP 6 CO, +6H,O 


Figure 1.17 Metabolism is the organized release or capture of small amounts of energy when the over- 
all energy change is large. (a) Cells can release energy from glucose in a stepwise fashion, and the small 
“packets” of energy are used as ATP. (b) Combustion of glucose in a bomb calorimeter results in an 
uncontrolled, explosive release of energy in its least useful form, heat. 
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are organized to release useful energy to the cell from the breakdown of food or to 
take such energy and use it to drive the synthesis of biomolecules essential to the liv- 
ing state. Collectively, these reaction sequences constitute cellular metabolism—the 
ordered reaction pathways by which cellular chemistry proceeds and biological 
energy transformations are accomplished. 


1.41 Enzymes Catalyze Metabolic Reactions 


The sensitivity of cellular constituents to environmental extremes places another con- 
straint on the reactions of metabolism. The rate at which cellular reactions proceed is a 
very important factor in maintaining the living state. However, the common ways that 
chemists accelerate reactions are not available to cells; the temperature cannot be raised, 
acid or base cannot be added, the pressure cannot be elevated, and concentrations 
cannot be dramatically increased. Instead, biomolecular catalysts mediate cellular reac- 
tions. These catalysts, called enzymes, accelerate the reaction rates by many orders of 
magnitude and, by selecting the substances undergoing reaction, determine the specific 
reaction that takes place. Virtually every metabolic reaction is catalyzed by an enzyme 
(Figure 1.18). 


Metabolic Regulation Is Achieved by Controlling the Activity of Enzymes Thousands of 
reactions mediated by an equal number of enzymes are occurring at any given instant 
within the cell. Collectively, these reactions constitute cellular metabolism. Metabo- 
lism has many branch points, cycles, and interconnections, as subsequent chapters 
reveal. All these reactions, many of which are at apparent cross-purposes in the cell, 
must be fine-tuned and integrated so that metabolism and life proceed harmoniously. 
This metabolic regulation is achieved through controls on enzyme activity so that the 
rates of cellular reactions are appropriate to cellular requirements. 

Despite the organized pattern of metabolism and the thousands of enzymes 
required, cellular reactions conform to the same thermodynamic principles that govern 
any chemical reaction. Enzymes have no influence over energy changes (the thermody- 
namic component) in their reactions. Enzymes only influence reaction rates. Thus, cells 
rely on enzyme-based metabolic systems that take in food, release waste, and carry out 
complex degradative and biosynthetic reactions essential to their survival while main- 
taining a constant internal environment (homeostasis) with no outwardly apparent 
changes. Cells are open thermodynamic systems that exchange matter and energy with 
their environment and function as highly regulated isothermal chemical engines. 


1.4m Biomolecular Phenomena across the Time Scale of Life 


Individual organisms have life spans ranging from a day or less to a century or more, 
but the phenomena that characterize and define living systems have durations rang- 
ing over 33 orders of magnitude, from 107! sec (electron transfer reactions, photo 
excitation in photosynthesis) to 10! sec (the period of evolution, spanning from the 
first appearance of organisms on the earth more than 3 billion years ago to today) 
(Table 1.4). Because proteins are the agents of biological function, phenomena involv- 
ing weak interactions and proteins dominate the shorter times. As time increases, 
more stable interactions (covalent bonds) and phenomena involving the agents of 
genetic information (the nucleic acids) come into play. 


1.5 | What Are the Organization and Structure of Cells? 


All living cells fall into one of three broad categories—Archaea, Bacteria, and 
Eukarya. Archaea and Bacteria are referred to collectively as prokaryotes. As a 
group, prokaryotes are single-celled organisms that lack nuclei and other organelles; 
the word is derived from pro meaning “prior to” and karyot meaning “nucleus.” In 


Figure 1.18 Carbonic anhydrase, a representa- 
tive enzyme. This macromolecule is a polymer 
of 260 amino acids and is composed of more 
than 2200 atoms. Its structure is represented as 
a ribbon diagram. Ribbon diagrams are useful 
summaries of a macromolecule’s 3D structural 
features. 
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Table 1.4 
Time (sec) 
19745 
10713 


1071 


107!? to 103 


10~6 to 10° 


10° 


10! to 10° 
10* to 10° 


107 to 108 
10° to 10° 
10!8 


Life Times 
Process 


Electron transfer 


Transition states 


H-bond lifetimes 


Motion in proteins 


Enzyme catalysis 


Diffusion in membranes 


Protein synthesis 


Cell division 


Embryonic development 
Life span 


Evolution 


Example 


The light reactions in photosynthesis. 


Transition states in chemical reactions have lifetimes of 10~!! to 107" sec (the reciprocal of the 
frequency of bond vibrations). 


H bonds are exchanged between H2O molecules due to the rotation of the water molecules 
themselves. 


Fast: tyrosine ring flips, methyl group rotations 
Slow: bending motions between protein domains 


10~ sec: fast enzyme reactions 
1073 sec: typical enzyme reactions 
10° sec: slow enzyme reactions 


A typical membrane lipid molecule can diffuse from one end of a bacterial cell to the other in 
1 sec; a small protein would go half as far. 


Some ribosomes synthesize proteins at a rate of 20 amino acids added per second. 


Prokaryotic cells can divide as rapidly as every hour or so; eukaryotic cell division varies greatly 
(from hours to years). 


Human embryonic development takes 9 months (2.4 X 107 sec). 
Human life expectancy is about 80 years in developed countries (2.5 X 10° sec). 


The first organisms appeared 3.8 X 10° years ago and evolution has continued since then. 


traditional biological classification schemes, prokaryotes were grouped together as 
members of the kingdom Monera. The other four living kingdoms were all Eukarya— 
the single-celled Protists, such as amoebae, and all multicellular life forms, includ- 
ing the Fungi, Plant, and Animal kingdoms. Eukaryotic cells have true nuclei and 
other organelles such as mitochondria, with the prefix ew meaning “true.” Group- 
ings of organisms into kingdoms is useful, but Eukarya are a more complex domain 
of life than earlier classification schemes suggest. 


1.5a The Eukaryotic Cell Likely Emerged from an Archaeal Lineage 


In the three-domain tree of life, Archaea, Bacteria, and Eukarya share a common 
ancestry, arising from primordial cells that exchanged genetic information among 
themselves. Current evidence suggests that, early on, Archaea and Bacteria were 
the two principal domains of life, and eukaryotic cells emerged when an archaeal 
cell hosted one or more bacterial partners. That is, contemporary eukaryotic cells 
are composite cells that harbor archaeal and bacterial contributions. 

Despite great diversity in form and function, cells and organisms share much 
biochemistry in common. This commonality was substantiated by the results of 
whole genome sequencing, the determination of the complete nucleotide sequence 
within the DNA of an organism. For example, the genome of the metabolically 
divergent archaea Methanococcus jannaschii shows 44% similarity to known genes 
in eubacteria and eukaryotes. 


1.5b How Many Genes Does a Cell Need? 


The Mycoplasma genitalium genome consists of 523 genes, encoding 484 proteins, 
in just 580,074 base pairs (Table 1.5). Note that some flowering plants (Arabidopsis 
thaliana) and animals (pufferfish) have more genes than humans. Thus, the num- 
ber of genes does not correlate directly with biological complexity. This informa- 
tion sparks an interesting question: How many genes are needed for cellular life? 
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Organism in Adult* 
Mycobacterium genitalium 1 
Pathogenic bacterium 

Methanococcus jannaschii 1 
Archaeal methanogen 

Escherichia coli K12 1 
Intestinal bacterium 

Saccharomyces cerevisiae 1 
Baker’s yeast (eukaryote) 

Caenorhabditis elegans 959 
Nematode worm 

Drosophila melanogaster 104 
Fruit fly 

Arabidopsis thaliana 107 
Flowering plant 

Fugu rubripes 10” 
Pufferfish 

Homo sapiens 10" 
Human 


Genes Does It Take to Make An Organism? 
Number of Cells 


Number of Genes 


523 


1800 


4400 


6000 


19,000 


13,500 


27,000 


26,700 (est.) 


20,500 (est.) 


The first four of the nine organisms in the table are single-celled microbes; the last six are eukaryotes; the last five are 
multicellular, four of which are animals; the final two are vertebrates. Although pufferfish and humans have roughly 
the same number of genes, the pufferfish genome, at 0.365 billion nucleotide pairs, is only one-eighth the size of the 


human genome. 


*Numbers for Arabidopsis thaliana, the pufferfish, and human are “order-of-magnitude” rough estimates. 


Critical Developments in Biochemistry 


Synthetic Life 


If life is chemistry, can we use chemistry to create life? J. Craig 
Venter and his colleagues at the J. Craig Venter Institute (JCVI) 
claim to have created the first synthetic life. They devised a synthetic 
version of the 1.08 x 10°-base pair genome of Mycobacterium 
mycoides by designing a thousand segments of DNA, each about 
1000 base pairs long, which were then chemically synthesized by the 
Blue Heron Biotechnology Co., a DNA synthesizing service. The 
JCVI scientists then assembled these pieces to form a complete syn- 
thetic genome that was transferred into the cytoplasm of a related 
bacterial species, M. capricolum. The self-replicating cells that grew 
following this genome transplantation were under the direction of 
the synthetic genome and produced proteins representative of M. 
mycoides, not M. capricolum. Further, the only DNA in these cell 


Any minimum gene set must encode all the information necessary for cellular 
metabolism, including the vital functions essential to reproduction. The simplest 
cell must show at least some degree of metabolism and energy production; genetic 
replication based on a template molecule that encodes information (DNA or 


cultures was the synthetic DNA created by Venter and associates, 
as evidenced by certain “watermarks” they designed into their syn- 
thetic genome to establish its uniqueness. These cells represent the 
first artificial living organisms, artificial in the sense that their entire 
genome was chemically synthesized and not the result of biological 
evolution. Their significance lies in the possibilities they open for 
the creation of life forms for specific purposes, such as oil-eating 
bacteria for environmental remediation or bacteria that synthesize 
desired products such as drugs. 


Gibson, D. G., 2010. Creation of a bacterial cell controlled by a chemically 
synthesized genome. Science 329:52-56. 


Gene A unit of hereditary information, physically 
defined by a specific sequence of nucleotides in DNA; 
in molecular terms, a gene is a nucleotide sequence 
that encodes a protein or an RNA product. 


RNA?); and formation and maintenance of a cell boundary (membrane). Top- 
down studies aim to discover from existing cells what a minimum gene set might 
be. These studies have focused on simple parasitic bacteria because parasites often 
obtain many substances from their hosts and do not have to synthesize them from 
scratch; thus, they require fewer genes. One study concluded that 206 genes are 
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(a) 


E. coli bacteria 


sufficient to form a minimum gene set. The set included genes for DNA replica- 
tion and repair, transcription, translation, protein processing, cell division, mem- 
brane structure, nutrient transport, metabolic pathways for ATP synthesis, and 
enzymes to make a small number of metabolites that might not be available, such 
as pentoses for nucleotides. Another study based on computer modeling decided 
that a minimum gene set might have only 105 protein-coding genes. Bottom-up 
studies aim to create a minimal cell by reconstruction based on known cellular 
components. The simplest, functional artificial cell capable of replication would 
contain an informational macromolecule (presumably a nucleic acid) and enough 
metabolic apparatus to maintain a basic set of cellular components within a 
membrane-like boundary. 


1.5¢ Archaea and Bacteria Have a Relatively Simple 
Structural Organization 


The bacteria form a widely distributed group. Certain of them are pathogenic to 
humans. The archaea, about which less is known, were first discovered growing in 
unusual environments where other cells cannot survive. Archaea include the 
thermoacidophiles (heat- and acid-loving bacteria) of hot springs, the halophiles (salt- 
loving bacteria) of salt lakes and ponds, and the methanogens (bacteria that generate 
methane from CO, and H;). Archaea are also common in typical microbial habitats, 
such as soils, seas, and the guts of animals. Bacteria and Archaea, collectively known 
as Prokaryota, are typically very small, on the order of several microns in length, 
and are usually surrounded by a rigid cell wall that protects the cell and gives it its 
shape. The characteristic structural organization of one of these cells is depicted in 
Figure 1.19. 

Prokaryotic cells have only a single membrane, the plasma membrane or cell mem- 
brane. Because they have no other membranes, prokaryotic cells contain no nucleus 
or organelles. Nevertheless, they possess a distinct nuclear area called the nucleoid 


Ribosomes 


Nucleoid (DNA) 


as Flagella 
a 7 


Figure 1.19 This bacterium is Escherichia coli, a member of the coliform group of bacteria that 
colonize the human intestinal tract. (a) Micrograph of a cross section of E. coli cells, (b) a micrograph 
of a cluster of E. coli cells, and (c) a schematic drawing of an E. coli cell. (Photo, Martin Rotker/ 
Phototake, Inc.; inset photo, David M. Phillips/Science Source/Photo Researchers, Inc.) 
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| Table 1.6 e| 1.6 Some Human Diseases Caused by Bacteria 

Disease Causative Organism 

Tetanus (lockjaw) Clostridium tetani 
Botulism Clostridium botulinum 
Whooping cough Bordetella pertussis 
Cholera Vibrio cholerae 

Bubonic plague Yersinia pestis 

Diphtheria Corynebacterium diphtheriae 
Tuberculosis Mycobacterium tuberculosis 
Leprosy Mycobacterium leprae 
Gonorrhea Neisseria gonorrhoeae 


where a single circular chromosome is localized. Some have internal membranous 
structures derived from and continuous with the cell membrane. Reactions of cellular 
energy production are localized mainly on these membranes. In cyanobacteria, flat, 
sheet-like membranous structures called lamellae are formed from cell membrane 
infoldings. These lamellae are the sites of photosynthesis. Some bacteria have flagella, 
single, long filaments used for motility. Prokaryotes largely reproduce by asexual 
division, although sexual exchanges can occur. Table 1.6 lists a few of the many 
diseases caused by bacteria. 


1.5d The Structural Organization of Eukaryotic Cells Is More Complex 
Than That of Prokaryotic Cells 


Compared with prokaryotic cells, eukaryotic cells are much greater in size, typically 
having cell volumes 10° to 104 times larger. They are also much more complex. These 
two features require that eukaryotic cells partition their diverse metabolic processes 
into organized compartments, with each compartment dedicated to a particular 
function. A system of internal membranes accomplishes this partitioning. A typical 
animal cell is shown in Figure 1.20 and a typical plant cell in Figure 1.21. 

Eukaryotic cells possess a discrete, membrane-bounded nucleus, the repository 
of the cell’s genetic material, which is distributed among a few or many chromo- 
somes. During cell division, equivalent copies of this genetic material must be 
passed to both daughter cells through duplication and orderly partitioning of the 
chromosomes by the process known as mitosis. Like prokaryotic cells, eukaryotic 
cells are surrounded by a plasma membrane. Unlike prokaryotic cells, eukaryotic 
cells are rich in internal membranes that are differentiated into specialized struc- 
tures such as the endoplasmic reticulum (ER) and the Golgi apparatus. Ribosomes 
associated with the rough ER are the site of membrane protein synthesis. Thirty 
percent of cellular proteins are inserted into the ER membrane. The Golgi appara- 
tus is involved in the packaging and processing of proteins destined for secretion 
from the cell. All of these internal membrane systems connect to one another and 
ultimately to the plasma membrane. Membranes also surround certain organelles 
(mitochondria and chloroplasts, for example) and various vesicles, including vacu- 
oles, lysosomes, and peroxisomes. The common purpose of these membranous 
partitionings is the creation of cellular compartments that have specific, organized 
metabolic functions, such as the mitochondrion’s role as the principal site of cel- 
lular energy production. Eukaryotic cells also have a cytoskeleton composed of 
arrays of filaments that give the cell its shape and its capacity to move. Some 
eukaryotic cells also have long projections on their surface—cilia or flagella— 
which provide propulsion. 
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Figure 1.20 This figure diagrams a typical animal cell. 


1.6 | What Are Viruses? 


Viruses are supramolecular complexes of nucleic acid (either DNA or RNA) 
encapsulated in a protein coat and, in some instances, surrounded by a membrane 
envelope (Figure 1.22). Viruses are acellular, but they act as cellular parasites to 
reproduce. The bits of nucleic acid in viruses are mobile elements of genetic infor- 
mation. The protein coat serves to protect the nucleic acid and allows it to gain entry 
to the cells that are its specific hosts. No cell type is free from virus attack. Viruses 
infecting bacteria are called bacteriophages (“bacteria eaters”); different viruses 
infect animal cells and plant cells. Once the nucleic acid of a virus gains access to its 
host, it typically takes over the metabolic machinery of the host cell, diverting it to 
the production of virus particles. The host metabolic functions are subjugated to the 
synthesis of viral nucleic acid and proteins. Mature virus particles arise by encapsu- 
lating the newly synthesized viral nucleic acid within a protein coat called the capsid. 
Thus, viruses are supramolecular assemblies that act as parasites of cells (Figure 1.23). 
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Figure 1.21 This figure diagrams a cell in the leaf of a higher plant. The cell wall, membrane, nucleus, chloro- 
plasts, mitochondria, vacuole, endoplasmic reticulum (ER), and other characteristic features are shown. 
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(b) 


Figure 1.22 Viruses are genetic elements enclosed in a protein coat. Viruses are not free-living organ- 
isms and can reproduce only within cells. Viruses have an almost absolute specificity for their host cells, 
infecting and multiplying only within those cells. Viruses are known that infect virtually every kind 

of cell. Shown here are examples of (a) SARS2-CoV-2 coronavirus, the virus that causes COVID-19, 
(b) bacteriophage T4 on its host, E. coli; and (c) a plant virus, the tobacco mosaic virus. 
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Figure 1.23 The virus life cycle. Viruses are mobile bits 

of genetic information encapsulated in a protein coat. The 
genetic material can be either DNA or RNA. Once this 
genetic material enters its host cell, it takes over the host 
machinery for macromolecular synthesis and subverts it to 
synthesize virus-specific nucleic acids and proteins. These viral 
components are then assembled into virus particles that are 
released from the cell. Often, this parasitic cycle of virus infec- 
tion leads to cell death and disease. 
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Often, viruses cause disintegration of the cells that they have infected, a process 
referred to as cell lysis. It is their cytolytic properties that are the basis of viral disease. 
In certain circumstances, the viral genetic elements may integrate into the host chromo- 
some and become quiescent. Such a state is termed lysogeny. Typically, damage to the 
host cell activates the replicative capacities of the quiescent viral nucleic acid, leading to 
viral propagation and release. Some viruses are implicated in transforming cells into a 
cancerous state, that is, in converting their hosts to an unregulated state of cell division 
and proliferation. Because all viruses are heavily dependent on their host for the produc- 
tion of viral progeny, viruses must have evolved after cells were established. Presumably, 
the first viruses were fragments of nucleic acid that developed the ability to replicate 
independently of the chromosome and then acquired the necessary genes enabling pro- 
tection, autonomy, and transfer between cells. Surprisingly, virus-related DNA makes 
up almost half of the human genome. 


Some Human Diseases Caused by Viruses 


Disease Causative Agent 

Caused by DNA Viruses 

Measles Measles morbillivirus 
Mumps Mumps paramyxovirus 
Hepatitis B Hepadnavirus 
Influenza (flu) Orthomyxovirus 


Chickenpox Varicella-zoster virus 
Smallpox Orthopoxvirus variola virus 
Dengue Flavivirus dengue virus 
Caused by RNA Viruses 

Hepatitis A Hepatovirus A 

Hepatitis C Hepacivirus C 


Common cold 


Rhinoviruses 


Polio Enterovirus C poliovirus 

Shingles Varicella-zoster virus 

Rabies Rabies lyssavirus 

AIDS Human immunodeficiency virus (HIV) 
COVID-19 SARS-CoV-2 coronavirus 
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Problems 


Foundational Biochemistry Things You Should Know after Reading Chapter 1. 


Chemistry is the logic of biological phenomena. ° 


Biological molecules serve functional purposes. 


The living state is characterized by the flow of energy through the 
organism. 


Biomolecules are compounds of carbon. 


Cellular macromolecules and structures are formed from simple 
molecules. 


Metabolites are used to form the building blocks of macromolecules. 


Membranes are supramolecular assemblies that define the boundar- 
ies of cells. bd 


Biological macromolecules and their building blocks have a “sense,” 
or directionality. 


Biological macromolecules are informational. 


Macromolecules have a defining three-dimensional architecture. 


Weak forces important in biochemistry include van der Waals inter- 


actions, hydrogen bonds, electrostatic (ionic) interactions, and hydro- 


phobic interactions. 


Problems 


Weak forces maintain biological structure and determine biomolecu- 
lar interactions. 


Molecular recognition through structural complementarity is the defin- 
ing concept in biochemistry. 


Biomolecular recognition is mediated by weak chemical forces. 


The importance of weak forces restricts organisms to a narrow range 
of environmental conditions. 


Enzymes catalyze metabolic reactions. 
Metabolic regulation is achieved by controlling the activity of enzymes. 


The time scale of life ranges from 10~!5 sec (the speed of electron 
transfer processes) to 10!8 sec, the time span that life has been evolv- 
ing on earth. 


Prokaryotic cells have a simple structural and functional organization. 


Plants and animals are composed of eukaryotic cells, which 
have organelles and a more complex structural and functional 
organization. 


Viruses are lifeless complexes of nucleic acid and protein that act as 
cellular parasites to reproduce. 


Answers to all problems are at the end of this book. Detailed solutions are 7. 


available in the Student Solutions Manual. 


1. 


The Principle of Molecular Recognition through Structural Comple- 
mentarity Biomolecules interact with one another through molecular 
surfaces that are structurally complementary. 


a. 


What properties must proteins possess to interact with mol- 


ecules as different as simple ions, hydrophobic lipids, polar but 8. 


uncharged carbohydrates, and nucleic acids? 


Consult Table 1.1 to compare the size of alanine (an amino acid) 
and immunoglobulin (a protein). Assume this protein can inter- 
act with alanine. What conclusions can you draw about the rela- 
tive surface area of the protein and the parts of it that interact 
with alanine. (Section 1.4) 


The Properties of Informational Macromolecules What organizational 
features allow biological polymers to be informational macromolecules? 


Is it possible for polysaccharides to be informational macromolecules? 
(Section 1.4) 


. Proteins and Nucleic Acids Are Informational Macromolecules What 


are the two essential criteria for a linear polymer to be an informa- 
tional macromolecule? 


Structure and Function DNA molecules adopt the famous double- 
helix shape (see Figure 1.4). Proteins adopt a variety of shapes (see 
Figures 1.10 and 1.16). Based on your understanding of the central 
dogma of molecular biology (see section 1.4b), provide a possible 
explanation for why DNA molecules adopt one shape whereas pro- 
teins have multiple different shapes? 


The Importance of Weak Forces in Biomolecular Recognition Why is it 
important that weak forces, not strong forces, mediate biomolecular 
recognition? (Section 1.4) 


Biological Molecules Often Interact via Weak Forces. 


a. 


b. 


How can you change the kinetic energy in a system of interacting 
molecules? 10. 


Predict the effect on such interactions of 
1) an increase in kinetic energy. 
2) a decrease in kinetic energy. 


Building Macromolecules Biomacromolecules are often built by 
linking monomeric units through a process that removes a water 
molecule from between the monomers (see Figure 1.9). If one ton 
(1000 kg) of a monomer with a molecular mass of 200 g/mol is 
linked into a macromolecular structure, what volume (in L) of water 
would be released (1 mL water = 1 g)? 


Monomers and Macromolecules Macromolecules are often formed 

by linking simple monomeric units through repeated chemical bonds 
between such monomers (see Figure 1.8). Consider the structure shown 
below and draw the structure of a macromolecule that could be created 
from this monomer based on the chemistry shown in Figure 1.8. 


H,N-CH,-O-CH,-COOH 


Solubility in Water Calcium stearate, the calcium salt of stearic acid, is 
poorly soluble in water despite being an ionic compound. Explain why 
using your understanding of the weak forces described in Section 1.4. 


(0) ‘or 
Ca" 
Sie 
(0) 


Structure of calcium stearate 


. Interatomic Distances in Weak Forces versus Chemical Bonds What is 


the distance between the centers of two carbon atoms (their limit of 
approach) that are interacting through van der Waals forces? What 
is the distance between the centers of two carbon atoms joined in a 
covalent bond? (See Table 1.4.) (Section 1.4) 
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11. 


12. 


13. 


14. 


15. 


16. 


Chapter1 The Facts of Life: Chemistry Is the Logic of Biological Phenomena 


The Strength of Weak Forces Determines the Environmental Sensitivity 
of Living Cells How does the central role of weak forces in biomolecu- 
lar interactions restrict living systems to a narrow range of environ- 
mental conditions? (Section 1.4) 


Cells as Steady-State Systems Describe what is meant by the phrase 
“cells are steady-state systems.” (Section 1.4) 


The Biosynthetic Capacity of Cells The nutritional requirements 

of Escherichia coli cells are far simpler than those of humans, yet 

the macromolecules found in bacteria are as complex as those of 
animals. Because bacteria can make all their essential biomolecules 
while subsisting on a simpler diet, do you think bacteria may have 
more biosynthetic capacity and thus more metabolic complexity than 
animals? Organize your thoughts on this question, pro and con, into 
a rational argument. (Section 1.5) 


Cell Structure Without consulting the figures in this chapter, sketch 
the characteristic prokaryotic and eukaryotic cell types and label their 
organelles and membranes. (Section 1.5) 


The Dimensions of Prokaryotic Cells and Their Constituents Escherichia 

coli cells are about 2 um long and 0.8 um in diameter. (Section 1.5) 

a. How many E. coli cells laid end to end would fit across the diam- 
eter of a pinhead? (Assume a pinhead diameter of 0.5 mm.) 

b. What is the volume of an E. coli cell? (Assume it is a cylinder, with 
the volume of a cylinder given by V = rh, where m = 3.14.) 

c. What is the surface area of an £E. coli cell? What is the surface-to- 
volume ratio of an £E. coli cell? 

d. Glucose, a major energy-yielding nutrient, is present in bacterial 
cells at a concentration of about 1 mM. 


What is the concentration of glucose, expressed as mg/mL? 


How many glucose molecules are contained in a typical E. coli 
cell? (Recall that Avogadro’s number = 6.023 x 10%.) 

e. A number of regulatory proteins are present in E. coli at only 
one or two molecules per cell. If we assume that an E. coli cell 
contains just one molecule of a particular protein, what is the 
molar concentration of this protein in the cell? 

If the molecular weight of this protein is 40 kDa, what is its con- 
centration, expressed as mg/mL? 

f. An E. coli cell contains about 15,000 ribosomes, supramolecular 
complexes that carry out protein synthesis. Assuming ribosomes 
are spherical and have a diameter of 20 nm (nanometers), what 
fraction of the E. coli cell volume is occupied by ribosomes? 


g. The Ł. coli chromosome is a single DNA molecule whose mass is 
about 3 x 10° Da. This macromolecule is actually a linear array of 
nucleotide pairs. The average molecular weight of a nucleotide pair 
is 660, and each pair imparts 0.34 nm to the length of the DNA 
molecule. What is the total length of the E. coli chromosome? 
How does this length compare with the overall dimensions of an 
E. coli cell? 


How many nucleotide pairs does this DNA contain? 

h. The average E. coli protein is a linear chain of 360 amino acids. 
If three nucleotide pairs in a gene encode one amino acid in a protein, 
how many different proteins can the E. coli chromosome encode? 
(The answer to this question is a reasonable approximation of 
the maximum number of different kinds of proteins that can be 
expected in bacteria.) 


The Dimensions of Mitochondria and Their Constituents Assume 

that mitochondria are cylinders 1.5 mm in length and 0.6 mm in 

diameter. (Section 1.5) 

a. What is the volume of a single mitochondrion? 

b. Oxaloacetate is an intermediate in the citric acid cycle, an important 
metabolic pathway localized in the mitochondria of eukaryotic cells. 
The concentration of oxaloacetate in mitochondria is about 0.03 mM. 


How many molecules of oxaloacetate are in a single mitochondrion? 


17. 


18. 


19. 


20. 


The Dimensions of Eukaryotic Cells and Their Constituents Assume 
that liver cells are cuboidal in shape, 20 mm on a side. (Section 1.5) 


a. How many liver cells laid end to end would fit across the diam- 
eter of a pinhead? (Assume a pinhead diameter of 0.5 mm.) 


b. What is the volume of a liver cell? 

c. What is the surface area of a liver cell? 

d. What is the surface-to-volume ratio of a liver cell? 
How does this compare to the surface-to-volume ratio of an 
E. coli cell? (Compare this answer with that of problem 3c.) 
Cells exchange matter and communicate with the environment 
through their surface membranes. 


What problems do cells with low surface-to-volume ratios face 
that cells with high surface-to-volume ratios don’t face? 

e. A human liver cell contains two sets of 23 chromosomes, each set 
being roughly equivalent in information content. The total mass 
of DNA contained in these 46 enormous DNA molecules is 4 X 
10!? Da. If each nucleotide pair has a mass of 660 Da, what is 
the total number of nucleotide pairs in these 46 DNA molecules? 

f. If each nucleotide pair contributes 0.34 nm to the length of 
DNA, what would be the total length of the 46 liver-cell DNA 
molecules if laid end-to-end? 

How does this length compare with the overall dimensions of a 
liver cell? 

g. The maximal information in each set of liver cell chromosomes 
should be related to the number of nucleotide pairs in the chro- 
mosome set’s DNA. This number can be obtained by dividing 
the total number of nucleotide pairs just calculated by 2. What is 
this value? 

If this information is expressed in proteins that average 400 
amino acids in length and three nucleotide pairs encode one 
amino acid in a protein, how many different kinds of proteins 
might a liver cell be able to produce? (In reality, liver cell DNA 
encodes approximately 20,000 different proteins.) 


A Simple Genome and Its Protein-Encoding Capacity The Mycoplasma 
genitalium genome consists of 523 genes, encoding 484 proteins, in 
just 580,074 base pairs (see Table 1.5). 

a. What fraction of the M. genitalium genes encode proteins? 

b. What do you think the other genes encode? 

c. If the fraction of base pairs devoted to protein-coding genes is 
the same as the fraction of the total genes that they represent, 
what is the average number of base pairs per protein-coding gene? 

d. If it takes three base pairs to specify an amino acid in a protein, how 
many amino acids are found in the average M. genitalium protein? 

e. If each amino acid contributes, on average, 120 Da to the mass 
of a protein, what is the mass of an average M. genitalium 
protein? (Section 1.5) 


An Estimation of Minimal Genome Size for a Living Cell One prominent 
study of existing cells to determine the minimum number of genes 
needed by a living cell has suggested that 206 genes are sufficient. 


If the ratio of protein-coding genes to non—protein-coding genes 
is the same in this minimal organism as the genes of Mycoplasma 
genitalium, how many proteins are represented in these 206 genes? 


How many base pairs would be required to form the genome of this 
minimal organism if the genes are the same size as M. genitalium 
genes? (Section 1.5) 


An Estimation of the Number of Genes in a Virus Virus genomes 
range in size from approximately 3500 nucleotides to 280,000 base 
pairs. The genome of SARS-CoV-2, the virus responsible for the 
COVID-19 pandemic, consists of 29,811 nucleotides. 


If viral genes are about the same size as M. genitalium genes, 
what is the minimum and maximum number of genes in viruses? 
(Section 1.6) 
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21. Intracellular Protein Transport Proteins made by ribosomes associated 
with the endoplasmic reticulum (ER) may pass into the ER mem- 
brane or enter the lumen of the ER. Devise pathways by which: 


a. aplasma membrane protein may reach the plasma membrane. 


b. a protein destined for secretion is deposited outside the cell. 
(Section 1.5) 


Think-Pair-Share Question 


Create YourOwn Monomer Many biopolymers are created through 

the repeated linking of monomeric units into a larger structure (see 
Figure 1.8). Such monomers have at least two functional groups that 
allow them to link with two other monomers. Create your own monomer 
with the following requirements: 


Further Reading 27 


1 The general formula should be Cj,H,7NO3. 

2 The structure should contain one amine and one carboxylic acid 
functional group (see structures below). 

3 Every carbon should have four bonds, every oxygen two bonds, 
every nitrogen three bonds and every hydrogen one bond. 

4 Double bonds and cyclic (including aromatic) features can be 
present, but no triple bonds. 


Amine Carboxylic acid 
—NH, —C—OH 

Il 

O 
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Water: The Medium of Life 2 


If there is magic on this planet, it is contained 
in water. 


Loren Eisley (inscribed on the wall of the National 
Aquarium in Baltimore, Maryland) 


Smit/Shutterstock.com 


< Where there's water, there's life. 


Water provided conditions for the origin, evolution, and flourishing of life; water is the 2.1 What Are the Properties of Water? 
Haag a oe are the properties of water that render it so suited to its role as 2.2 What Is pH? 
Cae SEARS 2.3 What Are Buffers, and What Do 
They Do? 
2.4 What Properties of Water Give It a 
ater is a major chemical component of Earth’s surface. It is indispensable to Unique Role in the Environment? 


life. Indeed, it is the only liquid that most organisms ever encounter. Life origi- 
nated, evolved, and thrives in the seas. Organisms invaded and occupied terrestrial 
and aerial niches, but none gained true independence from water. Typically, organ- 
isms are 70% to 90% water. Indeed, normal metabolic activity can occur only when 
cells are at least 65% H,O. This dependency of life on water can be understood by 
considering the unusual chemical and physical properties of HO. Subsequent chap- 
ters establish that water and its ionization products, hydrogen ions and hydroxide 
ions, are critical determinants of the structure and function of many biomolecules, 
including amino acids and proteins, nucleotides and nucleic acids, and even phospho- 
lipids and membranes. In yet another essential role, water is an indirect participant— 
a difference in the concentration of hydrogen ions in two solutions separated by a 
membrane represents an energized condition essential to the biological mechanisms 
of energy transformation. First, let’s review the remarkable properties of water. 


2.1 | What Are the Properties of Water? 


Water is everywhere around us, and we are prone to take it for granted because of our 
familiarity with it and its bland nature. Yet we also marvel at its many unusual and 
fascinating properties. At the center of this fascination is the role of water as the 
medium of life. What are the properties of water that make it so unique to our 
existence? 
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Dipole moment 


Covalent bond 
length = 0.095 nm 


van der Waals radius 
of oxygen = 0.14 nm 


van der Waals radius 
of hydrogen = 0.12 nm 


Figure 2.1 The structure of water. Two lobes of 
negative charge formed by the lone-pair electrons 
of the oxygen atom lie above and below the plane 
of the diagram. This electron density contributes 
substantially to the large dipole moment of the water 
molecule. Note that the H—O—H angle is 104.3°, 
not 109°, the angular value found in molecules 
with tetrahedral symmetry, such as CH4. Many of 
the important properties of water derive from this 
angular value, such as the decreased density of its 
crystalline state, ice. 


2.1a Water Has Unusual Properties 


Compared with chemical compounds of similar atomic organization and molecular 
size, water displays unexpected properties. For example, compare water, the hydride 
of oxygen, with hydrides of oxygen’s nearest neighbors in the periodic table, namely, 
ammonia (NH;) and hydrogen fluoride (HF), or with HS, the hydride of sulfur (the 
next element down in Group 16 of the periodic table). Water has a substantially 
higher boiling point, melting point, heat of vaporization, and surface tension. In- 
deed, all of these physical properties are anomalously high for a substance of this 
molecular weight that is neither metallic nor ionic. These properties suggest that 
intermolecular forces of attraction between H20 molecules are high. Thus, the inter- 
nal cohesion of this substance is high. Furthermore, water has an unusually high 
dielectric constant, its maximum density is found in the liquid (not the solid) state, 
and it has a negative volume of melting (that is, the solid form, ice, occupies more 
space than the liquid form, water). It is truly remarkable that so many eccentric 
properties occur together in this single substance. As chemists, we expect to find an 
explanation for these apparent eccentricities in the structure of water. The key to its 
intermolecular attractions must lie in its atomic constitution. Indeed, the unrivaled 
ability of water to form hydrogen bonds is the crucial fact to understanding its 
properties. 


2.1b Hydrogen Bonding in Water Is Key to Its Properties 


The two hydrogen atoms of water are linked covalently to oxygen, each sharing an 
electron pair, to give a nonlinear arrangement (Figure 2.1). This “bent” structure of 
the H2O molecule has enormous influence on its properties. If H2O molecules were 
linear, it would be a nonpolar substance. In the bent configuration, however, the elec- 
tronegative O atom and the two H atoms form a dipole that renders the molecule 
distinctly polar. Furthermore, this structure is ideally suited to H-bond formation, 
where water can serve as both an H donor and an H acceptor. The potential to form 
four H bonds per water molecule is the source of the strong intermolecular attractions 
that endow this substance with its anomalously high boiling point, melting point, heat 
of vaporization, and surface tension. In ordinary ice, the common crystalline form of 
water, each H,O molecule has four nearest neighbors to which it is hydrogen bonded: 
Each H atom donates an H bond to the O of a neighbor, and the O atom serves as an 
H-bond acceptor from H atoms bound to two different water molecules (Figure 2.2). 
A local tetrahedral symmetry results. 

Hydrogen bonding in water is cooperative. That is, an H-bonded water molecule 
serving as an acceptor is a better H-bond donor than an unbonded molecule (and an 
H2O molecule serving as an H-bond donor becomes a better H-bond acceptor). Thus, 
participation in H bonding by H2O molecules is a phenomenon of mutual reinforcement. 
The H bonds between neighboring molecules are weak (23 kJ/mol each) relative to the 
H—O covalent bonds (420 kJ/mol). As a consequence, the hydrogen atoms are situated 
asymmetrically between the two oxygen atoms along the O-O axis. There is never any 
ambiguity about which O atom the H atom is chemically bound to, nor to which O it is 
H bonded. 


2.1c The Structure of Ice Is Based on H-Bond Formation 


In ice, the hydrogen bonds form a space-filling, three-dimensional network. These 
bonds are directional and straight; that is, the H atom lies on a direct line between 
the two O atoms. This linearity and directionality mean that the H bonds in ice are 
strong. In addition, the directional preference of the H bonds leads to an open lattice 
structure. For example, if the water molecules are approximated as rigid spheres cen- 
tered at the positions of the O atoms in the lattice, then the observed density of ice 
is actually only 57% of that expected for a tightly packed arrangement of such spheres 
(Figure 2.2). The H bonds in ice hold the water molecules apart. Melting involves 
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Figure 2.2 The structure of ice. The smallest number of H2O molecules in any closed circuit of 
H-bonded molecules is six, so this structure bears the name hexagonal ice. 


breaking some of the H bonds that maintain the crystal structure of ice so that the 
molecules of water (now liquid) can pack closer together. Thus, the density of liquid 
water is slightly greater than that of ice. Ice floats, a property of great importance to 
aquatic organisms in cold climates. 

In liquid water, the rigidity of ice is replaced by fluidity and the crystalline period- 
icity of ice gives way to spatial homogeneity. The H2O molecules in liquid water form 
a disordered H-bonded network, with each molecule having an average of 4.4 close 
neighbors situated within a center-to-center distance of 0.284 nm (2.84 A). At least 
half of the H bonds have nonideal orientations (that is, they are not perfectly straight); 
consequently, liquid H2O lacks the regular lattice-like structure of ice. The space 
about an O atom is not defined by the presence of four H atoms but can be occupied 
by other H,O molecules randomly oriented so that the local environment, over time, 
is essentially uniform. Nevertheless, the heat of melting for ice is but a small fraction 
(13%) of the heat of sublimation for ice (the energy needed to go from the solid to the 
vapor state). This fact indicates that the majority of H bonds between H2O molecules 
survive the transition from solid to liquid. At 10°C, 2.3 H bonds per H2O molecule 
remain and the tetrahedral bond order persists, even though substantial disorder is 
now present. 


2.1d Molecular Interactions in Liquid Water Are Based on H Bonds 


Water structure at the molecular level is, at any instant, inhomogeneous, consisting 
of local fluctuations between patches of near-tetrahedral, ordered arrays of water 
molecules and more asymmetrical ensembles of H2O molecules linked together 
through distorted H bonds. The participation of each H2O molecule in an average 
state of H bonding to its neighbors means that each molecule is connected to every 
other in a fluid network of H bonds. The average lifetime of an H-bonded connection 
between two H2O molecules in water is 9.5 psec (picoseconds, where 1 psec = 107! sec). 
Thus, about every 10 psec, the average H.O molecule moves, reorients, and inter- 
acts with new neighbors, as illustrated in Figure 2.3. 

In summary, pure liquid water consists of HO molecules held in a disordered, 
three-dimensional network that has a local preference for tetrahedral geometry, yet 
contains a large number of strained or broken hydrogen bonds. The presence of strain 
creates a kinetic situation in which H,O molecules can switch H-bond allegiances; 
fluidity ensues. 


2.1 What Are the Properties of Water? 31 


Figure 2.3 The fluid network of H bonds linking water 
molecules in the liquid state. Yellow flashes indicate 

H bonds breaking. It is revealing to note that, in 10 psec, 
a photon of light (which travels at 3 xX 108 m/sec) 
would move a distance of only 0.003 m. 
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Kilan C. Ashad-Bishop 


Kilan C. Ashad-Bishop is a biomedical scientist 
and advocate at Sylvester Comprehensive Cancer 
Center based at the University of Miami where 
she investigates the Social Determinants of Health, 
known as SDOH. The study of SDOH includes 
complex social factors like the effects that income, 
housing, education, and systemic racism have on an 
individual’s overall health risks and their outcomes. 
Specifically, she uses her expertise in cancer biology 
to study the relationship between SDOH and the 
development and prognosis of chronic conditions 
such as cancer. Her research includes studies of 
molecules such as cortisol that create biochemical 
responses to chronic stress. 


Dr. Ashad-Bishop started her journey into 
the field through a BS in Biology at Morgan 
State University and a PhD in Cancer Biology at 
the University of Miami Miller School of Medicine. 
After her graduate studies she worked as a Medical 
Science Consultant to translate complicated drug 
mechanisms of action into easily understandable 
narratives. 

Her unique journey gives her the insight she 
needs to study important connections between 
biological health and social stress factors. Most 
recently she became director of National STEAM 
Academy to develop programs for foster youth in 
STEAM subjects with a social justice lens. 


Figure 2.4 Hydration shells surrounding ions in 
solution. Water molecules orient so that the electrical 
charge on the ion is sequestered by the water dipole. 


2.le The Solvent Properties of Water Derive from Its Polar Nature 


Because of its highly polar nature, water is an excellent solvent for ionic substances 
such as salts; nonionic but polar substances such as sugars, simple alcohols, and amines; 
and carbonyl-containing molecules such as aldehydes and ketones. Although the elec- 
trostatic attractions between the positive and negative ions in the crystal lattice of a salt 
are very strong, water readily dissolves salts. For example, sodium chloride (NaCl) is 
dissolved because dipolar water molecules participate in strong electrostatic interac- 
tions with the Na* and Cl- ions, leading to the formation of hydration shells surround- 
ing these ions (Figure 2.4). Although hydration shells are stable structures, they are also 
dynamic. Each H,O molecule in the inner hydration shell around a Na‘ ion is replaced, 
on average, every 2 to 4 nsec (nanoseconds, where | nsec = 10~° sec) by another H,O. 
Consequently, a water molecule is trapped only several hundred times longer by the 
electrostatic force field of an ion than it is by the H-bonded network of water. (Recall 
that the average lifetime of H bonds between water molecules is about 10 psec.) 


Water Has a High Dielectric Constant The attractions between the water molecules inter- 
acting with, or hydrating, ions are much greater than the tendency of oppositely charged 
ions to attract one another. Water’s ability to surround ions in dipole interactions and 
diminish their attraction for each other is a measure of its dielectric constant, D. Indeed, 
ionization in solution depends on the dielectric constant of the solvent; otherwise, the 
strongly attracted positive and negative ions would unite to form neutral molecules. 
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The strength of the dielectric constant is related to the force, F, experienced between two ECE Dielectric Constants* of Blgamiaiconctaniecak 


ions of opposite charge separated by a distance, r, as given in the relationship Some Gomimonsolvents a 25E 


F= ee Dr Solvent Dielectric Constant (D) 

where e; and e, are the charges on the two ions. Table 2.1 lists the dielectric constants oid. 109 

of some common liquids. Note that the dielectric constant for water is more than 

twice that of methanol and more than 40 times that of hexane. Water 78.5 
Methyl alcohol 32.6 

Water Forms H Bonds with Polar Solutes In the case of nonionic but polar compounds Ethyl alcohol 243 

such as sugars, the excellent solvent properties of water stem from its ability to read- 

; : : Acetone 20.7 

ily form hydrogen bonds with the polar functional groups on these compounds, such — 

as hydroxyls, amines, and carbonyls (see Figure 1.13). These polar interactions be- _A*etic acid 6.2 

tween solvent and solute are stronger than the intermolecular attractions between Chloroform 5.0 

solute molecules caused by van der Waals forces and weaker H bonding. Thus, the Benzene 2.3 

solute molecules readily dissolve in water. Hexane 1.9 


Hydrophobic Interactions The behavior of water toward nonpolar solutes is different from *The dielectric constant is also referred to as relative 
the interactions just discussed. Nonpolar solutes (or nonpolar groups on biological macro- —?2”"""/”/#7 by Physical chemists. 
molecules) do not readily H bond to H,O, and as a result, such compounds tend to be only 
sparingly soluble in water. The process of dissolving such substances is accompanied by sig- 
nificant reorganization of the water surrounding the solute so that the response of the 
solvent water to such solutes can be equated to “structure making.” Because nonpolar sol- 
utes must occupy space, the random H-bonded network of water must reorganize to 
accommodate them. At the same time, the H,O molecules participate in as many H-bonded 
interactions with one another as the temperature permits. Consequently, the H-bonded 
water network rearranges toward formation of a local cage-like (clathrate) structure sur- 
rounding each solute molecule, as shown for a long-chain fatty acid in Figure 2.5. This fixed 
orientation of water molecules around a hydrophobic molecule results in a hydration shell. 
A major consequence of this rearrangement is that the molecules of H,O participating in 
the cage layer have markedly reduced options for orientation in three-dimensional space. 
Water molecules tend to straddle the nonpolar molecule such that two or three tetrahedral 
directions (H-bonding vectors) are tangential to the space occupied by the inert solute. 
“Straddling” allows the water molecules to retain their H-bonding possibilities because no 
H-bond donor or acceptor of the H,O is directed toward the caged solute. The H,O mole- 
cules forming these clathrates are involved in highly ordered structures. That is, clathrate 
formation is accompanied by significant ordering of structure, or negative entropy. 

As discussed in Section 1.4h, multiple nonpolar molecules tend to cluster together, 
because their joint solvation cage involves less total surface area and thus fewer ordered H,O 


Figure 2.5 Nonpolar molecules create structural order in water. (/eft) Disordered network of H-bonded 
water molecules. (right) Clathrate cage of ordered, H-bonded water molecules around a nonpolar molecule. 
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Figure 2.6 Hydrophobic interactions between nonpolar molecules (or nonpolar regions of molecules) 
are caused by the increase in entropy of solvent water molecules. Note that many water molecules are 
freed from ordered interactions around nonpolar molecules when the nonpolar molecules coalesce. The 
entropy of the water has increased. 


molecules than in their separate cages. It is as if the nonpolar molecules had some net attrac- 
tion for one another. This apparent affinity of nonpolar structures for one another is called 
hydrophobic interactions (Figure 2.6). In actuality, the “attraction” between nonpolar sol- 
utes is an entropy-driven process due to a net decrease in order among the H,O molecules. 
To be specific, hydrophobic interactions between nonpolar molecules are maintained not so 
much by direct interactions between the inert solutes themselves as by the increase in en- 
tropy when the water cages coalesce and reorganize. Because interactions between nonpolar 
solute molecules and the water surrounding them are of uncertain stoichiometry and do not 
share the equality of atom-to-atom participation implicit in chemical bonding, the term 
hydrophobic interaction is more correct than the misleading expression hydrophobic bond. 


Amphiphilic Molecules Compounds containing both strongly polar and strongly non- 
polar groups are called amphiphilic molecules (from the Greek amphi meaning “both” 
and philos meaning “loving”). Such compounds are also referred to as amphipathic 
molecules (from the Greek pathos meaning “passion’”). Salts of fatty acids are a typi- 
cal example that has biological relevance. They have a long, nonpolar hydrocarbon 
tail and a strongly polar carboxyl head group, as in the sodium salt of palmitic acid 
(Figure 2.7). Their behavior in aqueous solution reflects the combination of the con- 
trasting polar and nonpolar nature of these substances. The ionic carboxylate func- 
tion hydrates readily, whereas the long hydrophobic tail is intrinsically insoluble. 
Nevertheless, sodium palmitate and other amphiphilic molecules readily disperse in 
water because the hydrocarbon tails of these substances are joined together in hydro- 
phobic interactions as their polar carboxylate functions are hydrated in a typical 
hydrophilic fashion. Such clusters of amphipathic molecules are termed micelles; 
Figure 2.7b is a two-dimensional representation of a spherical micelle. 
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Solutes Impose Order on Water Molecules The presence of dissolved substances disturbs 
the structure of liquid water, thereby changing its properties. The dynamic H-bonding 
interactions of HO must now accommodate the intruding substance. The net effect is 
that solutes, regardless of whether they are polar or nonpolar, fix nearby water mole- 
cules in a more ordered array. Ions, by establishing hydration shells through interac- 
tions with the H,O dipoles, create local order. Hydrophobic substances, for different 
reasons, make structures within water. To put it another way, by limiting the orienta- 
tions that neighboring water molecules can assume, solutes give order to the solvent and 
diminish the dynamic interplay among H,O molecules that occurs in pure water. 


Colligative Properties This influence of the solute on water is reflected in a set of 
characteristic changes in behavior termed colligative properties (properties related by 
acommon principle). These alterations in solvent properties are related in that they all 
depend only on the number of solute particles per unit volume of solvent and not on 
the chemical nature of the solute. These effects include freezing point depression, 
boiling point elevation, vapor pressure lowering, and osmotic pressure effects. For 
example, | mol of an ideal solute dissolved in 1000 g of water (a 1 m, or molal, solution) 
at 1 atm depresses the freezing point by 1.86°C, raises the boiling point by 0.543°C, 
lowers the vapor pressure in a temperature-dependent manner, and yields a solution 
whose osmotic pressure relative to pure water is 22.4 atm (at 25°C). In effect, by impos- 
ing local order on the HO molecules, solutes make it more difficult for water to 
assume its crystalline lattice (freeze) or escape into the atmosphere (boil or vaporize). 

Furthermore, the fourth colligative property, osmotic pressure, is of vital importance 
to biological systems. When a solution (such as the 1 m solution discussed here) is sepa- 
rated from a volume of pure water by a semipermeable membrane, the solution draws 
water molecules across this barrier. The HO molecules are moving from a region of 
higher effective concentration (pure H20) to a region of lower effective concentration 
(the solution). This movement of water into the solution dilutes the effects of the solute 
that is present. The osmotic force exerted by each mole of solute is so strong that it re- 
quires the imposition of 22.4 atm of pressure to be negated (Figure 2.8). Osmotic pres- 
sure from high concentrations of dissolved solutes is a challenge for cells. Bacterial and 
plant cells have strong, rigid cell walls to contain these pressures. In contrast, animal cells 
are bathed in extracellular fluids of comparable osmolarity, so that no net osmotic gradi- 
ent exists. Also, to minimize the osmotic pressure created by the contents of their cytosol, 
cells tend to store substances such as amino acids and sugars in polymeric form. For 
example, a molecule of glycogen or starch containing 1000 glucose units exerts only 
1/1000 the osmotic\ pressure that 1000 free glucose molecules would. 


(a) (b) (c) 22.4 atm 
ry 
S 7 S 7 5 
Y 
Nonpermeant 
solute 
Semipermeable ————* ==" 
membrane 
HO Ñ 4 A 4 


Figure 2.8 The osmotic pressure of a 1 molal (m) solution is equal to 22.4 atmospheres of pressure. (a) If 
a nonpermeant solute is separated from pure water by a semipermeable membrane through which H,O 
passes through freely, (b) water molecules enter the solution (osmosis) and the height of the solution col- 
umn in the tube rises. The pressure necessary to push water back through the membrane at a rate exactly 
the water influx is the osmotic pressure of the solution. (c) For a 1 m solution, this force is equal to 22.4 
atm of pressure. Osmotic pressure is directly proportional to the concentration of the nonpermeant solute. 
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(a) The sodium salt of palmitic acid: Sodium 
palmitate (Nat TOOC(CHə)14CH3) 


(b) 


Figure 2.7 Amphiphilic molecules can form micelles 
in water. (a) An amphiphilic molecule: sodium 
palmitate. (b) Micelle formation by amphiphilic 
molecules in an aqueous solution. Because of their 
negatively charged surfaces, neighboring micelles 
repel one another and thereby maintain relative sta- 
bility in solution. 
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Figure 2.9 The ionization of water. 
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Figure 2.10 The hydration of H;O*. 


2.1f Water Can lonize to Form H+ and OH- 


Water shows a small but finite tendency to form ions. This tendency is demonstrated 
by the electrical conductivity of pure water, a property that clearly establishes the 
presence of charged species (ions). Water ionizes because the larger, strongly electro- 
negative O atom strips the electron from one of its hydrogen atoms, leaving the proton 
to dissociate (Figure 2.9): 


H—O—H —> H* + OH™ 
Two ions are thus formed: protons or hydrogen ions, H*; and hydroxyl ions, OH. Free 
protons are immediately hydrated to form hydronium ions, H;0*: 


H* + HO — H;0* 


Indeed, because most hydrogen atoms in liquid water are H bonded to a neighboring 
H,O molecule, this protonic hydration is an instantaneous process, and the ion prod- 
ucts of water are H;O* and OH’: 


H H H 
\ / \ 
203-0 —> O—H'+OH- 
H £ 


The amount of H;0* or OH- in 1 L (liter) of pure water at 25°C is 1 X 1077 mol; the 
concentrations are equal because the dissociation is stoichiometric. 

Although it is important to keep in mind that the hydronium ion, a hydrated 
hydrogen ion, represents the true state in solution, the convention is to speak of hy- 
drogen ion concentrations in aqueous solution, even though “naked” protons are 
virtually nonexistent. Indeed, H;O* attracts a hydration shell by H bonding to adja- 
cent water molecules to form an H,O,* species (Figure 2.10) and even more highly 
hydrated forms. Similarly, the hydroxyl ion, like all other highly charged species, is 
also hydrated. 


K,, is the lon Product of Water The dissociation of water into hydrogen ions and hy- 
droxyl ions occurs to the extent that 107 mol of H* and 1077 mol of OH- are present 
at equilibrium in | L of water at 25°C. 


HO == H* + OH- 
The equilibrium constant for this process is 
[H*][OH ] 
[H-0] 


where brackets denote concentrations in moles per liter. Because the concentration of 
HO in | L of pure water is equal to the number of grams in a liter divided by the gram 
molecular weight of HO, or 1000/18, the molar concentration of H20 in pure water is 
55.5 M (molar). The decrease in H2O concentration as a result of ion formation ([H*], 
[OH ] = 107 M)is negligible in comparison; thus, its influence on the overall concen- 
tration of H,O can be ignored. Thus, 


Keg = 


(10-7)(1077) 
= ——— = 1.8 x107! M 
= 55.5 
Because the concentration of HO in pure water is essentially constant, a new con- 


stant, K,, the ion product of water, can be written as 
Ky = 55.5 Keg = 10714 M? = [H*][(OH] 


This equation has the virtue of revealing the reciprocal relationship between H* 
and OH” concentrations in aqueous solutions. If a solution is acidic (that is, it has 
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a significant [H*]), then the ion product of water dictates that the OH” concentration 
is correspondingly less. For example, if [HĦ] is 10°? M, [OH] must be 10° M 
(Ky = 107 M? = [10-7][OH J; [OH] = 107? M). Similarly, in an alkaline, or basic, 
solution in which [OH ] is great, [H7] is low. 


2.2 | What Is pH? 


To avoid the cumbersome use of negative exponents to express concentrations that 
range over 14 orders of magnitude, Soren Sorensen, a Danish biochemist, devised the 
pH scale by defining pH as the negative logarithm of the hydrogen ion concentration': 


pH = —logio [H*] 


Table 2.2 gives the pH scale. Note again the reciprocal relationship between [H*] and 
[OH ]. Also, because the pH scale is based on negative logarithms, low pH values 
represent the highest H* concentrations (and the lowest OH™ concentrations, as Kw 
specifies). Note also that 


pk, = pH + pOH = 14 


The pH scale is widely used in biological applications because hydrogen ion concentra- 
tions in biological fluids are very low, about 1077 M or 0.0000001 M, a value more easily 
represented as pH 7. The pH of blood plasma, for example, is 7.4, or 0.00000004 M H*. 
Certain disease conditions may lower the plasma pH level to 6.8 or less, a situation that 
may result in death. 


| Table 2.2 | pH Scale 


The hydrogen ion and hydroxyl ion concentrations are given in moles per liter at 25°C. 


pH l [H*] m~~ oH] ~~~ 
0 (10°) 1.0 0.00000000000001 1074 
1 (107!) 0.1 0.0000000000001 (107°) 
2 (10-2) 0.01 0.000000000001 (10-!?) 
3 (10-3) 0.001 0.00000000001 (1071) 
4 (10-4) 0.0001 0.0000000001 (10719) 
5 (1075) 0.00001 0.000000001 (10%?) 
6 (1076) 0.000001 0.00000001 (1078) 
7 (10-7) 0.0000001 0.0000001 (1077) 
8 (10-8) 0.00000001 0.000001 (1076) 
9 (1079) 0.000000001 0.00001 (1075) 
10 (10719) 0.0000000001 0.0001 (10-4) 
ll (10-1!) 0.00000000001 0.001 (107°) 
12 (1073 0.000000000001 0.01 (10°) 
13 (10-3) 0.0000000000001 0.1 (107) 
14 (10714 0.00000000000001 1.0 (10°) 


'To be precise in physical chemical terms, the activities of the various components, not their molar 
concentrations, should be used in these equations. The activity (a) of a solute component is defined as 
the product of its molar concentration, c, and an activity coefficient, y: a = cy. Most biochemical work 
involves dilute solutions, and the use of activities instead of molar concentrations is usually neglected. 
However, the concentration of certain solutes may be very high in living cells. 


2.2 What Is pH? 
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| Example | Calculating [H+] from pH 


a. What is [H*] in a solution with a pH = 6.8? 
b. How much greater is [H*] at pH = 6.8 versus pH = 7.4? 


That is, what is the ratio of the H* concentration in a solution of pH = 6.8 versus a 
solution of pH = 7.4? 

Answer 

Step 1: Recall that the formula that relates pH and [H*] is [H*] = —log,)»pH 

a. At pH = 6.8, [H*] = —logıopH = 10768 = 0.00000016 M 

Step 2: 

b. At pH 7.4, [H*] = —logjpH = 10774 = 0.00000004 M 

Step 3: The ratio of the H* concentration in a solution of pH = 6.8 versus a solution 
of pH = 7.4 is given by the answer in Step 1 divided by the answer in Step 2: 


0.000000016/0.000000004 = 4 
The [H*] is 4 times greater at pH = 6.8 versus pH = 7.4. 


At pH 7, [H*] = [OH]; that is, there is no excess acidity or basicity. The point of 
neutrality is at pH 7, and solutions with a pH of 7 are said to be at neutral pH. The pH 
values of various fluids of biological origin or relevance are given in Table 2.3. Because 
the pH scale is a logarithmic scale, two solutions whose pH values differ by 1 pH unit 
have a tenfold difference in [H*]. For example, grapefruit juice, at pH 3.2, contains more 
than 12 times as much H* as orange juice, at pH 4.3. 


2.2a Strong Electrolytes Dissociate Completely in Water 


Substances that are almost completely dissociated to form ions in solution are called 


- strong electrolytes. The term electrolyte describes substances capable of generating ions 
|Table 2.3 EZ pH of Various in solution and thereby causing an increase in the electrical conductivity of the solu- 
Common Fluids tion. Many salts (such as NaCl and K,SO,) fit this category, as do strong acids (such as 
Fluid pH HCl) and strong bases (such as NaOH). Recall from general chemistry that acids are 
—— _ proton donors and bases are proton acceptors. In effect, the dissociation of a strong 
Household lye 13.6 acid such as HCl in water can be treated as a proton transfer reaction between the acid 
Bleach 12.6 HCl and the base H20 to give the conjugate acid H;O* and the conjugate base Cl": 
Household ammonia 11.4 HCl + H,O H,O* + CI 
Milk of magnesia 10.3 Senet : nee 
—_---- === The equilibrium constant for this reaction is 
Baking soda 8.4 
Seawater 7.5-8.4 = [H:O ][C1 ] 
Pancreatic fluid 7.8-8.0 [HO]HCI] 
Blood plasma 7.4 Customarily, because the term [H20] is essentially constant in dilute aqueous solu- 
Intracellular fluids tions, it is incorporated into the equilibrium constant K to give a new term, Ka, the 
Liye œo  “ciddissociation constant, where K, = K [H30]. Also, the term [H3O0*] is often replaced 
+ 
Muscle 61 by H*, such that 
Saliva 6.6 t= [H*][Cl] 
Urine 5-8 i [HCI] 
Boric acid 5.0 For HCI, the value of K, is exceedingly large because the concentration of HCI in 
Beer 4.5 aqueous solution is vanishingly small. Because this is so, the pH of HCI! solutions is 
Orange juice 43 readily calculated from the amount of HCl used to make the solution: 
Grapefruit juice 3.2 [H*] in solution = [HCI] added to solution 
Vinegar 2.9 Thus, a 1 M solution of HCI has a pH of 0; a 1 mM HCI solution has a pH of 3. Simi- 
Soft drinks 2.8 larly, a 0.1 M NaOH solution has a pH of 13. (Because [OH] = 0.1 M, [H*] must be 
Lemon juice 2.3 107 M.) Viewing the dissociation of strong electrolytes another way, we see that the 
Gastric juice 1.2-3.0 ions formed show little aint for each other. For example, in HCl in water, Cl” has 
Battery acid 0.35 very little affinity for H*: 


HC] —> H*+Cl- 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


2.2 What Is pH? 


and in NaOH solutions, Na* has little affinity for OH~. The dissociation of these 
substances in water is effectively complete. 


2.2b Weak Electrolytes Are Substances That Dissociate 
Only Slightly in Water 


Substances with only a slight tendency to dissociate and form ions in solution are 
called weak electrolytes. Acetic acid, CH;COOH, is a good example: 


CH;COOH + H,O == CH;COO™ + H;0* 


The acid dissociation constant K, for acetic acid is 1.74 X 1075 M: 


[H*][CH;COO-] 
[CH;COOH] 


K, = = 1.74 X 1075 M 


K, is also termed an ionization constant because it states the extent to which a substance 
forms ions in water. The relatively low value of K, for acetic acid reveals that the 
unionized form, CH;COOH, predominates over H* and CH;COO™ in aqueous solutions 
of acetic acid. Viewed another way, CH;COO_, the acetate ion, has a high affinity for H*. 


| Example | What is the pH of a 0.1 M solution of acetic acid? In other words, what 


is the final pH when 0.1 mol of acetic acid (HAc) is added to water and the volume 
of the solution is adjusted to equal 1 L? 

Solution 

Step 1: The dissociation of HAc in water can be written simply as 


HAc == H* + Ac 

where Ac” represents the acetate ion, CH;COO-. 
And the acid dissociation constant K, for this equilibrium is: 
K, = [H*][Ac-VHAc = 1.74 x 1075 M 


In solution, some amount, x, of HAc dissociates, generating x amount of Ac” 
and an equal amount x of H*. Ionic equilibria characteristically are established very 
rapidly. At equilibrium, the concentration of HAc + Ac” must equal 0.1 M. 


Step 2: So, [HAc] can be represented in the dissociation as (0.1 — x) M, and [Ac] 
and [H+] then both equal x molar. 
K, = 74x 107 M = 37/(0.1 — x]. 

This is a quadratic equation of the form (ax? + bx + c = 0). 

Step 3: The solution to such quadratic equations) is 
x =(-b + VB — 4ac)/2a. 

For x? + (1.74 x 10->)x — (1.74 x 1075) = 0, x = 1.319 x 10-3 M = [H*] 
Step 4: so pH = —log)o[H*] = 2.88. 

(Note that the calculation of x can be simplified here: Because K, is quite small, 


x << 0.1 M. Therefore, K, is essentially equal to x?/0.1. Thus, x? = 1.74 X 1076 MP, 
So, x = 1.32 x 1073 M, and pH = 2.88.) 


2.2c The Henderson-Hasselbalch Equation Describes the Dissociation 
of a Weak Acid in the Presence of Its Conjugate Base 


Consider the ionization of a weak acid, HA, occurring with an acid dissociation con- 
stant, K,. Then, 
HA m—Ht + A` 
and 
WA] 
“ [HA] 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


39 


40 Chapter 2 Water: The Medium of Life 


Rearranging this expression in terms of the parameter of interest, [H*], we have 


K,||HA 
pry- EMA] 
[A7] 
Taking the logarithm of both sides gives 
log [H*] = log K, + lo [HA] 
g 8 Ka 810 [A7] 


If we change the signs and define pK, = —log K,, we have 


Teor ee N 
p = Pa Oe TA] 


or 
pH = pK, + logio Ea) 
[HA] 
This relationship is known as the Henderson—Hasselbalch equation. Thus, the pH of a 
solution can be calculated, provided K, and the concentrations of the weak acid HA and 
its conjugate base A~ are known. Note particularly that when [HA] = [A7], pH = pKa. 
For example, if equal volumes of 0.1 M HAc and 0.1 M sodium acetate are mixed, then 

pH = pK, = 4.76 


pK, = —log K, = —logjo(1.74 X 1075) = 4.76 


(Sodium acetate, the sodium salt of acetic acid, is a strong electrolyte and dissociates 
completely in water to yield Na* and Ac’.) 

The Henderson—Hasselbalch equation provides a general solution to the quantitative 
treatment of acid—base equilibria in biological systems. Table 2.4 gives the acid dissocia- 
tion constants and pK, values for some weak electrolytes of biochemical interest. 


Table 2.4 Acid Dissociation Constants and pK, Values for Some Weak 
Electrolytes (at 25°C) 


Acid K, (M) pK, 
“HCOOH (formic acid) O LXI 8S 
CHCOOH (acetic acid) 1.74 x 1075 4.76 
CH;CH,COOH (propionic acid) 1.35 X 10-5 4.87 
-CH;CHOHCOOH (lactic acid) 0 O LB8X10 8G 
HOOCCH:CH COOH (succinic acid) pk\* 6.16 X 1075 4.21 
HOOCCH;CH2COO™ (succinic acid) pKy 2.34 x 10% 5.63 
H;PO, (phosphoric acid) pK; 7.08 x 10-3 2.15 
HPO, (phosphoric acid) pK» 6.31 x 10-8 7.20 
HPO,” (phosphoric acid) pK; 3.98 x 107! 12.40 
C;N>H;* (imidazole) 1.02 x 1077 6.99 
CsO2N3H);* (histidine-imidazole group) pKp* 9.12 x 1077 6.04 
“HCO; (carbonic acid) pK; O LOX 37. 
HCO,- (bicarbonate) pK, 5.75 x 107! 10.24 
(HOCH,)3;CNH;>¢ (tris-hydroxymethyl aminomethane) 8.32 x 107° 8.07 
NH,* (ammonium) 5.62 x 10710 9.25 
CH;NH;* (methylammonium) 2.46 x 107"! 10.62 


[Source: CRC Handbook of Biochemistry, Cleveland, OH: The Chemical Rubber Co., 1968.] 

*The pK values listed as pK,, pK, or pK; are, in actuality, pK, values for the respective dissociations. 
This simplification in notation is used throughout this book. 

pKr refers to the imidazole ionization of histidine. 
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| Example | What is the pH when 100 mL of 0.1 N NaOH is added to 150 mL of 


0.2 M HAc? The pK, for acetic acid is 4.76. 


Solution 
Step 1: Use the Henderson—Hasselbalch equation to solve this problem. 


pH = pK, + logiof[Ac |/[HAc] 
Step 2: The pK, value for acetic acid is 1.74 x 1075 M. The total concentration of 
HAc and AC% is given by 0.15 L(0.2 M) = 0.03 mol. 
Step 3: The concentration of Ac™ will be equal to the amount of OH™ added and the 
concentration of HAc is equal to 0.03 mol minus the concentration of OH™ added. 
Step 4: 100 mL 0.1 N NaOH = 0.01 mol OH™ is added, which neutralizes 0.01 mol 
of HAc and yields an equivalent amount of Ac”. 


OH- + HAc —> Ac” + H,O 


So, the amount of Ac™ = 0.01 mol and the amount of HAc is 0.03 mol — 0.01 mol 
or 0.02 mol. [Ac7] = 0.01 mol and [HAc] = 0.02 mol 


Step 5: Using these values and the Henderson—Hasselbalch equation gives 
pH = pK, + logyo[Ac [HAc] = 4.76 + log;90.01 mol/0.02 mol) 


pH = 4.76 — log102 = 4.46 


2.2d Titration Curves Illustrate the Progressive 
Dissociation of a Weak Acid 


Titration is the analytical method used to determine the amount of acid in a solution. 
A measured volume of the acid solution is titrated by slowly adding a solution of base, 
typically NaOH, of known concentration. As incremental amounts of NaOH are 
added, the pH of the solution is determined, and a plot of the pH of the solution versus 
the amount of OH- added yields a titration curve. The titration curve for acetic acid is 
shown in Figure 2.11. In considering the progress of this titration, keep in mind two 
important equilibria: 


1. HAc == H+ + Ac™ K, = 1.74 x 1075 


_ [H0] 
[K,] 


As the titration begins, mostly HAc is present, plus some H* and Ac” in amounts that 
can be calculated (see the Example in Section 2.2b). Addition of a solution of NaOH 
allows hydroxide ions to neutralize any H* present. Note that reaction (2) as written 
is strongly favored; its apparent equilibrium constant is greater than 10! As H* is 
neutralized, more HAc dissociates to Ht and Ac’. The stoichiometry of the titration 
is 1:1—for each increment of OH™ added, an equal amount of the weak acid HAc is 
titrated. As additional NaOH is added, the pH gradually increases as Ac” accumu- 
lates at the expense of diminishing HAc and the neutralization of H*. At the point 
where half of the HAc has been neutralized (that is, where 0.5 equivalent of OH~ has 
been added), the concentrations of HAc and Ac” are equal and pH = pK, for HAc. 
Thus, we have an experimental method for determining the pK, values of weak elec- 
trolytes. These pK, values lie at the midpoint of their respective titration curves. After 
all of the acid has been neutralized (that is, when one equivalent of base has been 
added), the pH rises exponentially. 

The shapes of the titration curves of weak electrolytes are identical, as Figure 2.12 
reveals. Note, however, that the midpoints of the different curves vary in a way that 
characterizes the particular electrolytes. The pK, for acetic acid is 4.76, the pK, for im- 
idazole is 6.99, and that for ammonium is 9.25. These pK, values are directly related to 
the dissociation constants of these substances, or, viewed the other way, to the relative 
affinities of the conjugate bases for protons. Ammonia (NH;) has a high affinity for 
protons compared to Ac”; NH4* is a poor acid compared to HAc. 


2. Ht + OH” ==> H,O K = 5.55 X 105 


2.2 What Is pH? 4 


Low pH High pH 
100 

CHCOOH CH¿COOT 
8 
= 
S 
© 
g 
5 
2 50 pH 4.76 
3 
U 
24 


0 0.5 1.0 
Equivalents of OH” added 


CH COOR 


pH 
OL 


l 
0.5 1.0 
Equivalents of OH” added 


Figure 2.11 The titration curve for acetic acid. 
CH;COOH predominates at low pH; CH;COO- 
predominates at high pH. Note that the titration 
curve is relatively flat at pH values near the pKa. In 
other words, the pH changes relatively little as OH™ 
is added in this region of the titration curve. 
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Titration midpoint 


[imid.H*] = [imid] 
pK, = 6.99 


[HA] = [A7] 
pH= pk, 
12 i 
NH; 
[ pK, =9.25 = 
10 j [NH}] = [NH3] ig oF 
NH YY 
| —_ Imidazole 
E 


pH 

H-N N-H 
AAVA 6 
Imidazole Ht ————* 


pK, = 4.76 
CH,COOH GH,coo- 


0.5 1.0 
Equivalents of OH” 


Figure 2.12 The titration curves for several weak electrolytes: acetic acid, imidazole, and ammonium. 


2.2e Phosphoric Acid Has Three Dissociable H* 


Figure 2.13 shows the titration curve for phosphoric acid, H;PO4. This substance is a 
polyprotic acid, meaning it has more than one dissociable proton. Indeed, it has three, 
and thus, three equivalents of OH” are required to neutralize it, as Figure 2.13 shows. 
Note that the three dissociable H* are lost in discrete steps, with each dissociation 
showing a characteristic pK,. Note that pK; occurs at pH = 2.15, and the concentra- 
tions of the acid H3PO, and the conjugate base H,PO,4 are equal. As the next disso- 
ciation is approached, HPO% is treated as the acid and HPO,’ is its conjugate base. 
Their concentrations are equal at pH 7.20, so pK> = 7.20. (Note that at this point, 1.5 
equivalents of OH” have been added.) As more OH™ is added, the last dissociable 
hydrogen is titrated, and pK; occurs at pH = 12.4, where [HPO,7 ] = [PO,° ]. 

The shape of the titration curves for weak electrolytes has a biologically relevant 
property: In the region of the pKa, pH remains relatively unaffected as increments of 
OH- (or H*) are added. The weak acid and its conjugate base are acting as a buffer. 


ull [HPO,2-] = 
3— 3- 

| EOL] PO, 
125 | (ye 

| | 
10 3 HPO, 

[HPO,7-] = 
8 [HoPO,7] 
pH i po a a i pK = 72 
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Figure 2.13 The titration curve for phosphoric acid. 
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2.3 | What Are Buffers, and What Do They Do? 


Buffers are solutions that tend to resist changes in their pH as acid or base is added. 
Typically, a buffer system is composed of a weak acid and its conjugate base. A solu- 
tion of a weak acid that has a pH nearly equal to its pK,, by definition, contains an 
amount of the conjugate base nearly equivalent to the weak acid. Note that in this 
region, the titration curve is relatively flat (Figure 2.14). Addition of H*, then, has 
little effect because it is absorbed by the following reaction: 


H+ + A7 — HA 
Similarly, any increase in [OH ] is offset by the reaction 
OH- + HA —> A` + H,O 


Thus, the pH remains relatively constant. The components of a buffer system are cho- 
sen such that the pK, of the weak acid is close to the pH of interest. It is at the pK, that 
the buffer system shows its greatest buffering capacity. At pH values more than 1 pH 
unit from the pK,, buffer systems become ineffective because the concentration of one 
of the components is too low to absorb the influx of H* or OH”. The molarity of a buf- 
fer is defined as the sum of the concentrations of the acid and conjugate base forms. 

Maintenance of pH is vital to all cells. The structure, and hence the function, of 
proteins, nucleic acids, and many other cellular molecules depends on weak forces such 
as H bonds and ionic interactions, both of which can be affected by pH. Also, processes 
such as metabolism are dependent on the activities of enzymes; in turn, enzyme activity 
is markedly influenced by pH, as the graphs in Figure 2.15 show. Consequently, changes 
in pH would be disruptive to metabolism for reasons that become apparent in later 
chapters. Organisms have a variety of mechanisms to keep the pH of their intracellular 
and extracellular fluids essentially constant, but the primary protection against harm- 
ful pH changes is provided by buffer systems. The buffer systems employed reflect both 
the need for a pK, value near pH 7 and the compatibility of the buffer components with 
the metabolic machinery of cells. Two buffer systems act to maintain a constant intra- 
cellular pH—the phosphate (HPO,?~/H;PO,_) system and the histidine system. On the 
other hand, the pH of the extracellular fluid that bathes the cells and tissues of animals 
is maintained by the bicarbonate/carbonic acid (HCO; /H,CO3) system. 


2.3a The Phosphate Buffer System Is a Major Intracellular Buffering System 


The phosphate system serves to buffer the intracellular fluid of cells at physiological 
pH because its pK; lies near this pH value. The intracellular pH of most cells is main- 
tained in the range between 6.9 and 7.4. Phosphate is an abundant anion in cells, both 
in its inorganic form and as an important functional group on organic molecules that 
serve as metabolites or macromolecular precursors. In both organic and inorganic 
forms, its characteristic pK» means that the ionic species present at physiological pH 
are sufficient to donate or accept hydrogen ions to buffer any changes in pH, as the 
titration curve for HPO; in Figure 2.13 reveals. For example, if the total cellular con- 
centration of phosphate is 20 mM (millimolar) and the pH is 7.4, the distribution of 
the major phosphate species is given by 


[HPO] 


H = pK + logio = 
M R TO] 


[HPO] 


7.4 =7.20 +1 a 
°F TPO, |] 


[HPO] 


= 1.58 
[H,PO,] 


Thus, if [HPO] + [H:PO;] = 20 mM, then 
[HPO,27] = 12.25mM and [H,PO,7] = 7.75 mM 


il 
l m 
a! : 
[HA] = [A™] 
pH 6 
4 pie pH= pk, 


0.5 1.0 
Equivalents of OH” added —> 


Buffer action: 


HA AT 


=< 


Figure 2.14 A buffer system consists of a weak acid, 
HA, and its conjugate base, A7. 


+ 
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Figure 2.15 pH versus enzymatic activity. Note that 
the x-axis (pH range) is different in each graph. 

(a) Pepsin is a protein-digesting enzyme active in the 
gastric fluid. (b) Fumarase is a metabolic enzyme 
found in mitochondria. (c) Lysozyme digests bacterial 
cell walls; it is found in cells and bodily fluids. 


2.3b The Imidazole Group of Histidine Also Serves 

as an Intracellular Buffering System 
Histidine is one of the 20 naturally occurring amino acids commonly found in pro- 
teins. It possesses an imidazole group, a five-membered heterocyclic ring with two 
nitrogen atoms, one of whose protons easily dissociates. The pK, for dissociation of 
the imidazole hydrogen of histidine is 6.04. 


(0010 COO- 


| 7 | 
H;'N—C—C 7 he fi + H; N=C—CH; =>\ 
H H 


NH HN san 


Nop 


In cells, histidine occurs as a free amino acid, as a constituent of proteins, and as part 
of dipeptides in combination with other amino acids. Because the concentration of 
free histidine is low and its imidazole pK, is more than 1 pH unit removed from pre- 
vailing intracellular pH, its role in intracellular buffering is minor. However, protein- 
bound and dipeptide histidine may be the dominant buffering system in some cells. 
In combination with other amino acids, as in proteins or dipeptides, the imidazole 
pKa may increase substantially. For example, the imidazole pK, is 7.04 in anserine, a 
dipeptide containing -alanine and histidine (Figure 2.16). Thus, this pK, is near 
physiological pH, and some histidine peptides are well suited for buffering at physi- 
ological pH. 


2.3c The Bicarbonate Buffer System of Blood Plasma 


The important buffer system of blood plasma is the bicarbonate/carbonic acid 
couple: 


HCO, — H* + HCO,- 
The relevant pK,, pK; for carbonic acid, is 3.77 at 25°C (Table 2.4), but at 37°C, pK; is 


3.57.) At pH 7.4, the concentration of HCO; is a minuscule fraction of the HCO;~ 
concentration, as shown by. 


pH = 7.4 = 3.57 + logio([HCO3-V/[H»CO3]) 
[HCO3-/[HCO;] = 6761/1 


Thus, the plasma appears to be poorly protected against an influx of OH” ions. How- 
ever, the bicarbonate buffer system works well because the critical concentration of 
HCO; is maintained relatively constant through equilibrium with CO, produced in 
the tissues as a major end product of metabolism as well as the availability of CO, in 
the lungs. 


CO, + H20 ==> H,CO; 


The hydration of CO; is mediated by an enzyme, carbonic anhydrase, which facilitates 
equilibrium by rapidly catalyzing the hydration reaction. Thus, CO, serves to buffer any 
loss of H»CO;, and blood pH is effectively maintained. 


Q O 

SF 
o 
| 


+ 
HN — CHy— CHy— C —N— CH— CH, = 
| N NH 


Figure 2.16 Anserine (N-6-alanyl-3-methyl-L-histidine) is an important dipeptide buffer for maintain- 
ing the intracellular pH in some tissues. The structure shown is the predominant ionic species at pH 7. 
pK, (COOH) = 2.64; pK, (imidazole-N+H) = 7.04; pK; (N+H3) = 9.49. 
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a 
HO—CH,—CH,=NH N—CH,—CH,—SO3H 
NES 


HEPES 


Figure 2.17 The structure of HEPES, 4-(2-hydroxy)-1-piperazine ethane sulfonic acid, in its fully proton- 
ated form. The pK, of the sulfonic acid group is about 3; the pK, of the piperazine-N*H is 7.55 at 20°C. 


2.3d Buffers Useful within Physiological pH Ranges 

Not many common substances have pK, values in the range from 6 to 8. Conse- 

quently, biochemists conducting in vitro experiments were once limited in their In vitro Latin for “in glass”. Originally, a phrase 
choice of buffers effective at or near physiological pH. To remedy this problem, Meant to describe experiments conducted in glass 
biochemists synthesized various organic compounds with pK, values in the desir- pee o APENAS 
able pH = 6-8 range, and now a large selection is available. One such compound ts counterpart is “in vivo,” that is, an experiment 


is HEPES (Figure 2.17). conducted in living cells or organisms. 


| Example | Sample Buffer Preparation Problem 


Given a solution of 0.1 M HEPES in its fully protonated form and ready access to 
0.1 M HCl, 0.1 M NaOH, and distilled water, describe the preparation of 1 L of 
0.025 M HEPES buffer solution, pH 7.8. 


Solution 

Step 1: Since the desired pH is 7.8, the relevant pK, here is 7.55, the pK, of the 
piperazine-NH*. But the solution contains HEPES in its fully protonated form. So, 
the first step is to mix equal quantities of 0.1 M HEPES and 0.1 M NaOH to neu- 
tralize the sulfonic acid group and yield a solution of 0.05 M HEPES sulfonate. 


Step 2: Now the Henderson—Hasselbalch equation can be used to determine the 
proper ratio of piperazine-N: to piperazine-NH* 


pH = pK, + log([piperazine-N:]/[piperazine-NH*]) 
7.8 = 7.55 + log([piperazine-N:]/[piperazine-NH*]) 
0.25 = log([piperazine-N:]/[piperazine-NH*]) 
1.78 = ([{piperazine-N:]/[piperazine-NH*]) 


Step 3: The other equation needed to solve this equation is the conservation of mass. 
Titration does not change the total amount of piperazine species (piperazine-N: 
plus piperazine-N*H), which equals 0.025 moles in 1 L. 

So, in the desired 1 L of 0.025 M buffer solution, 


[piperazine-N:] + [piperazine-NH*] = 0.025 M 


And the amount of each can be calculated from the ratio of their concentrations 
(determined in Step 2 above): 


[piperazine-NH*] = 0.025 M — [piperazine-N:] 
1.78 = [piperazine-N:]/(0.025 M — [piperazine-N:]) 
0.0445 M — 1.78([piperazine-N:]) = [piperazine-N:] 

0.0445 M = 2.78([piperazine-N:]) 
[piperazine-N:] = 0.016 M 
[piperazine-NH*] = 0.009 M 


Step 4: Mix 500 mL of 0.05 M HEPES sulfonate with 160 mL of 0.1 M NaOH and 
add sufficient water (340 mL) to give a final volume of 1 L. Final solution is 0.025 M 
HEPES buffer, pH 7.8. Or, more directly, combine 250 mL of 0.1 M HEPES in its 
fully protonated form with 410 mL of 0.1 M NaOH and 340 mL H:O to give a final 
solution that is 0.025 M HEPES buffer, pH 7.8. 
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Human Biochemistry 


Blood pH and Respiration 


Hyperventilation, defined as a breathing rate more rapid than necessary 
for normal CO, elimination from the body, can result in an inappropri- 
ately low [CO,(g)] in the blood. Central nervous system disorders such 
as meningitis, encephalitis, or cerebral hemorrhage as well as a number 
of drug- or hormone-induced physiological changes, can lead to hyper- 
ventilation. As [CO,(g)] drops due to excessive exhalation, [HCO3] in 
the blood plasma falls, followed by a decline in [H+] and [HCO ] in 
the blood plasma. Blood pH rises within 20 sec of the onset of hyper- 
ventilation, becoming maximal within 15 min. [H*] can change from 


2.4 


its normal value of 40 nM (pH = 7.4) to 18 nM (pH = 7.74). This rise 
in plasma pH (increase in alkalinity) is termed respiratory alkalosis. 

Hypoventilation is the opposite of hyperventilation and is char- 
acterized by an inability to excrete CO, rapidly enough to meet 
physiological needs. Hypoventilation can be caused by narcotics, 
sedatives, anesthetics, and depressant drugs; diseases of the lung 
also lead to hypoventilation. Hypoventilation results in respiratory 
acidosis, as CO,(g) accumulates, giving rise to HCO}, which dis- 
sociates to form H* and HCO; -. 


What Properties of Water Give It a Unique 
Role in the Environment? 


The remarkable properties of water render it particularly suitable to its unique role in 
living processes and the environment, and its presence in abundance favors the exis- 
tence of life. Let’s examine water’s physical and chemical properties to see the extent 
to which they provide conditions that are advantageous to organisms. 

As a solvent, water is powerful yet innocuous. No other chemically inert solvent 
compares with water for the substances it can dissolve. Also, it is very important to life, 
in that water is a “poor” solvent for nonpolar substances. Thus, through hydrophobic 
interactions, lipids coalesce, membranes form, boundaries are created defining com- 
partments, and the cellular nature of life is established. Because of its very high dielec- 
tric constant, water is a medium for ionization. Ions enrich the living environment- 
they enhance the variety of chemical species and they introduce an important class of 
chemical reactions. They provide electrical properties to solutions and therefore to 
organisms. Aqueous solutions are the prime source of ions. 

The thermal properties of water are especially relevant to its environmental fitness. It 
has great power as a buffer resisting thermal (temperature) change. Its heat capacity, or 
specific heat (4.1840 J/g°C), is remarkably high; it is ten times greater than iron, five 
times greater than quartz or salt, and twice as great as hexane. Its heat of fusion is 
335 J/g. Thus, at 0°C, it takes a loss of 335 J to change the state of 1 g of H,O from liquid 
to solid. Its heat of vaporization (2.24 kJ/g) is exceptionally high. These thermal proper- 
ties mean that it takes substantial changes in heat content to alter the temperature and 
especially the state of water. Water’s thermal properties allow it to buffer the climate 
through such processes as condensation, evaporation, melting, and freezing. Further- 
more, these properties allow effective temperature regulation in living organisms. For 
example, heat generated within an organism as a result of metabolism can be efficiently 
eliminated through evaporation or conduction. The thermal conductivity of water is 
very high compared to other liquids. The anomalous expansion of water as it cools to 
temperatures near its freezing point is a unique attribute of great significance to its natu- 
ral fitness. As water cools, H bonding increases because the thermal motions of the 
molecules are lessened. H bonding tends to separate the water molecules (Figure 2.2), 
thus decreasing the density of water. These changes in density mean that, at tempera- 
tures below 4°C, cool water rises and, most important, ice freezes on the surface of bod- 
ies of water, forming an insulating layer that protects the liquid water underneath. 

Water has the highest surface tension (75 dyne/cm) of all common liquids (except mer- 
cury). Together, surface tension and density determine how high a liquid rises in a capillary 
system. Capillary movement of water plays a prominent role in the life of plants. Last, 
consider osmosis as it relates to water and, in particular, the bulk movement of water in the 
direction from a dilute aqueous solution to a more concentrated one across a semiperme- 
able boundary. Such bulk movements determine the shape and form of living things. 

Water is truly a crucial determinant of the fitness of the environment. In a very real 
sense, organisms are aqueous systems in a watery world. 
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Foundational Biochemistry Things You Should Know after Reading Chapter 2. 


e Water has unusual properties, and H bonding between water mole- © Weak electrolytes show only a slight tendency to dissociate in water 
cules is the key to understanding these properties. and form ions. 
e Water is an excellent solvent because of its polar nature. e The ionization constant, K,, is a measure of the extent to which a 
bout : ; t forms i in solution. 
e Hydrophobic interactions between nonpolar molecules in aqueous substance forms 1008 im Solution 
solutions occur because of an entropy-driven process based on a net © The Henderson—Hasselbalch equation relates pH to the pK, of a 
decrease in order among water molecules. weak acid, as determined by the relative concentrations of the weak 
@  Amphiphilic molecules have both polar and nonpolar chemical groupings. acid And ats:conjugatg base, 
; i . © Titrati d to determine the pK, val f k 
e Solute molecules affect the properties of the solution through their pee canbe used fordeterming tHe pA Vale o! wea 
interactions with water molecules. oe 
a : ; e Buff luti ini ts of k aci it 
e Colligative properties are properties that depend on the number of e ee $ aero nail ates oe E 
solute molecules in a solvent, not the particular chemical properties hace i added 8 8 P 
of the solute molecules. i 
e- Water can ionize toasmall but finite extent to form fi and Otm ions e The phosphate buffer system is a major intracellular buffer system. 
® Theion product of water, K,,is given by K, = [H*][OH-]. Ky = 10-42 © The imidazole group of histidine can also serve as an intracellular buffer. 
s hwo wo 7 tkw 
at 25°C. e The bicarbonate/carbonic acid system is the major buffering system 
; : : f bl lasmé th tracellular fluids. 
© pH is defined as the negative logarithm (to the base 10) of the H* Of blood plasma and óther extracellular fuid 
concentration. e Not many naturally occurring substances have pĶ, values in the neu- 
TOES ; tral pH biochemists h hesized ith 
® Strong electrolytes are substances that dissociate completely in water eee oe ayè: synthesized new cómpoünds Wi 
ioforniions such properties for in vitro work. 
: ; e Th ties of water give it a significant role in the envi t; 
e Strong electrolytes include strong acids, strong bases, and many salts, € properties o1 Water ee eee Me enr onmen 
including the salts of weak acids The fitness of the environment for living systems depends on the 
properties of water. 
Problems 


Answers to all problems are at the end of this book. Detailed solutions are 
available in the Student Solutions Manual. 


1. Calculating pH from [H+] Calculate the pH of the following. 


a. 5X 1074 M HCI d. 3 X 107? M KOH 
b. 7X105 MNaOH e. 0.04mM HCl 
c. 24M HCl f. 6x 10° M HCI 


2. Calculating [H+] from pH Calculate the following from the pH values 


given in Table 2.3. 

[H+] in vinegar 

[H+] in saliva 

[H+] in household ammonia 
[OH] in milk of magnesia 
[OH] in beer 

f. [H}] inside a liver cell 


sao fF Pf 


. Henderson-Hasselbalch Equation and lonization Consider the 


following equilibrium. 


CHOH CHOH 
(0) (0) 
OH OH + Ht 
OH OH OH OH 
NH} NH, 
pk, = 9.0 


A scientist claims that at pH=7.0, a solution of this compound 
would contain about 90% of the species in the charged form 
(-NH;") and about 10% of the species in the uncharged form 
(CNH). Is this scientist correct? If not, provide the correct percent- 
ages for these two species in the solution. 


4. Effect of pH on Secondary Interactions Three organic compounds are 


shown. Based upon the list of acidic compounds in Table 2.4, deter- 
mine which of these molecules is/are acidic. 


(a) (b) (c) 


CHO OH COOH 


. Effect of Solvent on Acidity The pX, values that are typically discussed 


are only applicable in water. If an acidic compound is dissolved in 
an organic solvent (like liquid hexane, CsH,4), would this decrease or 
increase the acid’s pK, value? Explain. 


. Calculating [H+] and pK, from the pH of a Solution of Weak Acid The 


pH of a 0.02 M solution of an acid was measured at 4.6. 
a. What is the [H*] in this solution? 
b. Calculate the acid dissociation constant K, and pK, for this acid. 


. Calculating the pH of a Solution of a Weak Acid; Calculating the pH of 


the Solution after the Addition of Strong Base The K, for formic acid 

is 1.78 X 10-4 M. 

a. What is the pH of a 0.1 M solution of formic acid? 

b. 150 mL of 0.1 M NaOH is added to 200 mL of 0.1 M formic 
acid, and water is added to give a final volume of 1 L. What is 
the pH of the final solution? 


. Prepare a Buffer by Combining a Solution of Weak Acid with a Solution 


of the Salt of the Weak Acid Given 0.1 M solutions of acetic acid and 
sodium acetate, describe the preparation of 1 L of 0.1 M acetate 
buffer at a pH of 5.4. 
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10. 


Tl. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


Chapter 2 Water: The Medium of Life 


. Calculate the pH in a Muscle Cell from the HPO,?- /H,PO,~ Ratio 


If the internal pH of a muscle cell is 6.8, what is the [HPO,?-|/ 
[H2PO, ] ratio in this cell? 


Preparing a Phosphate Buffer Solution of pH 7.5 from Solutions of 
Na3PO, and HPO, Given 0.1 M solutions of Na3;3PO, and H;POu,, 
describe the preparation of | L of a phosphate buffer at a pH of 7.5. 
What are the molar concentrations of the ions in the final buffer 
solution, including Nat and H+? 

Polyprotic Acids: Phosphate Species Abundance at Different pHs What 
are the approximate fractional concentrations of the following 
phosphate species at pH values of 0, 2, 4, 6, 8, 10, and 12? 

a. H;PO, 


b. HPO; 
C. HPO,- 
d. PO~ 


Polyprotic Acids: Citric Acid Species at Various pHs Citric acid, a 
tricarboxylic acid important in intermediary metabolism, can be 
symbolized as H;A. Its dissociation reactions are 


H3A s= H+ + HA~ pki = 3.13 
HA- ==> Ht + HA2- pko = 4.76 
HA- == Ht + A? pK; = 6.40 


If the total concentration of the acid and its anion forms is 0.02 M, 
what are the individual concentrations of H;3A, HsA~, HA”, and 
A? at pH 5.2? 


Calculate the pH Change in a Phosphate Buffer When Acid or Base Is 

Added 

a. If 50 mL of 0.01 M HCl is added to 100 mL of 0.05 M 
phosphate buffer at pH 7.2, what is the resultant pH? What are 
the concentrations of HPO, and HPO, in the final 
solution? 

b. If 50 mL of 0.01 M NaOH is added to 100 mL of 0.05 M 
phosphate buffer at pH 7.2, what is the resultant pH? What are 
the concentrations of HPO; and HPO,” in this final 
solution? 


CO, Buffer System A consequence of the condition known as COPD 
(chronic obstructive pulmonary disease) is the weakened capacity to 
exhale CO, that accumulates as a consequence of metabolism. This 
leads to an increased reabsorption of bicarbonate (HCO; ) into the 
circulation by the kidneys. Explain why this would happen. 


Explore the Bicarbonate/Carbonic Acid Buffering System of Blood 
Plasma At 37°C, if the plasma pH is 7.4 and the plasma concentra- 
tion of HCO; is 15 mM, what is the plasma concentration of 
H,CO;? What is the plasma concentration of COnissolveay? If 
metabolic activity changes the concentration of COnxissoivea) to 3 mM 
and [HCO ] remains at 15 mM, what is the pH of the plasma? 


How to Prepare a Buffer Solution: An Anserine Buffer Draw the 
titration curve for anserine (Figure 2.16). The isoelectric point of 
anserine is the pH where the net charge on the molecule is zero; what 
is the isoelectric point for anserine? Given a 0.1 M solution of 
anserine at its isoelectric point and ready access to 0.1 M HCI, 0.1 M 
NaOH, and distilled water, describe the preparation of 1 L of 0.04 M 
anserine buffer solution, pH 7.2. 


On the basis of Figure 2.12, what will be the pH of the acetate—acetic 
acid solution when the ratio of [acetate]/[acetic acid] is 10? 


a. 3.76 


b. 4.76 
c. 5.76 
d. 11.24 


Determination of the Molecular Weight of a Solute by Freezing Point 
Depression A 100 g amount of a solute was dissolved in 1000 g of 
water. The freezing point of this solution was measured accurately 
and determined to be —1.12°C. What is the molecular weight of the 
solute? 


19. 


20. 


21. 


22. 


23. 


24. 


How to Prepare a Triethanolamine Buffer Shown here is the structure 
of triethanolamine in its fully protonated form: 


CH,CH,OH 
+ 
HOCH,CH, — N — CH,CH,OH 
| 
H 


Its pK, is 7.8. You have available at your lab bench 0.1 M solutions 
of HCl, NaOH, and the uncharged (free base) form of triethanol- 
amine, as well as ample distilled water. Describe the preparation of 
a 1 L solution of 0.05 M triethanolamine buffer, pH 7.6. 


How to Prepare a Tris Buffer Solution Zris-hydroxymethyl aminometh- 
ane (Tris) is widely used for the preparation of buffers in biochemical 
research. Shown here is the structure of Tris in its protonated form: 


TNH; 


| 

HOCH, — C — CHOH 
| 
CHOH 


Its acid dissociation constant, K,, is 8.32 X 107° M. You have available 
at your lab bench a 0.1 M solution of Tris in its protonated form, 

0.1 M solutions of HCI and NaOH, and ample distilled water. Describe 
the preparation of a 1-L solution of 0.02 M Tris buffer, pH 7.8. 


Plot the Titration Curve for Bicine and Calculate How to Prepare a pH 
7.5 Bicine Buffer Solution Bicine (N, N—bis (2-hydroxyethyl) glycine) is 
another commonly used buffer in biochemistry labs. The structure of 
bicine in its fully protonated form is shown here: 


CH CHOH 


H*N — CH,COOH 
| 


CH,CH,OH 


a. Draw the titration curve for bicine, assuming the pK, for its free 
COOH group is 2.3 and the pK, for its tertiary amino group is 8.3. 

b. Draw the structure of the fully deprotonated form (completely 
dissociated form) of bicine. 

c. You have available a 0.1 M solution of bicine at its isoelectric point 
(pH)), 0.1 M solutions of HCl and NaOH, and ample distilled water. 
Describe the preparation of 1 L of 0.04 M bicine buffer, pH 7.5. 

d. What is the concentration of the fully protonated form of bicine 
in your final buffer solution? 


Calculate the Concentration of CI- in Gastric Juice Hydrochloric acid 
is a significant component of gastric juice. If chloride is the only 
anion in gastric juice, what is its concentration if pH = 1.2? 


Calculate the Concentration of Lactate in Blood Plasma at pH 7.4 if 
[Lactic Acid] = 1.5 uM From the pK, for lactic acid given in Table 2.4, 
calculate the concentration of lactate in blood plasma (pH = 7.4) if 
the concentration of lactic acid is 1.5 yM. 


Draw the Titration Curve for a Weak Acid and Determine Its pK, from 
the Titration Curve When a 0.1 M solution of a weak acid was titrated 
with base, the following results were obtained: 


Equivalents 
of base added pH observed 
0.05 3.4 
0.15 3.9 
0.25 4.2 
0.40 4.5 
0.60 4.9 
0.75 5.2 
0.85 5.4 
0.95 6.0 
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Plot the results of this titration and determine the pK, of the weak 
acid from your graph. 


25. Calculate the Rate of an Enzymatic Reaction Where H* Is a Product The 
enzyme alcohol dehydrogenase catalyzes the oxidation of ethyl alcohol 
by NAD* to give acetaldehyde plus NADH and a proton: 

CH;CH,OH + NAD* — CH;CHO + NADH + H* 


The rate of this reaction can be measured by following the change 
in pH. The reaction is run in 1 mL of 10 mM Tris buffer at pH 8.6. 
If the pH of the reaction solution falls to 8.4 after 10 minutes, what 
is the rate of alcohol oxidation, expressed as nanomoles of ethanol 
oxidized per mL per sec of reaction mixture? 


26. The Bicarbonate Buffer System of Blood Plasma The important buffer 
system of blood plasma is the bicarbonate/ carbonic acid couple: 


HCO; == Ht + HCO, 
The relevant pKa, pK, for carbonic acid, is 3.77 at 25°C. 
a. If [HCO; ] = 24 mM, what is [H,CO;]? 


b. What would be the effect on the pH of the blood plasma 
following a sudden influx of 4 x 1076 MOH~? 


However, remember that the concentration of HCO; is itself buffered 
by the available pools of CO), which is in equilibrium with H,CO;;: 


H20 + CO; == H2CO; 


Under the conditions prevailing in mammalian body fluids, the equi- 
librium constant for this reaction, Kj, is 0.003 at 37°C. That is, the 
equilibrium lies far to the left, such that more than 300 CO, molecules 
are present in solution for every molecule of H2CO3. 


K, = [H2CO3)/[ CO] 
Thus, 
[H2CO3] = Ky[COz] 


Putting this value for [H2CO3] into the expression for the first disso- 
ciation of HCO; gives 


K, = ((A*][HCO; K [CO] 


Therefore, the overall equilibrium constant for the ionization of 
H,CO; in equilibrium with CO, is given by 


KK, = (TH PHCOs ])/[CO] 
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and K,K;, the product of two constants, can be defined as a new equi- 
librium constant, Koverat- 

c. What is the value of Kyeran? 

d. What is the value of pK vera? 

e. If [CO,] = 1.2 mM, what is [HCO; ]? 


27. Blood pH and Respiration Hyperventilation, defined as a breathing 
rate more rapid than necessary for normal CO, elimination from the 
body, can result in an inappropriately low [CO,] in the blood. Central 
nervous system disorders such as meningitis, encephalitis, or cerebral 
hemorrhage as well as a number of drug- or hormone-induced physi- 
ological changes, can lead to hyperventilation. 

a. Which component of the bicarbonate/carbonic acid buffering 
system will be affected first? 


b. What is the [H*] at pH 7.4? What is the [H+] at pH 7.74? 


28. Hypoventilation and Blood pH Hypoventilation is the opposite of 
hyperventilation and is characterized by an inability to excrete CO, 
rapidly enough to meet physiological needs. Hypoventilation can be 
caused by narcotics, sedatives, anesthetics, and depressant drugs; 
diseases of the lung also lead to hypoventilation. Which component 
of the bicarbonate/carbonic acid buffering system will be affected first 
during hypoventilation? 


Think-Pair-Share Question 


Titration of an Unknown Acid A 20 mL solution containing an unknown, 
diprotic acid (AH,) was titrated with 0.2 M NaOH. The following figure 
shows the pH value of the mixture as a function of the volume (in mL) of 
0.2 M NaOH added. 


a. Calculate the concentration of the unknown acid. 
b. Estimate the values of pK, and pKa. for this unknown acid. 
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Publiphoto/Science Source 


Thermodynamics 
of Biological Systems 


Essential Question 


Living things require energy. Many life processes involve conversions of one kind of 
energy into another. To study such bioenergetic phenomena requires familiarity with 
thermodynamics. Thermodynamics also allows us to determine whether chemical 
processes and reactions occur spontaneously. What are the laws and principles of 
thermodynamics that allow us to describe the flows and interchanges of heat, energy, 
and matter in biochemical systems? 


his chapter presents the basic thermodynamic principles underlying the concepts 

of heat flow, entropy, and free energy functions and the relationship between en- 
tropy and information. In addition, some related concepts are considered, including 
the concept of standard states, the effect of pH on standard-state free energies, the 
effect of concentration on the net free energy change of a reaction, and the impor- 
tance of coupled processes in living things. The chapter concludes with a discussion 
of adenosine triphosphate (ATP) and other energy-rich compounds. 


3.1 | What Are the Basic Concepts of Thermodynamics? 


In any consideration of thermodynamics, a distinction must be made between the 
system and the surroundings. The system is that portion of the universe with which 
we are concerned. It might be a mixture of chemicals in a test tube, or a single cell, 
or an entire organism. The surroundings include everything else in the universe 
(Figure 3.1). The nature of the system must also be specified. There are three basic 
kinds of systems: isolated, closed, and open. An isolated system cannot exchange 
matter or energy with its surroundings. A closed system may exchange energy, but not 
matter, with the surroundings. An open system may exchange matter, energy, or both 


A theory is the more impressive the greater is the 
simplicity of its premises, the more different are the 
kinds of things it relates and the more extended is 
its range of applicability. Therefore, the deep 
impression which classical thermodynamics made 
upon me. It is the only physical theory of universal 
content which I am convinced, that within the 


framework of applicability of its basic concepts, 


will never be overthrown. 


Albert Einstein 


The sun is the source of energy for virtually all life. We 
even harvest its energy in the form of electricity by using 
windmills driven by air heated by the sun. 


Key Questions 


3.1 What Are the Basic Concepts of 
Thermodynamics? 

3.2 What Is the Effect of Concentration 
on Net Free Energy Changes? 

3.3 How May Standard State, Equilibrium, 
and Cellular Conditions Be Compared? 

3.4 What Are the Characteristics of High- 
Energy Biomolecules? 

3.5 How Do Group Transfer Potentials 
Quantify the Reactivity of Functional 
Groups? 

3.6 What Are Reduction Potentials, and 
How Are They Used to Account for 
Free Energy Changes in Redox 
Reactions? 

3.7 Why Are Coupled Processes 
Important to Living Things? 

3.8 What Is the Daily Human 
Requirement for ATP? 
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Isolated system: 


No exchange of matter or energy 


Closed system: Open system: 


Energy exchange may occur Energy exchange and/or matter exchange may occur 


Surroundings 


Isolated system 


Surroundings Surroundings 


Open syste 


Figure 3.1 The characteristics of isolated, closed, and open systems. Isolated systems exchange neither 
matter nor energy with their surroundings. Closed systems may exchange energy, but not matter, with 
their surroundings. Open systems may exchange either matter or energy with the surroundings. 


with the surroundings. Living things are typically open systems that exchange matter 
(nutrients and waste products) and energy (heat from metabolism, for example) with 
their surroundings. 


3.la Three Quantities Describe the Energetics 


of Biochemical Reactions 


The energy changes in chemical and biological reactions and processes are described by 
three quantities: enthalpy, entropy, and free energy: 


Enthalpy, H—the heat content of a system, which depends on the types and num- 
bers of bonds in the reactants and products of a reaction. The enthalpy change, AH, 
is the heat transferred in a constant pressure process. When heat is absorbed by a 
system, AH is positive. By contrast, AH is negative when a process involves loss of 
heat by the system (transfer of heat from system to surroundings). 

Entropy, S—a measure of disorder or randomness in the system. An organized or 
ordered state is a low-entropy state, whereas a disordered state is a high-entropy state. 


S=kinw (3.1) 
and 
AS = kin Wnai — kiln Winitial (3.2) 


where Wina and Winitia are the final and initial number of microstates, respectively, 
and where k is Boltzmann’s constant (k = 1.38 x 10°73 J/K). 

Entropy represents energy dispersion: The dispersion of energy among molecular 
motions in discrete states (microstates). An increase in entropy is just an increase in 
the number of microstates in any macrostate. On the other hand, if only one micro- 
state corresponds to a given macrostate, then the system has no freedom to choose its 
microstate—and it has zero entropy. Entropy change measures the dispersal of energy 
in a process. Dispersal of energy into a larger number of microstates results in an 
increase in entropy. 

Gibbs free energy, G—a function that describes the amount of energy that can be 
used to do work in a thermodynamic system. The Gibbs free energy is defined as 


G=H-TS (3.3) 


where T is temperature in Kelvin. For any process A = B at constant pressure and 
temperature, the free energy change, AG, is given by 


AG = AH — TAS (3.4) 


If a process results in a transfer of free energy from the system to the surroundings, 
it has a negative AG, and it is said to be exergonic. On the other hand, processes that 
transfer free energy from the surroundings to the system are said to be endergonic 
and have positive AG values. 
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All three of these quantities—H, S, and G—are state functions, meaning that they only 
depend on the state of a system, and not on how the system got there. We say that these 
and other state functions are independent of path. 


Chemistry in History 


Credit: The Asahi Shimbun / Contributor 


Satoshi Omura 


Satoshi Omura (1935—present) grew up in a poor 
farming community in rural Japan. As a boy he was 
fascinated by nature and what it could provide for 
humankind and other living things. His curiosity 
about nature led him to a career combining 
chemistry and pharmacology. He received a Ph.D. 
in Pharmaceutical Sciences from the University 
of Tokyo in 1968 and a Ph.D. in Chemistry at the 
Tokyo University of Science in 1970. 

In his research, Omura searched for novel strains 
of Streptomyces bacteria as a source for new 
bioactive compounds. He isolated and cultured 
many thousands of microbes from soil samples in 
Japan and characterized their antibacterial activity. 


In 1970, Omura isolated a strain of Streptomyces 
avermitilis from woodland soil near a golf course 
along the southeast coast of Honshu, Japan. 
Omura sent the bacteria to William Campbell 
at Merck in the U.S., who isolated avermectin 
and its derivative ivermectin from the bacterium. 
Ivermectin has been highly effective in the treatment 
and eradication of river blindness, lymphatic 
filariasis and other parasite infestations. Satoshi 
Omura and William Campbell shared in the 2015 
Nobel Prize in Physiology or Medicine. 


“T have long been convinced that nature has all the 
solutions we need to solve our past, present and future 
problems — we just need to find them.” Satoshi Omura 


3.1b All Reactions and Processes Follow 
the Laws of Thermodynamics 


Three fundamental principles—the laws of thermodynamics—govern all chemical, 
physical, and biological processes. The first law of thermodynamics states that the total 
energy of an isolated system is conserved. There are several important consequences 
(or alternative expressions) of this law: The total energy of the universe must be 
constant; and though energy may be converted from one form to another, used to do 
work, or moved within a system or between system and surroundings, it cannot be 
destroyed or created in any ordinary chemical process. 

The second law of thermodynamics has been described and expressed in many differ- 
ent ways, including the following: 


1. Systems tend to proceed from ordered (low-entropy or low-probability) states to 
disordered (high-entropy or high-probability) states. 

2. The entropy of the system plus surroundings is unchanged by reversible processes; 
the entropy of the system plus surroundings increases for irreversible processes. 

3. All naturally occurring processes proceed toward equilibrium, that is, to a state of 
minimum potential energy. Energy flows spontaneously so as to become diffused, 
dispersed, or spread out. Energy dispersal results in entropy increase. 


The third law of thermodynamics states that the entropy of any crystalline, perfectly 
ordered substance must approach zero as the temperature approaches 0 K, and, atT = 0K, 
entropy is exactly zero. Based on this, it is possible to establish a quantitative, absolute 
entropy scale for any substance as 


T 

S -Í Cpd In T (3.5) 
0 

where Cp is the heat capacity at constant pressure. The heat capacity of any substance 

is the amount of heat 1 mole of it can store as the temperature of that substance is 

raised by 1 degree. For a constant pressure process, this is described mathematically as 


_ dH 


Ce = OF 


(3.6) 
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A Deeper Look 


Entropy, Information, and the Importance of “Negentropy” 


When a thermodynamic system undergoes an increase in entropy, it 
becomes more disordered. On the other hand, a decrease in entropy 
reflects an increase in order. A more ordered system is more highly 
organized and possesses a greater information content. To appreciate 
the implications of decreasing the entropy of a system, consider the 
random collection of letters in the figure. This disorganized array of 
letters possesses no inherent information content, and nothing can 
be learned by its perusal. On the other hand, this particular array of 
letters can be systematically arranged to construct the first sentence 
of the Einstein quotation that opened this chapter: “A theory is the 
more impressive the greater is the simplicity of its premises, the more 
different are the kinds of things it relates and the more extended is its 
range of applicability.” 

Arranged in this way, this same collection of 151 letters possesses 
enormous information content—the profound words of a great sci- 
entist. Just as it would have required significant effort to rearrange 
these 151 letters in this way, so large amounts of energy are required 
to construct and maintain living organisms. Energy input is required 
to produce information-rich, organized structures such as proteins 
and nucleic acids. Information content can be thought of as negative 
entropy. In 1944, Erwin Schrödinger took time out from his studies of 
quantum mechanics to publish a delightful book titled What Is Life? 
In it, Schrödinger wrote that negative entropy confers organization 
and information content to living organisms. Schrödinger pointed 
out that organisms must acquire negative entropy to sustain life. 


French physicist Léon Brillouin later shortened the term to “negent- 
ropy.” To summarize, living systems are highly organized, low-entropy 
systems, far displaced from equilibrium. 
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If the heat capacity can be evaluated at all temperatures between 0 K and the tempera- 
ture of interest, an absolute entropy can be calculated. For biological processes, entropy 
changes are more useful than absolute entropies. The entropy change for a process can 
be calculated if the enthalpy change and free energy change are known. 


3.1c Free Energy Provides a Simple Criterion for Equilibrium 


An important question for chemists, and particularly for biochemists, is, “Will the reac- 
tion proceed in the direction written?” J. Willard Gibbs, one of the founders of thermo- 
dynamics, realized that the change in the Gibbs free energy (Equation 3.4) provided an 
answer to this question. 

If AG is equal to 0, the process is at equilibrium and there is no net flow either in 
the forward or reverse direction. When AG = 0, AS = AH/T and the enthalpic and 
entropic changes are exactly balanced. Any process with a nonzero AG proceeds 
spontaneously to a final state of lower free energy. If AG is negative, the process 
proceeds spontaneously in the direction written. If AG is positive, the reaction or 
process proceeds spontaneously in the reverse direction. Spontaneity is concerned 
only with whether the reaction is thermodynamically favorable (that is, AG is nega- 
tive) or not. Spontaneity, in the thermodynamic sense, should not be confused with 
rate, a term in kinetics indicating how fast the reaction goes. If the process has a 
negative AG, it is said to be exergonic, whereas processes with positive AG values are 
endergonic. The sign and value of AG do not allow us to determine how fast the 
process will go. 


The Standard State In order to compare the thermodynamic parameters of different 
reactions, it is convenient to define a standard state. For solutes in a solution, the stan- 
dard state is normally unit activity (often simplified to 1 M concentration). Enthalpy, 
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entropy, and other thermodynamic quantities are often given or determined for stan- 
dard-state conditions and are then denoted by a superscript degree sign °, as in AHY, 
AS’, and so on. 


The Standard-State Free Energy Change The free energy change, AG, for any reaction 
depends upon the nature of the reactants and products, but it is also affected by the 
conditions of the reaction, including temperature, pressure, pH, and the concentrations 
of the reactants and products. As explained earlier, it is useful to define a standard state 
for such processes. If the free energy change for a reaction is sensitive to solution condi- 
tions, what is the particular significance of the standard-state free energy change? To 
answer this question, consider a reaction between two reactants A and B to produce the 


products C and D. 
A+B=—=C+D (3.7) 
The free energy change for non-standard-state concentrations is given by 
C][D 
AG=AG° + RT In (R) (3.8) 
At equilibrium, AG = 0 and [C][D]/[A][B] = Keq. We then have 
AG® = —RT In Keg (3.9) 
or, in base 10 logarithms, 
AG? = —2.3RT logio Keq (3.10) 
This can be rearranged to 
Keg = 10-49 2387 (3.11) 


In any of these forms, this relationship allows the standard-state free energy change for 
any process to be determined if the equilibrium constant is known. More importantly, 
it states that the point of equilibrium for a reaction in solution is a function of the standard- 
state free energy change for the process. That is, A G° is another way of writing an equi- 
librium constant. 

Equation 3.9 has an additional meaning: A G° is a measure of how far a reaction is 
from equilibrium under standard-state conditions. If AG® is close to zero, the reaction 
is close to equilibrium under standard-state conditions. If AG° is large (whether 
positive or negative), the reaction is far from equilibrium under standard-state 
conditions. 


3.2 | What Is the Effect of Concentration 
| on Net Free Energy Changes? 


Equation 3.8 shows that the free energy change for a reaction can be very different from 
the standard-state value if the concentrations of reactants and products differ signifi- 
cantly from unit activity.! The effects can often be dramatic. Consider the hydrolysis of 
phosphocreatine: 


Phosphocreatine + H,O —> creatine + P; (3.12) 
This reaction is strongly exergonic, and the standard-state free energy change under 


physiological conditions at 37°C is —42.8 kJ/mol. Physiological concentrations of phos- 
phocreatine, creatine, and inorganic phosphate are normally between 1 and 10 mM. 


'Strictly speaking, unit activity is defined as 1 m (molal). In the dilute solutions typical of biochemical 
situations, molarity (M) and molality can be used interchangably. 
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Assuming 1 mM concentrations and using Equation 3.8, the AG for the hydrolysis of 
phosphocreatine is 


(3.13) 


AG = —42.8 KJ/mol + (8.314 J/mol - K)(310 K) In (eee) 


[0.001] 
AG = —60.5 kJ/mol (3.14) 


At 37°C, the difference in A G between standard-state and 1 mM concentrations for such 
a reaction is thus approximately —17.7 kJ/mol. 

For biochemical reactions in which hydrogen ions (H*) are consumed or produced, 
the usual definition of the standard state is awkward. Standard state for the H* ion is 
1 M, which corresponds to pH 0. At this pH, nearly all enzymes would be denatured 
and biological reactions would not occur. It makes more sense to use free energies 
and equilibrium constants determined at pH 7. Biochemists have thus adopted a 
modified standard state, designated with prime (') symbols, as in AG”, Keq', AH®’, and 
so on. For values determined in this way, a standard state of 10-7 M H* and unit activ- 
ity (1 M for solutions, 1 atm for gases and pure solids defined as unit activity) for all 
other components (in the ionic forms that exist at pH 7) is assumed. 


3.3 | How May Standard State, Equilibrium, and Cellular 
Conditions Be Compared? 


Any biochemical reaction may be imagined to occur under several different conditions, 
including standard state, equilibrium, and cellular conditions. How can these various 
states be compared from an energetic perspective? 

Consider a typical reaction—the conversion of dihydroxyacetonephosphate to 
glyceraldehye-3-phosphate in glycolysis. The reaction at standard state (Figure 3.2) 
has a free energy change (A G°’) of +7.56 kJ/mol. This is the free energy change when 
all the reactants and products are at 1 M concentration (but with [H*] at 107 M). 
Though important as a point of reference, such conditions never occur in any organ- 
ism. Now, if the system, initially at standard state, is left alone, it will eventually 
reach a state of equilibrium (the left-most case in Figure 3.2). Since the A G” is posi- 
tive, at equilibrium, [DHAP] > [G3P]. It is valuable to appreciate that, for any reaction, 
the magnitude of AG” is a measure of how far standard-state conditions are from 


Dihydroxyacetonephosphate < Glyceraldehyde-3-phosphate 
(DHAP) (G3P) 
Equilibrium Standard state Erythrocyte 
AG=0 AG" = +7.56 kJ/mol AG = +2.41 kJ/mol 

S a 
[DHAP] = 1.91 M [DHAP] =1 M 
[G3P] = 0.09 M [G3P] =1M 

_ teri [G3P] _ 
Kea= [DHAP] 7 0.047 [DHAP] — 5 

we ë ë 
ta AG! [G3P] 
AG" =—RT In Keq AG=AG + Rtn Ory 


Figure 3.2 Comparison of standard state (center), equilibrium (/eft), and cellular (right) conditions for a 
typical metabolic reaction. 
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3.4 What Are the Characteristics of High-Energy Biomolecules? 


| Example | The phosphoglucoisomerase reaction in glycolysis converts glucose- 


6-phosphate to fructose-6-phosphate: 
Glucose-6-P == Fructose-6-P 


The AG” for this reaction as written above is + 1.67 kJ/mol. The concentrations of 
glucose-6-P and fructose-6-P in human erythrocytes are 0.083 mM and 0.014 mM, 
respectively. What is the value of AG for this reaction in erythrocytes at human body 
temperature (310 K)? 


Solution 


Step 1: Set up the equation. 
Using Equation 3.8: 


AG = AG" + RT1 [Fructose-6-P] 
7 | ý [Glucose-6-P] 


Step 2: Solve for AG. 
AG = +1.67 kJ/mol + (8.314 J/mol - K)(310 om 


0.014 X 10° M 
0.083 X 10°37 M 


AG = 1670 J/mol + (8.314 J/mol + K)(310 K)(—1.78) 
AG = —2917 J/mol = —2.92 kJ/mol 


equilibrium! Moreover the simple relation AG” = -RT In K., (Equation 3.9) is the 
quantitative connection between equilibrium and standard-state conditions. 
In a similar manner, Equation 3.8 is the quantitative connection between standard 
state and actual conditions in living systems. For the present reaction, we have 
_ EP [G3P] 
AG =AG*' + RT In Lae! (3.15) 
It is important to note that the concentrations in this equation are the cellular condi- 
tions, which are established by the many processes extant in the cell. The cell is not at 
equilibrium, nor is it at standard state. Importantly, the magnitude of the cellular AG is 
a measure of how far cellular conditions are from standard state. In the present case, the 
value of AG of +2.41 kJ/mol lies between those of the equilibrium and standard states. 


3.4 | What Are the Characteristics of High-Energy 
Biomolecules? 


Organisms are highly organized chemical engines whose existence requires an external 
source of energy. Either directly or indirectly, virtually all life on earth depends on energy 
from the sun. Among life forms, there is a hierarchy of energetics: Certain organisms 
capture solar energy directly, whereas others derive their energy from this group in 
subsequent processes. Organisms that absorb light energy directly are called phototrophic 
organisms. These organisms store solar energy in the form of various organic molecules. 
Organisms that feed on these latter molecules, releasing the stored energy in a series of 
oxidative reactions, are called chemotrophic organisms. Despite these differences, both 
types of organisms share common mechanisms for generating a useful form of chemical 
energy. Once captured in chemical form, energy can be released in controlled exergonic 
reactions to drive a variety of life processes that require energy. A small family of 
universal biomolecules mediates the flow of energy from exergonic reactions to the 
energy-requiring processes of life. These high-energy molecules are compounds whose 
reaction with a substance normally present in the environment (for example, HO 
[hydrolysis reaction] or O» [oxidation reaction]) occurs with a AG?’ more negative than 
—25 kJ/mol. Phosphate compounds are considered high energy if they exhibit large 
negative free energies of hydrolysis (that is, if AG°’ is more negative than —25 kJ/mol). 
Oxidation of reduced coenzymes (NADH, NADPH, and FADH,) typically occurs with 
a free energy change of —150 to —220 kJ/mol. 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


58 Chapter 3 Thermodynamics of Biological Systems 


3.4a High Energy Phosphate Compounds 


Table 3.1 lists the most important high-energy phosphate compounds. Such molecules 
include phosphoric anhydrides (ATP, ADP), an enol phosphate (PEP), acyl phosphates 
(such as acetyl phosphate), and guanidino phosphates (such as creatine phosphate). Also 
included are thioesters, such as acetyl-CoA, which do not contain phosphorus but have a 
large negative free energy of hydrolysis. As noted earlier, the exact amount of chemical 
free energy available from the hydrolysis of such compounds depends on concentration, 
pH, temperature, and so on, but the AG®’ values for hydrolysis of these substances are 
substantially more negative than those for most other metabolic species. 

There are two important points to note here: First, high-energy phosphate compounds 
are not long-term energy storage substances. They are transient forms of stored energy, 
meant to carry energy from point to point, from one enzyme system to another, in the 
minute-to-minute existence of the cell. (As we shall see in subsequent chapters, other mol- 
ecules bear the responsibility for long-term storage of energy supplies.) Second, the term 
high-energy compound should not be construed to imply that these molecules are unstable 
and hydrolyze or decompose unpredictably. ATP, for example, is quite a stable molecule. A 
substantial activation energy must be delivered to ATP to hydrolyze the terminal, or y, 
phosphate group. In fact, as shown in Figure 3.3, the activation energy that must be 
absorbed by the molecule to break the O—P, bond is normally 200 to 400 kJ/mol, which 
is substantially larger than the net 30.5 kJ/mol released in the hydrolysis reaction. The en- 
ergetics of metabolism is more concerned with the net release of 30.5 kJ/mol than with the 
activation energy for the reaction (because the role of enzymes is to lower the energy of 
activation). The net release of large quantities of free energy distinguishes the high-energy 
phosphoric anhydrides from their “low-energy” ester cousins, such as glycerol-3-phosphate 
(Table 3.1). The next section provides a quantitative framework for understanding these 
comparisons. 


| Table 3.1 | Free Energies of Hydrolysis of Some High-Energy Compounds* 


Compound and Hydrolysis Reaction AG” (kj/mol) 
Phosphoenolpyruvate —> pyruvate + P; —62.2 
1,3-Bisphosphoglycerate —> 3-phosphoglycerate + P; —49.6 
Creatine phosphate —> creatine + P; —43.3 
Acetyl phosphate —> acetate + P; —43.3 
Adenosine-5'-triphosphate —> ADP + P; —35.7t 
Adenosine-5'-triphosphate —> ADP + P; —30.5 
(with excess Mg?*) 

Adenosine-5'-diphosphate —> AMP + P; 3501 
Pyrophosphate —> P; + P; (in 5 mM Mg?*) —33.6 
Adenosine-5'-triphosphate —> AMP + PP; (excess Mg?*) =32:3 
Uridine diphosphoglucose —> UDP + glucose —31.9 
Acetyl-coenzyme A —> acetate + CoA —31.5 
S-adenosylmethionine — methionine + adenosine —25.6% 
Glucose-1-phosphate —> glucose + P; —21.0 
Sn-Glycerol-3-phosphate —> glycerol + P; =9.2 
Adenosine-5'-monophosphate —> adenosine + P; —9.2 


*Source: Handbook of Biochemistry and Molecular Biology, 1976, 3rd ed. In Physical and Chemical Data, G. Fasman, ed., 
Vol. 1, pp. 296-304. Boca Raton, FL: CRC Press. 

tSource: Gwynn, R. W., and Veech, R. L., 1973. The equilibrium constants of the adenosine triphosphate hydrolysis and 
the adenosine triphosphate-citrate lyase reactions. Journal of Biological Chemistry 248:6966-6972. 

tSource: Mudd, H., and Mann, J., 1963. Activation of methionine for transmethylation. Journal of Biological Chemistry 
238:2164-2170. 
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eee 
transfer potential 
= -30.5 kJ/mol 


sphoryl group Products 


Figure 3.3 The activation energies for phosphoryl group transfer reactions (200-400 kJ/mol) are 


substantially larger than the free energy of hydrolysis of ATP (—30.5 kJ/mol). 


A Deeper Look 


Why Nature Chose Phosphates 


Phosphoryl transfer reactions form the heart of energy metabolism 
in cells, including both the formation of phosphorylated metabolites 
that then become substrates for energy-yielding pathways and 
activation of metabolites that serve as building blocks in energy- 
consuming biosynthetic processes. Why did these reactions based 
on phosphorus evolve to be so ubiquitous and important? The 
answer lies in two consequences of the charge-charge repulsions 
embodied in phosphate esters and phosphoric anhydrides. On the 
one hand, charge-charge repulsions between electron-rich reactants 
and negatively charged phosphates make phosphate esters among 
the most inert of common biochemical compounds. (The very high 


free energy of activation for ATP hydrolysis, shown in Figure 3.3 is 
just one example.) On the other hand, the very same charge-charge 
repulsion also makes it possible to favor either phosphate ester 
hydrolysis or synthesis by adjusting the electrostatic environment in 
the active sites of enzymes mediating phosphoryl transfer reactions. 
This elegant capacity to adjust and regulate electrostatic effects 
means that phosphate esters are ideal compounds to participate in 
either energy-producing or energy-consuming metabolic reactions. 


Westheimer, F. H., 1987. Why nature chose phosphates. Science 235:1173-1178. 


Kamerlin, S. C. L., Sharma, P. K., Prasad, R. B., and Warshel, A., 2013. Why nature really 
chose phosphates. Quarterly Review of Biophysics 46:1—-132. 


3.4b ATP Is an Intermediate Energy-Shuttle Molecule 


One last point about Table 3.1 deserves mention. Despite the central importance of 
ATP as a high-energy phosphate, it holds an intermediate place in the rank of high- 
energy phosphates. PEP, 1,3-BPG, creatine phosphate, acetyl phosphate, and 
pyrophosphate all exhibit higher values of AG°’. This is not a biological anomaly. 
ATP is uniquely situated between the very-high-energy phosphates synthesized in the 
breakdown of fuel molecules and the numerous lower-energy acceptor molecules 
that are phosphorylated in the course of further metabolic reactions. ADP can ac- 
cept both phosphates and energy from the higher-energy phosphates, and the ATP 
thus formed can donate both phosphates and energy to the lower-energy molecules 
of metabolism. The ATP/ADP pair is an intermediately placed acceptor/donor sys- 
tem among high-energy phosphates. In this context, ATP functions as a very versatile 
but intermediate energy-shuttle device that interacts with many different energy- 


coupling enzymes of metabolism. 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


60 Chapter 3 Thermodynamics of Biological Systems 


Figure 3.4 The triphosphate chain of ATP contains 
two phosphoric anhydride linkages (also known as 
pyrophosphate linkages), both of which release large 
amounts of energy upon hydrolysis. The high energy 
bonds are emphasized with yellow shading. 


3.4c The Hydrolysis of Phosphoric Acid Anhydrides 
Is Highly Favorable 


ATP contains two pyrophosphoryl or phosphoric acid anhydride linkages, as shown in 
Figure 3.4. Other common biomolecules possessing phosphoric acid anhydride linkages 
include ADP, GTP, GDP, and the other nucleoside diphosphates and triphosphates, 
sugar nucleotides such as UDP-glucose, and inorganic pyrophosphate itself. All exhibit 
large negative free energies of hydrolysis, as shown in Table 3.1. Three factors, described 
in detail below, account for the large negative A G°’ values for the hydrolysis reactions: 
Destabilization of the reactant due to bond strain caused by electrostatic repulsion, 
stabilization of the products by ionization and resonance, and entropy factors due to 
hydrolysis and subsequent ionization. 


Destabilization Due to Electrostatic Repulsion Electrostatic repulsion in the reactants is 
best understood by comparing these phosphoric anhydrides with other reactive anhy- 
drides, such as acetic anhydride. As shown in Figure 3.5a, the electronegative carbonyl 
oxygen atoms withdraw electrons from the C=O bonds, producing partial negative 
charges on the oxygens and partial positive charges on the carbonyl carbons. Each of 
these electrophilic carbonyl carbons is further destabilized by the other acetyl group, 
which is also electron withdrawing in nature. As a result, acetic anhydride is unstable 
with respect to its hydrolysis products. 

The situation with phosphoric anhydrides is similar. The phosphorus atoms of the 
pyrophosphate anion are electron withdrawing and destabilize PP; with respect to its 
hydrolysis products. Furthermore, the reverse reaction, reformation of the anhydride 
bond from the two anionic products, requires that the electrostatic repulsion between 
these anions be overcome (see following). 


Stabilization of Hydrolysis Products by lonization and Resonance The pyrophosphoryl 
group possesses two negative charges at pH values above 7.5 or so (Figure 3.5a). The 
hydrolysis products, two phosphate esters, each carry about two negative charges at 
PH values above 7.2. The increased ionization of the hydrolysis products helps stabi- 
lize the electrophilic phosphorus nuclei. 

Stabilization of the products by resonance and electrostatic effects, the second type of 
stabilization, is illustrated in Figure 3.5b by the reactant anhydrides. The unpaired elec- 
trons of the bridging oxygen atom in acetic anhydride (and phosphoric anhydride) can- 
not participate in resonance structures with both electrophilic centers at once. This 
competing resonance situation is relieved in the product acetate or phosphate molecules. 


Entropy Factors Arising from Hydrolysis and lonization For the phosphoric anhydrides, 
and for most of the high-energy compounds discussed here, there is an addi- 
tional entropic contribution to the free energy of hydrolysis. Most of the hydrolysis re- 
actions of Table 3.1 result in an increase in the number of molecules in solution. 
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3.4 What Are the Characteristics of High-Energy Biomolecules? 


(a) 
Acetic anhydride: Phosphoric anhydrides: 
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(b) 


Competing resonance in acetic anhydride: 
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P y 


These can only occur alternately 


Simultaneous resonance in the hydrolysis products: 


These resonances can occur simultaneously 


Figure 3.5 (a) Electrostatic repulsion between adjacent partial positive charges (on carbon and 
phosphorus, respectively) is relieved upon hydrolysis of the anhydride bonds of acetic anhydride and 
phosphoric anhydrides. The pink and blue shadings represent two mutually exclusive or competing 
resonance possibilities. (b) The competing resonances of acetic anhydride and the simultaneous 
resonance forms of the hydrolysis product, acetate. 


As shown in Figure 3.6, the hydrolysis of ATP (at pH values above 7) creates three spe- 
cies—ADP, inorganic phosphate (P;), and a hydrogen ion—from only two reactants 
(ATP and H,0). The entropy of the solution increases because the more particles, the 
more disordered the system. (This effect is ionization dependent because at low pH, the 
hydrogen ion created in many of these reactions simply protonates one of the phos- 
phate oxygens, and one fewer species results from the hydrolysis.) 


3.4d The Hydrolysis AG” of ATP and ADP 
Is Greater Than That of AMP 


The concepts of destabilization of reactants and stabilization of products described for 
pyrophosphate also apply to ATP and other phosphoric anhydrides (Figure 3.6). ATP 
and ADP are destabilized relative to the hydrolysis products by electrostatic repulsion, 
competing resonance, and entropy. AMP, on the other hand, is a phosphate ester (not 
an anhydride) possessing only a single phosphoryl group and is not markedly different 
from the product inorganic phosphate in terms of electrostatic repulsion and resonance 
stabilization. Thus, the AG°’ for hydrolysis of AMP is much smaller than the corre- 
sponding values for ATP and ADP. 


3.4e Acetyl Phosphate and 1,3-Bisphosphoglycerate 
Are Phosphoric-Carboxylic Anhydrides 
The mixed anhydrides of phosphoric and carboxylic acids, frequently called acyl phos- 


phates, are also energy rich. Two biologically important acyl phosphates are acetyl 
phosphate and 1,3-bisphosphoglycerate. Hydrolysis of these species yields acetate and 
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Figure 3.6 Hydrolysis of ATP to ADP (and/or 
of ADP to AMP) leads to relief of electrostatic 
repulsion. 


3-phosphoglycerate, respectively, in addition to inorganic phosphate (Figure 3.7). Once 
again, the large A G°’ values indicate that the reactants are destabilized relative to prod- 
ucts. This arises from bond strain, which can be traced to the partial positive charges on 
the carbonyl carbon and phosphorus atoms of these structures. The energy stored in the 
mixed anhydride bond (which is required to overcome the charge-charge repulsion) is 
released upon hydrolysis. Increased resonance possibilities in the products relative to the 
reactants also contribute to the large negative A G°’ values. The value of A G°’ depends 
on the pK, values of the starting anhydride and the product phosphoric and carboxylic 
acids as well as the pH of the medium. 


3.4f Enol Phosphates Are Potent Phosphorylating Agents 


The largest value of AG®’ in Table 3.1 belongs to phosphoenolpyruvate or PEP, an enolic 
phosphate. This molecule is an important intermediate in carbohydrate metabolism, and 
due to its large negative A G°’, it is a potent phosphorylating agent. PEP is formed via the 
dehydration of 2-phosphoglycerate by enolase during fermentation and glycolysis. PEP is 
subsequently transformed into pyruvate upon the transfer of its phosphate to ADP by 
pyruvate kinase (Figure 3.8). The very large negative value of A G°’ for the latter reaction 
is, to a large extent, the result, of a secondary reaction of the eno/ form of pyruvate. Upon 
hydrolysis, the unstable enolic form of pyruvate immediately converts to the keto form 
with a resulting large negative A G°’ (Figure 3.9). Together, the hydrolysis and subsequent 
tautomerization result in an overall A G°’ of —62.2 kJ/mol. 
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3.4 What Are the Characteristics of High-Energy Biomolecules? 
I 
+ H,O Á> CH;—C—O + + Ht 
AG*= -43.3 kJ/mol 
I 
+ HO —W-+> C—O + + Ht 
AG*= -49.6 kJ/mol | 
HCOH 
folie 
1,3-Bisphosphoglycerate 3-Phosphoglycerate 
Figure 3.7 The hydrolysis reactions of acetyl phosphate and 1,3-bisphosphoglycerate. 
+ 
Enolase 
CH COT aae L HC =C — COO7 
S Eerie Mg** Phosphoenolpyruvate 
AG*'= +1.8 kJ/mol (PEP) 
i. Pyruvate O 
ties... | 
H,C=C—COO™ = H,C— C— COO% 
Mg’, Kt 
Phosphoenolpyruvate AG*'= -31.7 kJ/mol Pyruvate 
(PEP) 
Figure 3.8 Phosphoenolpyruvate (PEP) is produced by the enolase reaction (in glycolysis) and in 
turn drives the phosphorylation of ADP to form ATP in the pyruvate kinase reaction. 
OH Oo 
| Tautomerization Il 
+ HO > + H,C=C— COO > HsC—C—COO- 
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Figure 3.9 Hydrolysis and the subsequent tautomerization account for the very large AG?’ of PEP. 
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Figure 3.10 Adenosine-5'-triphosphate (ATP). 
Color indicates the locations of the dissociable 
protons of ATP. 
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Figure 3.11 The pH dependence of the free energy 
of hydrolysis of ATP. Because pH varies only 
slightly in biological environments, the effect on 
AG is usually small. 


3.4g The Complex Equilibria Involved 
in ATP Hydrolysis 


So far, the hydrolyses of ATP and other high-energy phosphates have been portrayed as 
simple processes. The situation in a real biological system is far more complex, due to the 
operation of several ionic equilibria. First, ATP, ADP, and the other species in Table 3.1 
can exist in several different ionization states that must be accounted for in any quantita- 
tive analysis. Second, phosphate compounds bind a variety of divalent and monovalent 
cations with substantial affinity, and the various metal complexes must also be considered 
in such analyses. Consideration of these special cases makes the quantitative analysis far 
more realistic. The importance of these ionic equilibria in group transfer reactions is il- 
lustrated for the hydrolysis of ATP, but the principles and methods presented are general 
and can be applied to any similar hydrolysis reaction. 


The AG” of Hydrolysis for ATP Is pH-Dependent ATP has four dissociable protons, as 
indicated in Figure 3.10. Three of the protons on the triphosphate chain dissociate at 
very low pH. The last proton to dissociate from the triphosphate chain possesses a 
pKa of 6.95. At higher pH values, ATP is completely deprotonated. ADP and phos- 
phoric acid also undergo multiple ionizations. These multiple ionizations make the 
equilibrium constant for ATP hydrolysis more complicated than the simple expres- 
sion in Equation 3.21. Multiple ionizations must also be taken into account when the 
pH dependence of AG” is considered. The calculations are beyond the scope of this 
text, but Figure 3.11 shows the variation of AG° as a function of pH. The free energy 
of hydrolysis is nearly constant from pH 4 to pH 6. At higher values of pH, AG° varies 
linearly with pH, becoming more negative by 5.7 kJ/mol for every pH unit of increase 
at 37°C. Because the pH of most biological tissues and fluids is near neutrality, the 
effect on AG” is relatively small, but it must be taken into account in certain 
situations. 

Most biological environments contain substantial amounts of divalent and monova- 
lent metal ions, including Mg?*, Ca?*, Na*, K+, and so on. In most real biological en- 
vironments (with pH near 7 and Mg?*concentrations of 5 mM or more), the free energy 
of hydrolysis of ATP is altered more by metal ions than by protons. A widely used 
“consensus value” for AG” of ATP in biological systems is —30.5 kJ/mol (Table 3.1). 
This value, cited in the 1976 Handbook of Biochemistry and Molecular Biology (3rd ed., 
Physical and Chemical Data, Vol. 1, pp. 296-304, Boca Raton, FL: CRC Press), was 
determined at 37°C in the presence of “excess Mg?*.” This is the value we use for meta- 
bolic calculations in the balance of this text. 


Concentration Affects the Free Energy of Hydrolysis of ATP Through all these calcula- 
tions of the effect of pH and metal ions on the ATP hydrolysis equilibrium, we have 
assumed standard conditions with respect to concentrations of all species except for 
protons. The levels of ATP, ADP, and other high-energy metabolites never even begin 
to approach the standard state of 1 M. In most cells, the concentrations of these spe- 
cies are more typically 1 to 5 mM or even less. Earlier, we described the effect of 
concentration on equilibrium constants and free energies (Equation 3.8). For the 
present case, we can rewrite this as 


(3.16) 


AG=AG + RTIn aa 


[ZATP] 


where the terms in brackets represent the sum (È) of the concentrations of all the ionic 
forms of ATP, ADP, and P;. 

It is clear that changes in the concentrations of these species can have large effects on 
AG. The concentrations of ATP, ADP, and P; may, of course, vary in real biological envi- 
ronments. However, for the sake of some model calculations, we assume that all three con- 
centrations are equal; then the effect of concentration on AG is as shown in Figure 3.12. 
The free energy of hydrolysis of ATP, which is —35.7 kJ/mol at 1 M, becomes —49.4 kJ/mol 
at 5 mM (that is, the concentration for which pC = —2.3 in Figure 3.12). At 1 mM ATP, 
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ADP, and P;, the free energy change becomes even more negative at — 53.6 kJ/mol. A useful 
approximation for the AG of hydrolysis under cellular conditions is —50 to —55 kJ/mol. 

All the other high- and low-energy phosphates experience roughly similar changes 
in concentration under physiological conditions and thus similar changes in their 
free energies of hydrolysis. The roles of the very-high-energy phosphates (PEP, 
1,3-bisphosphoglycerate, and creatine phosphate) in the synthesis and maintenance 
of ATP in the cell are considered in later discussions of metabolic pathways. In the 
meantime, several of the problems at the end of this chapter address such coupled 
reactions (see Section 3.7). 


3.5 | How Do Group Transfer Potentials Quantify 
the Reactivity of Functional Groups? 


Many reactions in biochemistry involve the transfer of a functional group from a donor 
molecule to a specific receptor molecule or to water. The concept of group transfer 
potential explains the tendency for such reactions to occur. Biochemists define the group 
transfer potential as the free energy change that occurs upon hydrolysis, that is, upon 
transfer of the particular group to water. This concept and its terminology are prefera- 
ble to the more qualitative notion of high-energy bonds. 

The concept of group transfer potential has general application. Other kinds of trans- 
fer (of hydrogen ions and electrons, for example) are commonly characterized in terms of 
appropriate measures of transfer potential (pK, and reduction potential, é., respectively). 
As shown in Table 3.2, the notion of group transfer is fully analogous to those of ioniza- 
tion potential and reduction potential. The similarity is anything but coincidental because 
all of these are really specific instances of free energy changes. If we write 


AH — > A’ + H+ (3.17a) 


we do not really mean that a proton has literally been removed from the acid AH. In the gas 
phase at least, this would require the input of approximately 1200 kJ/mol! What we really 
mean is that the proton has been transferred to a suitable acceptor molecule, usually water: 


AH + HO —> A” + H;,0* (3.17b) 


The appropriate free energy relationship is, of course, 


_ 4G 


n= 2.303 RT (18) 


Similarly, the release of free energy that occurs upon the hydrolysis of ATP and other 
“high-energy phosphates” can be treated quantitatively in terms of group transfer. It is 
common to write for the hydrolysis of ATP 


ATP + HO — > ADP +P; (3.19) 
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Where C = concentration of 
ATP, ADP, and P; 


Figure 3.12 The free energy of hydrolysis of ATP 
as a function of concentration at 38°C, pH 7.0. The 
plot follows the relationship described in Equation 
3.16, with the concentrations [C] of ATP, ADP, and 
P; assumed to be equal. 


| Table 3.2 | Types of Transfer Potential 


Group Transfer Potential Standard Reduction Potential 
Proton Potential (Acidity) (High-Energy Bond) (Electron-Transfer Potential) 
Simple equation AH =— A` + Ht A~P = A +P; A= Atte 
Equation including acceptor AH + H,O == A~PO, + HO == A + H+ == At+3H, 
A- + H;,0* A—H + HPO,- 
Measure of transfer potential pK, = AG In Keg = —AG AG, = -AG 
2.303 RT RT nF 

Free energy change of transfer AG? per mole of AG’? per mole of phosphate AG’? per mole of 
is given by: H* transferred transferred e transferred 


[Source: Klotz, |. M., 1986. Introduction to Biomolecular Energetics. New York: Academic Press.] 
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Figure 3.13 Experimental apparatus used to 
measure the standard reduction potential of the 
indicated redox couples: (a) the acetaldehyde/ethanol 
couple and (b) the fumarate/succinate couple. 


The free energy change, which we henceforth call the group transfer potential, is given by 
AG? = -RT In Keq (3.20) 
where K,, is the equilibrium constant for the group transfer, which is normally written as 


x. = LADPIPI] 
“ [ATP][H,0] 
Even this set of equations represents a simplification because ATP, ADP, and P; all exist 
in solutions as a mixture of ionic species. This problem is discussed in a later section. For 
now, it is enough to note that the free energy changes listed in Table 3.1 are the group 
transfer potentials observed for transfers to water under physiological conditions at 25°C. 
Similarly, in the case of an oxidation—reduction reaction 


A — At +e 


(3.21) 


(3.22a) 


we do not really mean that A oxidizes independently. What we really mean (and what is 
much more likely in biochemical systems) is that the electron is transferred to a suitable 
acceptor: 


A+H+t — A*++ 3H, (3.22b) 
and the relevant free energy relationship is 
ie 22 
o = Tg (3.22c) 


where ©, is the standard reduction potential, n is the number of electron equivalents 
transferred, and F is Faraday’s constant. 


3.6 | What Are Reduction Potentials, and How 
Are They Used to Account for Free Energy 
Changes in Redox Reactions? 


NADH and reduced flavoproteins ((FADH;]) are two reduced compounds that appear 
often in biological redox (electron transfer) reactions. These compounds have a strong 
tendency to be oxidized—that is, to transfer electrons to other species. Oxidative phos- 
phorylation is the metabolic process by which the energy released in a series of electron- 
transfer reactions is captured through the phosphorylation of ADP to form ATP. Just 
as the group transfer potential was used in Section 3.5 to quantitate the energy of phos- 
phoryl transfer, the standard reduction potential, denoted by €,', quantitates the ten- 
dency of chemical species to be reduced or oxidized. The standard reduction potential 
difference describing electron transfer between two species, 


Reduced donor > C Oxidized acceptor 
ne 


Oxidized donor Reduced acceptor 


is related to the free energy change for the process by 


AG” = —nFAZ,' (3.23) 


where n represents the number of electrons transferred; ¥ is Faraday’s constant, 96,485 
J/V - mol; and A,’ is the difference in reduction potentials between the donor and ac- 
ceptor. This relationship is straightforward, but it depends on a standard of reference by 
which reduction potentials are defined. 


3.6a Standard Reduction Potentials Are Measured 
in Reaction Half-Cells 
Standard reduction potentials are determined by measuring the voltages generated in 


reaction half-cells (Figure 3.13). A half-cell consists of a solution containing 1 M concentra- 
tions of both the oxidized and reduced forms of the substance whose reduction 
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potential is being measured and a simple electrode. (Together, the oxidized and reduced 
forms of the substance are referred to as a redox couple.) Such a sample half-cell is con- 
nected to a reference half-cell and electrode via a conductive bridge (usually a salt- 
containing agar gel). A sensitive potentiometer (voltmeter) connects the two electrodes 
so that the electrical potential (voltage) between them can be measured. The reference 
half-cell normally contains 1 M H* in equilibrium with H; gas at a pressure of | atm. 
The H*/H;j reference half-cell is arbitrarily assigned a standard reduction potential of 
0.0 V. The standard reduction potentials of all other redox couples are defined relative 
to the H*/H; reference half-cell on the basis of the sign and magnitude of the voltage 
(electromotive force, emf) registered on the potentiometer (Figure 3.13). 

If electron flow between the electrodes is toward the sample half-cell, reduction occurs 
spontaneously in the sample half-cell and the reduction potential is said to be positive. If 
electron flow between the electrodes is away from the sample half-cell and toward the ref- 
erence cell, the reduction potential is said to be negative because electron loss (oxidation) 
is occurring in the sample half-cell. Strictly speaking, the standard reduction potential, 
Eo’, is the electromotive force generated at 25°C and pH 7.0 by a sample half-cell (contain- 
ing 1 M concentrations of the oxidized and reduced species) with respect to a reference 
half-cell. (Note that the reduction potential of the hydrogen half-cell is pH dependent. The 
standard reduction potential, 0.0 V, assumes | M H*. The hydrogen half-cell measured at 
pH 7.0 has an @,’ of —0.421 V.) 


How Standard Reduction Potentials Are Determined: Two Examples Figure 3.13a shows a 
sample/reference half-cell pair for measurement of the standard reduction potential of 
the acetaldehyde/ethanol couple. Because electrons flow toward the reference half-cell 
and away from the sample half-cell, the standard reduction potential is negative, spe- 
cifically —0.197 V. In contrast, the fumarate/succinate couple (Figure 3.13b) causes 
electrons to flow from the reference half-cell to the sample half-cell; that is, reduction 
occurs, and the reduction potential is thus positive. For each half-cell, a half-cell reac- 
tion describes the reaction taking place. For the fumarate/succinate half-cell coupled to 
a H*/H; reference half-cell, the reaction occurring is indeed a reduction of fumarate: 


Fumarate + 2 H* + 2 e7 —> succinate €E = +0.031 V (3.24) 


However, the reaction occurring in the acetaldehyde/ethanol half-cell is the oxidation of 
ethanol: 


Ethanol —> acetaldehyde + 2 H* + 2 e7 o = —0.197 V (3.25) 


3.6b €,’ Values Can Be Used to Predict the Direction 
of Redox Reactions 


Some typical half-cell reactions and their respective standard reduction potentials are 
listed in Table 3.3. Whenever reactions of this type are tabulated, they are uniformly 
written as reduction reactions, regardless of what occurs in the given half-cell. The sign 
of the standard reduction potential indicates which reaction really occurs when the 
given half-cell is combined with the reference hydrogen half-cell (see Figure 3.13). Redox 
couples that have large positive reduction potentials have a strong tendency to accept 
electrons, and the oxidized form of such a couple (O2, for example) is a strong oxidizing 
agent. Redox couples with large negative reduction potentials have a strong tendency to 
undergo oxidation (that is, donate electrons), and the reduced form of such a couple 
(NADPH, for example) is a strong reducing agent. 


3.6c ,' Values Can Be Used to Analyze Energy Changes 

in Redox Reactions 
The half-reactions and reduction potentials in Table 3.3 can be used to analyze energy 
changes in redox reactions. The oxidation of NADH to NAD* can be coupled with the 
reduction of a-ketoglutarate to isocitrate: 


NAD* + isocitrate —> NADH + H* + a-ketoglutarate + CO, (3.26) 
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Table 3.3 


Reduction Half-Reaction E V) 
50, +2H* + 2e —> H,O 0.816 
Fe3* + e —> Fe?* 0.771 
Photosystem P700 0.430 
Cytochrome f (F&e*) + e~ —> cytochrome f (Fe?*) 0.365 
Cytochrome a; (Fe**) + e~ —> cytochrome a;(Fe?*) 0.350 
Cytochrome a (Fe3*) + e~ —> cytochrome a(Fe?*) 0.290 
Cytochrome c (Fe**) + e~ ——> cytochrome c (Fe?*) 0.254 
UQH: + H+ + e —> UQH,) (UQ = coenzyme Q) 0.190 
UQ+2Ht +2e — > UQH, 0.060 
Cytochrome by(Fe**) + e7 —> cytochrome by(Fe?*) 0.050 
Fumarate + 2 H* + 2 e7 —> succinate 0.031 
UQ + H+ + e —> UQH: 0.030 
[FAD] + 2 Ht + 2 e —> [FADH] 0.003 to 0.091* 
Cytochrome b,(Fe**) + e~ —> cytochrome by (Fe?*) —0.100 
Oxaloacetate + 2 Ht + 2 e7 —> malate —0.166 
Pyruvate + 2 H* + 2 e —> lactate —0.185 
Acetaldehyde + 2 H* + 2 e —> ethanol —0.197 
FMN + 2 H* + 2 e —» FMNH, —0.219 
FAD + 2 Ht + 2e° —> FADH, —0.219 
1,3-Bisphosphoglycerate + 2 Ht + 2 e —> —0.290 
glyceraldehyde-3-phosphate + P; 
NAD? + 2 Ht + 2 e —-> NADH + H* —0.320 
NADP? + 2 H* + 2 e ——> NADPH + Ht —0.320 
Lipoyl dehydrogenase [FAD] + 2 Ht + 2 e —~> —0.340 
lipoyl dehydrogenase [FADH)] 
a-Ketoglutarate + CO, + 2 Ht + 2 e ——  isocitrate —0.380 
2H*+2e — H, —0.421 
Ferredoxin (spinach) (Fe**) + e~ —> ferredoxin (spinach) (Fe?*) —0.430 
Succinate + CO, + 2 H* + 2 e —~a-ketoglutarate + H,O —0.670 


*Typical values for reduction of bound FAD in flavoproteins such as succinate dehydrogenase (see Bonomi, F., 
Pagani, S., Cerletti, P., and Giori, C., 1983. Modification of the thermodynamic properties of the electron-transferring 
groups in mitochondrial succinate dehydrogenase upon binding of succinate. European Journal of Biochemistry 
134:439--445). 


This is the isocitrate dehydrogenase reaction of the citric acid cycle. Writing the two 
half-cell reactions, we have 


NAD* + 2 H? + 2 e —-> NADH + H+ 


E = —0.32 V (3.27) 
a-Ketoglutarate + CO, + 2 H* + 2 e —> isocitrate 
o = —0.38 V (3.28) 


In a spontaneous reaction, electrons are donated by (flow away from) the half-reaction 
with the more negative reduction potential and are accepted by (flow toward) the half- 
reaction with the more positive reduction potential. In this example, isocitrate donates 
electrons and NAD* accepts electrons. The convention defines Ag,’ as 


AZ,’ = €,' (acceptor) — @,’ (donor) (3.29) 
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Since isocitrate is the donor and NAD* the acceptor, so we write 
AG,’ = —0.32 V — (—0.38 V) = +0.06 V (3.30) 


From Equation 3.23, we can now calculate A G°’ as 
AG” = —(2)(96.485 kJ/V - mol)(0.06 V) (3.31) 


AG” = —11.58 kJ/mol 


Note that a reaction with a net positive AS,’ yields a negative A G°’, indicating a spon- 
taneous reaction. 


3.6d The Reduction Potential Depends on Concentration 


We have already noted that the standard free energy change for a reaction, A G°’, does not 
reflect the actual conditions in a living cell, where reactants and products are not at 
standard-state concentrations (1 M). Equation 3.8 was introduced to permit calculations 
of actual free energy changes under non-standard-state conditions. Similarly, standard 
reduction potentials for redox couples must be modified to account for the actual concen- 
trations of the oxidized (ox) and reduced (red) species. For any redox couple, 


ox + ne~ red (3.32) 
the actual reduction potential is given by 
[ox] 
E = Eo + (RTInF) In | —— (3.33) 
[red] 


Reduction potentials can also be quite sensitive to the molecular environment. The 
influence of environment is especially important for flavins, such as FAD/FADH, and 
FMN/FMNH.;. When these species are bound to their respective flavoproteins, their 
reduction potentials can be very different. As an example, see the standard reduction 
potentials shown for FAD and lipoyl dehydrogenase in Table 3.3. 


3.7 | Why Are Coupled Processes Important 
to Living Things? 


Many of the reactions necessary to keep cells and organisms alive must run against 
their thermodynamic potential, that is, in the direction of positive AG. Among these 
are the synthesis of ATP and other high-energy molecules as well as the creation of 
ion gradients in all mammalian cells. These processes are driven in the thermody- 
namically unfavorable direction via coupling with highly favorable processes. Many 
such coupled processes are discussed later in this text. They are crucially important in 
intermediary metabolism, oxidative phosphorylation, and membrane transport. 

We can predict whether pairs of coupled reactions will proceed spontaneously by 
simply summing the free energy changes for each reaction. For example, consider the 
reaction from glycolysis (discussed in Chapter 18) involving the conversion of phospho- 
enolpyruvate (PEP) to pyruvate (Figure 3.14). The hydrolysis of PEP is energetically 
very favorable, and it is used to drive phosphorylation of adenosine diphosphate (ADP) 
to form ATP, a process that is energetically unfavorable. Using values of AG that would 
be typical for a human erythrocyte: 


PEP + H,O —> pyruvate + P; AG = —78 kJ/mol (3.34) 
ADP + P, — ATP + H,O AG = +55 kJ/mol (3.35) 
PEP + ADP —> pyruvate + ATP Total AG = —23 kJ/mol (3.36) 


The net reaction catalyzed by this enzyme depends upon coupling between the two 
reactions shown in Equations 3.34 and 3.35 to produce the net reaction shown in 
Equation 3.36 with a net negative AG. Many other examples of coupled reactions are 
considered in our later discussions of intermediary metabolism. In addition, many of 


69 


Figure 3.14 The pyruvate kinase reaction. 
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the complex biochemical systems discussed in the later chapters of this text involve 
reactions and processes with positive AG values that are driven forward by coupling to 
reactions with a negative AG. 


| Example | The phosphofructokinase reaction in glycolysis is: 


Fructose-6-P + ATP = — Fructose-1,6-bisphosphate + ADP + H* 


The standard-state free energy change for this reaction is — 14,200 J/mol. 


a. Calculate the standard-state free energy of hydrolysis of fructose-1,6-bisphos- 
phate to fructose-6-P. 
Solution 
Step 1: Realize that the phosphofructokinase reaction can be combined with the hydro- 
lysis of ATP to obtain (by subtraction) the hydrolysis of fructose-1,6-bisphosphate: 


F1,6 — bisP + ADP —> F6P + ATP, AG? = +14,200 J/mol 
ATP + HO —> ADP + P, AG° = —30,500 J/mol 


Step 2: Add these two equations and add the values of A G° to obtain the answer. 


F1,6 — bisP + MO- F6P + P;, AG? = 14,200 — 30,500 
= —16,300 = —16.3 kJ/mol 
AG = —16.3 kJ/mol 


b. The human red blood cell concentrations of various metabolites are: 


[ATP] = 1.85 mM, [ADP] = 0.14 mM, [fructose-6-P] = 0.014 mM, 
[fructose-1,6-bisphosphate] = 0.031 mM 


What is the cellular free energy change for the phosphofructokinase reaction at 
IIE? 
Solution 


Step 1: Use Equation 3.8 and insert the concentrations of the metabolites in molar 
concentrations: 


[F1,6 — bisP|[ADP] 
AG = AG + RTIn 


[FOP][ATP] 
Step 2: Insert the correct values and solve for the answer. 


AG = —14.2 kJ/mol + (8.314 J/mol  K)(310 K) 
(ae x 10-3 M][0.14 X al 
n 


[0.014 X 10-3 MJ[1.85 xX 10°37 M] 
AG = —18.8 kJ/mol 


c. What is the equilibrium constant for the phosphofructokinase reaction? 
Solution 
Step 1: Write down the appropriate equation: 

AG” = -RT In Keg 
Step 2: Rearrange the equation and solve. 
14,200 J/mol 

(8.314 J/mol + K)(298 K) ~ 
K oS 


In Keg = 5.73 


Notice that the temperature used in this calculation is the standard-state tempera- 
ture, 298 K. Unless otherwise noted, equilibrium constants are always measured at 
standard-state temperature. 
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A Deeper Look 


Coupling a Reaction to ATP Hydrolysis Changes Its Keq by a Factor of 10° 


Consider a process, A —— B. It could be a biochemical reaction, or 
the transport of an ion against a concentration gradient, or even a 
mechanical process (such as muscle contraction). Assume that it is 
a thermodynamically unfavorable reaction. Let’s say for purposes of 
illustration that AG°’ = +13.8 kJ/mol. From the equation, 
AG” = -RT In Keg 
we have 
+13,800 = —(8.31 J/K - mol)(298 K) In Keg 
which yields 
In Keg = —5.57 

Therefore, 

Keq = 0.0038 = [Beq]/[Acg] 


This reaction is clearly unfavorable (as we could have foreseen from 
its positive A G°’). At equilibrium, there is one molecule of product B 
for every 263 molecules of reactant A. Not much A was transformed 
to B. 

Now suppose the reaction A —— B is coupled to ATP hydrolysis, 
as is often the case in metabolism: 


A + ATP =— B + ADP + P; 


The thermodynamic properties of this coupled reaction are the 
same as the sum of the thermodynamic properties of the partial 
reactions: 


A==B AG” = +13.8 kJ/mol 

ATP + H,O == ADP + P; AG?’ = —30.5 kJ/mol 

A+ ATP + H,O ==B+ ADP + P; AG” = —16.7 kJ/mol 
That is, 


AG" overall) = —16.7 kJ/mol 
So 


~16,700 = —RT In Kg = —(8.31)(298)In Keg 
In Keg = —16,700/—2476 = 6.75 
Keg = 850 


Using this equilibrium constant, let’s now consider the cellular situ- 
ation in which the concentrations of A and B are brought to equi- 
librium in the presence of typical prevailing concentrations of ATP, 
ADP, and P;.* 


*The concentrations of ATP, ADP, and P; in a normal, healthy bacterial cell 
growing at 25°C are maintained at roughly 8 mM, 8 mM, and | mM, respec- 
tively. Therefore, the ratio [ADP][P;/[ATP] is about 10~*. Under these con- 
ditions, AG for ATP hydrolysis is approximately —47.6 kJ/mol. 


_ [Begl[ADPIPi] 
“1 [Ag [ATP] 


[B.ql[8 x 10~3][10~7] 
[A.gI[8 x 1073] 


850 


[Beql/[Aca] = 850,000 


Comparison of the [Beg]/[Acq] ratio for the simple A == B reaction 
with the coupling of this reaction to ATP hydrolysis gives 


850,000 _ 


2.2 x 108 
0.0038 


The equilibrium ratio of B to A is more than 10° times greater 
when the reaction is coupled to ATP hydrolysis. A reaction that 
was clearly unfavorable (K., = 0.0038) has become emphatically 
spontaneous! 

The involvement of ATP has raised the equilibrium ratio of B/A 
by more than 200-million fold. It is informative to realize that this 
multiplication factor does not depend on the nature of the reaction. 
Recall that we defined A —— B in the most general terms. Also, 
the value of this equilibrium constant ratio, some 2.2 X 108, is not 
at all dependent on the particular reaction chosen or its standard 
free energy change, AG°’. You can satisfy yourself on this point by 
choosing some value for A G°’ other than + 13.8 kJ/mol and repeat- 
ing these calculations (keeping the concentrations of ATP, ADP, 
and P; at 8, 8, and 1 mM, as before). 


NHo 
Phosphoric anhydride Na DR 
linkages L | 
=A Sy N 
[eel 
~O—P P 7 P—O—CH, 

| | o 

On On op 
OHTOH 
ATP 


(adenosine-5'-triphosphate) 


3.8 | What Is the Daily Human Requirement for ATP? 


This discussion of ATP and the other important high-energy compounds in biology in- 
vites an examination of the daily metabolic consumption of ATP by humans. An approxi- 
mate calculation gives a somewhat surprising and impressive result. Assume that the 
average adult human consumes approximately 11,700 kJ (2800 kcal, that is, 2800 Calories) 
per day. Assume also that the metabolic pathways leading to ATP synthesis operate at a 
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thermodynamic efficiency of approximately 50%. Thus, of the 11,700 kJ a person con- 
sumes as food, about 5850 kJ end up in the form of synthesized ATP. As indicated earlier, 
the hydrolysis of 1 mole of ATP yields approximately 50 kJ of free energy under cellular 
conditions. This means that the body cycles through 5850/50 = 117 moles of ATP each 
day. The disodium salt of ATP has a molecular weight of 551 g/mol. Using this value in a 
rough calculation, an average person hydrolyzes about 


551g 
(117 moles) male 64,467 g of ATP per day 


The average adult human, with a typical weight of 70 kg or so, thus consumes ap- 
proximately 65 kg of ATP per day, an amount nearly equal to their own body weight! 
Fortunately, we have a highly efficient recycling system for ATP/ADP utilization. The 
energy released from food is stored transiently in the form of ATP. Once ATP energy 
is used and ADP and phosphate are released, our bodies recycle it to ATP through 
intermediary metabolism so that it may be reused. The typical 70 kg body contains 
only about 50 grams of ATP/ADP total. Therefore, each ATP molecule in our bodies 
must be recycled nearly 1300 times each day! Were it not for this fact, at current com- 
mercial prices for ATP of about $60 per gram, our ATP “habit” would cost almost 
$4 million per day! 


Foundational Biochemistry Things You Should Know after Reading Chapter 3. 


Isolated systems exchange neither matter nor energy with their sur- 
roundings. 


Closed systems exchange energy but not matter with their surround- 
ings. 


Open systems exchange both matter and energy with their surround- 
ings. 


The first law of thermodynamics—the total energy of an isolated 
system is conserved. 


Enthalpy, H, is the heat content of a system. 


The second law of thermodynamics—systems tend toward disorder 
and randomness. 


Entropy, S, represents energy dispersion. 


The third law of thermodynamics—the entropy of a crystalline, per- 
fectly ordered substance is exactly zero at 0 K. 


Free energy is the capacity to do work. 


Free energy depends on reactant and product concentrations 
(Equation 3.8). 


The equilibrium for a reaction in solution is a function of AG® 
(Equation 3.9). 


Free energy is pH dependent if protons are produced or consumed 
in a reaction. 


AG” symbolizes the standard-state free energy change at pH 7. 


Reduced coenzymes and high-energy phosphates mediate flow of 
energy from exergonic reactions to endergonic reactions. 


e High-energy phosphates include phosphoric anhydrides, enol phos- 
phates, acyl phosphates, and guanidine phosphates. 

e High-energy phosphates are not long-term energy storage substances, 
but rather, transient forms of stored energy. 

e ATP is an intermediate energy-shuttle molecule. 

èe The AG” for ATP hydrolysis used in this text is —30.5 kJ/mol. 

¢ =Phospho(enol)pyruvate (PEP) has the largest AG°’of hydrolysis of 
the high-energy phosphates (— 62.2 kJ/mol). 

e Group transfer potentials quantify the reactivity of functional groups. 

e The favorable AG°'of hydrolysis of phosphoric anhydrides is due to 
electrostatic, ionization, and resonance effects. 

e The AG” of ATP depends on pH and on the concentration of metal 
ions. 

© = The cellular AG of ATP hydrolysis depends on the concentrations of 
ATP, ADP, and P;. 

e Reactions and processes with unfavorable AG can be made favorable 
by coupling with favorable processes, such as the hydrolysis of ATP. 

e Humans typically consume their own body weight in ATP each day. 

e The typical human body contains about 50 grams of ATP. Recycling 
of each molecule approximately 1300 times per day satisfies the body’s 
ATP needs. 

©  ' values can be used to predict the direction of redox reactions. 

e 


Éo’ values can be used to analyze energy changes in redox reactions. 
The reduction potential depends on concentration. 
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Answers to all problems are at the end of this book. Detailed solutions 
are available in the Student Solutions Manual. 


1. 


Using Thermodynamic Equations A reaction was studied in the 
laboratory and the following thermodynamic data were obtained at 
20°C: AH = 600 kJ/mol and AS = 2000 J/mol. This resulted in an 
estimate for AG = 14 kJ/mol, making the reaction “non-spontaneous.” 
However, in cells of the human body this reaction may actually occur 
spontaneously, even without the help of any proteins, enzymes, or 
other biomolecules. Explain and show your calculations. 


. Calculating Keq and AG from Concentrations An enzymatic hydrolysis 


of fructose-1-P, 
Fructose-1-P + HO =— fructose + P; 


was allowed to proceed to equilibrium at 25°C. The original concentra- 
tion of fructose-1-P was 0.2 M, but when the system had reached equi- 
librium, the concentration of fructose-1-P was only 6.52 X 1075 M. 
Calculate the equilibrium constant for this reaction and the free energy 
of hydrolysis of fructose-1-P. 


. Calculating AG° and AS’ from AH’? The equilibrium constant for some 


process A == B is 0.5 at 20°C and 10 at 30°C. Assuming that A H° is 
independent of temperature, calculate A H° for this reaction. 
Determine A G° and A S° at 20°C and at 30°C. Why is it important in 
this problem to assume that A H° is independent of temperature? 


. Calculating AG from AG” The standard-state free energy of hydroly- 


sis for acetyl phosphate is AG°’ = —42.3 kJ/mol. 
Acetyl-P + HO —> acetate + P; 


Calculate the free energy change for acetyl phosphate hydrolysis 
in a solution of 2 mM acetate, 2 mM phosphate, and 3 nM acetyl 
phosphate. 


. Understanding State Functions Define a state function. Name three 


thermodynamic quantities that are state functions and three that 
are not. 


. Thermodynamics and Spontaneous/Non-Spontaneous Processes 


Passive diffusion, the movement of molecules from an area of high 
concentration to an area of low concentration, is an important 
transport mechanism in biochemistry. In almost all cases, passive 
diffusion is a spontaneous process. Using the thermodynamic terms 
discussed in this chapter, explain why this process would be 
spontaneous. 


. Calculating the Effect of pH on AG’ ATP hydrolysis at pH 7.0 is 


accompanied by release of a hydrogen ion to the medium 
ATP* + HO == ADP? + HPO,- + H* 
If the AG?’ for this reaction is —30.5 kJ/mol, what is A G° (that is, the 


free energy change for the same reaction with all components, includ- 
ing H*, at a standard state of 1 M)? 


. Calculating Keq and AG” for Coupled Reactions For the process 


A == B, Ke4 (AB) is 0.02 at 37°C. For the process B == C, Keq 

(BC) is 1000 at 37°C. 

a. Determine Keq (AC), the equilibrium constant for the overall 
process A == C, from Keq (AB) and Keq (BC). 

b. Determine standard-state free energy changes for all three 
processes, and use AG°(AC) to determine Keq (AC). Make sure that 
this value agrees with that determined in part a of this problem. 


. AG of Combination of Reactions An important step of the glycolysis 


pathway is the conversion of 1,3-bisphosphoglycerate (BPG) into 
3-phosphoglycerate (3-PG) with the release of ATP: BPG + 

ADP =— 3-PG + ATP. This reaction typically functions close to 
equilibrium, with AG ~ 0 kJ/mol. Consider a case where a second 


10. 


Tl. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


enzyme is present that hydrolyzes ATP into AMP and pyrophosphate 
(PP;): ATP —> AMP + PP.. 


a. Would the presence of the second enzyme increase or decrease 
the value of AG for the conversion of BPG into 3-PG? 

b. What would be the sign of AG, negative or positive, for the 
combined reaction: BPG + ADP —> 3-PG + AMP + PP? 
Explain. 


Understanding Resonance Structures Draw all possible resonance 
structures for creatine phosphate and discuss their possible effects on 
resonance stabilization of the molecule. 


Calculating Keq from AG” Write the equilibrium constant, Keg, for the 
hydrolysis of creatine phosphate and calculate a value for Keq at 25°C 
from the value of A G°’ in Table 3.2. 


Imagining Creatine Phosphate and Glycerol-3-Phosphate as Energy 
Carriers Imagine that creatine phosphate, rather than ATP, is the 
universal energy carrier molecule in the human body. Repeat the 
calculation presented in Section 3.8, calculating the weight of creatine 
phosphate that would need to be consumed each day by a typical 
adult human if creatine phosphate could not be recycled. If recycling 
of creatine phosphate were possible, and if the typical adult human 
body contained 20 grams of creatine phosphate, how many times 
would each creatine phosphate molecule need to be turned over or 
recycled each day? Repeat the calculation assuming that glycerol- 
3-phosphate is the universal energy carrier and that the body contains 
20 grams of glycerol-3-phosphate. 


Calculating AG in a Rat Liver Cell Calculate the free energy of 
hydrolysis of ATP in a rat liver cell in which the ATP, ADP, and P; 
concentrations are 3.4, 1.3, and 4.8 mM, respectively. 


Calculating AG” and Keq from Cellular Concentrations Hexokinase 
catalyzes the phosphorylation of glucose from ATP, yielding glucose- 
6-P and ADP. The standard-state free energy change for hydrolysis of 
glucose-6-P is — 13.9 kJ/mol. Calculate the standard-state free energy 
change and equilibrium constant for the hexokinase reaction. 


Oxidation/Reduction Potential and Thermodynamics According to the 
data presented in Table 3.3, oxygen has a very high reduction 
potential, meaning that it is a potent oxidizing agent. However, oxygen 
does not spontaneously oxidize nutrients. Refer to Section 3.4 and 
explain what impact enzymes may have. 


Evaluating Reactivity of High-Energy Molecules Would you expect the 
free energy of hydrolysis of acetoacetyl-coenzyme A (see diagram) to 
be greater than, equal to, or less than that of acetyl-coenzyme A? 
Provide a chemical rationale for your answer. 
(0) O 
Il | 


CH,— C — CHy— C—S— CoA 


Assessing the Reactivity of Carbamoyl Phosphate Consider carbamoyl 
phosphate, a precursor in the biosynthesis of pyrimidines: 


\ g= 
O—PO,” 


Based on the discussion of high-energy phosphates in this chapter, 
would you expect carbamoyl phosphate to possess a high free energy 
of hydrolysis? Provide a chemical rationale for your answer. 


Assessing the Effect of Temperature on Equilibrium You are studying 
the various components of the venom of a poisonous lizard. One of 
the venom components is a protein that appears to be temperature 
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20. 


21. 


22. 
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sensitive. When heated, it denatures and is no longer toxic. The 
process can be described by the following simple equation: 


T (toxic) == N (nontoxic) 


There is only enough protein from this venom to carry out two equi- 

librium measurements. At 298 K, you find that 98% of the protein is 

in its toxic form. However, when you raise the temperature to 320 K, 

you find that only 10% of the protein is in its toxic form. 

a. Calculate the equilibrium constants for the T to N conversion at 
these two temperatures. 


b. Use the data to determine the A H°, A S°, and AG? for this 
process. 


Assessing the Effect of pH on Metabolic Reactions The difference 
between A G° and A G°’ was discussed in Section 3.3. Consider the 
hydrolysis of acetyl phosphate (Figure 3.7) and determine the value 
of AG? for this reaction at pH 2, 7, and 12. The value of A G°’ for the 
enolase reaction (Figure 3.8) is 1.8 kJ/mol. What is the value of A G° 
for enolase at pH 2, 7, and 12? Why is this case different from that of 
acetyl phosphate? 


Analyzing the Energetics of Coupled Reactions What is the significance 
of the magnitude of A G°’ for ATP hydrolysis in the calculations in the 
Deeper Look box “ATP Changes the K,, by a Factor of 108”? 

Repeat these calculations for the case of coupling of a reaction to 
1,3-bisphosphoglycerate hydrolysis to see what effect this reaction 
would have on the equilibrium ratio for components A and B under 
the conditions stated on this page. 


The acyl-CoA synthetase reaction is: 


O O 
Y/ A 
R—C + CoASH + ATP R—C + AMP + PP; 
Oo SCoA 
Fatty Acid Fatty-acyl CoA 


a. AG” = —0.8 kJ/mol. Use data from Table 3.1 to determine the 
standard-state free energy of hydrolysis of fatty-acyl CoA. 
The concentrations of various metabolites in the human red 
blood cell are: [CoASH] = 0.12 mM, [fatty acid] = 0.001 mM, 
[AMP] = 0.14 mM, [ATP] = 1.85 mM, PP; = [0.5 mM] 

b. What is the equilibrium constant for the fatty-acyl CoA 
synthetase reaction? 

c. What is the cellular free energy change for the fatty-acyl CoA 
synthetase reaction at 37°C? 


The standard-state free energy change (A G°’) for hydrolysis of 
carbamoyl phosphate is —40.47 kJ/mol. 


oO 
I i Il 
H,N—C—O—P—O- t HO 2>HN—C—O + R 
| 
o- 
Carbamoyl phosphate Carbamate 


a. What is the free energy change (AG) for carbamoyl phosphate 
hydrolysis in a solution in which the concentrations are 


[carbamoyl] phosphate] = 2.5 mM, [carbamate] = 1.5 mM, 
[P] = 1 mM? 

b. What is the equilibrium constant for the carbamoyl phosphate 
hydrolysis reaction? 

c. In the carbamoyl phosphate synthetase reaction, ADP is 
phosphorylated by carbamoyl phosphate to form ATP: 


ADP + carbamoyl phosphate —> ATP + carbamate 
What is the standard-state free energy change for this reaction? 


d. What is the equilibrium constant for the carbamoyl phosphate 
synthetase? 


23. The hydrolysis of 1,3-bisphosphoglycerate is favorable, due in part to 
the increased resonance stabilization of the products of the reaction. 
Draw resonance structures for the reactant and the products of this 
reaction to establish that this statement is true. 


24. The acyl-CoA synthetase reaction activates fatty acids for oxidation 
in cells: 


R-COO™ + CoASH + ATP —> R-COSCoA + AMP + pyrophosphate 


The reaction is driven forward in part by hydrolysis of ATP to AMP 
and pyrophosphate. However, pyrophosphate undergoes further cleav- 
age to yield two phosphate anions. Discuss the energetics of this reac- 
tion both in the presence and absence of pyrophosphate cleavage. 


25. You have been hired to work in an independent drug- and chemical- 
testing lab and your first assignment involves creatine, a molecule 
found in skeletal muscle that reacts with ATP to produce creatine 
phosphate and ADP (the so-called creatine kinase reaction, which has 
a AG in skeletal muscle of 8040 J/mol). During the first few seconds of 
intense muscular effort, this reaction runs in reverse, producing extra 
ATP to support muscle contraction. For your first assignment, your 
supervisor asks you to determine the concentration of creatine in 
resting human muscle. She tells you that the muscle tissue contains 
1.85 mM ATP, 0.14 mM ADP, and 25 mM creatine phosphate. Use 
the information provided here as well as in Table 3.1 to determine the 
creatine concentration in muscle. 


Think-Pair-Share Question 


Create An Electron Transport Pathway Consider the figure shown. 
Electrons need to move from NADH to O, through a number of 
intermediates. The following intermediates are available (in no 
particular order): UQ, Cytochrome c (Fe**), FMN, Cytochrome b; 
(Fe3*). Consult information given in Table 3.3. 

a. Assign appropriate intermediates to beakers B, C, D, and E. 


b. For each pair of beakers, write a balanced equation and calculate the 
overall reduction potential AG’ of the redox reactions. 

c. For the entire system, write a balanced equation and calculate the 
overall reduction potential A60’. 


(a) (b) (c) (d) (e) (£) 
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David W. Grisham 


Amino Acids and the 
Peptide Bond 


i. a = pills i 

Te 4 < All objects have mirror images. Like many 
molecules, amino acids exist in 
mirror-image forms (stereoisomers) that 
are not superimposable. Only the 
L-isomers of amino acids commonly occur 
in nature. (Three Sisters Wilderness, 
central Oregon. The Middle Sister, 
reflected in an alpine lake.) 


Proteins are the indispensable agents of biological function, and amino acids are the build- 
ing blocks of proteins. Why are amino acids uniquely suited to their role as the building 
blocks of proteins? 


he stunning diversity of the thousands of proteins found in nature arises from the 

intrinsic properties of only 20 commonly occurring amino acids. These features 
include (1) the capacity to polymerize, (2) novel acid—base properties, (3) varied 
structure and chemical functionality in the amino acid side chains, and (4) chirality. 
This chapter describes each of these properties, laying a foundation for discussions of 
protein structure, enzyme function, and many other subjects in later chapters. 


4.1 | What Are the Structures and Properties 


of Amino Acids? 


The functions of all proteins depend on the structures and properties of the twenty 
common amino acids. This section describes the essential features of these amino 
acid building blocks. 


4.1a Typical Amino Acids Contain a Central Tetrahedral Carbon Atom 


The structure of a single typical amino acid is shown in Figure 4.1. Central to this 
structure is the tetrahedral alpha (a) carbon (Ca), which is covalently linked to both 
the amino group and the carboxyl group. Also bonded to this a-carbon are a hydro- 
gen and a variable side chain. It is the side chain, the so-called R group, that gives each 
amino acid its identity. The detailed acid—base properties of amino acids are discussed 
in the following sections. It is sufficient for now to realize that in a neutral solution 


4 


To hold, as ‘twere, the mirror up to nature. 


William Shakespeare 
Hamlet 
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of Amino Acids? 
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Careers in Chemistry 


Vanessa A. 


(pH 7), the carboxyl group exists as —COO™ and the amino group as —NH;°. Be- 
cause the resulting amino acid contains one positive and one negative charge, it is a 
neutral molecule called a zwitterion. Amino acids are also chiral molecules. With four 
different groups attached to it, the a-carbon is said to be asymmetric. The two possible 
configurations for the a-carbon constitute nonsuperimposable mirror-image isomers 
or enantiomers. Details of amino acid stereochemistry are discussed in Section 4.4. 


4.1b Amino Acids Can Join via Peptide Bonds 


The crucial feature of amino acids that allows them to polymerize to form peptides 
and proteins is the existence of their two identifying chemical groups: the amino 
(—NH;*) and carboxyl (—COO_) groups, as shown in Figure 4.2. The amino and 
carboxyl groups of amino acids can react in a head-to-tail fashion, eliminating a 
water molecule and forming a covalent amide linkage, which, in the case of peptides 
and proteins, is typically referred to as a peptide bond. The equilibrium for this reac- 
tion in aqueous solution favors peptide bond hydrolysis. Thus, peptide bond forma- 
tion requires energy input. 

Repetition of the reaction shown in Figure 4.2 produces polypeptides and proteins. The 
remarkable properties of proteins, described throughout this text, all depend in one way or 
another on the unique properties and chemical diversity of the 20 common amino acids. 


Nadal 
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at Fordham University where she specializes in the 
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4.lc There Are 20 Common Amino Acids 


The structures and abbreviations for the 20 amino acids commonly found in proteins are 
shown in Figure 4.3. All the amino acids except proline have both free a-amino and free 
a-carboxyl groups (Figure 4.1). There are several ways to classify the common amino acids. 
The most useful of these classifications is based on the polarity of the side chains. Thus, the 
structures shown in Figure 4.3 are grouped into the following categories: (1) nonpolar or 
hydrophobic amino acids, (2) neutral (uncharged) but polar amino acids, (3) acidic amino 
acids (which have a net negative charge at pH 7.0), and (4) basic amino acids (which have a 


Side 
H R^ chain 
a-Carbon ali Z 
a 
tf o\ = 
Figure 4.1 Anatomy of an amino acid. Except for H3N COO 
proline and its derivatives, all of the amino acids 
commonly found in proteins possess this type of Amino Carboxyl Ball-and-stick Amino acids are 
structure. group group model tetrahedral structures 
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Removal of a water 
molecule leads to formation 
of the CO —NH bond. 


Peptide bond © 


net positive charge at pH 7). In later chapters, the importance of this classification system 
for predicting protein properties becomes clear. Also shown in Figure 4.3 are the three- 
letter and one-letter codes used to represent the amino acids. These codes are useful when 
displaying and comparing the sequences of proteins in shorthand form. (Note that several 
of the one-letter abbreviations are phonetic in origin: arginine = “Rginine” = R, phenyl- 
alanine = “Fenylalanine” = F, aspartic acid = “asparDic” = D.) 


Nonpolar Amino Acids The nonpolar amino acids (Figure 4.3a) are critically impor- 
tant for the processes that drive protein chains to fold, that is to form their natural 
(and functional) structures, as shown in Chapter 6. Amino acids termed nonpolar in- 
clude all those with alkyl chain R groups (alanine, valine, leucine, and isoleucine), as 
well as proline (with its unusual cyclic structure); methionine (one of the two sulfur- 
containing amino acids); and two aromatic amino acids, phenylalanine and trypto- 
phan. Tryptophan is sometimes considered a borderline member of this group because 
it can interact favorably with water via the N—H moiety of the indole ring. Proline, 
strictly speaking, is not an amino acid but rather an a-imino acid. 


Polar, Uncharged Amino Acids All the polar, uncharged amino acids (Figure 4.3b), except 
for glycine, contain R groups that can (1) form hydrogen bonds with water, and (2) play 
a variety of nucleophilic roles in enzyme reactions. These amino acids are usually more 
soluble in water than the nonpolar amino acids. The amide groups of asparagine and 
glutamine; the hydroxyl groups of tyrosine, threonine, and serine; and the sulfhydryl 
group of cysteine are all good hydrogen bond-forming moieties. Glycine, the simplest 
amino acid, has only a single hydrogen for an R group, and this hydrogen is not a good 
hydrogen bond former. Glycine’s solubility properties are mainly influenced by its polar 
amino and carboxyl groups, and thus glycine is best considered a member of the polar, 
uncharged group. It should be noted that tyrosine has significant nonpolar characteris- 
tics due to its aromatic ring and could arguably be placed in the nonpolar group. How- 
ever, with a pK, of 10.1, tyrosine’s phenolic hydroxyl is a charged, polar entity at high pH. 


Acidic Amino Acids There are two acidic amino acids—aspartic acid and glutamic 
acid—whose R groups contain a carboxyl group (Figure 4.3c). These side-chain car- 
boxyl groups are weaker acids than the a-COOH group but are sufficiently acidic to 
exist as —COO™ at neutral pH. Aspartic acid and glutamic acid thus have a net nega- 
tive charge at pH 7. These forms are appropriately referred to as aspartate and gluta- 
mate. These negatively charged amino acids play several important roles in proteins. 
Many proteins that bind metal ions for structural or functional purposes possess 
metal-binding sites containing one or more aspartate and glutamate side chains. The 
acid—base chemistry of such groups is considered in detail in Section 4.2. 


Figure 4.2 The a-COOH and a-NH;°* groups of 
two amino acids can react with the resulting loss of 
a water molecule to form a covalent amide bond. 
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(a) 


Nonpolar (hydrophobic) 


Proline (Pro, P) 


Glycine (Gly, G) 


Alanine (Ala, A) Valine (Val, V) 
(b) Polar, uncharged 
COOH ieee 
H3Nt ! SE 
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Serine (Ser, S) 


coon 
H;N+— 


Asparagine (Asn, N) 


Glutamine (Gln, Q) 


(c) 


Acidic 


a 


H;N+— C —H 


Aspartic acid (Asp, D) 


Glutamic acid (Glu, E) 


Figure 4.3 The 20 amino acids that are building blocks of most proteins can be classified as (a) nonpolar (hydrophobic); (b) polar, neutral; 
(c) acidic; or (d) basic. The side chains (R groups) are highlighted in yellow. Note that all these amino acids are shown in their (lowest pH) most 
protonated forms, for consistency. The various protonation states are discussed in Section 4.2. 
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Basic Amino Acids Three of the common amino acids have side chains with net positive 
charges at neutral pH: histidine, arginine, and lysine (Figure 4.3d). Histidine contains an 
imidazole group, arginine contains a guanidino group, and lysine contains a protonated 
alkyl amino group. The side chains of the latter two amino acids are fully protonated at 
pH 7, but histidine, with a side-chain pK, of 6.0, is only 10% protonated at pH 7. With a 
pX, near neutrality, histidine side chains play important roles as proton donors and accep- 
tors in many enzyme reactions. Histidine-containing peptides are important biological 
buffers, as discussed in Chapter 2. Arginine and lysine side chains, which are protonated 
under physiological conditions, participate in electrostatic interactions in proteins. 


4.1d Are There Other Ways to Classify Amino Acids? 


There are alternative ways to classify the 20 common amino acids. For example, it 
would be reasonable to imagine that the amino acids could be described as hydrophobic, 
hydrophilic, or amphipathic: 


Hydrophobic: Hydrophilic: Amphipathic: 
Alanine Proline Arginine Glutamine Lysine 
Glycine Valine Asparagine Histidine Methionine 
Isoleucine Aspartic acid Serine Tryptophan 
Leucine Cysteine Threonine Tyrosine 
Phenylalanine Glutamic acid 


Lysine can be considered amphipathic because its R group consists of an aliphatic 
side chain, which can interact with hydrophobic amino acids in proteins, and an 
amino group, which is normally charged at neutral pH. Methionine is the least polar 
of the amphipathic amino acids, but its thioether sulfur can be an effective metal 
ligand in proteins. Cysteine can deprotonate at pH values greater than 7, and the thio- 
late anion is the most potent nucleophile that can be generated among the 20 common 
acids. The imidazole ring of histidine has two nitrogen atoms, each with an H. The pK 
for dissociation of the first of these two H is around 6. However, once one N—H has 
dissociated, the pK value for the other becomes greater than 10. 


A Deeper Look 
Why Nature Chose Selenium 


Small amounts of selentum—mostly as selenocysteine (Sec)—are 
found in all three domains of life—Eukarya, Archaea, and Bacte- 
ria. Humans have 25 genes for selenoproteins, and half of these are 
enzymes involved in homeostasis of oxidation—reduction processes. 
What chemical properties of selenium explain the role of selenocys- 
teine in biology? 

Selenium is typically a replacement for its Group 16 cousin, sul- 
fur, and one of the most notable differences is that the pK, of the 
selenocysteine-Se-H group (5.2) is much lower than that of cysteine- 
SH (8.3). Thus, Sec is fully deprotonated at physiological pH and 
Sec is an even better nucleophile than Cys. On the other hand, 
redox chemistry (both for one-electron and two-electron redox reac- 
tions) is the biggest difference between the two. The outer valence 
electrons of selenium are more loosely held than those of sulfur, 
and selenium cannot form 7 bonds of any type. As a much better 
nucleophile, Se will react faster than S with reactive oxygen species. 
And, the lack of 7 bond character in the Se-O bond means that the 
Se-oxides are much more readily reduced than comparable S-oxides. 
This combination of properties means that the replacement of 
sulfur with selenium creates molecules more resistant to oxidation. 


Human selenoenzymes are involved in peroxide removal, activation 
and inactivation of thyroid hormones, repair of oxidized methionine res- 
idues in proteins, immune system regulation, mammalian development, 
and both pro- and anti-cancer activities. For example, methionine 
sulfoxide (see figure) accumulates in proteins and tissues with time and 
is thought to contribute to many aspects of aging. A selenoprotein, 
methionine sulfoxide reductase, reverses this process as shown. 


COOH COOH 
HN H F 1 Ə Oxidative processes — HN H 
Ci 
2 ~ Methionine sulfoxide 
s reductase o=s 
\ \ 
CH; CH3 


Hatfield, D. L., Tsuji, P. A., Carlson, B. A., and Gladyshev, V. N., 2014. Selenium and 
selenocysteine: roles in cancer, health, and development. Trends in Biochemical Sciences 
39:112-120. 

Labunksyy, V. M., Hatfield, D. L., and Gladyshev, V. N., 2014. Selenoproteins: molecular 
pathways and biological roles. Physiological Review 94:739-777. 

Reich, H. J., and Hondal, R. J., 2016. Why Nature Chose Selenium. ACS Chemical Biology 
11:821-841. 
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Figure 4.4 The structures of several amino acids that are less common but nevertheless found in 
certain proteins. Hydroxylysine and hydroxyproline are found in connective-tissue proteins; 
pyroglutamic acid is found in bacteriorhodopsin (a protein in Halobacterium halobium). 


4.le Amino Acids 21 and 22—and More? 


Although uncommon, natural amino acids beyond the well-known 20 actually do 
occur. Selenocysteine (Figure 4.4) was first identified in 1986 and it has since been 
found in a variety of organisms. 

More recently, Joseph Krzycki and his colleagues at Ohio State University have 
discovered a lysine derivative—pyrrolysine—in several archaeal species, including 
Methanosarcina barkeri, a bottom-dwelling microbe of fresh-water lakes. Pyrrolysine 
(Figure 4.4) and selenocysteine both are incorporated naturally into proteins thanks to 
specific modifications that occur during the reactions of protein synthesis. 

Both selenocysteine and pyrrolysine bring novel structural and chemical features to 
the proteins that contain them. How many more unusual amino acids might be incorpo- 
rated in proteins in a similar manner? 


4.1f Several Amino Acids Occur Only Rarely in Proteins 


There are several amino acids that occur only rarely in proteins and are produced by 
modifications of one of the 20 amino acids already incorporated into a protein 
(Figure 4.4). These include hydroxylysine and hydroxyproline, which are found mainly 
in the collagen and gelatin proteins; pyroglutamic acid, which is found in a light-driven 
proton-pumping protein called bacteriorhodopsin; and y-carboxyglutamic acid, which 
is found in calcium-binding proteins. 


4.2 | What Are the Acid-Base Properties of Amino Acids? 


The functions of proteins are almost always dependent on the protonation states of their 
ionizable side chains. This section describes the acid—base properties and behavior of the 
amino acids. 


4.2a Amino Acids Are Weak Polyprotic Acids 


From a chemical point of view, the common amino acids are all weak polyprotic 
acids. The ionizable groups are not strongly dissociating ones, and the degree of dis- 
sociation thus depends on the pH of the medium. All the amino acids contain at least 
two dissociable hydrogens. 

Consider the acid—base behavior of glycine, the simplest amino acid. At low pH, 
both the amino and carboxyl groups are protonated and the molecule has a net positive 
charge. If the counterion in solution is a chloride ion, this form is referred to as glycine 
hydrochloride. If the pH is increased, the carboxyl group is the first to dissociate, yield- 
ing the neutral zwitterionic species Gly? (Figure 4.5). A further increase in pH eventually 


Pyroglutamic acid 
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Figure 4.5 The ionic forms of the amino acids that 
lack an ionizable sidechain. The cationic form is the | 
low pH form, and the titration of the cationic R 
species with base yields the zwitterion and finally the 
anionic form. 
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Critical Developments in Biochemistry 


Adding New Chemistry to Proteins with Unnatural Amino Acids 


The astounding diversity of protein structure and function in the 
biosphere derives almost entirely from the 20 common amino acids. 
Nonetheless, it is intriguing to imagine how protein function can be 
changed by creating and introducing new ‘unnatural’ amino acids. 
Peter Schultz and his coworkers at the Scripps Research Institute 
have developed powerful methodologies that facilitate the incorpo- 
ration of more than 160 novel and unnatural amino acids (UAAs) 
into proteins in Escherichia coli, yeast, and mammalian cells. In- 
corporation of these UAAs (two are shown here) at unique sites in 
proteins enables an array of new methods for the study of structure 
and function in these proteins, including the introduction of new 
catalytic functionality. For example, aniline is known to catalyze the 
formation of hydrazones by condensation of carbonyl compounds 
with hydrazines (figure, part a). Insertion of p-aminophenylalanine 
(pAF), an amino acid analogue of aniline, into LmrR, a bacterial 
protein with no catalytic function, creates a designer enzyme that 
carries out the same type of reaction. Similarly, 8-hydroxyquinoline 
(figure, part b) is known to bind tightly to a variety of metals, in- 
cluding Cu**, Zn?*, and Rh**. Incorporation of an amino acid ana- 
logue of 8-hydroxyquinoline (HQ-Ala) into the same noncatalytic 
LmrR protein produces instead a metalloenzyme that binds Cu** 
tightly and carries out the double-bond hydration reaction shown, 
and other Cu?*-dependent reactions. UAA methodology is enabling 
the creation of many new designer enzymes. It also has therapeutic 
potential for the development of novel antibodies, immunotoxins, 
and vaccines as well as new medical imaging probes. 
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The chemistry of some unnatural amino acids. (a) Structures of aniline and 
p-aminophenylalanine (pAF), and the reaction between 4-nitrobenzaldehyde 
(4-NBA) and 4-hydrazino-7-nitro-2,1,3-benzoxadiazole to form the 
corresponding hydrazone. (b) Structures of 8-hydroxyquinoline and 2-amino- 
3-(8-hydroxyquinolin-3-yl)propanoic acid (HQ-Ala), and the reaction of an 
a,B-unsaturated 2-acyl pyridine to form the corresponding B-hydroxy ketone 
product. 
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pK, Values of Common Amino Acids 


Amino Acid a-COOH pK, a-NH,* pK, R group pK, 
Alanine 2.4 9.7 

Arginine 2:2 9.0 12.5 
Asparagine 2.0 8.8 

Aspartic acid 2.1 9.8 3.9 
Cysteine 1.7 10.8 8.3 
Glutamic acid 2.2 9.7 4.3 
Glutamine 2.2 9.1 

Glycine 2.3 9.6 

Histidine 1.8 9.2 6.0 
Isoleucine 2.4 9.7 

Leucine 2.4 9.6 

Lysine 2.2 9.0 10.5 
Methionine 2.3 9.2 

Phenylalanine 1.8 9.1 

Proline 2.1 10.6 

Serine 2.2 9.2 = 13 
Threonine 2.6 10.4 ~13 
Tryptophan 2.4 9.4 

Tyrosine 22 9.1 10.1 
Valine 23 9.6 


results in dissociation of the amino group to yield the negatively charged glycinate. If 
we denote these three forms as Gly*, Gly?, and Gly~, we can write the first dissociation 
of Gly* as 


Gly* + HO == Gly’ + H,0* 
and the dissociation constant K; as 


_ [Gly°1[H,0*1] 
! [Gly*] 


Values for K; for the common amino acids are typically 0.4 to 1.0 x 10? M, so that 
typical values of pK; are in the range of 2.0 to 2.4 (Table 4.1). In a similar manner, we 
can write the second dissociation reaction as 


Gly? + HO == Gly~ + H;0* 
and the dissociation constant K, as 
_ [Gly ][H;0*] 
K, = —— 
[Gly°] 


Typical values for pK» are in the range of 9.0 to 9.8. At physiological pH, the 
a-carboxyl group of a simple amino acid (with no ionizable side chains) is completely 
dissociated, whereas the a-amino group has not really begun its dissociation. 
The titration curve for such an amino acid is shown in Figure 4.6. Note that the 
isoelectric point, pI, the pH where glycine has a net charge of 0, can be calculated as 
(pK, + pK2)/2. 

Note that the dissociation constants of both the a-carboxyl and a-amino groups 
are affected by the presence of the other group. The adjacent a-amino group makes the 
a-COOH group more acidic (that is, it lowers the pK,), so it gives up a proton more 
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Glyt Gly? Gly 
COOH COO7 COO- 
a= —— | 
H,N*— CHo H,N*— CHo HoN — CHo 


Isoelectric 
point 


1.0 < 0 > 1.0 
Equivalents of H* Equivalents of OH™ 


0 1.0 2.0 
Equivalents of OH” added > 


2.0 1.0 0 
~<— Equivalents of Ht added 


Figure 4.6 Titration of glycine, a simple amino acid. 


| Example | What is the pH of a glycine solution in which the a-NH;* group is one- 


third dissociated? 


Answer 

Step 1: Use the appropriate Henderson-Hasselbalch equation: 

[Gly] 

[Gly°] 

Step 2: What is the pK, for the a-NH;* group? 9.6 (Table 4.1) 

Step 3: If the a-amino group is one-third dissociated, there is one part Gly” for 


every two parts Gly®. 
Step 4: Inserting these values into the Henderson-Hasselbalch equation yields 


z 1] 
pH = 9.6 + logy, H 
[2] 


pH = pK, + logio 


pH = 9.3 


readily than simple alkyl carboxylic acids. Thus, the pK, of 2.0 to 2.1 for a-carboxyl 
groups of amino acids is substantially lower than that of acetic acid (pK, = 4.76), for 
example. What is the chemical basis for the low pK, of the a-COOH group of amino 
acids? The a-NH;3* (ammonium) group is strongly electron withdrawing, and the posi- 
tive charge of the amino group exerts a strong field effect and stabilizes the carboxylate 
anion. (The effect of the a-COO™ group on the pK, of the a-NH3°* group is the basis for 
Problem 6 at the end of this chapter.) 


4.2b Side Chains of Amino Acids Undergo Characteristic lonizations 


The side chains (R groups) of amino acids give them their unique character and func- 
tions. The side chains of several of the amino acids also contain dissociable groups. 
For example, aspartic and glutamic acids contain an additional carboxyl group, and 
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Lys Lys Lys Lys” 
COOH CcOO- COO (0010 
Glut Glu’ Glu” Glu2- H,N‘t-C—H HsN*-C—H H.N—C—H H.N—C—H 
oo ia aba ~ CH, CH, CHo CHo 
HN™—C—H H,N™— 7 —H HN í —H HN—C—H CHo CHo CH, CH, 
CHo CH CH, CHo CH, CH, CHo CH, 
CH qh: m CH, CH, CHo CHo CH, 
COOH COOH COO- (000 NH;* NHst NHst NHg 
14 14 


Isoelectric 
point 


pH pH 
6 6 
4 4 
2 point 2 
0 0 
0 1.0 2.0 3.0 0 1.0 2.0 3.0 
Equivalents of OH” added > Equivalents of OH” added —> 


Figure 4.7 Titrations of glutamic acid and lysine. 


lysine possesses an aliphatic amino group. Histidine contains an ionizable imidazo- 
lium proton, and arginine carries a guanidinium group. Typical pK, values of these 
groups are shown in Table 4.1. The B-carboxyl group of aspartic acid and the 
y-carboxyl side chain of glutamic acid exhibit pK, values intermediate to the a- COOH 
on one hand and typical aliphatic carboxyl groups on the other hand. In a similar 
fashion, the e-amino group of lysine exhibits a pX, that is higher than that of the 
a-amino group but similar to that for a typical aliphatic amino group. These interme- 
diate side-chain pK, values reflect the slightly diminished effect of the a-carbon dis- 
sociable groups that lie several carbons removed from the side-chain functional 
groups. Figure 4.7 shows typical titration curves for glutamic acid and lysine, along 
with the ionic species that predominate at various points in the titration. The only 
other side-chain groups that exhibit any significant degree of dissociation are the 
para-OH group of tyrosine and the —SH group of cysteine. The pK, of the cysteine 
sulfhydryl is 8.32, so it is about 5% dissociated at pH 7. The tyrosine para-OH group 
is a very weakly acidic group, with a pK, of about 10.1. This group is essentially fully 
protonated and uncharged at pH 7. 

It is important to note that side-chain pK, values for amino acids in proteins can be dif- 
ferent from the values shown in Table 4.1. On average, values for side chains in proteins are 
one pH unit closer to neutrality compared with the free amino acid values. Moreover, 
the local chemical environment of an amino acid in a protein can change its pK, dra- 
matically. For example, Lys residues in staphylococcal nuclease variants have been 
found to have pK, values as low as 5.3, whereas a mutant thioredoxin possesses an Asp 
residue with a pK, of 9.4. 
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[=< 


Calculating the pH of amino acid solutions: 


a. What is the pH of a 0.3 M solution of leucine hydrochloride? 

b. What is the pH of a 0.3 M sodium leucinate solution? 

c. What is the pH of isoelectric leucine? 
Solution 
Part a: Note that “hydrochloride” means that the leucine has been titrated to its 
fully protonated form, which will be called H,A*. 


Step 1: Begin by deriving the K, from the pK, given in Table 4.1. 
K, = 107PXa = 10724 = 3,98 x 1073 


Step 2: Write the K, expression: 


K, = HOHA’ — 3 9g x 10-3 
[H2A*] 


Step 3: Assume that only a small amount of H»A* dissociates: 


_[A*JHAY e 


= 3.98 x 1073 
[HLA] 03 —x 


Ka 


Step 4: Assume x is small compared to 0.3 M and solve for x: 


x2 = (0.3)(3.98 X 1073) = 1.19 x 10-3 
x = 3.45 x 10° 
pH = —logo(3.45 X 1072) = 1.46 


Part b: Note that “sodium leucinate” means that the leucine has been titrated to the 
fully deprotonated form, which will be called A~. 


Step 1: The reaction for the formation of A7 is: 

HA” + H,O —A + H;0* 
Step 2: Important—a K, expression cannot be used for the fully deprotonated 
form because the concentration of HA” (the denominator in a K, expression) goes 


to zero when this reaction goes to completion. So a K, expression, based on this 
equation, must be used: 


A” + HO —> HA” + OH- 
Step 3: Write the K, expression: 
g, - HAON 
[A] 
Step 4: Derive the K, from the pK, for the second dissociation of leucine from 
Table 4.1: 
pK, = 9.6, so pK, = 14 — 9.6 = 4.4 
Then Ke = 104 = 398 x 107 

Step 5: Write the K, expression: 

0 a 

me [HA lies I2 
[A] 


3.98 X 10> 


Now assume that only a small amount of A` protonates: 


_[HAJOH]_— * 
[A] 0.3 =x 


Ky = 3.98 x 10> 
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Step 6: Assume x is small compared to 0.3 M and solve for x: 


x? = (0.3)(3.98 X 10-5) = 1.19 x 10-5 
x = 3.46 X 10-3 = [OH-] 
pOH = 2.46 


Then pH = 14 — pOH = 14 — 2.46 = 11.54 


Part c: Note that the isoelectric point is the point where leucine has a net zero charge. 
That form, HA”, predominates at the first equivalence point in the leucine titration. 
The isoelectric pH is calculated by determining the average of the two adjacent 
pKa values. : 
24m9 
H =e Z 


7 6.0 


4.3 | What Reactions Do Amino Acids Undergo? 


The a-carboxyl and a-amino groups of all amino acids exhibit similar chemical reactiv- 
ity. The side chains, however, exhibit specific chemical reactivities, depending on the 
nature of the functional groups. Whereas all of these reactivities are important in 
the study and analysis of isolated amino acids, it is the characteristic behavior of the 
side chain that governs the reactivity ofamino acids incorporated into proteins. There are 
three reasons to consider these reactivities. Proteins can be modified in very specific 
ways by taking advantage of the chemical reactivity of certain amino acid side chains. 
The detection and quantitation of amino acids and proteins often depend on reactions 
that are specific to one or more amino acids and that result in color, radioactivity, or 
some other quantity that can be easily measured. Finally and most importantly, the bio- 
logical functions of proteins depend on the behavior and reactivity of specific R groups. 

A number of reactions of amino acids are noteworthy because they are essential to 
the degradation, sequencing, and chemical synthesis of peptides and proteins. One of 
these, the reaction with phenylisothiocyanate, or Edman reagent, involves nucleophilic 
attack by the amino acid a-amino nitrogen, followed by cyclization, to yield a phenyl- 
thiohydantoin (PTH) derivative of the amino acid (Figure 4.8a). PTH-amino acids can 
be easily identified and quantified, as shown in Section 4.6. An important amino acid 
side-chain reaction is formation of disulfide bonds via reaction between two cysteines. 
In proteins, cysteine residues form disulfide linkages that stabilize protein structure 
(Figure 4.8b). Related reactions are discussed in Chapter 5. 


4.4 | What Are the Optical and Stereochemical 
Properties of Amino Acids? 


The structures of proteins are influenced strongly by the asymmetric nature or ‘hand- 
edness’ of their amino acid components. This section describes the essential features 
and consequences of this aspect of their chemistry. 


44a Amino Acids Are Chiral Molecules 


Except for glycine, all of the amino acids isolated from proteins have four different 
groups attached to the a-carbon atom. In such a case, the a-carbon is said to be asym- 
metric or chiral (from the Greek cheir, meaning “hand”), and the two possible con- 
figurations for the a-carbon constitute nonsuperimposable mirror-image isomers, or 
enantiomers (Figure 4.9). Enantiomeric molecules display a special property called 
optical activity—the ability to rotate the plane of polarization of plane-polarized 
light. Clockwise rotation of incident light is referred to as dextrorotatory behavior, 
and counterclockwise rotation is called levorotatory behavior. The magnitude and 
direction of the optical rotation depend on the nature of the amino acid side chain 
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es HN derivative acid 
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HEN R'— CH 
H—N | 
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R’— CH | 
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n, c N C Figure 4.8 Some reactions of amino acids. (a) Edman reagent, 
“wt he 1 l phenylisothiocyanate, reacts with the a-amino group of an amino 
| | . acid or peptide to produce a phenylthiohydantoin (PTH) derivative. 
H O Disulfide Notably proline residues are not removed completely under these 
Cys residues in conditions. See reference at end of chapter. (b) A pair of cysteines 
two peptide chains can be oxidized to form a disulfide. 


and can be used to reveal certain details of molecular structure. Some protein-derived 
amino acids at a given pH are dextrorotatory and others are levorotatory, even 


WwW WwW 

2 i though all of them are of the L-configuration. The direction of optical rotation can be 
Xes 2 es ae - aR specified in the name by using a (+) for dextrorotatory compounds and a (—) for levo- 

Y y rotatory compounds, as in L(+)-leucine. 


Perspective drawing 


4.4b Chiral Molecules Are Described by the p,t and (R,S) Naming Conventions 


Two systems are commonly used to name and describe chiral molecules—the so- 
called D,L system and the (R, S) system. 

In the D,L system of nomenclature, the (+) and (—) isomers of glyceraldehyde are 
denoted as D-glyceraldehyde and L-glyceraldehyde, respectively (see Critical Develop- 
Fischer projections ments in Biochemistry box “Rules for Description of Chiral Centers in the (R,S) 
System” in this section). Absolute configurations of all other carbon-based molecules 
are referenced to D- and L-glyceraldehyde. When sufficient care is taken to avoid racemi- 
zation of the amino acids during hydrolysis of proteins, it is found that all of the amino 
acids derived from natural proteins are of the L-configuration. Amino acids of the 
D-configuration are nonetheless found in nature, especially as components of certain 
peptide antibiotics, such as valinomycin, gramicidin, and actinomycin D, and in the cell 
walls of certain microorganisms. 

Although the D,L system is commonly used, it can be ambiguous for molecules with 
two or more chiral centers. To address such problems, the (R,.S) system of nomenclature 
for chiral molecules was proposed in 1956 by Robert Cahn, Sir Christopher Ingold, and 
Vladimir Prelog. In this more versatile system, priorities are assigned to each of the 
groups attached to a chiral center on the basis of atomic number; atoms with higher 
atomic numbers have higher priorities. The newer (R,S) system of nomenclature is 


WwW WwW 
X | Z Z | X 
Y Y 


Figure 4.9 Enantiomeric molecules based on a chiral 
carbon atom. Enantiomers are nonsuperimposable 
mirror images of each other. 
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Critical Developments in Biochemistry 


Green Fluorescent Protein—The “Light Fantastic”’—from Jellyfish to Gene Expression 


Aequorea victoria, a species of jellyfish found in the northwest (a) 
Pacific Ocean, contains a green fluorescent protein (GFP) that 
works together with another protein, aequorin, to provide a defense 
mechanism for the jellyfish. When the jellyfish is attacked or 
disturbed, aequorin produces a blue light. This light energy is cap- 
tured by GFP, which then emits a bright green flash that presum- 
ably blinds or startles the attacker. The light-transducing capability 
of GFP is the result of a reaction between three amino acids in the 
protein itself. As shown in the diagram, adjacent serine, tyrosine, 
and glycine in the sequence of the protein react to form the pigment 
complex—termed a chromophore (figure, part (a)). No enzymes are 
required; the reaction is autocatalytic. 

Because the light-transducing function of GFP depends only 
on the protein itself GFP has quickly become a darling of genetic 
engineering laboratories. The promoter of any gene whose cellular 
expression is of interest can be fused to the DNA sequence coding 
for GFP (see Figure 12.18). 

Amino acid substitutions in GFP can tune the color of emit- 
ted light; examples include YFP, CFP, and BFP (yellow, cyan, and 
blue fluorescent protein, respectively). One remarkable use of this 
multi-colored palette of fluorescent proteins involves the “brain- 
bow” method that enables many distinct cells within a brain circuit 
to be viewed at one time (figure, part (b)). The technique involves 
random expression of combinations of three or more fluorescent 
proteins in neighboring neurons in a transgenic mouse (see figure). 
The ability of the brainbow system to label uniquely many indi- 
vidual neurons may facilitate the analysis of neuronal circuitry on 
a large scale. 


i \ 
o, S Gin 
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64 69 Ny 
HO oO 
H, o 


N `H 
_Phe 


Chromophore 


Individual neurons in a mouse 
brain appear in different colors in 
a fluorescence microscope. 
(Source: Figure 4c, Livet, J., et al., 
2007. Transgenic strategies for 
combinatorial expression of 
fluorescent proteins in the nervous 
system, Nature 450:56—62.) 


superior to the older D,L system in one important way: The configuration of molecules 
with more than one chiral center can be more easily, completely, and unambiguously 
described with (R,S) notation. Several amino acids, including isoleucine, threonine, 
hydroxyproline, and hydroxylysine, have two chiral centers. In the (R,S) system, 
L-threonine is (2S,3.R)-threonine. 


4.5 | What Are the Spectroscopic Properties 
of Amino Acids? 


One of the most important and exciting advances in modern biochemistry has been 
the application of spectroscopic methods, which measure the absorption and emission 
of energy of different frequencies by molecules and atoms. Spectroscopic studies of 
proteins, nucleic acids, and other biomolecules are providing many new insights into 
the structure and the wide range of essential motions in these molecules. 
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4.5a Phenylalanine, Tyrosine, and Tryptophan 
Absorb Ultraviolet Light 


Many details of the structure and chemistry of the amino acids have been eluci- 
dated, or at least confirmed, by spectroscopic measurements. None of the amino 
acids absorb light in the visible region of the electromagnetic spectrum. Several of 
the amino acids, however, do absorb ultraviolet radiation, and all absorb in the in- 
frared region. Only the aromatic amino acids phenylalanine, tyrosine, and trypto- 
phan exhibit significant ultraviolet absorption above 250 nm, as shown in 
Figure 4.10. These strong absorptions can be used for spectroscopic determinations 
of protein concentration. The aromatic amino acids also exhibit relatively weak 
fluorescence, and it has recently been shown that tryptophan can exhibit phospho- 
rescence—a relatively long-lived emission of light. These fluorescence and 
phosphorescence properties are especially useful in the study of protein structure 
and dynamics. 


4.5b Amino Acids Can Be Characterized 
by Nuclear Magnetic Resonance 


The development in the 1950s of nuclear magnetic resonance (NMR), a spectroscopic 
technique that involves the absorption of radio-frequency energy by certain nuclei in the 
presence of a magnetic field, played an important part in the chemical characterization 
of amino acids and proteins. Several important principles emerged from these studies. 
First, the chemical shift! of amino acid protons depends on their particular chemical 
environment and thus on the state of ionization of the amino acid. Second, the change 
in electron density during a titration is transmitted throughout the carbon chain in the 
aliphatic amino acids and the aliphatic portions of aromatic amino acids, as evidenced 
by changes in the chemical shifts of relevant protons. Finally, the magnitude of the 
coupling constants between protons on adjacent carbons depends in some cases on 
the ionization state of the amino acid. This apparently reflects differences in the 
preferred conformations in different ionization states. Proton NMR spectra of two 
amino acids are shown in Figure 4.11. Because they are highly sensitive to their envi- 
ronment, the chemical shifts of individual NMR signals can detect the pH-dependent 
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Figure 4.10 The ultraviolet absorption spectra of the aromatic amino acids at pH 6. (Source: 
Wetlaufer, D. B., 1962. Ultraviolet spectra of proteins and amino acids. Advances in Protein Chemistry 
17:303-390.) 


'The chemical shift for any NMR signal is the difference in resonant frequency between the observed 
signal and a suitable reference signal. If two nuclei are magnetically coupled, the NMR signals of these 
nuclei split, and the separation between such split signals, known as the coupling constant, is likewise 
dependent on the structural relationship between the two nuclei. 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


1-Alanine 
COOH 
e | 
H,N—C—H 
| 
CH; 


Relative intensity 


BSS Ss ag ee ee ee Se ee 
10 9 8 7 6 5 4 3 2 1 0 
ppm 


4.5 What Are the Spectroscopic Properties of Amino Acids? 93 


Relative intensity 
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Figure 4.11 Proton NMR spectra of several amino acids. Zero on the chemical shift scale is 
defined by the resonance of tetramethylsilane (TMS). (The large resonance at approximately 
5 ppm is due to the normal HDO impurity in the D20 solvent.) (Adapted from Aldrich Library 


of NMR Spectra.) 


Critical Developments in Biochemistry 


Rules for Description of Chiral Centers in the (R,S) System 


Naming a chiral center in the (R, S) system is accomplished by view- 
ing the molecule from the chiral center to the atom with the lowest 
priority. If the other three atoms facing the viewer then decrease in 
priority in a clockwise direction, the center is said to have the (R) 
configuration (where R is from the Latin rectus, meaning “right”). 
If the three atoms in question decrease in priority in a counterclock- 
wise fashion, the chiral center is of the (S) configuration (where S 
is from the Latin sinistrus, meaning “left”). If two of the atoms co- 
ordinated to a chiral center are identical, the atoms bound to these 


CHO OH 
CH,OH OHC ~—7 CHOH 
L-Glyceraldehyde (S)-Glyceraldehyde 


COOH 


+ 2 


L-Alanine 


— > 


two are considered for priorities. For such purposes, the priorities 
of certain functional groups found in amino acids and related mol- 
ecules are in the following order: 


SH > OH > NH, > COOH > CHO > CH,OH > CH; 


From this, it is clear that D-glyceraldehyde is (R)-glyceraldehyde 
and L-alanine is (S)-alanine (see figure). Interestingly, the a-carbon 
configuration of all the L-amino acids except for cysteine is (S). 
Cysteine, by virtue of its thiol group, is, in fact, (R)-cysteine. 


HOH;C = ‘CHO 


CHO 


H= C—O0H —W—> 


mi O me 


CH,OH 


p-Glyceraldehyde (R)-Glyceraldehyde 


+ 
NH; 


-00C —7°CH, 


(S)-Alanine 


The assignment of (R) and (S) notation for glyceraldehyde and L-alanine. 
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ionizations of amino acids and other aspects of their molecular environment. More 
sophisticated NMR measurements at very high magnetic fields are also used to deter- 
mine the three-dimensional structures of peptides and proteins. Kurt Wiithrich 
received the 2002 Nobel Prize in Chemistry for developing NMR methods for deter- 
mining protein structure. 


4.6 | How Are Amino Acid Mixtures 
Separated and Analyzed? 


The characterization of newly discovered proteins depends on analyses that reveal 
their amino acid compositions. Biochemists employ a wide variety of techniques to 
elucidate such information. 


4.6a Amino Acids Can Be Separated by Chromatography 


A wide variety of methods is available for the separation and analysis of amino acids 
(and other biological molecules and macromolecules). All of these methods take ad- 
vantage of the relative differences in the physical and chemical characteristics of 
amino acids, particularly ionization behavior and solubility characteristics. Separa- 
tions of amino acids are usually based on partition properties (the tendency to associ- 
ate with one solvent or phase over another) and separations based on electrical charge. 
In all of the partition methods discussed here, the molecules of interest are allowed 
(or forced) to flow through a medium consisting of two phases—solid—liquid, liquid— 
liquid, or gas—liquid. The molecules partition, or distribute themselves, between the 
two phases in a manner based on their particular properties and their consequent 
preference for associating with one or the other phase. 

In 1903, a separation technique based on repeated partitioning between phases was 
developed by Mikhail Tsvet for the separation of plant pigments (carotenes and 
chlorophylls). Due to the colorful nature of the pigments thus separated, Tsvet called 
his technique chromatography. This term is now applied to a wide variety of separation 
methods. The success of many chromatography techniques depends on the different 
affinities of molecules in a mixture for two different phases—for example, solid and 
liquid, or liquid and gas—a phenomenon termed “partitioning.” The more frequently 
this partitioning can be made to occur within a given time span or over a given volume, 
the more efficient is the resulting separation. Chromatographic methods have advanced 
rapidly in recent years, due in part to the development of sophisticated new solid-phase 
materials. Methods important for amino acid separations include ion-exchange chro- 
matography, gas chromatography (GC), and high-performance liquid chromatography 
(HPLC). 

A typical HPLC chromatogram using precolumn modification of amino acids to 
form phenylthiohydantoin (PTH) derivatives is shown in Figure 4.12. HPLC is the chro- 
matographic technique of choice for most modern biochemists. The very high resolution, 
excellent sensitivity, and high speed of this technique usually outweigh the disadvantage 
of relatively low capacity. 


4.7 | What Is the Fundamental Structural 
Pattern in Proteins? 


Proteins play an essential role in nearly all aspects of cell structure and function. 
Chemically, proteins are unbranched polymers of amino acids linked head to tail, 
from carboxyl group to amino group, through the formation of covalent peptide 
bonds, a type of amide linkage (Figure 4.13). 

Peptide bond formation results in the release of H,O. The peptide “backbone” of a 
protein consists of the repeated sequence —N—C,—C,—, where the N represents the 
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Figure 4.12 Gradient separation of common PTH-amino acids, which absorb UV light. Absorbance 
was monitored at 269 nm. PTH peaks are identified by single-letter notation for amino acid residues 
and by other abbreviations. D, Asp; CMC, carboxymethyl Cys; E, Glu; N, Asn; S, Ser; Q, Gln; H, 
His; T, Thr; G, Gly; R, Arg; MO;, Met sulfoxide; A, Ala; Y, Tyr; M, Met; V, Val; P, Pro; W, Trp; K, 
Lys; F, Phe; I, Ile; L, Leu. See Figure 4.8a for PTH derivatization. (Adapted from Persson, B., and 
Eaker, D., 1990. An optimized procedure for the separation of amino acid phenylthiohydantoins by 
reversed phase HPLC. Journal of Biochemical and Biophysical Methods 21:341-350.) 


+ 
+ HN 


Amino acid 1 Amino acid 2 Dipeptide 


Figure 4.13 Peptide formation is the creation of an amide bond between the carboxyl group of one 
amino acid and the amino group of another amino acid. 


amide nitrogen, the C, is the a-carbon atom of an amino acid in the polymer chain, and 
the final C, is the carbonyl carbon of the amino acid, which in turn is linked to the 
amide N of the next amino acid down the line. The geometry of the peptide backbone 
is shown in Figure 4.14. Note that the carbonyl oxygen and the amide hydrogen are trans 
to each other in this figure. This conformation is favored energetically because it results 
in less steric hindrance between nonbonded atoms in neighboring amino acids. Because 
the a-carbon atom of the amino acid is a chiral center (in all amino acids except glycine), 
the polypeptide chain is inherently asymmetric. 


4.7a The Peptide Bond Has Partial Double-Bond Character 


The peptide linkage is usually portrayed by a single bond between the carbonyl car- 
bon and the amide nitrogen (Figure 4.15a). Therefore, in principle, rotation may occur 
about any covalent bond in the polypeptide backbone because all three kinds of 
bonds (N—C,, C,—C,, and the C,—N peptide bond) are single bonds. In this 
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Figure 4.14 The peptide bond is shown in its usual trans conformation of carbonyl O and amide H. 
The C, atoms are the a-carbons of two adjacent amino acids joined in peptide linkage. The 
dimensions and angles are the average values observed by crystallographic analysis of amino acids 
and small peptides. The peptide bond is the light-colored bond between C and N. (Adapted from 
Ramachandran, G. N., et al., 1974. The mean geometry of the peptide unit from crystal structure 
data. Biochimica et Biophysica Acta 359:298-302.) 
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A pure double bond between C 
and O would permit free rotation 
around the C—N bond. 
(b) 
Cc, H 
\ +7 
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This resonance form would prohibit 
C—N bond rotation but would 
place too great a charge on O and N. 
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The true electron density is intermediate. 


Figure 4.15 The partial double-bond character of the peptide bond. Resonance interactions among 
the carbon, oxygen, and nitrogen atoms of the peptide group can be represented by two resonance 
extremes (a and b). (a) Left - The way the peptide atoms are normally drawn. (b) In an equally 
feasible form, the peptide bond is now a double bond; the amide N bears a positive charge and the 
carbonyl O has a negative charge. (c) The actual peptide bond is best described as a resonance hybrid 
of the forms in (a) and (b). Significantly, all of the atoms associated with the peptide group are 
coplanar, rotation about C,—N is restricted, and the peptide is distinctly polar. 
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representation, the C, and N atoms of the peptide group are both in planar sp? hy- 
bridization and the C, and O atoms are linked by a m bond, leaving the nitrogen with 
a lone pair of electrons in a 2p orbital. However, another resonance form for the pep- 
tide bond is feasible in which the C, and N atoms participate in a 7 bond, leaving a 
lone e~ pair on the oxygen (Figure 4.15b). This structure prevents free rotation about 
the C,—N peptide bond because it becomes a double bond. The real nature of the 
peptide bond lies somewhere between these extremes; that is, it has partial double- 
bond character, as represented by the intermediate form shown in Figure 4.15c. 

Peptide bond resonance has several important consequences. First, it restricts free 
rotation around the peptide bond and leaves the peptide backbone with only two ro- 
tational degrees of freedom per amino acid group: rotation around the N—C, bond 
and rotation around the C,—C, bond. Second, the six atoms composing the peptide 
bond group tend to be coplanar, forming the so-called amide plane of the polypeptide 
backbone (Figure 4.16). Third, the C,—N bond length is 0.133 nm, which is shorter 
than normal C—N bond lengths (for example, the C,—N bond of 0.145 nm) but lon- 
ger than typical C=N bonds (0.125 nm). Based on this measurement, the peptide 
bond is estimated to have 40% double-bond character. The barrier to C—N bond 
rotation of about 88 kJ/mol is enough to keep the amide group planar. 


4.7b The Polypeptide Backbone Is Relatively Polar 


Peptide bond resonance also causes the peptide backbone to be relatively polar. As 
shown in Figure 4.15b, the amide nitrogen is in a protonated or positively charged 
form, and the carbonyl oxygen is a negatively charged atom in this double-bonded 
resonance state. In actuality, the hybrid state of the partially double-bonded peptide 
arrangement gives a net positive charge of 0.28 on the amide N and an equivalent net 
negative charge of 0.28 on the carbonyl O. The presence of these partial charges 
means that the peptide bond has a permanent dipole. Nevertheless, the peptide back- 
bone is relatively unreactive chemically, and protons are gained or lost by the peptide 
groups only at extreme pH conditions. 


4.7c Peptides Can Be Classified According to How Many Amino 
Acids They Contain 
Peptides are short polymers of amino acids. They are classified according to the num- 


ber of amino acid units in the chain. Each unit is called an amino acid residue. The 
term describes what is left after the release of HO when an amino acid forms a 


a-Carbon 


a-Carbon 


Figure 4.16 The coplanar relationship of the atoms in the amide group is highlighted as an imaginary 
shaded plane lying between two successive a-carbon atoms in the peptide backbone. 
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peptide link upon joining the peptide chain. Dipeptides have two amino acid residues, 
tripeptides have three, tetrapeptides four, and so on. After about 12 residues, this 
terminology becomes cumbersome, so peptide chains of more than 12 and less than 
about 20 amino acid residues are usually referred to as oligopeptides. When the chain 
exceeds several dozen amino acids in length, the term polypeptide is used. The distinc- 
tions between these terms are not precise. 


4.74 The Amino Acids in Polypeptides Can Be Released 
by Acid Hydrolysis 


The peptide bonds of polypeptides and proteins can be hydrolyzed by strong acid to 
liberate the individual amino acids. Typically, a sample of the protein is treated with 
6 N HCI for 24 hours at 110°C in a sealed vial. Tryptophan is destroyed by acid and 
must be estimated by other means (usually UV spectrophotometry; see Figure 4.10) to 
determine its contribution to the total amino acid composition. The OH-containing 
amino acids serine and threonine are slowly destroyed. In contrast, peptide bonds 
involving hydrophobic residues such as valine and isoleucine are only slowly hydro- 
lyzed in acid. Another complication arises because the B- and y- amide linkages in 
asparagine (Asn) and glutamine (Gln) are acid labile. The side-chain amino nitrogen 
is released as free ammonium, and all of the Asn and Gln residues of the protein are 
converted to aspartic acid (Asp) and glutamic acid (Glu), respectively. The amount of 
ammonium released during acid hydrolysis gives an estimate of the total number of 
Asn and Gln residues in the original protein, but not the amounts of either. 

The complex amino acid mixture in the hydrolysate obtained after digestion of a 
protein in 6 M HCI can be separated into the component amino acids by using either 
ion-exchange chromatography or reversed-phase high-pressure liquid chromatogra- 
phy (HPLC) (see Figure 4.12 and the A Deeper Look box in Chapter 5: Techniques 
Used in Protein Purification). The amount of each amino acid can then be deter- 
mined. Amino acid analysis provides no information on the order or sequence of 
amino acid residues in the polypeptide chain. 

Amino acids almost never occur in equimolar ratios in proteins, indicating that 
proteins are not composed of repeating arrays of amino acids. There are a few excep- 
tions to this rule. Collagen, for example, contains large proportions of glycine and 
proline, and much of its structure is composed of (Gly-x-Pro) repeating units, where x 
is any amino acid. Other proteins show unusual abundances of various amino acids. 
For example, histones are rich in positively charged amino acids such as arginine and 
lysine. Histones are a class of proteins found associated with the anionic phosphate 
groups of eukaryotic DNA. 


4.7e Proteins Are Composed of One or More Polypeptide Chains 


The term protein broadly defines molecules composed of one or more polypeptide 
chains. The terms polypeptide and protein are used interchangeably in discussing 
single polypeptide chains. Proteins with one polypeptide chain are monomeric pro- 
teins. Proteins composed of more than one polypeptide chain are multimeric proteins. 
Multimeric proteins may contain only one kind of polypeptide, in which case they 
are homomultimeric, or they may be composed of several different kinds of polypep- 
tide chains, in which instance they are heteromultimeric. Greek letters and subscripts 
are used to denote the polypeptide composition of multimeric proteins. Thus, an 
a@-type protein is a dimer of identical polypeptide subunits, or a homodimer. For 
example, hemoglobin (Table 4.2), which consists of four polypeptides of two different 
kinds, is an œ2ß2 heteromultimer. 

Polypeptide chains of proteins typically range in length from about 100 amino acids 
to around 2000, the number found in each of the two polypeptide chains of myosin, the 
contractile protein of muscle. However, exceptions abound, including human cardiac 
muscle titin, which has 26,926 amino acid residues and a molecular weight of 2,993,497. 
The average molecular weight of polypeptide chains in eukaryotic cells is about 31,700, 
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Size of Protein Molecules* 


Protein M, Number of Residues Subunit 
per Chain Organization 
Insulin (bovine) 5733 21 (A) aß 
30 (B) 
Cytochrome c (equine) 12,500 104 a 
Ribonuclease A 12,640 124 æ 
(bovine pancreas) 
Lysozyme (egg white) 13,930 129 a 
Myoglobin (horse) 16,980 153 ay 
Chymotrypsin 22,600 13 (a) apy 
(bovine pancreas) 132 (8) 
97 (y) 
Hemoglobin (human) 64,500 141 (a) aß 
146 (B) 
Serum albumin 68,500 550 a 
(human) 
Hexokinase (yeast) 96,000 200 a4 
y-Globulin (horse) 149,900 214 (a) aß 
446 (B) 
Glutamate 332,694 500 a6 
dehydrogenase (liver) 
Myosin (rabbit) 470,000 2000 (heavy, /) haa’ >B> 
190 (a) 
149 (a’) 
160 (8) 
Ribulose bisphosphate 560,000 475 (a) agbs 
carboxylase (spinach) 123 (8) 
Glutamine synthetase 600,000 468 æ 
(E. coli) 
Insulin 


Cytochrome ¢ Ribonuclease i 
Lysozyme Myoglobin 


Hemoglobin 


Immunoglobulin 


Glutamine synthetase 


*|Ilustrations of selected proteins listed in Table 4.2 are drawn to constant scale. 
Adapted from Goodsell, D. S., and Olson, A. J., 1993. Soluble proteins: Size, shape and function. Trends in Biochemical 
Sciences 18:65-68. 
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corresponding to about 270 amino acid residues. Table 4.2 is a representative list of 
proteins according to size. The molecular weights (M,) of proteins can be estimated by 
a number of physicochemical methods such as polyacrylamide gel electrophoresis or 
ultracentrifugation (see A Deeper Look box “Techniques Used in Protein Purification” 
in Section 5.2). Protein molecular masses can be determined precisely by a simple cal- 
culation based on knowledge of their amino acid sequence, which is often available in 
genome databases. No simple generalizations correlate the size of proteins with their 
functions. For instance, the same function may be fulfilled in different cells by proteins 
of different molecular weight. The Escherichia coli enzyme responsible for glutamine 
synthesis (a protein known as glutamine synthetase) has a molecular weight of 600,000, 
whereas the analogous enzyme in brain tissue has a molecular weight of 380,000. 


Foundational Biochemistry Things You Should Know after Reading Chapter 4. 


The common amino acids contain a central a-carbon covalently 
linked to an amino group, a carboxyl group, a hydrogen, and a side 
chain, also known as an R group. 


The amino group of one amino acid and the carboxyl group of an- 
other amino acid can link together covalently to form an amide bond, 
also termed a peptide bond. 


The nonpolar amino acids include alanine, isoleucine, leucine, me- 
thionine, phenylalanine, proline, tryptophan, and valine. 


The polar, uncharged amino acids include asparagine, cysteine, gluta- 
mine, glycine, serine, threonine, and tyrosine. 


The acidic amino acids include aspartic acid and glutamic acid. 
The basic amino acids include arginine, histidine, and lysine. 


The common amino acids are referred to in proteins by their three- 
letter and one-letter codes. 


Amino acids that occur less commonly in proteins include selenocys- 
teine, hydroxylysine, y-carboxyglutamate, and pyroglutamate. 


The pK, values of the a-carboxyl groups in amino acids are approxi- 
mately 2. 


The pKa values of the a-amino groups in amino acids are approxi- 
mately 9. 


The pK, values of the side chains in amino acids are 3.9, 4.3, 6.0, 8.3, 
10.1, 10.5, 12.5, 13, and 13 for Asp, Glu, His, Cys, Tyr, Lys, Arg, Ser, 
and Thr, respectively. 


The a-amino groups of amino acids react with phenylisothiocyanate 
(Edman’s reagent) to form PTH derivatives. 


e All the common amino acids are chiral molecules. All the amino acids 
in natural proteins are of the L-configuration. 

e Phenylalanine, tyrosine, and tryptophan absorb ultraviolet radiation. 
Absorbance at 280 nm due to tyrosine and tryptophan is diagnostic 
for the presence of protein in an unknown solution and can be used 
to estimate the concentration of protein. 

e Amino acids can be characterized with NMR spectroscopy. Multidi- 
mensional NMR studies can be used to determine the structures of 
peptides and proteins in solution. 

e Mixtures of amino acids can be separated and characterized using 
various chromatography methods. 

e Proteins are unbranched polymers of amino acids linked head-to-tail 
through the formation of peptide bonds. 

e The peptide bond has approximately 40% double-bond character. 

e The six atoms comprising the peptide bond group are coplanar and 
form the so-called peptide plane. 

e Peptides consisting of 20 or so amino acids are termed oligopeptides. 
Longer peptides are termed polypeptides. 

e The polypeptide backbone is relatively polar. 

e Proteins with one polypeptide chain are monomeric proteins. Pro- 
teins composed of more than one polypeptide chain are multimeric 
proteins. 

e 


Multimeric proteins composed of only one kind of polypeptide are 
homomultimers. Proteins composed of two or more different poly- 
peptides are heteromultimers. 
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Problems 101 


Answers to all problems are at the end of this book. Detailed solutions are 
available in the Student Solutions Manual. 


1. Drawing Fischer Projection Formulas for Amino Acids Without 
consulting chapter figures, draw Fischer projection formulas for 
glycine, aspartate, leucine, isoleucine, methionine, and threonine. 


2. Knowing Abbreviations for Amino Acids Without reference to the text, 
give the one-letter and three-letter abbreviations for asparagine, 
arginine, cysteine, lysine, proline, tyrosine, and tryptophan. 


3. Recognizing Peptide Structures Consider the peptide structures shown 
and give the sequence of amino acids (from N-terminal to 
C-terminal) in these peptides. 


(0) oO o 
|l H | H | 
C—N—CH—C—N—CH—C—OH 
T . j CH—OH CH; 
H,N* — CH— C—N—CH—C—N i 
CHo CHo 
| | 
CH—CH, C=O Ss 
| | 
CH; OH OH 
Hy 
Hc E “çh 
Ho n ~nn 
C~.¢ C 
~oc 
HC=( \ 
ane NH “So 
Nc” HN 
H \ 
on 7 oh 
CH; Cc \ NH, 
HO,, 7 `O \__ HC o’ 
CH \ NH í 
\ Geri \ 
CH 7 CH O 
HOW,” N \ 
C H CH, 
\ 
(0) 


4. Recognizing Zwitterions Consider the structure shown. 
a. Would you consider this a zwitterion or not? Explain. 


b. If not, would the pH have to increase or decrease to make it a 
zwitterion? Explain. 


5. Writing Dissociation Equations for Amino Acids Write equations for 
the ionic dissociations of alanine, glutamate, histidine, lysine, and 
phenylalanine. 


6. Understanding Chemical Effects on Amino Acid pK, Values How is the 
pK, of the a-NH;* group affected by the presence on an amino acid 
of the a-COO~? 


7. pH and Charges in Amino Acid Side Chains Consider the structure of 
histidine shown in Figure 4.3. One of the two nitrogen atoms in the 
side chain can act as a proton donor/acceptor and the pK, value has 
been determined as 6.0. 


a. Draw a titration curve for histidine. 


b. Using the Henderson-Hasselbalch equation from Chapter 2, 
calculate the ratio of the charged and uncharged states of the 
side chain of histidine at pH 7.4. 


8. Drawing Titration Curves for the Amino Acids (Integrates with Chapter 2.) 
Draw an appropriate titration curve for aspartic acid, labeling the axes 
and indicating the equivalence points and the pK, values. 


9. Calculating Concentrations of Species in Amino Acid Solutions 
(Integrates with Chapter 2.) Calculate the concentrations of all ionic 
species in a 0.25 M solution of histidine at pH 2, pH 6.4, and pH 9.3. 


10. Calculating pH in Amino Acid Solutions | (Integrates with Chapter 2.) 
Calculate the pH at which the y-carboxyl group of glutamic acid is 
two-thirds dissociated. 


11. Calculating pH in Amino Acid Solutions II (Integrates with Chapter 2.) 
Calculate the pH at which the e-amino group of lysine is 20% 
dissociated. 


12. Calculating pH in Amino Acid Solutions III (Integrates with 
Chapter 2.) Calculate the pH of a 0.3 M solution of (a) valine 
hydrochloride, (b) sodium valinate, and (c) isoelectric valine. 


13. Understanding Stereochemical Transformations of Amino Acids 
Absolute configurations of the amino acids are referenced to D- and 
L-glyceraldehyde on the basis of chemical transformations that can 
convert the molecule of interest to either of these reference isomeric 
structures. In such reactions, the stereochemical consequences for the 
asymmetric centers must be understood for each reaction step. 
Propose a sequence of reactions that would demonstrate that 
L(—)-serine is stereochemically related to L(—)-glyceraldehyde. 


14. Understanding Amino Acid Stereochemistry Describe the stereochemi- 
cal aspects of the structure of cystine, the structure that is a disulfide- 
linked pair of cysteines. 


15. Amino Acids and Stereochemistry The genetic code is a system in 
which each amino acid in a protein is determined by the sequence of 
nucleotides in DNA. Our genome codes for the 20 amino acids 
considered in this chapter. If we used all possible isoforms 
(stereoisomers) for each amino acid, how many amino acids would 
need to be encoded in our genome in total? 


16. Understanding Amino Acid Reaction Mechanisms Draw a simple 
mechanism for the reaction of a cysteine sulfhydryl group with 
iodoacetamide. 


17. Determining Tyrosine Content of an Unknown Protein A previously 
unknown protein has been isolated in your laboratory. Others in your 
lab have determined that the protein sequence contains 172 amino 
acids. They have also determined that this protein has no tryptophan 
and no phenylalanine. You have been asked to determine the possible 
tyrosine content of this protein. You know from your study of this 
chapter that there is a relatively easy way to do this. You prepare a 
pure 50 mM solution of the protein, and you place it in a sample cell 
with a l-cm path length, and you measure the absorbance of this 
sample at 280 nm in a UV-visible spectrophotometer. The absorbance 
of the solution is 0.372. Are there tyrosines in this protein? How 
many? (Hint: You will need to use Beer’s law, which is described in 
any good general chemistry or physical chemistry textbook. You will 
also find it useful to know that the units of molar absorptivity are 
M~'cm7!.) 
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A Rule of Thumb for Amino Acid Content in Proteins The simple 
average molecular weight of the 20 common amino acids is 138, but 
most biochemists use 110 when estimating the number of amino acids 
in a protein of known molecular weight. Why do you suppose this is? 
(Hint: There are two contributing factors to the answer. One of them 
will be apparent from a brief consideration of the amino acid 
compositions of common proteins. See, for example, Figure 5.16 of 
this text.) 


Understanding the Chemistry of the Dipeptide Sweetener Aspartame 
The artificial sweeteners Equal and NutraSweet contain aspartame, 
which has the structure: 


COs 
| 
CHo O CHo O 
TNE | | 
HN — CH— C— NH— CH— C— OCH, 
Aspartame 


What are the two amino acids that are components of aspartame? 
What kind of bond links these amino acids? What do you suppose 
might happen if a solution of aspartame was heated for several hours 
at a pH near neutrality? Suppose you wanted to make hot chocolate 
sweetened only with aspartame, and you stored it in a thermos for 
several hours before drinking it. What might it taste like? 


Understanding a Defect of Amino Acid Metabolism Individuals with 
phenylketonuria must avoid dietary phenylalanine because they are 
unable to convert phenylalanine to tyrosine. Look up this condition 
and find out what happens if phenylalanine accumulates in the body. 
Would you advise a person with phenylketonuria to consume foods 
sweetened with aspartame? Why or why not? 


Distinguishing Prochiral Isomers In this chapter, the concept of 
prochirality was discussed. Citrate (see Figure 19.2) is a prochiral 
molecule. Describe the process by which you would distinguish 
between the (R—) and (S—) portions of this molecule and how an 
enzyme could discriminate between similar but distinct moieties. 


Understanding Buffer Capacity of Amino Acids Amino acids are 
frequently used as buffers. Describe the pH range of acceptable 
buffering behavior for the amino acids alanine, histidine, aspartic 
acid, and lysine. 


Determining Absolute Configuration in Amino Acids The different 
isomeric forms of chiral molecules are remarkably similar in their 
structures and physical properties. How can the different isomeric 
forms be characterized and distinguished? Consult suitable references 
to learn about the work of Jean-Baptiste Biot and Louis Pasteur that 
first revealed evidence for “optical activity.” Read also about the work 
of Emil Fischer that came very close to determining the correct 
structure of natural glucose. Finally, look up the work of J. M. 
Bijvoet that confirmed Emil Fischer’s determination of the structure 
of glucose. 


The Origin of p,. Nomenclature Some chemists and biochemists 
mistakenly attribute the system of D,L nomenclature to Emil Fischer. 
Consult suitable reference works to learn who actually devised the D,L 
system for naming chiral molecules. 


The Advantage of Replacing Cysteine with Selenocysteine in a Redox 
Enzyme More than a hundred types of selenocysteine-containing 


proteins have been characterized. One of these is the enzyme named 
methionine-S-sulfoxide reductase. In several of these, the replacement 
of cysteine with selenocysteine confers a catalytic advantage to the 
enzyme. What is the chemical basis for this catalytic advantage? 
Consult a suitable reference to explain how this works. (one good 
reference: Kim, H.-Y., Fomenko, D. E., Yoon, Y.-E., and 

Gladyshev, V. N., 2006. Biochemistry 45:13697-13704.) 


The Murchison Meteorite—a Possible Explanation for L-lsomer 
Predominance in Earthly Amino Acids The predominance of L-amino 
acids in biological systems is one of life’s intriguing features. Hints as 
to the origins of this preference have come to Earth in the form of 
meteorites that contain enantiomeric excesses of particular amino 
acids. One of these is the famous Murchison meteorite that fell in 
Murchison, Australia in 1969, but many others have now also been 
studied. Consult suitable references to learn more about these events 
and their analysis and summarize what is known about enantiomeric 
preferences in interstellar objects such as meteorites. (Good 
references: Elsila, J. E., et al., 2016. Meteoric amino acids: diversity in 
compositions reflects parent body histories. ACS Central Science 
2:370-379; Pizzarello, S., 2006. The chemistry of life’s origin: a 
carbonaceous meteorite perspective. Accounts of Chemical Research 
39:23 1-237; Schmitt-Kopplin, P., et al., 2011. High molecular 
diversity of extraterrestrial organic matter in Murchison meteorite 
revealed 40 years after its fall. Proceedings of the National Academy of 
Sciences, USA 107:2763-2768.) 


. Although the other common amino acids are used as buffers, cysteine 
is rarely used for this purpose. Why? 


26. 


28. Draw all the possible isomers of threonine and assign (R, S) nomen- 


clature to each. 


Think-Pair-Share Question 


Amino Acid Analysis Using RP-HPLC A student was tasked with the 
development of a new chromatography protocol to analyze amino 
acids (see Section 4.6a). The student was asked to evaluate the proto- 
col by analyzing all twenty amino acids at least twice. Faced with the 
prospect of having to perform forty analyses, the student took a 
“combinatorial” approach to the request. The results of the combina- 
torial approach as applied to nine different amino acids is shown be- 
low. Six mixtures were prepared containing three different amino acids 
and analyzed. The mixtures and the observed retention times are listed 
in the table. 


Mixture Amino acids present Retention times (min) 
l Phe Asp Pro 1.08 2.78 3.82 
II Met Lys Ser 2.29 3.46 4.46 
IlI Ala Val Leu 2.89 3.33 367 
IV Val Asp Lys 1.11 335 3:67 
V Pro Ser Leu 2.31 2.79 3.68 
VI Met Phe Ala 2.91 3.47 3.83 


For each mixture the retention times cannot be assigned to any indi- 
vidual amino acid present. However, the student claims that the six 
analyses resulted in a duplicate estimate of each amino acid’s retention 
time. With a fellow student, determine how this system works and 
what the retention time results were for each amino acid. 
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Proteins: Their Primary 
Structure and Biological 
Functions 


Essential Questions 


Proteins are polymers composed of hundreds or even thousands of amino acids linked in 
series by peptide bonds. What structural forms do these polypeptide chains assume, how 
can the sequence of amino acids in a protein be determined, and what are the biological 
roles played by proteins? 


roteins are a diverse and abundant class of biomolecules, making up more than 

50% of the dry weight of cells. Their diversity and abundance reflect the central 
role of proteins in virtually all aspects of cell structure and function. An extraordi- 
nary repertoire of cellular activity is possible only because of the versatility inherent 
in proteins, each of which is specifically tailored to its biological role. The pattern by 
which each is tailored resides within the genetic information of cells, encoded in a 
specific sequence of nucleotide bases in DNA. Each such segment of encoded infor- 
mation defines a gene, and expression of the gene leads to synthesis of the specific 
protein encoded by it, endowing the cell with the functions unique to that particular 
protein. Proteins are the agents of biological function; they are also the expressions of 
genetic information. 


5.1 | What Architectural Arrangements 
Characterize Protein Structure? 


Proteins are among the most complex molecular structures known. How can their 
structural properties be dissected and understood? First, protein structures can be 
classified according to their overall shape and solubility. Then, deeper inspection 
reveals that protein structures can be described by organizational patterns based on 
the amino acid sequence of their polypeptide chains and on how noncovalent interac- 
tions lead these chains to fold into unique three-dimensional architectures. 


...by small and simple things are great things 
brought to pass. 


ALMA 37.6 
The Book of Mormon 


Helices, which sometimes appear as decorative or 
utilitarian motifs in manmade structures, are a common 
structural theme in biological macromolecules— 
proteins, nucleic acids, and even polysaccharides. 


Key Questions 


5.1 What Architectural Arrangements 
Characterize Protein Structure? 


5.2 How Are Proteins Isolated and 
Purified from Cells? 


5.3 How Is the Primary Structure of 
a Protein Determined? 

5.4 What Is the Nature of Amino Acid 
Sequences? 

5.5 Can Polypeptides Be Synthesized in 
the Laboratory? 

5.6 Do Proteins Have Chemical Groups 
Other Than Amino Acids? 

5.7 What Are the Many Biological 
Functions of Proteins? 


5.8 What Is the Proteome and What 
Does It Tell Us? 
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(a) 


Collagen, 
a fibrous protein 


Myoglobin, a globular protein 


Bacteriorhodopsin, a membrane protein 


Figure 5.1 Three classes of proteins. (a) Collagen, a major component of animal connective tissue, is 
the most abundant protein in mammals. Proteins having structural roles in cells are typically fibrous 
and often water insoluble. (b) Myoglobin is a globular protein. (c) Membrane proteins fold so that 
hydrophobic amino acid side chains are exposed in their membrane-associated regions. 
Bacteriorhodopsin binds the light-absorbing pigment, cis-retinal, shown here in blue. 


5.la Proteins Fall into Three Basic Classes 
According to Shape and Solubility 


As a first approximation, proteins can be assigned to one of three global classes on 
the basis of shape and solubility: fibrous, globular, or membrane (Figure 5.1). Fibrous 
proteins tend to have relatively simple, regular linear structures. These proteins often 
serve structural roles in cells. Typically, they are insoluble in water or in dilute salt 
solutions. In contrast, globular proteins are roughly spherical in shape. The polypep- 
tide chain is compactly folded so that hydrophobic amino acid side chains are in the 
interior of the molecule and the hydrophilic side chains are on the outside exposed to 
the solvent, water. Consequently, globular proteins are usually very soluble in aque- 
ous solutions. Most soluble proteins of the cell, such as the cytosolic enzymes, are 
globular in shape. Membrane proteins are found in association with the various mem- 
brane systems of cells. For interaction with the nonpolar phase within membranes, 
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5.1 What Architectural Arrangements Characterize Protein Structure? 


membrane proteins have hydrophobic amino acid side chains oriented outward. As 
such, membrane proteins are insoluble in aqueous solutions but can be solubilized in 
solutions of detergents. Membrane proteins characteristically have fewer hydrophilic 
amino acids than cytosolic proteins. 


5.1b Protein Structure Is Described in Terms 
of Four Levels of Organization 


The architecture of protein molecules is quite complex. Nevertheless, this complexity 
can be resolved by defining various levels of structural organization. 


Primary Structure The amino acid sequence is, by definition, the primary (1°) structure 
of a protein. For example, the primary structure of bovine pancreatic R Nase is shown 
in Figure 5.2. The amino acid sequence of a protein is its defining feature. All other 
structural properties, as well as the protein’s function, follow from its primary 
structure. 


Secondary Structure Through hydrogen-bonding interactions between adjacent 
amino acid residues, the polypeptide chain can arrange itself into characteristic heli- 
cal or pleated segments. These segments constitute structural conformities, so-called 
regular structures, which extend along one dimension, like the coils of a spring. Such 
architectural features of a protein are designated secondary (2°) structures (Figure 5.3). 
Secondary structures are just one of the higher levels of structure that represent the 
three-dimensional arrangement of the polypeptide in space. 


50 


Figure 5.2 Bovine pancreatic ribonuclease A is this specific sequence of 124 amino acid residues, 
none of which are tryptophan. Four intrachain disulfide bridges (S—S) form crosslinks in this 
polypeptide between Cys% and Cyst, Cys*? and Cys, Cys: and Cys!!°, and Cys® and Cys”. 
Amino acids with nonpolar R groups are in orange, those with polar uncharged R groups are in 
green, with positively charged R groups in blue, and with negatively charged R groups, red. 
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a-Helix B-Strand 

Only the N — Cy — C backbone The N — Cy — C backbone as well 

is represented. The vertical line as the Cg of R groups are represented 
is the helix axis. here. Note that the amide planes 


are perpendicular to the page. 


Symbolic representation of an a-helix Symbolic representation of a B-strand 


Figure 5.3 The a-helix and the B-pleated strand are 
the two principal secondary structures found in 


proteins. Simple representations of these structures 
are a flat, helical ribbon or a cylinder for the a-helix 


and a flat, wide arrow for B-structures. 


(a) Chymotrypsin tertiary structure 


Chymotrypsin ribbon 


(c) 


Figure 5.4 Folding of the polypeptide chain into a compact, roughly spherical conformation creates the 
tertiary level of protein structure. Shown here are three different representations of the tertiary structure 
of the protein chymotrypsin: (a) A tracing showing the position of all of the C, carbon atoms, (b) a 
ribbon diagram that shows the three-dimensional track of the polypeptide chain, and (c) a space-filling 


representation of the atoms as spheres. Chymotrypsin space-filling model 
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Tertiary Structure When the polypeptide chains of protein molecules bend and B-Chains Heme 
fold in order to assume a more compact three-dimensional shape, the tertiary 
(3°) level of structure is generated (Figure 5.4). It is by virtue of their tertiary structure 
that proteins adopt a globular shape. A globular conformation gives the lowest 
surface-to-volume ratio, minimizing interaction of the protein with the surrounding 
environment. 


Quaternary Structure Many proteins consist of two or more interacting polypeptide 
chains of characteristic tertiary structure, each of which is commonly referred to 
as a subunit of the protein. Subunit organization constitutes another level in the 
hierarchy of protein structure, defined as the protein’s quaternary (4°) structure 
(Figure 5.5). The protein’s quaternary structure includes the various kinds of sub- 
units within a protein molecule, the number of each, and the ways in which they 
interact with one another. 


a-Chains 
5.1c Noncovalent Forces Drive Formation of the Higher Orders Figure 5.5 Hemoglobin is a tetramer consisting of 
of Protein Structure two a and two B polypeptide chains. 


Whereas the primary structure of a protein is determined by the covalently linked 
amino acid residues in the polypeptide backbone, secondary and higher orders of 
structure are determined principally by noncovalent forces such as hydrogen bonds 
as well as ionic, van der Waals, and hydrophobic interactions. It is important to em- 
phasize that all the information necessary for a protein molecule to achieve its intricate 
architecture is contained within its 1° structure, that is, within the amino acid sequence 
of its polypeptide chain(s). 


5.1d A Protein’s Conformation Can Be Described 

as Its Overall Three-Dimensional Structure 
The overall three-dimensional architecture of a protein is generally referred to as its 
conformation. This term is not to be confused with configuration, which denotes the 
geometric possibilities for a particular set of atoms (Figure 5.6). In going from one 
configuration to another, covalent bonds must be broken and rearranged. 


(c) 


D-Glyceraldehyde L-Glyceraldehyde 
(b) 
cl H 
\ > 
e—onk 
H” | \ 
H cl 
1,2-Dichloroethane 


Figure 5.6 Configuration and conformation are not synonymous. (a) Rearrangements between 
configurational alternatives of a molecule can be achieved only by breaking and remaking bonds, as 
in the transformation between the D- and L-configurations of glyceraldehyde. (b) The intrinsic free 
rotation around single covalent bonds creates a great variety of three-dimensional conformations, 
even for relatively simple molecules, such as 1,2-dichloroethane. (c) Imagine the conformational 
possibilities for a protein in which two of every three bonds along its backbone are freely rotating 
single bonds. (Illustration: Irving Geis. Rights owned by Howard Hughes Medical Institute. Not to be 
reproduced without permission.) 
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Figure 5.7 The solubility of most globular proteins 
is markedly influenced by pH and ionic strength. 

This figure shows the solubility of a typical protein 
as a function of pH and various salt concentrations. 


In contrast, the conformational possibilities of a molecule are achieved without 
breaking any covalent bonds. In proteins, rotations about each of the single bonds 
along the peptide backbone have the potential to alter the course of the polypeptide 
chain in three-dimensional space. These rotational possibilities create many possi- 
ble orientations for the protein chain, referred to as its conformational possibilities. 
Of the great number of theoretical conformations a given protein might adopt, only 
a very few are favored energetically under physiological conditions. At this time, the 
rules that direct the folding of protein chains into energetically favorable conforma- 
tions are still not entirely clear; accordingly, they are the subject of intensive con- 
temporary research. Chapter 6 presents a detailed discussion of the 2°, 3°, and 4° 
structure of protein molecules. 


5.2 | How Are Proteins Isolated and Purified from Cells? 


Cells contain thousands of different proteins. The first step in studying a specific 
protein is to purify it, so it is not contaminated by the presence of any other proteins. 
Because the biological function of a protein depends on its native (occasionally called 
physiological) structure, techniques employed in protein purification should not de- 
nature the protein (Figure 1.17). The starting material for protein purification is usu- 
ally a cell type or tissue of interest; alternatively, cloned cells engineered to mass 
produce the protein are used. Proteins can be separated and purified on the basis of 
their two prominent physical properties: size and electrical charge. A direct approach 
is to use affinity purification strategies that take advantage of the ligand-binding 
properties of a protein (see A Deeper Look box: Techniques Used in Protein 
Purification). 


5.2a A Number of Protein Separation Methods 
Exploit Differences in Size and Charge 


Separation methods based on size include size exclusion chromatography, ultrafiltra- 
tion, and ultracentrifugation (see A Deeper Look box: Techniques Used in Protein 
Purification). The charge or ionic properties of peptides and proteins are determined 
principally by their complement of amino acid side chains. Furthermore, the ioniza- 
tion of these groups is pH dependent. 

A variety of procedures has been designed to exploit the electrical charges on a 
protein as a means to separate proteins in a mixture. These procedures include ion 
exchange chromatography, electrophoresis (see A Deeper Look box: Techniques 
Used in Protein Purification), and separation by solubility. Proteins tend to be least 
soluble at their isoelectric point, the pH value at which the sum of their positive and 
negative electrical charges is zero. At this pH, electrostatic repulsion between protein 
molecules is minimal and they are more likely to coalesce and precipitate out of 
solution. Ionic strength also profoundly influences protein solubility. Most globular 
proteins tend to become increasingly soluble as the ionic strength is raised. This phe- 
nomenon, the salting-in of proteins, is attributed to the diminishment of electrostatic 
attractions between protein molecules by the presence of abundant salt ions. Such 
electrostatic interactions between the protein molecules would otherwise lead to pre- 
cipitation. However, as the salt concentration reaches high levels (greater than 1 M), 
the effect may reverse so that the protein is salted out of solution. In such cases, the 
numerous salt ions begin to compete with the protein for waters of solvation, and as 
they win out, the protein becomes insoluble. The solubility properties of a typical 
protein are shown in Figure 5.7. 

Although the side chains of nonpolar amino acids in soluble proteins are usually 
buried in the interior of the protein away from contact with the aqueous solvent, a por- 
tion of them may be exposed at the protein’s surface, giving it a partially hydrophobic 
character. Hydrophobic interaction chromatography is a protein purification technique 
that exploits this hydrophobicity (see A Deeper Look box: Techniques Used in Protein 
Purification). 
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A Deeper Look 


Estimation of Protein Concentrations in Solutions of Biological Origin 


Biochemists are often interested in knowing the protein concentra- 
tion in various preparations of biological origin. Such quantitative 
analysis is not straightforward. Cell extracts are complex mixtures 
that typically contain protein molecules of many different molecular 
weights, so the results of protein estimations cannot be expressed on 
a molar basis. Also, aside from the rather unreactive repeating pep- 
tide backbone, little common chemical identity is seen among the 


Brilliant Blue G-250, which undergoes a change in its color upon bind- 
ing to proteins. The binding is quantitative and takes place through 
hydrophobic interactions as well as ionic bonding with basic amino 
groups. The color change is easily measured by a spectrophotometer. 
A similar, very sensitive method capable of quantifying nanogram 
amounts of protein is based on the shift in color of colloidal gold 
upon binding to proteins. 


many proteins found in cells that might be readily exploited for exact 
chemical analysis. Most of their chemical properties vary with their 
amino acid composition, for example, nitrogen or sulfur content or 
the presence of aromatic, hydroxyl, or other functional groups. 

Several methods rely on the reduction of Cu?” ions to Cu* by read- 
ily oxidizable protein components, such as cysteine or the phenols and 
indoles of tyrosine and tryptophan. For example, bicinchoninic acid 
(BCA) forms a purple complex with Cu* in alkaline solution, and the 
amount of Cu* formed upon the reaction of a protein sample with 
Cu?* can be easily measured spectrophotometrically to provide an 
estimate of protein concentration. 

Other assays are based on dye binding by proteins. The Bradford 


assay is a rapid and reliable technique that uses a dye called Coomassie BCA-Cut complex 


5.2b A Typical Protein Purification Scheme 
Uses a Series of Separation Methods 


Most purification procedures for a particular protein are developed in an empirical 
manner, the overriding aim being isolation of the protein in a pure state with ac- 
ceptable yield. The source material is harvested cells, an organ, or an entire organ- 
ism thought to contain the protein of interest. The first step is to break open the 
cells so that the protein can be extracted. Table 5.1 presents a summary of a purifi- 
cation scheme for a desired enzyme. Note that the specific activity of the enzyme in 
the immunoaffinity purified fraction (fraction 5) has been increased 152/0.108, or 
1407 times the specific activity in the crude extract (fraction 1). Thus, the concentra- 
tion of this protein has been enriched more than 1400-fold by the purification 
procedure. 


| Table 5.1 | Example of a Protein Purification Scheme: Purification of an Enzyme from a Cell Extract 


Volume Total Total Specific Percent 
Fraction (mL) Protein (mg) Activity* Activity? Recovery? 
1. Crude extract 3800 22,800 2460 0.108 100 
2. Salt precipitate 165 2800 1190 0.425 48 
3. Ion exchange chromatography 65 100 720 ed 29 
4. Molecular sieve chromatography 40 14.5 555 38.3 23 
5. Immunoaffinity chromatography’ 6 1.8 275 152 11 


*The relative enzymatic activity of each fraction is cited as arbitrarily defined units. One unit here is 1 nanomole/mL of substrate converted per minute. The enzyme is xanthine dehydro- 
genase from the bread mold Neurospora crassa. 

iThe specific activity is the total activity of the fraction divided by the total protein in the fraction. This value gives an indication of the increase in purity attained during the course of the 
purification as the samples become enriched for the enzyme. 

The percent recovery of total activity is a measure of the yield of the desired enzyme. 

‘The last step in the procedure is an affinity method in which antibodies specific for the enzyme are covalently coupled to a chromatography matrix and packed into a glass tube to make 
a chromatographic column through which fraction 4 is passed. The enzyme is bound by this immunoaffinity matrix while other proteins pass freely out. The enzyme is then recovered 
by passing through the column a strong salt solution, which dissociates the enzyme-antibody complex. 
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A Deeper Look 


Techniques Used in Protein Purification’ 


Dialysis and Ultrafiltration 

If asolution of protein is separated from a bathing solution by a semi- 
permeable membrane, small molecules and ions can pass through 
the semipermeable membrane to equilibrate between the protein 
solution and the bathing solution, called the dialysis bath or dialy- 
sate (Figure 1). This method is useful for removing small molecules 
from macromolecular solutions or for altering the composition of 
the protein-containing solution. 

Ultrafiltration is an improvement on the dialysis principle. Filters 
with pore sizes over the range of biomolecular dimensions are 
used to filter solutions to select for molecules in a particular size 
range. Because the pore sizes in these filters are microscopic, high 
pressures are often required to force the solution through the 
filter. This technique is useful for concentrating dilute solutions of 
macromolecules. The concentrated protein can then be diluted 
into the solution of choice. 


Ion Exchange Chromatography Can Be Used 

to Separate Molecules on the Basis of Charge 

Charged molecules can be separated using ion exchange chromatog- 
raphy, a process in which the charged molecules of interest (ions) 
are exchanged for another ion (usually a salt ion) on a charged solid 
support. In a typical procedure, solutes in a liquid phase, usually wa- 
ter, are passed through a column filled with a porous solid phase com- 
posed of synthetic resin particles containing charged groups. Resins 
containing positively charged groups attract negatively charged sol- 
utes and are referred to as anion exchange resins. Resins with nega- 
tively charged groups are cation exchangers. Weakly acidic or basic 
groups on ion exchange resins exhibit charges that are dependent on 


Semipermeable bag 
containing protein solution 


Magnetic stirrer 
for mixing 


Figure 1 A dialysis experiment. The solution of macromolecules to be 
dialyzed is placed in a semipermeable membrane bag, and the bag is 
immersed in a bathing solution. A magnetic stirrer gently mixes the solution 
to facilitate equilibrium of diffusible solutes between the dialysate and the 
solution contained in the bag. 


‘Although this box is titled Techniques Used in Protein Purification, these 
methods are also applicable to other macromolecules such as nucleic acids. 


the pH of the bathing solution. Changing the pH will alter the ionic 
interaction between the resin groups and the bound ions. In all cases, 
the bare charges on the resin particles must be counterbalanced by 
oppositely charged ions in solution (counterions); salt ions (e.g., Nat 
or Cl-) usually serve this purpose. The hypothetical separation of a 
mixture of several proteins on a column of cation exchange resin is 
illustrated in Figure 2. Increasing the salt concentration in the 
solution passing through the column leads to competition between 
the cationic proteins bound to the column and the cations in the 
salt for binding to the column. Bound cationic proteins that in- 
teract weakly with the charged groups on the resin wash out first, 
and those interacting strongly are washed out only at high salt 
concentrations. 
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Figure 2 The separation of proteins on a cation exchange column. 
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Size Exclusion Chromatography 

Size exclusion chromatography is also known as gel filtration chro- 
matography or molecular sieve chromatography. In this method, fine, 
porous beads are packed into a chromatography column. The pore 
sizes of these beads approximate the dimensions of macromolecules. 
The total bed volume (Figure 3) of the packed chromatography col- 
umn, V, is equal to the volume outside the porous beads (V,) plus the 
volume inside the beads (V;) plus the volume actually occupied by the 
bead material (V): Vi = Vo + Vit Vz. (Ve is typically less than 1% of 
V, and can be conveniently ignored in most applications.) 

As a solution of molecules is passed through the column, the 
molecules passively distribute between V, and V;, depending on their 
ability to enter the pores (that is, their size). If a molecule is too large 
to enter at all, it is totally excluded from V; and emerges first from the 
column at an elution volume, V., equal to Vo. If a particular molecule 
can enter the pores in the gel, its distribution is given by the distribu- 
tion coefficient, Kp: 


Kp = (Ve — Vo)! Vi 


where V, is the molecule’s characteristic elution volume. The chroma- 
tography run is complete when a volume of solvent equal to V, has 
passed through the column. 


Electrophoresis 

Electrophoretic techniques are based on the movement of ions in 
an electrical field. An ion of charge q experiences a force F given by 
F = Eq/d, where E is the voltage (or electrical potential) and d is the 
distance between the electrodes. In a vacuum, F would cause the 
molecule to accelerate. In solution, the molecule experiences frictional 
drag, F;, due to the solvent: 
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Figure 3 Gel filtration separates proteins according to size. (a) A gel 
filtration chromatography column. Larger molecules are excluded from 
the gel beads and emerge from the column sooner than smaller molecules, 
whose migration is slowed because they can enter the beads. (b) An 
elution profile. 
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F; = 6arnv 


where r is the radius of the charged molecule, 7 is the viscosity of the 
solution, and v is the velocity at which the charged molecule is mov- 
ing. So, the velocity of the charged molecule is proportional to its 
charge q and the voltage E, but inversely proportional to the viscosity 
of the medium 7 and d, the distance between the electrodes. 

Generally, electrophoresis is carried out not in free solution but in 
a porous support matrix such as polyacrylamide or agarose, which 
slows the movement of molecules according to their dimensions rela- 
tive to the size of the pores in the matrix. 


SDS-Polyacrylamide Gel Electrophoresis (SDS-PAGE) 

SDS is sodium dodecylsulfate (sodium lauryl sulfate) (Figure 4). The 
hydrophobic tail of dodecylsulfate interacts strongly with polypep- 
tide chains. The number of SDS molecules bound by a polypeptide 
is proportional to the length (number of amino acid residues) of the 
polypeptide. Each dodecylsulfate contributes two negative charges. 
Collectively, these charges overwhelm any intrinsic charge that the 
protein might have. SDS is also a detergent that disrupts protein fold- 
ing (protein 3° structure). SDS-PAGE is usually run in the presence 
of sulfhydryl-reducing agents such as B-mercaptoethanol so that any 
disulfide links between polypeptide chains are broken. The electro- 
phoretic mobility of proteins upon SDS-PAGE is inversely propor- 
tional to the logarithm of the protein’s molecular weight (Figure 5). 
SDS-PAGE is often used to determine the molecular weight of a 
polypeptide chain. 


Isoelectric Focusing 

Isoelectric focusing is an electrophoretic technique for separating 
proteins according to their isoelectric points (pls). A solution of 
ampholytes (amphoteric electrolytes) is first electrophoresed through 
a gel, usually contained in a small tube. The migration of these 
substances in an electric field establishes a pH gradient in the tube. 
Then a protein mixture is applied to the gel, and electrophoresis is 
resumed. As the protein molecules move down the gel, they experi- 
ence the pH gradient and migrate to a position corresponding to 
their respective pIs. At its pI, a protein has no net charge and thus 
moves no farther. 
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Figure 4 The structure of sodium dodecylsulfate (SDS). 
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Figure 5 A plot of the relative electrophoretic mobility of proteins in 
SDS-PAGE versus the log of the molecular weights of the individual 
polypeptides. 
(Continued) 
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Two-Dimensional Gel Electrophoresis 

This separation technique uses isoelectric focusing in one dimen- 
sion and SDS-PAGE in the second dimension to resolve protein 
mixtures. The proteins in a mixture are first separated according 
to pI by isoelectric focusing in a polyacrylamide gel in a tube. The 
gel is then removed and laid along the top of an SDS-PAGE slab, 
and the proteins are electrophoresed into the SDS polyacrylamide 
gel, where they are separated according to size (Figure 6). The 
gel slab can then be stained to reveal the locations of the indi- 
vidual proteins. Using this powerful technique, researchers have 
the potential to visualize and construct catalogs of virtually all 
the proteins present in particular cell types. The ExPASy server 
at http-//us.expasy.org provides access to a two-dimensional poly- 
acrylamide gel electrophoresis database named SWISS-2DPAGE. 
This database contains information on proteins, identified as spots 
on two-dimensional electrophoresis gels, from many different cell 
and tissue types. 


Hydrophobic Interaction Chromatography 

Hydrophobic interaction chromatography (HIC) exploits the hydro- 
phobic nature of proteins in purifying them. Proteins are passed over 
a chromatographic column packed with a support matrix to which 
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hydrophobic groups are covalently linked. Phenyl Sepharose, an aga- 
rose support matrix to which phenyl groups are attached, is a prime 
example of such material. In the presence of high salt concentrations, 
proteins bind to the phenyl groups by virtue of hydrophobic interac- 
tions. Proteins in a mixture can be differentially eluted from the phe- 
nyl groups by lowering the salt concentration or by adding solvents 
such as polyethylene glycol to the elution fluid. 


High-Performance Liquid Chromatography 

The principles exploited in high-performance (or high-pressure) liquid 
chromatography (HPLC) are the same as those used in the common 
chromatographic methods such as ion exchange chromatography 
or size exclusion chromatography. Very-high-resolution separations 
can be achieved quickly and with high sensitivity in HPLC using 
automated instrumentation. Reverse-phase HPLC is a widely used 
chromatographic procedure for the separation of nonpolar solutes. 
In reverse-phase HPLC, a solution of nonpolar solutes is chroma- 
tographed on a column having a nonpolar liquid immobilized on an 
inert matrix; this nonpolar liquid serves as the stationary phase. A 
more polar liquid that serves as the mobile phase is passed over the 
matrix, and solute molecules are eluted in proportion to their solubil- 
ity in this more polar liquid. 
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Figure 6 A two-dimensional electrophoresis separation. A mixture of macromolecules is first separated according to charge by isoelectric focusing in a 
tube gel. The gel containing separated molecules is then placed on top of an SDS-PAGE slab, and the molecules are electrophoresed into the SDS-PAGE gel, 
where they are separated according to size. Shown on the right is an actual 2D-gel electrophoretogram of mouse liver proteins from the Swiss Institute of 


Bioinformatics database. 
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Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


Affinity Chromatography 

Affinity purification strategies for proteins exploit the biological 
function of the target protein. In most instances, proteins carry out 
their biological activity through binding or complex formation with 
specific small biomolecules, or /igands, as in the case of an enzyme 
binding its substrate. If this small molecule can be immobilized 
through covalent attachment to an insoluble matrix, such as a chro- 
matographic medium like cellulose or polyacrylamide, then the pro- 
tein of interest, in displaying affinity for its ligand, becomes bound 
and immobilized itself. It can then be removed from contaminating 
proteins in the mixture by simple means such as filtration and wash- 
ing the matrix. Finally, the protein is dissociated or eluted from the 
matrix by the addition of high concentrations of the free ligand in 
solution. Figure 7 depicts the protocol for such an affinity chroma- 
tography scheme. Because this method of purification relies on the 
biological specificity of the protein of interest, it is a very efficient 
procedure, and proteins can be purified several thousand-fold in a 
single step. 


Ultracentrifugation 

Centrifugation methods separate macromolecules on the basis of their 
characteristic densities. Particles tend to “fall” through a solution if 
the density of the solution is less than the density of the particle. The 
velocity of the particle through the medium is proportional to the dif- 
ference in density between the particle and the solution. The tendency 
of any particle to move through a solution under centrifugal force is 
given by the sedimentation coefficient, S: 


S= (Pp = Pm) V/f 


where pp is the density of the particle or macromolecule, pm is the 
density of the medium or solution, V is the volume of the particle, 
and f is the frictional coefficient, given by 


f = Fr/v 


where v is the velocity of the particle and F; is the frictional drag. 
Nonspherical molecules have larger frictional coefficients and thus 
smaller sedimentation coefficients. The smaller the particle and the 
more its shape deviates from spherical, the more slowly that particle 
sediments in a centrifuge. 

Centrifugation can be used either as a preparative technique for 
separating and purifying macromolecules and cellular components 
(see Figure 17.16) or as an analytical technique to characterize the 
hydrodynamic properties of macromolecules such as proteins and 
nucleic acids. See also the A Deeper Look box: The Buoyant Density 
of DNA in Section 11.3. 


5.3 


How Is the Primary Structure of a Protein Determined? 


A protein interacts with a metabolite. The 
metabolite is thus a ligand that binds specifically 
to this protein 


G+ 9=@ 


Protein Metabolite 


The metabolite can be immobilized by covalently 
coupling it to an insoluble matrix such as an 
agarose polymer. Cell extracts containing many 
individual proteins may be passed through 

the matrix. 


Specific protein binds to ligand. All other 
unbound material is washed out of the matrix. 


Adding an excess of free metabolite that will 
compete for the bound protein dissociates the 
protein from the chromatographic matrix. The 
protein passes out of the column complexed with 
metabolite. 


Purifications of proteins as 
much as 1,000-fold or more are 
routinely achieved in a single 
affinity chromatographic step 
like this. 


© 
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Figure 7 Diagram illustrating affinity chromatography. 
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5.3 | How Is the Primary Structure 
of a Protein Determined? 


The defining characteristic of each protein is the distinctive sequence of amino acid 
residues in its polypeptide chain(s). Indeed, it is the amino acid sequence of proteins 
that is encoded by the nucleotide sequence of DNA. This amino acid sequence, then, 
is a form of genetic information. Because polypeptide chains are unbranched, a poly- 
peptide chain has only two ends, an amino-terminal, or N-terminal, end and a 
carboxy-terminal, or C-terminal, end. By convention, the amino acid sequence is read 
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Figure 5.8 The hormone insulin consists of two 
polypeptide chains, A and B, held together by two 
disulfide cross-bridges (S—S). The A chain has 

21 amino acid residues and an intrachain disulfide; 
the B polypeptide contains 30 amino acids. The 
sequence shown is for bovine insulin. 


from the N-terminal end of the polypeptide chain through to the C-terminal end. 
As an example, every molecule of ribonuclease A from bovine pancreas has the same 
amino acid sequence, beginning with N-terminal lysine at position 1 and ending with 
C-terminal valine at position 124 (Figure 5.2). Given the possibility of any of the 
20 amino acids at each position, the number of unique amino acid sequences is astro- 
nomically large. The astounding sequence variation possible within polypeptide 
chains provides a key insight into the incredible functional diversity of protein mole- 
cules in biological systems discussed later in this chapter. 


5.3a Sanger Was the First to Determine the Sequence of a Protein 


In 1953, Frederick Sanger of Cambridge University in England reported the amino 
acid sequences of the two polypeptide chains composing the protein insulin 
(Figure 5.8). Not only was this a remarkable achievement in analytical chemistry, 
but it helped demystify speculation about the chemical nature of proteins. Sanger’s 
results clearly established that all of the molecules of a given protein have a defined 
amino acid sequence, and therefore an invariant molecular weight. In short, proteins 
are well defined chemically. Today, the amino acid sequences of millions of proteins are 
known. Although many sequences have been determined from application of 
the principles first established by Sanger, most are now deduced from knowledge of 
the nucleotide sequence of the gene that encodes the protein. In addition, in recent 
years, the application of mass spectrometry to the sequence analysis of proteins has 
largely superseded the protocols based on chemical and enzymatic degradation of 
polypeptides that Sanger pioneered. Either mass spectrometry or these chemical and 
enzymatic protocols are useful in confirming gene sequence data and either can 
reveal posttranslational modifications of proteins, which often play a critical role in 
protein function (see Section 5.6). 


5.3b Both Chemical and Enzymatic Methodologies 
Are Used in Protein Sequencing 


The chemical strategy for determining the amino acid sequence of a protein involves 
six basic steps: 


1. If the protein contains more than one polypeptide chain, the chains are separated 

and purified. 

2. Intrachain S—S (disulfide) cross-bridges between cysteine residues in the poly- 
peptide chain are cleaved. (If these disulfides are interchain linkages, then step 2 
precedes step 1.) 

. The N-terminal and C-terminal residues are identified. 

4. Each polypeptide chain is cleaved into smaller fragments, and the amino acid com- 

position and sequence of each fragment are determined. 

5. Step 4 is repeated, using a different cleavage procedure to generate a different and 

therefore overlapping set of peptide fragments. 

6. The overall amino acid sequence of the protein is reconstructed from the sequences 

in overlapping fragments. 


w 


Each of these steps is discussed in greater detail in the following sections. 


5.3c Step 1. Separation of Polypeptide Chains 


If the protein of interest is a heteromultimer (composed of more than one type of poly- 
peptide chain), then the protein must be dissociated into its component polypeptide 
chains, which then must be separated from one another and sequenced individually. 
Because subunits in multimeric proteins typically associate through noncovalent 
interactions, most multimeric proteins can be dissociated by exposure to pH ex- 
tremes, 8 M urea, 6 M guanidinium hydrochloride, or high salt concentrations. (All 
of these treatments disrupt polar interactions such as hydrogen bonds, both within 
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A Deeper Look 
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The Virtually Limitless Number of Different Amino Acid Sequences 


Given 20 different amino acids, a polypeptide chain of n residues For a polypeptide chain of 100 residues in length, a rather modest 
can have any one of 20” possible sequence arrangements. To por- size, the number of possible sequences is 20!, or because 20 = 
tray this, consider the number of tripeptides possible if there were 10!3, 101° unique possibilities. These numbers are more than astro- 
only three different amino acids, A, B, and C (tripeptide = 3 =n; nomical! Because an average protein molecule of 100 residues would 


3” = 33 = 27): have a mass of 12,000 daltons (assuming the average molecular 
AAA BBB ccc mass of an amino acid residue = 120), 101° such molecules would 
AAB BBA CCA have a mass of 1.2 X 1014 daltons. The mass of the observable 
AAC BBC CCB universe is estimated to be 10% proton masses (about 10% daltons). 
ABA BAB CBC Thus, the universe lacks enough material to make just one molecule 
ACA BCB CAC of each possible polypeptide sequence for a protein only 100 resi- 
ABC BAA CBA dues in length. 


ACB BCC CAB 
ABB BAC CBB 
ACC BCA CAA 


the protein molecule and between the protein and the aqueous solvent.) Once 
dissociated, the individual polypeptides can be isolated from one another on the basis 
of differences in size and/or charge. 


5.3d Step 2. Cleavage of Disulfide Bridges 


Occasionally, heteromultimers are linked together by interchain S—S bridges. In 
such instances, these crosslinks must be cleaved before dissociation and isolation of 
the individual chains. The methods described under step 2 are applicable for this 
purpose. A number of methods exist for cleaving disulfides. An important consid- 
eration is to carry out these cleavages so that the original or even new S—S links do 
not form. Oxidation of a disulfide by performic acid results in the formation of two 
equivalents of cysteic acid (Figure 5.9a). Because these cysteic acid side chains 
are ionized SO; groups, electrostatic repulsion (as well as altered chemistry) 
prevents S—S recombination. Alternatively, sulfhydryl compounds such as 
2-mercaptoethanol or dithiothreitol (DTT) readily reduce S—S bridges to regener- 
ate two cysteine—SH side chains, as in a reversal of the reaction shown in Fig- 
ure 4.8b. However, these SH groups recombine to re-form either the original disul- 
fide link or, if other free Cys—SHs are available, new disulfide links. To prevent 
this, S—S reduction must be followed by treatment with alkylating agents such as 
iodoacetate or 3-bromopropylamine, which modify the SH groups and block disul- 
fide bridge formation (Figure 5.9b). 


5.3e Step 3. N- and C-Terminal Analysis 


N-Terminal Analysis The amino acid residing at the N-terminal end of a protein can 
be identified in a number of ways; one method, Edman degradation, has become the 
procedure of choice. This method is preferable because it allows the sequential iden- 
tification of a series of residues beginning at the N-terminus. In weakly basic 
solutions, phenylisothiocyanate, or Edman reagent (phenyl— N—C=S), combines 
with the free amino terminus of a protein (see Figure 4.8a), which can be excised from 
the end of the polypeptide chain and recovered as a PTH derivative. Chromatographic 
methods can be used to identify this PTH derivative. Importantly, in this procedure, 
the rest of the polypeptide chain remains intact and can be subjected to further 
rounds of Edman degradation to identify successive amino acid residues in the chain. 
Often, the carboxyl terminus of the polypeptide under analysis is coupled to an 
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Figure 5.9 Methods for cleavage of disulfide bonds in proteins. (a) Oxidative cleavage by reaction 
with performic acid. (b) Disulfide bridges can be broken by reduction with sulfhydryl agents such as 
B-mercaptoethanol or dithiothreitol (as in the reverse of the reaction shown in Figure 4.8b). Because 
reaction between the newly reduced —SH groups to reestablish disulfide bonds is a likelihood, S—S 
reduction must be followed by —SH modification: (1) Alkylation with iodoacetate (ICH,COOH) or 
(2) modification with 3-bromopropylamine (Br—(CH2);—NHb). 


insoluble matrix, allowing the polypeptide to be easily recovered by filtration or 
centrifugation following each round of Edman reaction. Thus, the Edman reaction 
not only identifies the N-terminal residue of proteins but through successive reaction 
cycles can reveal further information about sequence. Automated instruments (so- 
called Edman sequenators) have been designed to carry out repeated rounds of the 
Edman procedure. In practical terms, such devices require 3 picomoles (about 6 
nanograms) of a polypeptide and yield about 15 residues of sequence information. 
Today, mass spectrometry, not Edman sequencing, is the preferred way to obtain the 
amino acid sequence of a protein (see Section 5.3h). 


C-Terminal Analysis For the identification of the C-terminal residue of polypeptides, 
an enzymatic approach is commonly used. Carboxypeptidases are enzymes that 
cleave amino acid residues from the C-termini of polypeptides in a successive fashion. 
Several carboxypeptidases are in general use: A, B, and Y (sometimes designated C). 
Carboxypeptidase A (from bovine pancreas) works well in hydrolyzing the C-terminal 
peptide bond of all residues except proline, aspartate, glutamate, arginine, and lysine. 
The analogous enzyme from hog pancreas, carboxypeptidase B, is effective only when 
Arg or Lys are the C-terminal residues. Carboxypeptidase Y from yeast acts on any 
C-terminal residue. Because the nature of the amino acid residue at the end often 
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determines the rate at which it is cleaved and because these enzymes remove residues 
successively, care must be taken in interpreting results. Carboxypeptidase Y cleavage 
has been adapted to an automated protocol analogous to that used in Edman 
sequenators. 


5.3f Steps 4 and 5. Fragmentation of the Polypeptide Chain 


The aim in these steps is to produce fragments useful for sequence analysis. The cleav- 
age methods employed are usually enzymatic, but proteins can also be fragmented by 
specific or nonspecific chemical means (such as partial acid hydrolysis). Proteolytic 
enzymes offer an advantage in that many hydrolyze only specific peptide bonds, and 
this specificity immediately gives information about the peptide products. As a first 
approximation, fragments produced upon cleavage should be small enough to yield 
their sequences through end-group analysis and Edman degradation, yet not so small 
that an overabundance of products must be resolved before analysis. 


Using Proteases to Cleave Protein Chains The digestive enzyme trypsin is the most 
commonly used reagent for specific proteolysis. Trypsin will only hydrolyze peptide 
bonds in which the carbonyl function is contributed by an arginine or a lysine residue. 
That is, trypsin cleaves on the C-side of Arg or Lys, generating a set of peptide frag- 
ments having Arg or Lys at their C-termini. The number of smaller peptides resulting 
from trypsin action is equal to the total number of Arg and Lys residues in the protein 
plus one—the protein’s C-terminal peptide fragment (Figure 5.10). 

Other proteases used in sequencing include chymotrypsin, which shows a strong pref- 
erence for hydrolyzing peptide bonds formed by the carboxyl groups of the aromatic 
amino acids—phenylalanine, tyrosine, and tryptophan, and to a lesser extent, by leucine 
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Figure 5.10 Trypsin cleavage in protein sequencing. 
(a) Trypsin is a proteolytic enzyme, or protease, that 
specifically cleaves only those peptide bonds in 
which arginine or lysine contributes the carbonyl 
function. (b) The products of the reaction are a 
mixture of peptide fragments with C-terminal Arg 
or Lys residues and a single peptide derived from the 
polypeptide’s C-terminal end. 
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| Table 5.2 | Specificity of Representative Polypeptide Cleavage Procedures 


Used in Sequence Analysis 
Peptide Bond on 


Carboxyl (C) or Amino (N) Susceptible 
Method Side of Susceptible Residue Residue(s) 
Proteolytic enzymes* 
Trypsin ÇC Arg and Lys 
Chymotrypsin C Phe, Trp, and Tyr; Leu 
Clostripain C Arg 
Staphylococcal protease C Asp and Glu 
Endopeptidase Lys-C C Lys 
Chemical methods 
Cyanogen bromide C Met 
NHOH Asn-Gly bonds 
pH 2.5, 40°C Asp-Pro bonds 


*Some proteolytic enzymes, including trypsin and chymotrypsin, will not cleave peptide bonds where proline is the 
amino acid contributing the N-atom. 


(Table 5.2). Various other endopeptidases (proteases that, like trypsin and chymotryp- 
sin, cleave peptide bonds within the interior of a polypeptide chain) are also used in 
sequence investigations. These include clostripain, which acts only at Arg residues; en- 
dopeptidase Lys-C, which cleaves only at Lys residues; and staphylococcal protease, 
which acts at the acidic residues, Asp and Glu. Because these endoproteases yield differ- 
ent sets of products, treatment of separate samples of a protein with different enzymes 
generates fragments whose sequences overlap. Resolution of the order of amino acid 
residues in the fragments yields the amino acid sequence in the original protein. 


Chemical methods of proteolysis Several highly specific chemical methods of proteoly- 
sis are available, the most widely used being cyanogen bromide (CNBr) cleavage. 
CNBr acts upon methionine residues (Figure 5.11). The nucleophilic sulfur atom of 
Met reacts with CNBr, yielding a sulfonium ion that undergoes a rapid intramolecular 
rearrangement to form a cyclic iminolactone. Water readily hydrolyzes this iminolac- 
tone, cleaving the polypeptide and generating peptide fragments with C-terminal 
homoserine lactone residues at the former Met positions. A number of other chemical 
methods can also give specific fragmentation of polypeptides, including cleavage at 
asparagine—glycine bonds by hydroxylamine (NHOH) at pH 9 and selective hydroly- 
sis at aspartyl—prolyl bonds under mildly acidic conditions. Table 5.2 summarizes the 
various procedures described here for polypeptide cleavage. These methods are only 
a partial list of the arsenal of reactions available to protein chemists. Cleavage prod- 
ucts generated by these procedures must be isolated and individually sequenced to 
accumulate the information necessary to reconstruct the protein’s complete amino 
acid sequence. Sequencing of peptide fragments is still occasionally done by Edman 
degradation. However, far more often, intact proteins are fragmented and sequenced 
directly within mass spectrometers (see following discussion). 


5.3g Step 6. Reconstruction of the Overall Amino Acid Sequence 


The sequences obtained for the sets of fragments derived from two or more cleavage 
procedures are now compared, with the objective being to find overlaps that establish 
continuity of the overall amino acid sequence of the polypeptide chain. The strategy 
is illustrated by the example shown in Figure 5.12. Peptides generated from specific 
fragmentation of the polypeptide can be aligned to reveal the overall amino acid 
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Figure 5.11 Cyanogen bromide (CNBr) is a highly selective reagent for cleavage of peptides only at 
methionine residues. (1) Nucleophilic attack of the Met S atom on the —C=N carbon atom, with 
displacement of Br. (2) Nucleophilic attack by the Met carbonyl oxygen atom on the R group. The 
cyclic derivative is unstable in aqueous solution. (3) Hydrolysis cleaves the Met peptide bond. 
C-terminal homoserine lactone residues occur where Met residues once were. 
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Figure 5.12 Summary of the sequence analysis of 
catrocollastatin-C, a 23.6 kDa protein found in the 
venom of the western diamondback rattlesnake 
Crotalus atrox. Sequences shown are given in the 
one-letter amino acid code. The overall amino acid 
sequence (216 amino acid residues long) for 
catrocollastatin-C as deduced from the overlapping 
sequences of peptide fragments is shown on the lines 
headed CAT-C. The other lines report the various 
sequences used to obtain the overlaps. These 
sequences were obtained from (a) N-term: Edman 
degradation of the intact protein in an automated 
Edman sequenator; (b) M: proteolytic fragments 
generated by CNBr cleavage, followed by Edman 
sequencing of the individual fragments (numbers 
denote fragments M1 through M5); (c) K: proteolytic 
fragments from endopeptidase Lys-C cleavage, 
followed by Edman sequencing (only fragments 

K3 through K6 are shown); (d) E: proteolytic 
fragments from Staphylococcus protease digestion of 
catrocollastatin sequenced in the Edman sequenator 
(only E13 through E15 are shown). (Adapted from 
Shimokawa, K., et al., 1997. Sequence and biological 
activity of catrocollastatin-C: A disintegrin-like/ 
cysteine-rich two-domain protein from Crotalus 
atrox venom. Archives of Biochemistry and 
Biophysics 343:35—43.) 
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sequence. Such comparisons are also useful in eliminating errors and validating the 
accuracy of the sequences determined for the individual fragments. 


5.3h The Amino Acid Sequence of a Protein Can Be Determined 
by Mass Spectrometry 


Mass spectrometry technology has evolved rapidly over the past decades and now 
dominates sequence determination as well as the new field of global protein study 
known as proteomics (Section 5.8). What follows here is a discussion of the basic 
principles of mass spectrometry, as applied to proteins. 

Mass spectrometers exploit the difference in the mass-to-charge (m/z) ratio of 
ionized atoms or molecules to separate them from each other. The m/z ratio of a mole- 
cule is also a highly characteristic property that can be used to acquire chemical and 
structural information. Furthermore, molecules can be fragmented in distinctive ways 
in mass spectrometers, and the fragments that arise also provide quite specific structural 
information about the molecule. The basic operation of a mass spectrometer is to (1) 
evaporate and ionize molecules in a vacuum, creating gas-phase ions; (2) separate the 
ions in space and/or time based on their m/z ratios; and (3) measure the amount of ions 
with specific m/z ratios. Because proteins (as well as nucleic acids and carbohydrates) 
decompose upon heating rather than evaporating, methods to ionize such molecules for 
mass spectrometry (MS) analysis require innovative approaches. Two prominent MS 
modes for protein analysis are summarized in Table 5.3. 

Figure 5.13 illustrates the basic features of electrospray mass spectrometry (ESI 
MS). In this technique, the high voltage at the electrode causes proteins to pick up pro- 
tons from the solvent, such that, on average, individual protein molecules acquire about 
one positive charge (proton) per kilodalton, leading to the spectrum of m/z ratios for a 
single protein species (Figure 5.14). Computer analysis can convert these data into a 
single spectrum that has a peak at the correct protein mass (Figure 5.14, inset). 


| Table 5.3 | The Two Most Common Methods of Mass Spectrometry 
for Protein Analysis 
Electrospray lonization (ESI-MS) 


A solution of macromolecules is sprayed in the form of fine droplets from a glass capillary 
under the influence of a strong electrical field. The droplets pick up positive charges as they 
exit the capillary; evaporation of the solvent leaves multiply charged molecules. The typical 
20 kDa protein molecule will pick up 10 to 30 positive charges. The MS spectrum of this 
protein reveals all of the differently charged species as a series of sharp peaks whose 
consecutive m/z values differ by the charge and mass of a single proton (see Figure 5.14). 
Note that decreasing m/z values signify increasing number of charges per molecule, z. 
Tandem mass spectrometers downstream from the ESI source (ESI-MS/MS) can analyze 
complex protein mixtures (such as tryptic digests of proteins or chromatographically 
separated proteins emerging from a liquid chromatography column [LC/MS/MS]), selecting 
a single m/z species for collision-induced dissociation and acquisition of amino acid 
sequence information. 


Matrix-Assisted Laser Desorption lonization-Time of Flight (MALDI-TOF MS) 


The protein sample is mixed with a chemical matrix that includes a light-absorbing 
substance excitable by a laser. A laser pulse is used to excite the chemical matrix, creating a 
microplasma that transfers the energy to protein molecules in the sample, ionizing them and 
ejecting them into the gas phase. Among the products are protein molecules that have 
picked up a single proton. These positively charged species can be selected by the MS for 
mass analysis. MALDI-TOF MS is very sensitive and very accurate. 
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Countercurrent 
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Mass spectrometer 


(c) 


(a) High voltage 


(b) Vacuum 
interface 


Figure 5.13 The three principal steps in electrospray ionization mass spectrometry (ESI-MS). 

(a) Small, highly charged droplets are formed by electrostatic dispersion of a protein solution through 
a glass capillary subjected to a high electric field; (b) protein ions are desorbed from the droplets into 
the gas phase (assisted by evaporation of the droplets in a stream of hot N> gas); and (c) the protein 
ions are separated in a mass spectrometer and identified according to their m/z ratios. (Adapted from 
Figure | in Mann, M., and Wilm, M., 1995. Electrospray mass spectrometry for protein 
characterization. Trends in Biochemical Sciences 20:219-224.) 
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Figure 5.14 Electrospray ionization mass spectrum of the protein aerolysin K. The attachment of many 
protons per protein molecule (from fewer than 30 to more than 50 here) leads to a series of m/z peaks 
for this single protein. The equation describing each m/z peak is: m/z = [M + n(mass of proton)/ 
n(charge on proton), where M = mass of the protein and n = number of positive charges per protein 
molecule. Thus, if the number of charges per protein molecule is known and m/z is known, M can be 
calculated. The inset shows a computer analysis of the data from this series of peaks that generates a 
single peak at the correct molecular mass of the protein. (Adapted from Figure 2 in Mann, M., and 
Wilm, M., 1995. Electrospray mass spectrometry for protein characterization. Trends in Biochemical 
Sciences 20:219-224.) 
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Sequencing by Tandem Mass Spectrometry Tandem MS (or MS/MS) allows sequencing 
of proteins by hooking two mass analyzers in tandem. The first is used as a filter to 
sort the oligopeptide fragments in a protein digest based on differences in their m/z 
ratios. Typically, the fragments are generated by trypsin digestion of the protein of 
interest. Such fragments are usually about | kD in mass and thus, pick up about one 
additional positive charge during ESI (in addition to the positive -NH;* terminus). 
Each tryptic peptide selected by the first MS is directed toward the second mass ana- 
lyzer; on the way, this peptide is fragmented by collision with inert gas molecules (a 
process called collision-induced dissociation, or CID), and the fragments are analyzed 
by the second MS (Figure 5.15). Fragmentation occurs primarily at the peptide bonds 
linking successive amino acids in the oligopeptide. Thus, the products include a series 
of fragments that represent a nested set of peptides differing in size by one amino acid 
residue. The various members of this set of fragments differ in mass by 56 atomic mass 
units [amu, the mass of the peptide backbone atoms (NH—CH—CO)] plus the mass 
of the R group at each position, which ranges from | amu (Gly) to 130 (Trp). Figure 
5.16 illustrates MS/MS sequencing results obtained for a specific tryptic peptide. 

MS sequencing has the advantages of very high sensitivity, fast sample processing, 
and the ability to work with mixtures of proteins. Sub-femtomole amounts (less than 
107! moles) of peptide can be analyzed with these spectrometers. The protein need not 
be pure. Capillary HPLC-separated trypsin digests of protein mixtures can be directly 
loaded into the tandem MS spectrometer. Alternatively, separation of a complex mixture 


(a) Electrospray Ionization Tandem Mass Spectrometer 
Electrospray Collision cell Detector 
ionization 
source 

(b) Collision 
cell 
Pi 
P, 
P, MS-2 
Py 
P, 


or 


Electrospray 
ionization 
(c) R O R, O R, O 
‘'—N—C—GA_N—C— GN C— Gs 
H H lh H H H 
Fragmentation 
at peptide 


bonds 


Figure 5.15 Tandem mass spectrometry. (a) The basic configuration used in tandem MS. 

(b) Schematic description of tandem MS: Tandem MS involves electrospray ionization of a protein 
digest (IS in this figure), followed by selection of a single peptide ion mass for collision with inert gas 
molecules (He or N2) and mass analysis of the fragment ions resulting from the collisions. 

(c) Fragmentation usually occurs at peptide bonds, as indicated. (Adapted from Yates, J. R., 1996. 
Protein structure analysis by mass spectrometry. Methods in Enzymology 271:351—376; and Gillece- 
Castro, B. L., and Stults, J. T., 1996. Peptide characterization by mass spectrometry. Methods in 
Enzymology 271:427-447.) 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


5.3 How Is the Primary Structure of a Protein Determined? 125 


| Example! How do we know the sequence for the peptide in Figure 5.16 is 


-SPTDIEEGMR-coop? 


Step 1: Reading from left to right on the m/z axis, the first peak has an m/z of 175.11, 
the molecular mass of arginine-H*. This R represents the C-terminal amino acid in 
this tryptic peptide. 

Step 2: The next peak, m/z = 306.16, is the peak of the tryptic peptide’s C-terminal 
dipeptide, whose sequence can be represented as XR. The unknown amino acid, 
X, has a molecular mass of 306.16 minus 175.11, or 131.05, which is the molecular 
mass of a methionine residue, so the dipeptide sequence is MR. 

Step 3: The next peak, m/z = 363.17, represents the C-terminal tripeptide XMR. X 
here has a molecular mass of 363.17 minus 306.16, or 57.01, the mass of a glycine 
residue, so the C-terminal tripeptide is GMR. 

Step 4: The remaining amino acids in the sequence: Repeating this operation for the 
next seven peaks reveals the overall tryptic peptide sequence: -SPTDIEEGMR* 
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Figure 5.16 Mass spectrum of a selected tryptic peptide from a tandem MS (MS/MS) peptide 
sequencing experiment. The mass differences between the series yield the amino acid sequence, 
which is shown below the spectrum in reverse (C—>N) from the conventional N->C direction. For 
example, 1134.58 minus 1047.43 equals 87.15; the mass of a serine residue; 1047.63 minus 950.38 
equals 97.05, the mass of a proline residue. (Adapted from Figure 3c in Steen, H., and Mann, M., 
2004. The abc’s (and xyz’s) of peptide sequencing. Nature Reviews Molecular Cell Biology 
5:599-611.) 


*Since the molecular masses of leucine (L) and isoleucine (I) are identical, there is uncertainty 
whether the sequence is SPTDIEEGMR or SPTDLEEGMR. 


of proteins from a whole-cell extract by two-dimensional gel electrophoresis (see 
A Deeper Look box: Techniques Used in Protein Purification), followed by trypsin 
digestion of a specific protein spot on the gel and injection of the digest into the 
HPLC/tandem MS, gives information that can be used to identify the protein within 
sequence databases. 


5.31 Sequence Databases Contain the Amino Acid 
Sequences of Millions of Different Proteins 
The first protein sequence databases were compiled by protein chemists using chemi- 


cal sequencing methods. Today, most protein sequence information has been derived 
from translating the nucleotide sequences of genes into codons and, thus, amino acid 
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Figure 5.17 Amino acid composition: frequencies of 
the various amino acids in proteins for all the 
proteins in the SWISS-PROT protein knowledge 
base. These data are derived from the amino acid 


composition of more than 600,000 different proteins. 


The range is from leucine at 9.66% to tryptophan at 
1.09% of all residues. 


sequences (see Chapter 30). Sequencing the order of nucleotides in cloned genes is a 
more rapid, efficient, and informative process than determining the amino acid 
sequences of proteins by chemical methods. Several electronic databases containing 
continuously updated sequence information are accessible by personal computer. 
Prominent among these is the SWISS-PROT protein sequence database on the ExPASy 
(Expert Protein Analysis System) Molecular Biology server and the PIR (Protein Iden- 
tification Resource Protein Sequence Database) at /ttp.//pir.georgetown.edu. Mass 
spectrometric proteomics data are freely available from a number of repositories, in- 
cluding PRIDE (PRoteomics IDentification database) at http://www.ebi.ac.uk/pridel, 
PeptideAtlas at http-//peptideatlas.org, and the Global Proteome Machine at /ttp:// 
gpmdb.thegpm.org. Further, protein information from genomic sequences is available 
in databases such as GenBank, accessible via the National Center for Biotechnology 
Information (NCBI) website located at http://www.ncbi.nlm.nih.gov. The protein 
sequence databases contain millions of entries. 

In addition to sequence information, protein structural information can be found in 
the Worldwide Protein Data Bank (wwPDB), an ensemble of repositories composed of 
the Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB 
PDB at Attp://www.rcsb.org/pdb), the Protein Data Bank Europe (PDBe), the Protein 
Data Bank Japan (PDBj), and the BioMagResBank (BMRB), which specifically houses 
NMR-derived structural information. The RCSB PDB site contains three-dimensional 
structural information on hundreds of thousands of proteins and nucleic acids. 


5.4 | What Is the Nature of Amino Acid Sequences? 


Figure 5.17 illustrates the relative frequencies of the amino acids in proteins. It is very 
unusual for a globular protein to have an amino acid composition that deviates sub- 
stantially from these values. Apparently, these abundances reflect a distribution of 
amino acid polarities that is optimal for protein stability in an aqueous milieu. Mem- 
brane proteins tend to have relatively more hydrophobic and fewer ionic amino acids, 
a condition consistent with their location. Fibrous proteins may show compositions 
that are atypical with respect to these norms, indicating an underlying relationship 
between the composition and the structure of these proteins. 

Proteins have unique amino acid sequences, and it is this uniqueness of sequence that 
ultimately gives each protein its own particular personality. Because the number of pos- 
sible amino acid sequences in a protein is astronomically large, the probability that two 
proteins will, by chance, have similar amino acid sequences is negligible. Consequently, 
sequence similarities between proteins imply evolutionary relatedness. 
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5.4a Homologous Proteins from Different Organisms Have 
Homologous Amino Acid Sequences 


Proteins sharing a significant degree of sequence similarity and structural resemblance 
are said to be homologous. Proteins that perform the same function in different organ- 
isms are also referred to as homologous. For example, the oxygen transport protein 
hemoglobin serves a similar role and has a similar structure in all vertebrates. The study 
of the amino acid sequences of homologous proteins from different organisms provides 
very strong evidence for their evolutionary origin within a common ancestor. Homolo- 
gous proteins characteristically have polypeptide chains that are nearly identical in 
length, and the degree to which their sequences share identity provides a direct measure 
of the evolutionary relationship between the species from which they are derived. 

Homologous proteins can be further subdivided into orthologous and paralogous 
proteins. Orthologous proteins are proteins from different species that have homologous 
amino acid sequences (and often a similar function). Orthologous proteins arose from a 
common ancestral gene during evolution. Paralogous proteins are proteins found within 
a single species that have homologous amino acid sequences; paralogous proteins arose 
through gene duplication. For example, the œ- and B-globin chains of hemoglobin are 
paralogs. 


5.4b Computer Programs Can Align Sequences 
and Discover Homology between Proteins 


How is homology revealed? Protein and nucleic acid sequence databases (see 
Section 5.3) provide enormous resources for sequence comparisons. If two proteins 
share homology, it can be revealed through alignment of their sequences using power- 
ful computer programs. In such studies, a given amino acid sequence is used to query 
the databases for proteins with similar sequences. BLAST (Basic Local Alignment 
Search Tool) is one commonly used program for rapid searching of sequence data- 
bases. The BLAST program detects local as well as global alignments where se- 
quences are in close agreement. Even regions of similarity shared between otherwise 
unrelated proteins can be detected. Discovery of sequence similarities between pro- 
teins can be an important clue to the function of uncharacterized proteins. Similari- 
ties are also useful in assigning related proteins to protein families. 

The process of sequence alignment is an operation akin to sliding one sequence 
along another in a search for regions where the two sequences show a good match. Posi- 
tive scores are assigned everywhere the amino acid in one sequence is similar to or iden- 
tical with the amino acid in the other; the greater the overall score, the better the match 
between the two protein sequences. Sometimes two sequences match well at several 
places along their lengths, but, in one of the proteins, the matching segments are inter- 
rupted by a sequence that is dissimilar. When such an interruption is found by the com- 
puter program, it inserts a gap in the uninterrupted sequence to bring the matching 
segments of the two sequences into better alignment (Figure 5.18). Because any two 
sequences would show similarity if a sufficient number of gaps were introduced, a 
gap penalty is imposed for each gap. Gap penalties are negative numbers that lower the 
overall similarity score. Gaps arise naturally during evolution through insertion and 
deletion mutations, so-called indels, which add or remove residues in the gene and, con- 
sequently, the protein. The optimal sequence alignment between two proteins is one that 
maximizes sequence alignments while minimizing gaps. 

Methods for alignment and comparison of protein sequences depend upon some quan- 
titative measure of how similar any two sequences are. One way to measure similarity is to 
use a matrix that assigns scores for all possible substitutions of one amino acid for another. 
BLOSUM62 is the substitution matrix most often used with BLAST. This matrix assigns 
a probability score for each position in an alignment based on the frequency with which 
that substitution occurs in the consensus sequences of related proteins. BLOSUM is an 
acronym for Blocks Substitution Matrix, a matrix that scores each position on the basis of 
observed frequencies of different amino acid substitutions within blocks of local 
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S. acidocaldarius FPIAKGGTAAIPGPFGSGKTVTLQSLAKWSAAK--—-VVIYVGCGERGNEMTD 
E. coli CPFAKGGKVGLFGGAGVGKTVNMMELIRNIAIEHSGYSVFAGVGERTREGND 


Figure 5.18 Alignment of the amino acid sequences of two protein homologs using gaps. Shown are 
parts of the amino acid sequences of the catalytic subunits from the major ATP-synthesizing enzyme 
(ATP synthase) in a representative archaea (Sulfolobus acidocaldarius) and a bacterium (Escherichia 
coli). These protein segments encompass the nucleotide-binding site of these enzymes. Identical 
residues in the two sequences are shown in red. Introduction of a three-residue-long gap in the 
archaeal sequence optimizes alignment of the two sequences. 
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Figure 5.19 The BLOSUM62 substitution matrix provides scores for all possible exchanges of one 
amino acid with another. (From Henikoff, S., and Henikoff, J. G., 1992. Amino acid substitution 
matrices from protein blocks. Proceedings of the National Academy of Sciences, USA 89:10915-10919.) 
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alignments in related proteins. In the BLOSUM62 matrix, the scores are derived using 
sequences sharing no more than 62% identity (Figure 5.19). BLOSUM substitution scores 
range from —4 (lowest probability of substitution) to 11 (highest probability of substitu- 
tion). For example, to look up the value corresponding to the substitution of an aspara- 
gine (N) by a tryptophan (W), or vice versa, find the intersection of the “N” column with 
the “W” row in Figure 5.19. The value —4 means that the substitution of N for W, or vice 
versa, is not very likely. On the other hand, the substitution of V for I (BLOSUM score: 3), 
or vice versa, is very likely. Amino acids whose side chains have unique qualities (such as 
C, H, P, or W) have high BLOSUM62 scores because replacing them with any other amino 
acid may change the protein significantly. Amino acids that are similar (such as R and K; 
or D and E; or A, V, L, and I) have low scores, since one can replace the other with less 
likelihood of serious change to the protein structure. 


Cytochrome c as an Example of Orthology The electron transport protein cytochrome 
c, found in the mitochondria of all eukaryotic organisms, provides a well-studied 
example. Amino acid sequencing of cytochrome c from more than 40 different species 
has revealed that there are 28 positions in the polypeptide chain where the same amino 
acid residues are always found (Figure 5.20). These invariant residues serve roles crucial 
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5.4 What Is the Nature of Amino Acid Sequences? 


to the biological function of this protein, and thus substitutions of other amino acids Figure 5.20 The sequence of 
at these positions cannot be tolerated. The number of amino acid differences between CYtochrome c from more than 40 

: ` sg: different species reveals that 28 
two cytochrome c sequences is proportional to the phylogenetic difference between residues are invariant THERE 
the species from which they are derived. Cytochrome c in humans and in chimpanzees invariant residues sme scattered 
is identical; human and another mammalian (sheep) cytochrome c differ at 10 resi- irregularly along the polypeptide 
dues. The human cytochrome c sequence has 14 variant residues from a reptile se- chain, except for a cluster between 
quence (rattlesnake), 18 from a fish (carp), 29 from a mollusc (snail), 31 from an insect "esidues 70 and 80. All cytochrome 


: . c polypeptide chains have a cysteine 
(moth), and more than 40 from yeast or higher plants (cauliflower). residue at position 17, and all but 


: : ; š : one have another Cys at position 
The Phylogenetic Tree for Cytochrome c Figure 5.21 displays a phylogenetic tree (a dia- 14. These Cys residues serve to link 


gram illustrating the evolutionary relationships among a group of organisms) CON- the heme prosthetic group of 
structed from the sequences of cytochrome c. The tips of the branches are occupied cytochrome c to the protein. 
by contemporary species whose sequences have been determined. The tree has been 

deduced by computer analysis of these sequences to find the minimum number of 

mutational changes connecting the branches. Other computer methods can be used to 

infer potential ancestral sequences represented by nodes, or branch points, in the tree. 

Such analysis ultimately suggests a primordial cytochrome c sequence lying at the 

base of the tree. Evolutionary trees constructed in this manner, that is, solely on the 

basis of amino acid differences occurring in the primary structure of one selected 

protein, show remarkable agreement with phylogenetic relationships derived from 

more classic approaches and have given rise to the field of molecular evolution. 
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Figure 5.21 This phylogenetic tree depicts the evolutionary relationships among organisms as 
determined by the similarity of their cytochrome c amino acid sequences. The numbers along the 
branches give the amino acid changes between a species and a hypothetical progenitor. Note that 
extant species are located only at the tips of branches. (Adapted from Creighton, T. E., 1983. Proteins: 
Structure and Molecular Properties. San Francisco: W. H. Freeman.) 
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a-chain of horse methemoglobin 


5.4c Related Proteins Share a Common Evolutionary Origin 


Amino acid sequence analysis reveals that proteins with related functions often show 
a high degree of sequence similarity. Such findings suggest a common ancestry for 
these proteins. 


Oxygen-Binding Heme Proteins Myoglobin and the a- and -globin chains of hemo- 
globin constitute a set of paralogous proteins. Myoglobin, the oxygen-binding heme 
protein of muscle, consists of a single polypeptide chain of 153 residues. Hemoglobin, 
the oxygen transport protein of erythrocytes, is a tetramer composed of two a-chains 
(141 residues each) and two B-chains (146 residues each). These globin paralogs— 
myoglobin, a-globin, and B-globin—share a strong degree of sequence and structural 
homology (Figure 5.22). Human myoglobin and the human a-globin chain show 38 
amino acid identities, whereas human a-globin and human B-globin have 64 residues 
in common. The relatedness suggests an evolutionary sequence of events in which 
chance mutations led to amino acid substitutions and divergence in primary structure. 
The ancestral myoglobin gene diverged first, after duplication of a primordial globin 
gene had given rise to its progenitor and an ancestral hemoglobin gene (Figure 5.23). 
Subsequently, the ancestral hemoglobin gene duplicated to generate the progenitors of 


B-chain of horse methemoglobin Sperm whale myoglobin 


Figure 5.22 The structures of the globin chains of human hemoglobin and myoglobin show a strong 
degree of homology. The amino acid sequences of a- and B-globin chains share 64 residues of their 
approximately 140 residues in common. Myoglobin and the a-globin chain have 38 amino acid 
sequence identities. This homology is further reflected in these proteins’ tertiary structure. 


Myoglobin B a 


Ancestral 
a-globin 


Ancestral 
B-globin 


Figure 5.23 This evolutionary tree is inferred 
from the homology between the amino acid 
sequences of the a-globin, B-globin, and 
myoglobin chains. Duplication of an ancestral 
globin gene allowed the divergence of the 
myoglobin and ancestral hemoglobin genes. 
Another gene duplication event subsequently 
gave rise to ancestral a and B forms, as 
indicated. Gene duplication is an important 
evolutionary force in creating diversity. 
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the present-day a-globin and B-globin genes. The ability to bind O; via a heme pros- 
thetic group is retained by all three of these polypeptides. 


Serine Proteases Whereas the globins provide an example of gene duplication giving 
rise to a set of proteins in which the biological function has been highly conserved, 
other sets of proteins united by strong sequence homology show variation in their 
properties. Trypsin, chymotrypsin, and elastase are members of a class of proteolytic 
enzymes called serine proteases because of the central role played by specific serine 
residues in their catalytic activity. Thrombin, an essential enzyme in blood clotting, is 
also a serine protease. These enzymes show sufficient sequence homology to conclude 
that they arose via duplication of a progenitor serine protease gene, even though their 
substrate preferences are now quite different. 


5.4d Apparently Different Proteins May Share a Common Ancestry 


A more remarkable example of evolutionary relatedness is inferred from sequence 
homology between hen egg white lysozyme and human milk a@-lactalbumin, proteins 
that have different biological activity. Lysozyme (129 residues) and a-lactalbumin 
(123 residues) are identical at 48 positions. Lysozyme hydrolyzes the polysaccharide 
wall of bacterial cells, whereas a-lactalbumin regulates milk sugar (lactose) synthesis 
in the mammary gland. Although both proteins act in reactions involving carbohy- 
drates, their functions show little similarity otherwise. Nevertheless, their tertiary 
structures are strikingly similar (Figure 5.24). It is conceivable that many proteins are 
related in this way, but time and the course of evolutionary change erased most evi- 
dence of their common ancestry. 


5.4e A Mutant Protein Is a Protein with a Slightly Different 
Amino Acid Sequence 


Ina large population of individuals, a considerable number of sequence variants can be 
found for a protein. These variants are a consequence of mutations in a gene (alterations 
in DNA nucleotide sequence) that have arisen naturally within the population. 
Gene mutations lead to mutant forms of the protein in which the amino acid sequence 
is altered at one or more positions. Many of these mutant forms are “neutral” in that 
the functional properties of the protein are unaffected by the amino acid substitution. 
Others may be nonfunctional (if loss of function is not lethal to the individual), and still 
others may display a range of aberrations between these two extremes. The severity of 
the effects on function depends on the nature of the amino acid substitution and its role 
in the protein. These conclusions are exemplified by the hundreds of human hemoglo- 
bin variants that have been discovered. 

A variety of effects on the hemoglobin molecule are seen in these mutants, including 
alterations in oxygen affinity, heme affinity, stability, solubility, and subunit interactions 
between the a-globin and B-globin polypeptide chains. Some variants show no apparent 
changes, whereas others, such as HbS, sickle-cell hemoglobin, result in serious illness. 
Sickle cell anemia is due to the substitution of the glutamate residue at position 6 in the 
B-globin amino acid sequence with a valine residue (E°V). This diversity of response in- 
dicates that some amino acid changes are relatively unimportant, whereas others drasti- 
cally alter one or more functions of a protein. 


5.5 | How Are Polypeptides Synthesized 


in the Laboratory? 


Chemical synthesis of peptides and polypeptides of defined sequence can be carried 
out in the laboratory. Chemical synthesis allows scientists to create polypeptides of 
any desired sequence, an advantage in testing ideas regarding relationships between 
protein structure and function. Formation of peptide bonds linking amino acids 


123 
C c 129 
Human milk Hen egg 
a-lactalbumin white lysozyme 


Figure 5.24 The tertiary structures of hen egg white 
lysozyme and human a-lactalbumin are very similar. 
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together is not a chemically complex process, but making a specific peptide can be 
challenging because various functional groups present on side chains of amino acids 
may also react under the conditions used to form peptide bonds. Furthermore, if cor- 
rect sequences are to be synthesized, the a-COOH group of residue x must be linked 
to the a-NH, group of neighboring residue y in a way that prevents reaction of the 
amino group of x with the carboxyl group of y. In essence, any functional groups to 
be protected from reaction must be blocked while the desired coupling reactions pro- 
ceed. Also, the blocking groups must be removable later under conditions in which the 
newly formed peptide bonds are stable. 


5.5a Orthogonal Synthesis of Peptides 


An ingenious synthetic strategy to circumvent these technical problems is orthogonal 
synthesis. An orthogonal system is defined as a set of distinctly different blocking 
groups—one for side-chain protection, another for a-amino protection, and a third for 
a-carboxyl protection or anchoring to a solid support (see following discussion). Ideally, 
any of the three classes of protecting groups can be removed in any order and in the 
presence of the other two, because the reaction chemistries of the three classes are suf- 
ficiently different from one another. In peptide synthesis, all reactions must proceed 
with high yield if peptide recoveries are to be acceptable. Peptide formation between 
amino and carboxyl groups is not spontaneous under normal conditions (see Chapter 4), 
so one or the other of these groups must be activated to facilitate the reaction. Despite 
these difficulties, biologically active peptides and polypeptides have been re-created by 
synthetic organic chemistry. Milestones include the pioneering synthesis of the nona- 
peptide posterior pituitary hormones oxytocin and vasopressin by Vincent du Vigneaud 
in 1953 and, in later years, larger proteins such as insulin (21 A-chain and 30 B-chain 
residues), ribonuclease A (124 residues), and HIV protease (99 residues). 


5.5b Solid-Phase Methods Are Very Useful in Peptide Synthesis 


Bruce Merrifield and his collaborators pioneered a clever solution to the problem of 
recovering intermediate products in the course of a synthesis. The carboxyl-terminal 
residues of synthesized peptide chains are covalently anchored to an insoluble resin 
(polystyrene particles) that can be removed from reaction mixtures simply by filtra- 
tion. After each new residue is added successively at the free amino-terminus, the 
elongated product is recovered by filtration and readied for the next synthetic step. 
Because the growing peptide chain is coupled to an insoluble resin bead, the method 
is called solid-phase synthesis. The procedure is detailed in Figure 5.25. This cyclic 
process is automated and computer controlled so that the reactions take place in a 
small cup with reagents being pumped in and removed as programmed. 


5.6 | Do Proteins Have Chemical Groups 


| 
| Other Than Amino Acids? 


Many proteins consist of only amino acids and contain no other chemical groups. 
The enzyme ribonuclease and the contractile protein actin are two such examples. 
Such proteins are called simple proteins. However, many other proteins contain vari- 
ous chemical constituents as an integral part of their structure. Some of these con- 
stituents arise through covalent modification of amino acid side chains in proteins 
after the protein has been synthesized. Such alterations are called post-translational 
modifications. For example, the reaction of two cysteine residues in a protein to form 
a disulfide linkage (Figure 4.8b) is a post-translational modification. Many of the 
prominent post-translational modifications, such as those listed in Table 5.4, can act 
as “on—off switches” that regulate the function or cellular location of the protein. 
Over 600 post-translational modifications are listed in the RESID database accessi- 
ble through the Georgetown University Protein Information Resource at Attp://pir 
georgetown. edulresidlresid. shtml. 
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Figure 5.25 Solid-phase synthesis of a peptide. The 9-fluorenylmethoxycarbony] Amino-blocked | 
i è : tripeptidyl- Fmoc — NHCHC NHCHCN HCHC 
(Fmoc) group is an excellent orthogonal blocking group for the a-amino group of resin paride i i i 
amino acids during organic synthesis because it is readily removed under basic Oo Oo oO 
conditions that don’t affect the linkage between the insoluble resin and the a-carboxyl 
group of the growing peptide chain. (inset) N,N'-diisopropylcarbodiimide (DIPCDI) 5) Base 
is one agent of choice for activating carboxyl groups to condense with amino groups pee 
: é $ removal 
to form peptide bonds. (1) The carboxyl group of the first amino acid (the carboxyl- 
terminal amino acid of the peptide to be synthesized) is chemically attached to an e s A 
insoluble resin particle (the aminoacyl-resin particle). (2) The second amino acid, with 3 2 1 
its amino group blocked by a Fmoc group and its carboxyl group activated with Tp piyee HyNCHCNHCHCNHCHC 
DIPCDL, is reacted with the aminoacyl-resin particle to form a peptide linkage, with PE I I Il 
O Oo 


elimination of DIPCDI as diisopropylurea. (3) Then, basic treatment (with 
piperidine) removes the N-terminal Fmoc blocking group, exposing the N-terminus 
of the dipeptide for another cycle of amino acid addition. (4) Any reactive side 
chains on amino acids are blocked by addition of acid-labile tertiary butyl (tBu) 
groups as orthogonal protective functions. (5) After each step, the peptide product is 
recovered by collection of the insoluble resin beads by filtration or centrifugation. 
Following cyclic additions of amino acids, the completed peptide chain is hydrolyzed 
from linkage to the insoluble resin by treatment with HF; HF also removes any 

tBu protecting groups from side chains on the peptide. 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


134 Chapter 5 Proteins: Their Primary Structure and Biological Functions 


Table 5.4 Some Prominent Post-Translational Modifications Found in Proteins 
Amino Acid Side 

Name Nonprotein Part Chain Modified Examples 

Phosphorylation —PO;- S, T, Y Hormone receptors, 
regulatory enzymes 

Acetylation —CH,COO- K Histones, metabolic enzymes 

Methylation —CH; K,R Histones 

Acylation Palmitic acid C G-protein-coupled receptors 

Prenylation Prenyl group C Ras p21 

ADP-ribosylation ADP-ribose H,R G proteins, eukaryotic 
elongation factors 

Adenylylation AMP Y Glutamine synthetase 


| Table 5.5 | Some Common Conjugated Proteins 


Name 


Lipoproteins 
Nucleoproteins 
Glycoproteins 


Metalloproteins and metal-activated 
proteins 


Hemoproteins 


Flavoproteins 


Nonprotein Part Association Examples 

Lipids Covalent or noncovalent Blood lipoprotein complexes (HDL, LDL) 
RNA, DNA Noncovalent Ribosomes, chromosomes 

Carbohydrate groups Covalent Immunoglobulins, LDL receptor 

Ca?*, Kt, Fe?*, Zn?*, Covalent or noncovalent Metabolic enzymes, kinases, phosphatases, 
Co?*, others among others 

Heme group Covalent or noncovalent Hemoglobin, cytochromes 

FMN, FAD Covalent or noncovalent Electron transfer enzymes 


The general term for proteins containing nonprotein constituents is conjugated 
proteins (Table 5.5). Because association of the protein with the conjugated group does 
not occur until the protein has been synthesized, these associations are post-translational 
as well, although such terminology is usually not applied to these proteins (with the 
possible exception of glycoproteins). As Table 5.5 indicates, conjugated proteins are 
typically classified according to the chemistry of the nonprotein part. If the nonprotein 
part participates in the protein’s function, it is referred to as a prosthetic group. Conjuga- 
tion of proteins with these different nonprotein constituents dramatically enhances the 
repertoire of functionalities available to proteins. 

A common form of post-translational modification not to be found in such a data- 
base or in Table 5.4 is the removal of amino acids from the protein by proteolytic cleav- 
age. Many proteins localized in specific subcellular compartments have N-terminal 
signal sequences that stipulate their proper destination, much like zipcodes specify postal 
regions. Such signal sequences typically are clipped off during their journey. Other pro- 
teins, such as some hormones or potentially destructive proteases, are synthesized in an 
inactive form and converted into an active form through proteolytic removal of some of 
their amino acids. 


5.7 | What Are the Many Biological Functions of Proteins? 


Proteins are the agents of biological function. Virtually every cellular activity is de- 
pendent on one or more particular proteins. Thus, a convenient way to classify the 
enormous number of proteins is to group them according to the biological roles 
they serve. 
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5.7 What Are the Many Biological Functions of Proteins? 


5.7a Proteins Are the Agents of Biological Function 


Proteins fill essentially every biological role, with the exception of information stor- 
age. The ability to bind other molecules (ligands) is common to many proteins. Bind- 
ing proteins typically interact noncovalently with their specific ligands. Transport 
proteins are one class of binding proteins. Transport proteins include membrane pro- 
teins that transport substances across membranes as well as soluble proteins that de- 
liver specific nutrients or waste products throughout the body. Scaffold proteins are 
a class of binding proteins that uses protein-protein interactions to recruit other pro- 
teins into multimeric assemblies. Such assemblies coordinate the flow of information 
in cells. Catalytic proteins (enzymes) mediate almost every metabolic reaction. Regu- 
latory proteins that bind to specific nucleotide sequences within DNA control gene 
expression. Hormones are another kind of regulatory protein; they convey informa- 
tion about the environment and deliver this information to cells when they bind to 
specific receptors. Switch proteins such as G-proteins can switch between two con- 
formational states—an “on” state and an “off” state—and act via this conforma- 
tional switching as regulatory proteins. Structural proteins give form to cells and 
subcellular structures. The great diversity in function that characterizes biological 
systems is based on attributes that proteins possess. 

The human genome contains slightly more than 19,000 protein-encoding genes and 
the proteins they encode can be grouped according to biological function. The three 
categories with the most entries are enzymes, nucleic acid-binding proteins, and proteins 
involved in signal transduction, but more than a dozen other categories can be consid- 
ered, including transport proteins, structural proteins, and motor proteins. Remarkably, 
the biological function remains unknown for some 15% or so of the proteins encoded 
by the human genome. 


5.7b All Proteins Function through Specific Recognition and 
Binding of Some Target Molecule 


Although the classification of proteins according to function has advantages, many 
proteins are not assigned readily to one of the traditional groupings. Further, clas- 
sification can be somewhat arbitrary, because many proteins fit more than one 
category. However, for all categories, the protein always functions through specific 
recognition and binding of some other molecule, although for structural proteins, it 
is usually self-recognition and assembly into stable multimeric arrays. Protein be- 
havior provides the cardinal example of molecular recognition through structural 
complementarity, a fundamental principle of biochemistry that was presented in 
Chapter 1. 


Protein Binding Protein—ligand interactions may be weak or they may be so strong that 
it is very difficult to dislodge the ligand from the protein. The interaction of a protein 
with its target often can be described in simple quantitative terms. Let’s explore the 
simplest situation in which a protein has a single binding site for the molecule it binds 
(its ligand; Chapter 1). If we treat the interaction between the protein (P) and the 
ligand (L) as a dissociation reaction: 


PL=—=P+L 
The equilibrium constant for the reaction as written 
Keg = [P][LV[PL] 


is a dissociation constant, because it describes the dissociation of the ligand from the 
protein. Biochemists typically use dissociation constants (Kp) to describe binding 
phenomena. Because brackets ([ ]) denote molar concentrations, dissociation con- 
stants have the units of M. 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


135 


136 


Chapter 5 Proteins: Their Primary Structure and Biological Functions 


ol 


Ye 
O nm 


0 Ligand concentration ([L]) 


Figure 5.26 Saturation curve or binding isotherm. 


Typically, the ligand concentration is much greater than the protein concentration. 
Under such conditions, a plot of the moles of ligand bound per mole of protein 
(defined as [PL]/([P] + [PL])) versus [L] yields a hyperbolic curve known as a saturation 
curve or binding isotherm (Figure 5.26). 

If we define the fractional saturation of P with L, [PL]/([P] + [PL]), as v, a little 
algebra yields 


v = [L]/(Kp + [L] 
Thus, when v = 0.5, 
[L] = Kp 


That is, the concentration of L where half the protein has L bound is equal to the 
value of Kp. The smaller this number is, the better the ligand binds to the protein; that 
is, a small Kp means that the protein is half-saturated with L at a low concentration 
of L. In other words, if Kp is small, the protein binds the ligand avidly. Typical Kp 
values fall in a range from 10-7 M to 107" M. As a rule of thumb, Kp values greater 
than 1074 M indicate weak protein:ligand affinity, 107 M—average affinity, and less 
than 10-? M—high affinity. 


The Ligand-Binding Site Ligand binding occurs through noncovalent interactions 
between the protein and ligand. The lack of covalent interactions means that binding 
is readily reversible. Proteins display specificity in ligand binding because they pos- 
sess a specific site, the binding site, within their structure that is complementary to 
the structure of the ligand, its charge distribution, and any H-bond donors or accep- 
tors it might have. Structural complementarity within the binding site is achieved 
because part of the three-dimensional structure of the protein provides an ensemble 
of amino acid side chains (and polypeptide backbone atoms) that establish an inter- 
active cavity complementary to the ligand molecule. When a ligand binds to the 
protein, the protein usually undergoes a conformational change. This new protein 
conformation provides an even better fit with the ligand than before. Such changes 
are called ligand-induced conformational changes, and the result is an even more stable 
interaction between the protein and its ligand. 

Thus, in a general sense, most proteins are binding proteins because ligand binding is 
a hallmark of protein function. Catalytic proteins (enzymes) bind substrates; regulatory 
proteins bind hormones or other proteins or regulatory sequences in genes; structural 
proteins bind to and interact with each other; and the many types of transport proteins 
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bind ligands, facilitating their movement from one place to another. Many proteins 
accomplish their function through the binding of other protein molecules, a phenomenon 
called protein-protein interaction. Some proteins engage in protein-protein interactions 
with proteins that are similar or identical to themselves so that an oligomeric structure is 
formed, as in hemoglobin. Other proteins engage in protein-protein interactions with 
proteins that are very different from themselves, as in the anchoring proteins or the scaf- 
folding proteins of signaling pathways. 


5.8 | What Is the Proteome and What Does It Tell Us? 


The essential questions in cellular biology and biochemistry are: What is the cell 
doing, and why is it doing it? The full genetic potential of a cell (what it is capable of 
doing) is contained within its genome, but not all genes are expressed at any moment 
in time. Those genes that are being expressed are defined by the transcriptome, but 
what the cell is doing is not directly determined by the transcriptome. A more 
accurate reflection of what a cell is doing at any moment in time is found in the 
proteome because proteins are the agents of cellular function. Further, the pro- 
teome is much more complex than the genome. Whereas there are only 20,000 or so 
protein-coding genes in the human genome, estimates suggest that there are hun- 
dreds of thousands of different proteins, perhaps even a million or more. This 
discrepancy exists because one gene may give rise to a large number of protein 
products through a range of processes, such as post-translational modification or 
alternative RNA splicing or RNA editing. 


5.8a The Proteome Is Large and Dynamic 


Proteins are synthesized, processed, delivered to appropriate subcellular compartments, 
assembled into complexes, and degraded. Different cells have different sets of proteins. 
The same cell has different proteins at different times. The lifetimes of different pro- 
teins vary from minutes to years. Defining the proteome requires techniques that can 
unambiguously identify each protein and determine how much of it is present. 
Bacterial cells contain about 2 million protein molecules per 10715 L (approximately 
the volume of an E. coli cell). Indeed, cells from bacteria to yeast to mammals seem to 
be rather uniform in containing 2 to 4 million protein molecules per 10715 L. A liver 
cell is about 8000 times larger than an Æ. coli cell, so it would contain as many as 
30 billion protein molecules. These numbers reflect total number of proteins/cell. In 
actuality, in human cells, the number of molecules of each protein can vary from as 
low as 10 or so copies per cell to as much as 100 billion, by some estimates. Thus, the 
concentrations of different proteins in a human cell can vary over ten orders of mag- 
nitude. Clearly, proteomics poses a daunting analytical challenge. The tools of pro- 
teomics are global purification strategies to separate complex mixtures, followed by 
sequence determination using mass spectroscopy to both identify and quantify each 
of the different proteins present. 


5.8b Determining the Proteome of a Cell 


Proteomics seeks to describe the full complement of proteins present in a specific cell 
type under a defined set of conditions. In a typical experimental approach to this 
problem, proteins from a cellular extract are first separated by two independent chro- 
matographic methods within a single chromatographic column (Figure 5.27). The 
upper part of the column contains an ion exchange resin that separates proteins based 
on their charge, and the lower part contains a reverse phase separation material that 
separates proteins on the basis of their solubility. As the separated proteins emerge 
from the column, they are led into an ESI mass spectrometer that identifies them by 
mass and provides information regarding their abundance. The MS data can then be 
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Figure 5.27 In a typical proteomics study, proteins obtained from a cell extract are cleaved into 
peptides by proteases, the proteolytic fragments are separated by HPLC using two different 
chromatographic media (such as ion exchange and reversed-phase), and the resolved fragments are 
analyzed by mass spectrometry (LC/MS/MS). The sequences of individual, cleaved peptides from two 
different proteolytic digestions provide the overlaps necessary to construct the sequence of each 
protein in the original extract. This reconstruction is performed by computer algorithms. The 
complete protein sequence of every protein in the original extract is then used to identify the protein 
within a proteomic or genomic database. 


Critical Developments in Biochemistry 


Two New Suffixes in Molecular Biology and Biochemistry: 


“ome” and “-omics” 


The emergence of technologies to identify all of the molecules or func- 
tions of a particular class within a given cell or organism is exemplified 
by the Human Genome Project (1991 to 2003). This project revealed 
the identity of all of the human genes (see the Human Biochemistry 
box: The Human Genome Project in Chapter 11). The suffix “-ome” 
is now applied to the entire set of a particular class of molecules, in 
terms such as “transcriptome” (all the RNA transcripts of genes in a 
cell at a particular moment in time), “lipidome” (all the lipids in a cell 
at a given moment), “glycome” (all the carbohydrates), or the one that 
interests us here, the “proteome” (all of the proteins present in a cell or 
organism under defined conditions at a specified time). 
Methodologies aimed at identifying and characterizing the 
global set of a particular class of molecules or functions led to 


the establishment of new fields of science, such as genomics— 
the study of all of the genes in a particular cell, organism, or 
group of organisms—or proteomics—the scientific field devoted 
to the study of all of the proteins in a cell or organism, usually 
under a specified set of conditions of growth. These fields rely 
on large-scale, comprehensive determination of which molecules 
of a particular class are present and how much of each. Com- 
parative -omic studies reveal more information, often of medical 
relevance, such as a comparison of all of the proteins in a liver 
cell versus all of the proteins in a hepatoma cell (a cancerous 
liver cell). 

Thus, “-ome” defines the ensemble of molecules and “-omics” 
refers to the field that studies them. 


analyzed by comparison with known protein databases to reveal the identity and 
amount of every protein in the cellular extract, and thus, in the cells of interest. 
Proteomics databases are found in the PRIDE Archive http-//www.ebi.ac.uk/pride 
/archive/ that arranges entries by species, MAPU 2.0, the Max Planck Institute Uni- 
fied Proteome Database http-//www.mapuproteome.com/ that includes bodily fluid 
and organelle proteomes, and the human proteome database at Proteomics DB 
Attps://www.proteomicsdb.org/, among others. 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


A Deeper Look 


139 


Foundational Biochemistry 


The Human Proteome Project and the Human Protein Atlas 


Initiated in 2010, the Human Proteome Project (HPP) serves as an 
international framework for global collaboration and data sharing 
in order to define the complete human genome encoded proteome 
(see https://hupo.org). The HPP counts 19,773 protein-coding 
genes in the human genome and as of 2020, has documented mass 
spectroscopic evidence of proteins produced by 17,874 of these 
genes. It is estimated that only 10,000 or so genes are expressed in 
any particular human cell at any time, so the proteome of a cell 
never reflects the complete genome. On the other hand, since pro- 
teins can exist in multiple forms as a result of post-translational 
modifications, the scale of the proteome is much greater than the 
scale of the genome. The work of the HPP is significant to health 
and medicine. While the human genome specifies the full range 


of human biology, the proteome is a dynamic report on what a 
particular cell type or tissue might be doing at a particular mo- 
ment in time. 

The Europe-based Human Protein Atlas is a similar project and 
is aimed at mapping all the proteins in cells, tissues, and organs 
under normal as well as pathological states. Launched in 2003, it 
relies on an integrated transcriptomics and proteomics approach. 
In 2020, version 20 of the Human Protein Atlas was announced. 
It includes a number of sub-atlases: the Single Cell Type Atlas, the 
Tissue Atlas, the Pathology Atlas, the Blood Atlas, the Brain Atlas, 
and the Cell Atlas, each of which is a compendium of the proteins 
expressed within the stated category. See the online resource Human 
Protein Atlas portal at www.proteinatlas.org. 


Foundational Biochemistry Things You Should Know after Reading Chapter 5. 


Proteins fall into three general classes on the basis of shape and solu- 
bility: fibrous, globular, and membrane. 
Protein structure is described in terms of four levels of organization: 
primary (1°)—amino acid sequence 
secondary (2°)—patterns arising by hydrogen bonding 
tertiary (3°)—folding of the polypeptide chain 
quaternary (4°)—subunit organization 


Noncovalent forces underlie the higher (2°, 3°, 4°) levels of protein 
structure. 


The overall three-dimensional structure of a protein is called its 
conformation. 


Proteins can be purified from cellular sources by exploiting their size, 
charge, and affinity differences. 


The amino acid compositions of proteins differ, but soluble proteins 
tend to have about one-third charged amino acid residues, one-third 
hydrophobic, and one-third polar but uncharged. 


The amino acid sequence of a protein is the defining characteristic 
of a protein. 


Chemical and enzymatic methodologies can be used to determine the 
sequence of amino acids along a polypeptide chain. 


The amino acid sequence of a protein is more often determined by 
mass spectrometric analysis. 


Most of the available amino acid sequence information for proteins 
was obtained from nucleotide sequence analysis of genes and genomes. 


Homologous proteins share sequence similarity and structural resem- 
blance and often perform similar functions across different organisms. 


e Homologous proteins can be subclassified as orthologous or paralogous: 


Orthologous proteins are proteins that perform the same function 
in different cells. 


Paralogous proteins are proteins found within the same species that 
have homologous amino acid sequences; paralogous proteins arise 
through gene duplication. 


e Related proteins shared a common evolutionary origin. 


e A mutant protein is a protein with a slightly different amino acid 
sequence. 


e Polypeptides of defined sequences can be chemically synthesized; the 
most effective strategy for chemical synthesis relies on solid-phase 
methods. 


e = Post-translational modification of proteins can introduce new proper- 
ties into proteins. 


e Proteins may have other chemical groups attached to them, such as 
carbohydrates to form glycoproteins. 


e Proteins are the agents of biological function. 


e All proteins function through recognition and binding of some target 
molecule. 


e The proteome is the entire set of different proteins found in a particu- 
lar cell at a specific moment in time. 


e Proteomics is the science of describing the proteome. 


© The tools of proteomics are protein separation, followed by identifica- 
tion, usually by mass spectrometry. 
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Answers to all problems are at the end of this book. Detailed solutions are 
available in the Student Solutions Manual. 


1. Determining the Subunit Organization of a Protein From Its Molecular 
Weight and Metal Content The element molybdenum (atomic weight 
95.95) constitutes 0.08% of the weight of nitrate reductase. If the 
molecular weight of nitrate reductase is 240,000, what is its likely 
quaternary structure? 


2. Solving the Sequence of an Oligopeptide from Sequence Analysis 
Data Amino acid analysis of an oligopeptide seven residues 
long gave 


Asp Leu Lys Met Phe Tyr 


The following facts were observed: 
a. Trypsin treatment had no apparent effect. 
b. The phenylthiohydantoin released by Edman degradation was 


c. Brief chymotrypsin treatment yielded several products, including 
a dipeptide and a tetrapeptide. The amino acid composition of 
the tetrapeptide was Leu, Lys, and Met. 


d. Cyanogen bromide treatment yielded a dipeptide, a tetrapeptide, 
and free Lys. 


What is the amino acid sequence of this heptapeptide? 
3. Solving the Sequence of an Oligopeptide from Sequence Analysis Data 


Amino acid analysis of a decapeptide revealed the presence of the 
following products: 


NH,* Asp Glu Tyr Arg 
Met Pro Lys Ser Phe 
The following facts were observed: 


a. Neither carboxypeptidase A nor B treatment of the decapeptide 
had any effect. 


b. Trypsin treatment yielded two tetrapeptides and free Lys. 

c. Clostripain treatment yielded a tetrapeptide and a hexapeptide. 
Cyanogen bromide treatment yielded an octapeptide and a 
dipeptide of sequence NP (using the one-letter codes). 

e. Chymotrypsin treatment yielded two tripeptides and a tetrapep- 
tide. The N-terminal chymotryptic peptide had a net charge of 
—1 at neutral pH and a net charge of —3 at pH 12. 


f. One cycle of Edman degradation gave the PTH derivative 


(0) 
. 4 
/°~C—CH OH 

N | 
i. 
f H 


What is the amino acid sequence of this decapeptide? 


4. Getting Insights into a Protein's Properties from Its Behavior during 
Purification A protein mixture consisting of proteins A, B, and C was 
subjected to various protein purification steps. First, the protein 
mixture was applied to an ion exchange column having anionic 
groups. When the column was washed with an increasing concentra- 
tion gradient of chloride ions, the order of elution of the three 
proteins was B, then C, and finally, A. Next, the protein mixture was 
chromatographed on a gel filtration column. The order of elution of 
the three proteins from this column was C, A, B. Finally, the protein 


mixture was passed over a chromatography column containing a 

flavin affinity matrix. Only protein A was retained by the column. 
What can you infer about the relative properties of proteins A, B, 
and C? 


. N-Terminal Analysis A protein was purified and confirmed to consist 


of only a singlemolecule with a single molecular mass. Yet N-terminal 
analyses resulted in the identification of two different amino acids. 
Give a possible explanation. 


. Solving the Sequence of an Oligopeptide from Sequence Analysis 


Data Amino acid analysis of an octapeptide revealed the following 
composition: 


2Arg 1Gly 1Met 1Trp 1Tyr 1Phe 1Lys 
The following facts were observed: 
a. Edman degradation gave 


q n 
/ ~C—H 
2 
ee 
f H 


b. CNBr treatment yielded a pentapeptide and a tripeptide 
containing phenylalanine. 


c. Chymotrypsin treatment yielded a tetrapeptide containing a 
C-terminal indole amino acid and two dipeptides. 


d. Trypsin treatment yielded a tetrapeptide, a dipeptide, and free 
Lys and Phe. 


e. Clostripain yielded a pentapeptide, a dipeptide, and free Phe. 
What is the amino acid sequence of this octapeptide? 


. Solving the Sequence of an Oligopeptide from Sequence Analysis 


Data Amino acid analysis of an octapeptide gave the following 

results: 

1Ala 1Arg 1Asp 1 Gly 3Ile 1Val 1NH,* 

The following facts were observed: 

a. Trypsin treatment yielded a pentapeptide and a tripeptide. 

b. Chemical reduction of the free a-COOH and subsequent acid 
hydrolysis yielded 2-aminopropanol. 

c. Partial acid hydrolysis of the tryptic pentapeptide yielded, 
among other products, two dipeptides, each of which contained 
C-terminal isoleucine. One of these dipeptides migrated as an 
anionic species upon electrophoresis at neutral pH. 

d. The tryptic tripeptide was degraded in an Edman sequenator, 
yielding first A, then B: 


wT 

pe —CH, 
A. N | 

Diane CH, 
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What is an amino acid sequence of the octapeptide? Four sequences 
are possible, but only one suits the authors. Why? 


. Solving the Sequence of an Oligopeptide from Sequence Analysis 


Data An octapeptide consisting of 2 Gly, 1 Lys, 1 Met, 1 Pro, 1 Arg, 
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1 Trp, and 1 Tyr was subjected to sequence studies. The following 
was found: 


a. Edman degradation yielded 
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b. Upon treatment with carboxypeptidases A, B, and Y, only 
carboxypeptidase Y had any effect. 


c. Trypsin treatment gave two tripeptides and a dipeptide. 


Chymotrypsin treatment gave two tripeptides and a dipeptide. 
Acid hydrolysis of the dipeptide yielded only Gly. 


e. Cyanogen bromide treatment yielded two tetrapeptides. 
f. Clostripain treatment gave a pentapeptide and a tripeptide. 
What is the amino acid sequence of this octapeptide? 


. Solving the Sequence of an Oligopeptide from Sequence Analysis Data 


Amino acid analysis of an oligopeptide containing nine residues 
revealed the presence of the following amino acids: 


Arg Cys Gly Met Pro Tyr 
The following was found: 


Leu Val 


a. Carboxypeptidase A treatment yielded no free amino acid. 
b. Edman analysis of the intact oligopeptide released 


\ T 

i ba Hə—C— CH; 
NON CH; 
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c. Neither trypsin nor chymotrypsin treatment of the nonapeptide 
released smaller fragments. However, combined trypsin and 
chymotrypsin treatment liberated free Arg. 

d. CNBr treatment of the eight-residue fragment left after 
combined trypsin and chymotrypsin action yielded a six-residue 
fragment containing Cys, Gly, Pro, Tyr, and Val and a dipeptide. 

e. Treatment of the six-residue fragment with B-mercaptoethanol 
yielded two tripeptides. Brief Edman analysis of the tripeptide 
mixture yielded only PTH-Cys. (The sequence of each tripep- 
tide, as read from the N-terminal end, is alphabetical if the 
one-letter designation for amino acids is used.) 


What is the amino acid sequence of this nonapeptide? 


Describe the Solid-Phase Chemical Synthesis of a Small Peptide 
Describe the synthesis of the dipeptide Lys-Ala by Merrifield’s 
solid-phase chemical method of peptide synthesis. What pitfalls 
might be encountered if you attempted to add a leucine residue to 
Lys-Ala to make a tripeptide? 


Identify Proteins Using BLAST Searches of Peptide Fragment 
Sequences Go to the National Center for Biotechnology Informa- 
tion website at http://www.ncbi.nlm.nih. gov/. From the menu of 
Popular Resources on the right-hand side, click on “BLAST.” Under 
the Basic BLAST heading on the new page that comes up, click on 
“protein blast.” In the Enter Query Sequence box at the top of the 
page that comes up, enter the following sequence: NQMMKSRN- 
LTKDRCKP. Confirm that the database under “Choose Search Set” 
is set on “nr” (nonredundant protein sequences), then click the 
BLAST button at the bottom of the page to see the results of your 
search. Next, enter this sequence from a different protein: 
SLQTASAPDVYAIGECA. Identify the proteins from which these 
sequences were derived. 


12. 


13. 


14. 


15. 


16. 
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Problems 


Calculate the Mass of a Protein from Mass Spectrometric m/z Values 
Electrospray ionization mass spectrometry (ESI-MS) of the polypep- 
tide chain of myoglobin yielded a series of m/z peaks (similar to those 
shown in Figure 5.14 for aerolysin K). Two successive peaks had m/z 
values of 1304.7 and 1413.2, respectively. Calculate the mass of the 
myoglobin polypeptide chain from these data. 


Phosphorylation of Proteins Introduces New Properties Phospho- 
proteins are formed when a phosphate group is esterified to an 
—OH group of a Ser, Thr, or Tyr side chain. At typical cellular 

pH values, this phosphate group bears two negative charges 
(—OPO;7-). Compare this side-chain modification with the 20 side 
chains of the common amino acids found in proteins and comment 
on the novel properties that it introduces into side-chain 
possibilities. 


Binding of Ligands to a Protein A pharmaceutical company studied 
the binding of three different compounds, X, Y and Z, toa 
particular protein of interest. For each compound, the fractional 
saturation (v; see Section 5.7b) was determined as a function of 
concentration. The results are shown in the figure. Which of the 
three compounds has the strongest affinity for the protein? Explain. 


0.0 
0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 1.8 2.0 2.2 2.4 2.6 2.8 3.0 


L(uM) 


Using Graphical Analysis to Determine the Kj for the Interaction 
between a Protein and Its Ligand A quantitative study of the 
interaction of a protein with its ligand yielded the following results: 


Ligand concentration (mM) 1 2 3 4 5 6 9 12 
v (moles of ligand 0.28 0.45 0.56 0.60 0.71 0.75 0.79 0.83 
bound per mole of 

protein) 

Plot a graph of [L] versus v. Determine Kp, the dissociation 
constant for the interaction between the protein and its ligand, 

from the graph. 


A protein can bind either of two different ligands, A and B. The 
protein’s Kp for A is 0.3 mM. The protein’s Kp for B is 1 mM. 
Calculate % saturation of the protein with A at concentrations of 
ligand equal to 0.1 mM, 0.2 mM, 0.5 mM, 1 mM, and 2 mM. Repeat 
the calculations for ligand B at these concentrations. Describe the 
relationship between Kp and relative affinity of the protein for ligands 
A and B. 


Biochemistry on the Web 


17. 


Exploring the ExPASy Proteomics Website The human insulin 
receptor substrate-1 (IRS-1) is designated protein P35568 in the 
protein knowledge base on the ExPASy Website. Log onto https:// 
www.expasy.org. Type “peptide mass tool” in the search box and click 
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“Enter” on your computer. In the results page that displays, scroll 
down and click on the “PeptideMass” resource. Then click on 
“Browse the resource website.” In the empty “Search” box, enter 
“P35568” as the accession number and click the “perform” button to 
see the results of trypsin digestion of IRS-1. How many amino acids 
does IRS-1 have? What is the average molecular mass of IRS-1? What 
is the amino acid sequence of the tryptic peptide of IRS-1 that has a 
mass of 1741.9629? 


18. Write a General Chemical Mechanism for Protease Reactions Prote- 
ases such as trypsin and chymotrypsin cleave proteins at different 
sites, but both use the same reaction mechanism. Based on your 
knowledge of organic chemistry, suggest a “universal” protease 
reaction mechanism for hydrolysis of the peptide bond. 


19. Hemoglobin Mutations and Their Effects on Hb Structure and 
Function Hundreds of mutations have been described in the genes 
encoding the a- and B-globin subunits of hemoglobin. 

a. Some of these mutations affect subunit interactions between the 
subunits. In an examination of the tertiary structure of globin 
chains, where would you expect to find amino acid changes in 
mutant globins that affect formation of the hemoglobin af, 
quaternary structure? 

b. Other mutations, such as the S form of the B-globin chain, 
increase the tendency of hemoglobin tetramers to polymerize 
into very large structures. Where might you expect the amino 
acid substitutions to be in these mutants? 


20. S Values The ribosomal complex is often subdivided into the 
so-called 50S and 30S subunits. 
a. What do those “50S” and “30S” terms refer to? 
b. What is the reason for these different “S” values? 


Think-Pair-Share Question 
Evaluation of Peptide Degradation Consider the peptide shown. 


QVDGLMRTSEQMKNSRV 
You are asked to apply three digestion methods sequentially using 
trypsin, chymotrypsin, and staphylococcal protease as the endopepti- 
dases. In between each digestion, you are asked to perform an Edman 
degradation analysis of the N-terminal amino acids of the fragments 
generated. You were not told in which order the three endopeptidases 
are to be applied. There are six possible orders in which the three en- 
dopeptidases could be applied. 


First Second Third 
Order 1 Trypsin Chymotrypsin St. protease 
Order 2 Trypsin St. protease Chymotrypsin 
Order 3 Chymotrypsin St. protease Trypsin 
Order 4 Chymotrypsin Trypsin St. protease 
Order 5 St. protease Trypsin Chymotrypsin 
Order 6 St. protease Chymotrypsin Trypsin 


Select at least three digestion orders from the six shown; another 
student can select the remaining three. For each three-step digestion, 
determine the fragments that would be generated and the results one 
would obtain from the Edman degradation in between each digestion. 
Discuss whether the order of endopeptidase treatment leads to different 
end results (the results for the final Edman degradation reaction). 
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Proteins: Secondary, Tertiary, 6 
and Quaternary Structure 


Growing in size and complexity 
Living things, masses of atoms, DNA, protein 
Dancing a pattern ever more intricate. 
Out of the cradle onto the dry land 
Here it is standing 

Atoms with consciousness 

Matter with curiosity. 

Stands at the sea 

Wonders at wondering 

I 

A universe of atoms 

An atom in the universe. 


Richard P. Feynman (1918-1988) From “The 
Value of Science” in Edward Hutchings, Jr., ed. 
1958. Frontiers of Science: A Survey. New York: 
Basic Books. 


< Like the Greek sea god Proteus, who could assume 
different forms, proteins act through changes in 
conformation. Proteins (from the Greek proteios, meaning 
“primary”) are the primary agents of biological function. 
(“Proteus, Old Man of the Sea, Roman period mosaic, 
from Thessalonika, 1st century a.d. National Archaeological 
Museum, Athens/Ancient Art and Architecture Collection 
Ltd./Bridgeman Art Library, London/New York) 


Proteus, Old Man of the Sea, from Thessalonika (mosaic), Greek, (1st century)/National Archaeological 


Museum, Athens, Greece/Ancient Art and Architecture Collection Ltd./Bridgeman Images 


Proteins are the agents of biological function. They achieve this remarkable status through 6.1 What Noncovalent Interactions 

the intricate architectures they adopt by the process of protein folding. The forces of chemi- Stabilize the Higher Levels of Protein 
cal bonding drive this formation of protein structure and the functions that follow. How do Structure? 

the forces of chemical bonding determine the formation, stability, and myriad functions of 6.2 What Role Does the Amino Acid 
proteins? Sequence Play in Protein Structure? 


6.3 What Are the Elements of Secondary 
Structure in Proteins, and How Are 
They Formed? 


6.4 How Do Polypeptides Fold into Three- 
Dimensional Protein Structures? 


Ne all biological processes involve the specialized functions of one or more pro- 
tein molecules. Proteins function to produce other proteins, control all aspects 
of cellular metabolism, regulate the movement of various molecular and ionic species 


across membranes, convert and store cellular energy, and carry out many other activi- éS. HowDo Protein Subünits neces 
ties. Essentially all of the information required to initiate, conduct, and regulate each the Quaternary Level of Protein 
of these functions must be contained in the structure of the protein itself. The previous Structure? 


chapter described the details of protein primary structure. However, proteins do not 
normally exist as fully extended polypeptide chains, but rather as compact structures 
that biochemists refer to as folded. The ability of a particular protein to carry out its 
function in nature is normally determined by its overall three-dimensional shape, or 
conformation. 

This chapter reveals and elaborates upon the exquisite beauty of protein structures 
and focuses on the overarching principles that link protein structure and function: 


Function depends on structure. 

Structure depends both on amino acid sequence and on weak, noncovalent forces. 
The number of protein folding patterns is very large but finite. 

The structures of globular proteins are marginally stable. 

Marginal stability facilitates motion. 

Motion enables function. 


Sor BN 
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Chapter 6 Proteins: Secondary, Tertiary, and Quaternary Structure 


6.1 | What Noncovalent Interactions Stabilize 
the Higher Levels of Protein Structure? 


The amino acid sequence (primary structure) of any protein is dictated by covalent 
bonds, but the higher levels of structure—secondary, tertiary, and quaternary—are 
formed and stabilized by weak, noncovalent interactions (for example, the ionic 
interaction shown in Figure 6.1). Hydrogen bonds, hydrophobic interactions, elec- 
trostatic bonds, and van der Waals forces all contribute to protein conformation. 
The stabilization free energies afforded by each of these interactions may be highly 
dependent on the local environment within the protein, but certain generalizations 
can still be made. 


6.1a Hydrogen Bonds Are Formed Whenever Possible 


Hydrogen bonds are generally made wherever possible within a given protein struc- 
ture. In most protein structures that have been examined to date, component atoms of 
the peptide backbone tend to form hydrogen bonds with one another. Furthermore, 
side chains capable of forming H bonds are usually located on the protein surface 
and form such bonds either with the water solvent or with other surface residues. The 
strengths of hydrogen bonds depend to some extent on environment. The difference 
in energy between a side chain hydrogen bonded to water and that same side chain 
hydrogen bonded to another side chain is usually quite small. On the other hand, a 
hydrogen bond in the protein interior, away from bulk solvent, can provide substantial 
stabilization energy to the protein. Although each hydrogen bond may contribute an 
average of only a few kilojoules per mole in stabilization energy for the protein struc- 
ture, the number of H bonds formed in the typical protein is very large. For example, in 
a-helices, the C=O and N—H groups of every interior residue participate in H bonds. 
The importance of H bonds in protein structure cannot be overstated. 


6.1b Hydrophobic Interactions Drive Protein Folding 


Hydrophobic “bonds,” or, more accurately, interactions, form because nonpolar side 
chains of amino acids and other nonpolar solutes prefer to cluster in a nonpolar envi- 
ronment rather than to intercalate in a polar solvent such as water. The forming of 
hydrophobic “bonds” minimizes the interaction of nonpolar residues with water and 
is therefore highly favorable. Such clustering is entropically driven, and it is, in fact, the 
principal impetus for protein folding. The side chains of the amino acids in the interior 
or core of the protein structure are almost exclusively hydrophobic. Polar amino acids 
are much less common in the interior of a protein, but the protein surface may consist 
of both polar and nonpolar residues. 


(b) 


Main chain Main chain 
Om / 
NH =e Nat HN 
O=Cc © C=0 
+ = Í A 
TO CHCHCHCHNH3 +++ OT CT CHCH; i 
HN i NH 
Ne Lysine ma Glutamate j 
C= a O=C 
Vi Nat H,O N 


Figure 6.1 An electrostatic interaction between the e-amino group of a lysine and the y-carboxyl 
group of a glutamate. (a) The general case, and (b) the same interaction in a particular protein. 
The protein is IRAK-4 kinase, an enzyme that phosphorylates other proteins (PDB id = 2NRY). 
The interaction shown is between Lys?! (left) and Glu?’ (right). 
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6.1¢ lonic Interactions Usually Occur on the Protein Surface 


Ionic interactions arise either as electrostatic attractions between opposite charges or 
repulsions between like charges. Amino acid side chains can carry positive charges, as 
in the case of lysine, arginine, and histidine, or negative charges, as in aspartate and 
glutamate. In addition, the N-terminal and C-terminal residues of a protein or peptide 
chain usually exist in ionized states and carry positive or negative charges, respectively. 
All of these may experience ionic interactions in a protein structure. Charged residues 
are normally located on the protein surface, where they interact optimally with the 
water solvent. It is energetically unfavorable for an ionized residue to be located in the 
hydrophobic core of the protein. Ionic interactions between charged groups on a pro- 
tein surface are often complicated by the presence of salts in the solution. For example, 
the ability of a positively charged lysine to attract a nearby negative glutamate may 
be weakened by dissolved salts such as NaCl (Figure 6.1). The Na* and Cl ions are 
highly mobile, compact units of charge, compared to the amino acid side chains, and 
thus compete effectively for charged sites on the protein. In this manner, ionic interac- 
tions among amino acid residues on protein surfaces may be damped out by high con- 
centrations of salts. Nevertheless, these interactions are important for protein stability. 


6.1d Van der Waals Interactions Are Ubiquitous 


Both attractive forces and repulsive forces are included in van der Waals interactions. 
The attractive forces are due primarily to instantaneous dipole-induced dipole interac- 
tions that arise because of fluctuations in the electron charge distributions of adjacent 
nonbonded atoms. Individual van der Waals interactions are weak ones (with stabi- 
lization energies of 0.4 to 4.0 kJ/mol), but many such interactions occur in a typical 
protein, and by sheer force of numbers, they can represent a significant contribution 
to the stability of a protein. 


Chemistry in History 


César Milstein 


César Milstein (1927-2002) was born and raised 
in Argentina. An older cousin who studied snake 
venoms aroused César’s interest in science at the 
age of 8. He was active in student politics in college, 
campaigning against the Peronist government’s 
policies of the time. After graduating with a bachelor’s 
in chemistry, Milstein spent a year hitchhiking 
around Europe with his wife Celia. Returning to 
Buenos Aires, he worked in a clinical chemistry lab to 
support his graduate studies on kinetics of aldehyde 
dehydrogenase, earning his PhD. in 1957. Milstein 
moved to England where he earned a second PhD 
from the University of Cambridge, working closely 


with Frederick Sanger on phosphoglucomutase. 
Later at the MRC Laboratory for Molecular Biology, 
Sanger encouraged Milstein to study immunology. 
Milstein and his student Georges Köhler pioneered 
the creation of hybridomas — fusions of antibody- 
producing B cells and myeloma cells that are 
immortal and that produce monoclonal antibodies. 
For this ground-breaking work, Milstein and Köhler 
shared in the Nobel Prize in Medicine or Physiology 
in 1984. Today monoclonal antibodies have found 
hundreds of applications in the treatment of cancer, 
infectious diseases and many other pathologies, 
including COVID-19. 


6.2 | What Role Does the Amino Acid Sequence Play 
in Protein Structure? 


As described in Section 6.1, many different forces work together to determine the over- 
all three-dimensional structure of a protein. These forces operate both within the pro- 
tein structure itself and between the protein and the water solvent. How, then, does 
nature dictate the manner of protein folding to generate the three-dimensional struc- 
ture that optimizes and balances these many forces? All of the information necessary for 
folding the peptide chain into its native structure is contained in the amino acid sequence 
of the peptide. 
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fora a-Carbon 


Side group 


Amide plane 


= 180°, y =180° 


Figure 6.2 The amide or peptide bond planes are 
joined by the tetrahedral bonds of the a-carbon. The 
rotation parameters are ¢ and y. The conformation 
shown corresponds to œ = 180° and y = 180°. Note 
that positive values of @ and y correspond to clock- 
wise rotation as viewed from C,. Starting from 0°, a 
rotation of 180° in the clockwise direction (+ 180°) is 
equivalent to a rotation of 180° in the counterclock- 
wise direction (— 180°). 


contact 
radius 


= 0°, = 180° 


Nonbonded 


Chapter 6 Proteins: Secondary, Tertiary, and Quaternary Structure 


Just how proteins recognize and interpret the information that is stored in the 
amino acid sequence is not yet well understood. Certain loci along the peptide chain 
may act as nucleation points, which initiate folding processes that eventually lead 
to the correct structures. Regardless of how this process operates, it must take the 
protein correctly to the final native structure. Along the way, local energy-minimum 
states different from the native state itself must be avoided. 


6.3 | What Are the Elements of Secondary Structure 
in Proteins, and How Are They Formed? 


Any discussion of protein folding and structure must begin with the peptide bond, the 
fundamental structural unit in all proteins. As we saw in Chapter 4, the resonance struc- 
tures experienced by a peptide bond constrain six atoms—the oxygen, carbon, nitrogen, 
and hydrogen atoms of the peptide group, as well as the adjacent a-carbons—to lie in 
a plane. The resonance stabilization energy of this planar structure is approximately 
88 kJ/mol, and substantial energy is required to twist the structure about the C—N 
bond. A twist of 0 degrees involves a twist energy of 88 sin?@ kJ/mol. 


6.3a The Amide Plane Is a Fundamental Determinant of 
All Protein Structure 


The planarity of the peptide bond means that there are only two degrees of freedom per 
residue for the peptide chain. Rotation is allowed about the bond linking the a-carbon 
with its carbonyl carbon and also about the bond linking the a-carbon with its amide 
nitrogen. As shown in Figure 6.2, each a-carbon is the joining point for two planes 
defined by peptide bonds. The angle about the C,—N bond is denoted by the Greek 
letter @ (phi), and that about the C,—C, is denoted by y (psi). For either of these bond 
angles, a value of 0° corresponds to an orientation with the amide plane bisecting the 
H—C,—R (side-chain) angle and a cis conformation of the main chain around the 
rotating bond in question (Figure 6.3). 


Nonbonded 
contact 
radius 


o =-60°, y = 180° 

A further ¢ rotation of 120° 
removes the bulky carbonyl 
group as far as possible 
from the side chain 


= 180°, y = 0° 


Figure 6.3 Many of the possible conformations about an a-carbon between two peptide planes are 
forbidden because of steric crowding. Several noteworthy examples are shown here. 

Note: The formal IUPAC-IUB Commission on Biochemical Nomenclature convention for the 
definition of the torsion angles ¢ and y in a polypeptide chain (Biochemistry 9:3471—3479, 1970) is 
different from that used here, where the C, atom serves as the point of reference for both rotations, 
but the result is the same. 
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The entire path of the peptide backbone in a protein is known if the ¢ and w rota- 
tion angles are all specified. Some values of @ and y are not allowed due to steric inter- 
ference between nonbonded atoms. As shown in Figure 6.3, values of @ = 180° and 
y = 0° are not allowed because of the forbidden overlap of the N—H hydrogens. Simi- 
larly, 6 = 0° and y = 180° are forbidden because of unfavorable overlap between the 
carbonyl oxygens. 

A plot of ¢ values against y values, first described by G. N. Ramachandran 
and his colleagues in Madras, India, revealed the distribution of allowed values 
in a protein or in a family of proteins. A typical Ramachandran plot is shown in 
Figure 6.4. Note the clustering of p and y values in a few regions of the plot. Most 
combinations of ¢ and w are sterically forbidden, and the corresponding regions of 
the Ramachandran plot are sparsely populated. The combinations that are sterically 
allowed represent the subclasses of structure described in the remainder of this section. 


Ad 
Parallel B-sheet a Left-handed 


Antiparallel B-sheet Collagen oe 
triple helix 


(deg) 


180 


Figure 6.4 A Ramachandran diagram showing the sterically reasonable values of 
the angles ¢ and y. The shaded regions indicate particularly favorable values of 
these angles. Dots in purple indicate actual angles measured for 1000 residues 
(excluding glycine, for which a wider range of angles is permitted) in eight pro- 
teins. The lines running across the diagram (numbered +5 through 2 and —5 
through —3) signify the number of amino acid residues per turn of the helix; “+” 
Closed ring means right-handed helices; “—” means left-handed helices. (After Richardson, 


Right-handed J. S., 1981. The anatomy and taxonomy of protein structure. Advances in Protein 
a-helix Chemistry 34:167-339.) 
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Chapter 6 Proteins: Secondary, Tertiary, and Quaternary Structure 


6.3b The Alpha-Helix Is a Key Secondary Structure 


The term secondary structure describes local conformations of the polypeptide that 
are stabilized by hydrogen bonds. In nearly all proteins, the hydrogen bonds that 
make up secondary structures involve the amide proton of one peptide group and 
the carbonyl oxygen of another, as shown in Figure 6.5. These structures tend 
to form in cooperative fashion and involve substantial portions of the peptide 
chain. When a number of hydrogen bonds form between portions of the pep- 
tide chain in this manner, two basic types of structures can result: a-helices and 
B-pleated sheets. (The R-groups (sidechains) of a protein may slightly alter the struc- 
tures and stability of helices and sheets, but the H bonds between backbone N—H and 
C=O groups are the dominant stabilizing interactions in these structures.) 

The earliest studies of protein secondary structure were those of William Astbury 
at the University of Leeds. Astbury carried out X-ray diffraction studies on wool and 
observed differences between unstretched wool fibers and stretched wool fibers. He pro- 
posed that the protein structure in unstretched fibers was a helix (which he called the 
alpha form). He also proposed that stretching caused the helical structures to uncoil, 


Figure 6.5 A hydrogen bond between the backbone C=O of Ala”?! 
and the backbone N—H of Ser!” in the acetylcholine-binding protein 
of a snail, Lymnaea stagnalis (PDB id = 119B). 


A Deeper Look 


Knowing What the Right Hand and Left Hand Are Doing 


Certain conventions related to peptide bond angles and the “hand- 
edness” of biological structures are useful in any discussion of 
protein structure. To determine the ¢ and angles between peptide 
planes, viewers should imagine themselves at the C, carbon looking 
outward and should imagine starting from the ¢ = 0°, y = 0° con- 
formation. From this perspective, positive values of @ correspond 
to clockwise rotations about the C,—N bond of the plane that 
includes the adjacent N—H group. Similarly, positive values of w 


correspond to clockwise rotations about the C,—C bond of the 
plane that includes the adjacent C=O group. 

Biological structures are often said to exhibit “right-hand” or 
“left-hand” twists. For all such structures, the sense of the twist can 
be ascertained by holding the structure in front of you and look- 
ing along the polymer backbone. If the twist is clockwise as one 
proceeds outward and through the structure, it is said to be right- 
handed. If the twist is counterclockwise, it is said to be left-handed. 
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Critical Developments in Biochemistry 


X-ray Crystallography of Proteins 


Most of the protein structures archived in the Protein Data Bank 
(PDB) were determined by X-ray crystallography. In this technique, 
the protein of interest must be purified and crystallized and then 
subjected to an intense X-ray beam. Atoms in the crystal diffract 
the X-ray beam to produce characteristic spots on a film or X-ray 
detector. Analysis of the resulting diffraction pattern (figure), 
combined with special techniques to determine the phase of the 
X-rays at each spot, reveals the arrangement and distribution of 
electrons in the protein crystal in an electron density map. This 
map can be interpreted to assign the locations of each atom in the 
structure (producing the atomic model). A crystallographic study 
produces two types of data—the coordinate files that include atomic 
positions for the final structure model, and the structure factor file 


X-ray crystallography on a femtosecond timescale has been 
achieved using X-ray free electron lasers (XFEL). In a free electron 
laser, a beam of electrons is accelerated to almost the speed of light 
and passed through a periodic array of magnets, creating pulses of 
radiation that are very short (107! s) and very bright (billions of 
times stronger than conventional X-rays). A stream of tiny crys- 
tals of the protein of interest is passed through the beam, and each 
intense X-ray pulse generates a diffraction pattern, usually burning 
up the crystal in the process. Many hundreds or even tens of thou- 
sands of these diffraction patterns are compiled into data sets that 
provide information on the ultrafast dynamics of protein molecules, 
such as conformation changes and absorption of light by molecular 
groups in the protein termed chromophores. 


that consists of the intensities and phases of the X-ray spots in the 


diffraction pattern. Appropriate software (the AstexViewer on the Sa crystal 
PDB website or PyMol https://pymol.org) can then be used to display 
an image of the protein, in a variety of formats. we | 
The crystallization process is frequently difficult and limits the 
types of proteins that can be studied by X-ray crystallography. Very 
stable, rigid proteins that form well-ordered crystals work well, but pate a 
pattern 


marginally stable, very flexible proteins are less amenable to crys- 
tallographic analysis. Interestingly, many proteins are a mixture of 
very stable and very flexible domains, even in the crystalline state. 
The stable portions may produce good structure information, but 
the flexible portions may be invisible in the electron density maps 
and the resulting structure images, because their electron density 
is spread over a large space. The quality and accuracy of a crys- 
tal structure is defined by two parameters. The first of these is the 
resolution, which is the smallest distance that is resolved in the dif- 
fraction pattern. A value of 4 A means poor resolution, whereas a 
value of 1.5 A represents good resolution. The second criterion of 
accuracy is the R-value, which measures how well the atomic model 
is supported by the experimental data found in the structure fac- 
tor file. A random set of atoms produces an R-value of about 0.63, 3, Model 
whereas a perfect fit yields a value of 0. Typical values for published h ; 
structures are about 0.20. 


electron 
density map 


refinement 


The workflow in X-ray crystallography. 


yielding an extended structure (which he called the beta form). Astbury was the first to 
propose that hydrogen bonds between peptide groups contributed to stabilizing these 
structures. 

In 1951, Linus Pauling, Robert Corey, and their colleagues at the California Insti- 
tute of Technology summarized a large volume of crystallographic data in a set of 
dimensions for polypeptide chains. (A summary of data similar to what they reported is 
shown in Figure 4.14.) With these data in hand, Pauling, Corey, and Herman Branson 
proposed a new model for a helical structure in proteins, which they called the a-helix. 
The report from Caltech was of particular interest to Max Perutz in Cambridge, 
England, a crystallographer who was also interested in protein structure. By taking into 
account a critical but previously ignored feature of the X-ray data, Perutz realized that 
the a-helix existed in keratin, a protein from hair, and also in several other proteins. 
Since then, the a-helix has proved to be a fundamentally important peptide structure. 
Several representations of the a-helix are shown in Figure 6.6. One turn of the helix 
represents 3.6 amino acid residues. (A single turn of the a-helix involves 13 atoms from 
the O to the H of the H bond. For this reason, the a-helix is sometimes referred to as 
the 3.6,; helix.) 
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(a) (b) 


(c) (d) 


a-Carbon 


Side group 
Hydrogen bonds stabilize The helix can be viewed 
the helix structure. as a stacked array of 
peptide planes hinged 


at the a-carbons and 
approximately parallel 
to the helix. 


B-Hemoglobin subunit Myohemerythrin 


Figure 6.7 The three-dimensional structures of two 
proteins that contain substantial amounts of a-helix. 
The helices are represented by the regularly coiled sec- 
tions of the ribbon drawings. Myohemerythrin is the 
oxygen-carrying protein in certain invertebrates, 
including Sipunculids, a phylum of marine worm. 
B-Hemoglobin subunit: PDB id = 1HGA; myohemer- 
ythrin PDB id = 1A7D. 


Figure 6.6 Four different graphic representations of the a-helix. (a) A stick representation with H 
bonds as dotted lines, as originally conceptualized in Pauling’s 1960 The Nature of the Chemical Bond. 
(b) Showing the arrangement of peptide planes in the helix. (c) A space-filling computer graphic pre- 
sentation. (d) A “ribbon structure” with an inlaid stick figure, showing how the ribbon indicates the 
path of the polypeptide backbone. 


Each amino acid residue extends 1.5 A (0.15 nm) along the helix axis. With 3.6 residues 
per turn, this amounts to 3.6 X 1.5 A or 5.4 A (0.54 nm) of travel along the helix axis per 
turn. This is referred to as the translation distance or the pitch of the helix. If one ignores 
side chains, the helix is about 6 A in diameter. The side chains, extending outward from 
the core structure of the helix, are removed from steric interference with the polypep- 
tide backbone. As can be seen in Figure 6.6, each peptide carbonyl is hydrogen bonded to 
the peptide N—H group four residues farther up the chain. Note that all of the H bonds 
lie parallel to the helix axis and all of the carbonyl groups are pointing in one direc- 
tion along the helix axis while the N—H groups are pointing in the opposite direction. 
Recall that the entire path of the peptide backbone can be known if the and y 
twist angles are specified for each residue. The a-helix is formed if the values of ¢ are 
approximately — 60° and the values of y are in the range of —45 to — 50°. Figure 6.7 
shows the structures of two proteins that contain a-helical segments. The number 
of residues involved in a given a-helix varies from helix to helix and from protein to 
protein. On average, there are about 10 residues per helix. Myoglobin, one of the first 
proteins in which a-helices were observed, has eight stretches of a-helix that form a 
box to contain the heme prosthetic group (see Figure 5.1). 

As shown in Figure 6.6, all of the hydrogen bonds point in the same direction along 
the a-helix axis. Each peptide bond possesses a dipole moment that arises from the 
polarities of the N—H and C=O groups, and because these groups are all aligned along 
the helix axis, the helix itself has a substantial dipole moment, with a partial positive 
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charge at the N-terminus and a partial negative charge at the C-terminus (Figure 6.8). 
Negatively charged ligands (e.g., phosphates) frequently bind to proteins near the 
N-terminus of an a-helix. By contrast, positively charged ligands are only rarely found 
to bind near the C-terminus of an a-helix. 

In a typical a-helix of 12 (or n) residues, there are 8 (or n — 4) hydrogen bonds. As 
shown in Figure 6.9, the first 4 amide hydrogens and the last 4 carbonyl oxygens can- 
not participate in helix H bonds. Also, nonpolar residues situated near the helix termini 
can be exposed to solvent. Proteins frequently compensate for these problems by helix 
capping—providing H-bond partners for the otherwise bare N—H and C—O groups 
and folding other parts of the protein to foster hydrophobic contacts with exposed 
nonpolar residues at the helix termini. 

With one exception, all of the twenty common amino acids are found frequently in 
alpha-helices. Proline (and hydroxyproline) act as helix breakers due to their unique 
structure, which limits the value of ¢ (1.e., rotation about the C,—N bond). 


6.3c The B-Pleated Sheet Is a Core Structure in Proteins 


Another type of structure commonly observed in proteins also forms because of local, 
cooperative formation of hydrogen bonds. That is the pleated sheet, or B-structure, often 
called the B-pleated sheet. This structure was also first postulated by Pauling and Corey 
in 1951 and has now been observed in many natural proteins. A B-pleated sheet can be 
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Figure 6.9 Four N—H groups at the N-terminal end of an a-helix (bottom) and four C=O groups at 
the C-terminal end (top) lack partners for H-bond formation. The formation of H bonds with other 
nearby donor and acceptor groups is referred to as helix capping. Capping may also involve appropri- 
ate hydrophobic interactions that accommodate nonpolar side chains at the ends of helical segments. 


(a) 


Dipole 


+0.42 moment 


@ +0.20 


(b) 


Figure 6.8 The arrangement of N—H and C=O 
groups (each with an individual dipole moment) 
along the helix axis creates a large net dipole for 
the helix. Numbers indicate fractional charges on 
respective atoms. 
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Critical Developments in Biochemistry 


In Bed with a Cold, Pauling Stumbles onto the a-Helix and a Nobel Prize* 


As high technology continues to transform the modern biochemical 
laboratory, it is interesting to reflect on Linus Pauling’s discovery of 
the a-helix. It involved only a piece of paper, a pencil, scissors, and 


and bond angles from memory.... He knew that the peptide 
bond, at the carbon-to-nitrogen link, was always rigid: 


a sick Linus Pauling, who had tired of reading detective novels. The O H R H O H R H 

story is told in the excellent book The Eighth Day of Creation by Hl ae d i AeA | 

Horace Freeland Judson: ye >on ONE eI Ne 
From the spring of 1948 through the spring of 1951 ... rivalry ra A i b = Y L d ae y 


sputtered and blazed between Pauling’s lab and (Sir Lawrence) 


Bragg’s—over protein. The prize was to propose and verify 
in nature a general three-dimensional structure for the poly- 


peptide chain. Pauling was working up from the simpler struc- 


tures of components. In January 1948, he went to Oxford as 
a visiting professor for two terms, to lecture on the chemical 
bond and on molecular structure and biological specificity. 
“In Oxford, it was April, I believe, I caught cold. I went to 
bed, and read detective stories for a day, and got bored, and 
thought why don’t I have a crack at that problem of alpha 
keratin.” Confined, and still fingering the polypeptide chain 


in his mind, Pauling called for paper, pencil, and straightedge 


and attempted to reduce the problem to an almost Euclid- 
ean purity. “I took a sheet of paper—I still have this sheet 
of paper—and drew, rather roughly, the way that I thought 
a polypeptide chain would look if it were spread out into a 


And this meant that the chain could turn corners only 

at the alpha carbons.... “I creased the paper in parallel 
creases through the alpha carbon atoms, so that I could 
bend it and make the bonds to the alpha carbons, along the 
chain, have tetrahedral value. And then I looked to see if I 
could form hydrogen bonds from one part of the chain to 
the next.” He saw that if he folded the strip like a chain of 
paper dolls into a helix, and if he got the pitch of the screw 
right, hydrogen bonds could be shown to form, 
N—H::-O—C, three or four knuckles apart along the 
backbone, holding the helix in shape. After several tries, 
changing the angle of the parallel creases in order to adjust 
the pitch of the helix, he found one where the hydrogen 
bonds would drop into place, connecting the turns, as 


plane.” The repetitious herringbone of the chain he could straight lines of the right length. He had a model. 


stretch across the paper as simply as this—putting in lengths 


| *The discovery of the a-helix structure was only one of many achieve- 

C ments that led to Pauling’s Nobel Prize in Chemistry in 1954. The official 

citation for the prize was “for his research into the nature of the chemical 

| bond and its application to the elucidation of the structure of complex 
substances.” 


visualized by laying thin, pleated strips of paper side by side to make a “pleated sheet” of 
paper (Figure 6.10). Each strip of paper can then be pictured as a single peptide strand 
in which the peptide backbone makes a zigzag pattern along the strip, with the a-carbons 
lying at the folds of the pleats. The pleated sheet can exist in both parallel and antiparallel 
forms. 

In the parallel B-pleated sheet, adjacent chains run in the same direction. In 
the antiparallel B-pleated sheet, adjacent strands run in opposite directions. The 
arrangement of successive amide planes in a -sheet has a pleated appearance due 
to the tetrahedral nature of the C, atom. It is important to note that the hydrogen 
bonds in this structure are essentially interstrand rather than intrastrand. Opti- 
mum formation of H bonds in the parallel pleated sheet results in a slightly less 
extended conformation than in the antiparallel sheet. The H bonds thus formed 
in the parallel B-sheet are bent significantly. The distance between residues is 0.347 nm 
for the antiparallel pleated sheet, but only 0.325 nm for the parallel pleated sheet. Note 
that the side chains in the pleated sheet are oriented perpendicular or normal to the 
plane of the sheet, extending out from the plane on alternating sides. 

Parallel B-sheets tend to be more regular than antiparallel B-sheets. As can be seen in 
Figure 6.4, typical #, y values for a parallel B-sheet are p = — 120°, y = 105°, and typical 
values for an antiparallel B-sheet are @ = — 135°, y = 140°. However, the range of @ and 
y angles for the peptide bonds in parallel sheets is much smaller than that for antiparal- 
lel sheets. Parallel sheets are typically large structures; those composed of fewer than 
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(b) C<———_-N 
C<———_N 
(c) N ——> C 


C -<— N 


Figure 6.10 6-sheet structures in proteins. (a) A “pleated sheet” of paper with an antiparallel B-sheet 
drawn on it. (b) Hydrogen bonds in a parallel B-sheet. (c) Hydrogen bonds in an antiparallel B-sheet. 


five strands are rare. Antiparallel sheets, however, may consist of as few as two strands. 
Parallel sheets characteristically distribute hydrophobic side chains on both sides of the 
sheet, whereas antiparallel sheets are usually arranged with all their hydrophobic resi- 
dues on one side of the sheet. This requires an alternation of hydrophilic and hydro- 
phobic residues in the primary structure of peptides involved in antiparallel B-sheets 
because every other side chain projects to the same side of the sheet (Figure 6.10). 
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6.3d Helix-Sheet Composites Provide Flexibility 
and Strength in Spider Silk 


Although the intricate designs of spider webs are eye (and fly) catching, it might be argued 
that the composition of web silk itself is even more remarkable. Spider silk, a form of 
keratin, is synthesized in special glands in the spider’s abdomen. The silk strands pro- 
duced by these glands are both strong and elastic. Dragline silk (that from which the 
spider hangs) has a tensile strength of 200,000 psi (pounds per square inch)—stron- 
ger than steel and similar to Kevlar, the synthetic material used in bulletproof vests! 
This same silk fiber is also flexible enough to withstand strong winds and other natural 
stresses. This combination of strength and flexibility derives from the composite nature 
of spider silk. As keratin protein is extruded from the spider’s glands, it endures shearing 
forces that break the H bonds stabilizing keratin a-helices (Figure 6.11). These regions 
then form microcrystalline arrays of B-sheets. These microcrystals are surrounded 
by the keratin strands, which adopt a highly disordered state composed of a-helices 
and random coil structures. The B-sheet microcrystals contribute strength, and the dis- 
ordered array of helix and coil make the silk strand flexible. The resulting silk strand 
resembles modern human-engineered composite materials and products, such as tennis 
racquets. 


6.3e -Turns Allow the Protein Strand to Change Direction 


Polypeptide chains must possess the capacity to bend, turn, and reorient themselves to 
produce compact, globular structures. A simple structure observed in many proteins is the 
B-turn (also known as the tight turn or B-bend), in which the peptide chain forms a tight 
loop with the carbonyl oxygen of one residue hydrogen bonded with the amide proton of 
the residue three positions down the chain. This H bond makes the B-turn a relatively stable 
structure. As shown in Figure 6.12, the B-turn allows the protein to reverse the direction of 
its peptide chain. This figure shows the two major types of B-turns, but a number of less 
common types are also found in protein structures. Because it lacks a side chain, glycine 


(b) Radial strand 


(c) Ordered B-sheets surrounded (d) B-sheets impart strength and 
by disordered a-helices and a-helices impart flexibility 
B-bends. to the strand. 


Figure 6.11 Spider web silks (a) are composites of a-helices and B-sheets. The radial strands 
of webs (b), (c), (d) must be strong and rigid; silks in radial strands contain a higher per- 
centage of B-sheets. The circumferential strands (termed capture silk) must be flexible (to 
absorb the impact of flying insects); capture silk contains a higher percentage of a-helices. 
Spiders typically have several different silk gland spinnerets, which secrete different silks of 
differing composition. Spiders have inhabited the earth for about 470 million years. 
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(b) 


Type I Type I 


Figure 6.12 The structures of two kinds of B-turns (also called tight turns or B-bends). Four residues 
are required to form a B-turn. (a) Type I; (b) Type II. 


is sterically the most adaptable of the amino acids, and it accommodates conveniently to 
other steric constraints in the B-turn. Proline, however, has a cyclic structure and a fixed f 
angle, so, to some extent, it forces the formation of a B-turn; in many cases this facilitates the 
turning of a polypeptide chain upon itself. Such bends promote formation of antiparallel 
B-pleated sheets. Type I turns (Figure 6.12) are more common than type II. Proline fits 
best in the 3 position of the type I turn. In the type II turn, proline is preferred at the 
2 position, whereas the 3 position prefers glycine or small polar residues. 


6.4 | How Do Polypeptides Fold into Three-Dimensional 
Protein Structures? 


The arrangement of all atoms of a single polypeptide chain in three-dimensional space 
is referred to as its tertiary structure. As discussed in Section 6.2 all of the informa- 
tion needed to fold the protein into its native tertiary structure is contained within the 
primary structure of the peptide chain itself. Sometimes proteins known as chaperones 
assist in the process of protein folding in the cell, but proteins in dilute solution can be 
unfolded and refolded without the assistance of such chaperones. 

The first determinations of the tertiary structure of a protein were by John 
Kendrew and Max Perutz. Kendrew’s structure of myoglobin and Perutz’s structure of 
hemoglobin, reported in the late 1950s, were each the result of more than 20 years of 
work. Ever since these first protein structures were elucidated, biochemists have sought 
to understand the principles by which proteins adopt their remarkable structures. Vigor- 
ous work in many laboratories has slowly brought important principles to light: 


e Secondary structures—helices and sheets—form whenever possible as a conse- 
quence of the formation of large numbers of hydrogen bonds. 

e a-Helices and B-sheets often associate and pack close together in the protein. No 
protein is stable as a single-layer structure, for reasons that become apparent later. 
There are a few common methods for such packing to occur. 

e Because the peptide segments between secondary structures in the protein tend to be 
short and direct, the peptide does not execute complicated twists and knots as it 
moves from one region of secondary structure to another. 

e Proteins generally fold so as to form the most stable structures possible. The stabil- 
ity of most proteins arises from (1) the formation of large numbers of intramolecular 
hydrogen bonds and (2) the reduction in the surface area accessible to solvent that 
occurs upon folding. 
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Two factors lie at the heart of these four principles. First, proteins are typically a 
mixture of hydrophilic and hydrophobic amino acids. Why is this important? Imagine 
a protein that is composed only of polar and charged amino acids. In such a protein, 
every side chain could hydrogen bond to water. This would leave no reason for the pro- 
tein to form a compact, folded structure. 

Now consider a protein composed of a mixture of hydrophilic and hydrophobic 
residues. In this case, the hydrophobic side chains cannot form H bonds with water, and 
their presence will disrupt the hydrogen-bonding structure of water itself. To minimize 
this, the hydrophobic groups will tend to cluster together. This hydrophobic effect induces 
formation of a compact structure—the folded protein. 

A potential problem with this rather simple folding model is that polar backbone 
N—H and C=O groups on the hydrophobic residues accompany the hydrophobic side 
chains into the folded protein interior. This would be energetically costly to the protein, 
but the actual result is that the polar backbone groups form H bonds with one another, 
so that the polar backbone N—H and C—O moieties are stabilized in a-helices and 
B-sheets in the protein interior. 


A Deeper Look 


The Coiled-Coil Motif in Proteins 


The coiled-coil motif was first identified in 1953 by Linus Pauling, 
Robert Corey, and Francis Crick as the main structural ele- 
ment of fibrous proteins such as keratin and myosin. Since 
then, many proteins have been found to contain one or more 
coiled-coil segments or domains. A coiled coil is a bundle of 
a-helices that are wound into a superhelix. Two, three, or four 
helical segments may be found in the bundle, and they may be 
arranged parallel or antiparallel to one another. Coiled coils are 
characterized by a distinctive and regular packing of side chains 
in the core of the bundle. This regular meshing of side chains 


(a) Coiled coil 


requires that they occupy equivalent positions turn after turn. 
This is not possible for undistorted a-helices, which have 3.6 
residues per turn. The positions of side chains on their surface 
shift continuously along the helix surface (see Figure 6.13c). 
However, giving the right-handed a-helix a left-handed twist 
reduces the number of residues per turn to 3.5, and because 
3.5 X 2 = 7.0, the positions of the side chains repeat after two turns 
(7 residues). Thus, a heptad repeat pattern in the peptide sequence 
is diagnostic of a coiled-coil structure. Figure 1 shows a sampling 
of coiled-coil structures (highlighted in color) in various proteins. 


(b) Periodicity of hydrophobic residues 


Undistorted Left-handed coiled coil 


Supercoiled 


Helices with a heptad repeat 
of hydrophobic residues 


Influenza hemagglutinin 


Catabolite activator protein 
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6.4a Fibrous Proteins Usually Play a Structural Role 


In Chapter 5, we saw that proteins can be grouped into three large classes based on 
their structure and solubility: fibrous proteins, globular proteins, and membrane proteins. 
Fibrous proteins contain polypeptide chains organized approximately parallel along a 
single axis, producing long fibers or large sheets. Such proteins tend to be mechanically 
strong and resistant to solubilization in water and dilute salt solutions. Fibrous pro- 
teins often play a structural role in nature (see Chapter 5). 


a-Keratin The a-keratins are the predominant constituents of claws, fingernails, hair, and 
horns in mammals. As their name suggests, the structure of the a-keratins is dominated by 
a-helical segments of polypeptide. The amino acid sequence of a-keratin subunits is com- 
posed of central a-helix—rich rod domains about 311 to 314 residues in length, flanked by 
nonhelical N- and C-terminal domains of varying size and composition (Figure 6.13a). 
The structure of the central rod domain of a typical a-keratin is shown in Figure 6.13b. 
Pairs of right-handed a-helices wrap around each other to form a left-twisted coiled coil. 
X-ray diffraction data show that these structures resemble a-helices, but with a pitch of 
0.51 nm rather than the expected 0.54 nm. This is consistent with a tilt of the helix rela- 
tive to the long axis of the coiled coil (and the keratin fiber) in Figure 6.13. 

The amino acid sequence of the central rod segments of a-keratin consists of quasi- 
repeating seven-residue segments of the form (a-b-c-d-e-f-g),,. These units are not true 
repeats, but residues a and d are usually nonpolar amino acids. In a-helices, with 3.6 
residues per turn, these nonpolar residues are arranged in an inclined row or stripe 
that twists around the helix axis (Figure 6.13c). These nonpolar residues would make 
the helix highly unstable if they were exposed to solvent, but the association of hydro- 
phobic stripes on two a-helices to form the two-stranded coiled coil effectively buries 


(a) N-terminal Rod C-terminal 
domain domain domain 


Keratin 


type I 


Coiled coil of two a-helices 
= 7 WITS 
ANAISOKDREKRE 


ZN N SON EON AN A 


Protofilament (pair of coiled coils) Undistorted Supercoiled Left-handed coiled coil 


Helices with a heptad repeat 
of hydrophobic residues 


Figure 6.13 (a) Both type I and type II a-keratin molecules have sequences consisting of long, central 
rod domains with terminal cap domains. The numbers of amino acid residues in each domain are indi- 
cated. Asterisks denote domains of variable length. (b) The rod domains form coiled coils consisting of 
left-twisted right-handed a-helices. These coiled coils form protofilaments that then wind around each 
other in a right-handed twist. Keratin filaments consist of twisted protofibrils (each a bundle of four 
coiled coils). (c) Periodicity of hydrophobic residues. (Adapted from Steinert, P., and Parry, D., 1985. 
Intermediate filaments: Conformity and diversity of expression and structure. Annual Review of Cell 
Biology 1:41—65; and Cohlberg, J., 1993. Textbook error: The structure of alpha-keratin. Trends in 
Biochemical Sciences 18:360-362.) 
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the hydrophobic residues and forms a highly stable structure (Figure 6.13). The helices 
sacrifice some stability in assuming this twisted conformation, but they gain stabiliza- 
tion energy from the packing of side chains between the helices. In other forms of ker- 
atin, covalent disulfide bonds form between cysteine residues of adjacent molecules, 
making the overall structure even more rigid, inextensible, and insoluble—important 
properties for structures such as claws, fingernails, hair, and horns. How and where 
these disulfides form determines the amount of curling in hair and wool fibers. When 
a hairstylist creates a permanent wave (simply called a perm) in a hair salon, disulfides 
in the hair are first reduced and cleaved, then reorganized and reoxidized to change 
the degree of curl or wave. In contrast, a “set” that is created by wetting the hair, set- 
ting it with curlers, and then drying it represents merely a rearrangement of the hydro- 
gen bonds between helices and between fibers. (On humid or rainy days, the hydrogen 
bonds in curled hair may rearrange, and the hair becomes frizzy.) 


Fibroin and B-Keratin: B-Sheet Proteins The fibroin proteins found in silk fibers in the 
cocoons of the silkworm, Bombyx mori, and also in spiderwebs represent another type 
of fibrous protein. These are composed of stacked antiparallel B-sheets, as shown in 
Figure 6.14. In the polypeptide sequence of silk proteins, there are large stretches in 
which every other residue is a glycine. As previously mentioned, the residues of a -sheet 
extend alternately above and below the plane of the sheet. As a result, the glycines all 
end up on one side of the sheet and the other residues (mainly alanines and serines) com- 
pose the opposite surface of the sheet. Pairs of B-sheets can then pack snugly together 
(glycine surface to glycine surface or alanine—serine surface to alanine—serine surface). 
The £-keratins found in bird feathers are also made up of stacked B-sheets. 


Collagen: A Triple Helix Collagen is a rigid, inextensible fibrous protein that is a principal 
constituent of connective tissue in animals, including tendons, cartilage, bones, teeth, skin, 
and blood vessels. The high tensile strength of collagen fibers in these structures makes pos- 
sible the various animal activities such as running and jumping that put severe stresses on 
joints and skeleton. Broken bones and tendon and cartilage injuries to knees, elbows, and 
other joints involve tears or hyperextensions of the collagen matrix in these tissues. 

The basic structural unit of collagen is tropocollagen, which has a molecular weight 
of 285,000 and consists of three intertwined polypeptide chains, each about 1000 amino 
acids in length. Tropocollagen molecules are about 300 nm long and only about 1.4 nm 


Figure 6.14 Silk fibroin consists of a unique stacked array of B-sheets. The primary structure of 
fibroin molecules consists of long stretches of alternating glycine and alanine or serine residues. When 
the sheets stack, the more bulky alanine and serine residues on one side of a sheet interdigitate with 
similar residues on an adjoining sheet. Glycine hydrogens on the alternating faces interdigitate in a 
similar manner, but with a smaller intersheet spacing. 
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in diameter. Several kinds of collagen have been identified. Type I collagen, which is 
the most common, consists of two identical peptide chains designated a1(I) and one 
different chain designated a2(I). Type I collagen predominates in bones, tendons, and 
skin. Type IT collagen, found in cartilage, and type IH collagen, found in blood vessels, 
consist of three identical polypeptide chains. 

Collagen has an amino acid composition that is unique and is crucial to its three- 
dimensional structure and its characteristic physical properties. Nearly one residue out 
of three is a glycine, and the proline content is also unusually high. Three unusual modi- 
fied amino acids are also found in collagen: 4-hydroxyproline (Hyp), 3-hydroxyproline, 
and 5-hydroxylysine (Hyl) (Figure 4.4). Proline and Hyp together compose up to 30% of 
the residues of collagen. Interestingly, these three amino acids are formed from normal 
proline and lysine after the collagen polypeptides are synthesized. The modifications are 
effected by two enzymes: prolyl hydroxylase and lysyl hydroxylase. The prolyl hydrox- 
ylase reaction (Figure 6.15) requires molecular oxygen, a-ketoglutarate, and ascorbic 
acid (vitamin C) and is activated by Fe?*. The hydroxylation of lysine is similar. 

Because of their high content of glycine, proline, and hydroxyproline, collagen fibers 
are incapable of forming traditional structures such as a-helices and B-sheets. Instead, col- 
lagen polypeptides intertwine to form a unique right-handed triple helix, with each of the 
three strands arranged in a left-handed helical fashion (Figure 6.16). Compared with the 
a-helix, the collagen helix is much more extended, with a rise per residue along the triple 
helix axis of 2.9 A (versus 1.5 A for the a-helix). There are about 3.3 residues per turn of 
each of these helices. The triple helix is a structure that forms to accommodate the unique 
composition and sequence of collagen. Long stretches of the polypeptide sequence are 
repeats of a Gly-x-y motif, where x is frequently Pro and y is frequently Pro or Hyp. In the 
triple helix, every third residue faces or contacts the crowded center of the structure. This 
area is so crowded that only Gly can fit, and thus every third residue must be a Gly (as 
observed). Moreover, the triple helix is a staggered structure, such that Gly residues from 
the three strands stack along the center of the triple helix and the Gly from one strand lies 
adjacent to an x residue from the second strand and a y from the third. This allows the 
N—H of each Gly residue to hydrogen bond with the C=O of the adjacent x residue. The 
triple helix structure is further stabilized and strengthened by the formation of interchain 
H bonds involving hydroxyproline. 
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Figure 6.15 Hydroxylation of proline residues is catalyzed by prolyl hydroxylase. The reaction 
requires a-ketoglutarate and ascorbic acid (vitamin C). 


Figure 6.16 Poly(Gly-Pro-Pro), a collagenlike right- 
handed triple helix composed of three left-handed 
helical chains (PDB id = 1K6F). (Adapted from 
Miller, M. H., and Scheraga, H. A., 1976. Calculation 
of the structures of collagen models. Role of inter- 
chain interactions in determining the triple-helical 
coiled-coil conformation. I. Poly(glycyl-prolyl-prolyl). 
Journal of Polymer Science Symposium 54:171—200.) 
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6.4b Globular Proteins Mediate Cellular Function 


Fibrous proteins, although interesting for their structural properties, represent only a small 
percentage of the proteins found in nature. Globular proteins, so named for their approxi- 
mately spherical shape, are far more numerous. The diversity of protein structures found in 
nature reflects the remarkable variety of functions performed by proteins—binding, cataly- 
sis, regulation, transport, immunity, cellular signaling, and more. The functional diversity 
and versatility derive in turn from the large number of folded structures that polypeptide 
chains can adopt and the varied chemistry of the side chains of the 20 common amino 
acids. Remarkably, this diversity of structure and function derives from a relatively small 
number of principles and themes of protein folding and design. The rest of Chapter 6 
explores and elaborates these principles and themes. 


6.4c Helices and Sheets Make Up the Core of Most Globular Proteins 


Globular proteins exist in an enormous variety of three-dimensional structures, but nearly 
all contain substantial amounts of a-helices and B-sheets folded into a compact structure 
that is stabilized by both polar and nonpolar interactions. A typical example is bovine ribonu- 
clease A, a small protein (12.6 kD, 124 residues) that contains a few short a-helices, a broad 
section of antiparallel B-sheet, a few B-turns, and several peptide segments without defined 
secondary structure (Figure 6.17). The space between the helices and sheets in the protein 
interior is filled efficiently and tightly with mostly hydrophobic amino acid side chains. 


Critical Developments in Biochemistry 
The Protein Data Bank (PDB) 


The Protein Data Bank (PDB, http://www. wwpdb. org) is a worldwide 
archive and repository for information about the three-dimensional 
structures of proteins, nucleic acids, and complex molecular 
assemblies. Established in 1971 with just seven structures, the PDB 
has grown exponentially to contain more than 200,000 entries, with 
more than 10,000 deposited each year. The Research Collaboratory 
for Structural Bioinformatics at http://www.rcsb.org operates the 
U.S. data center for the global PDB archive and makes PDB data 
available at no charge for students and researchers. 

Each structure archived in the PDB receives a four-character 
identifier known as the PDB id. Each PDB file contains informa- 
tion about how the structure was determined, the atoms in each 
protein or nucleic acid, and their three-dimensional locations 


in space as well as the accuracy of the data. A typical PDB file 
includes a large header section of text that summarizes the pro- 
tein, citation information, and the details of the structure solu- 
tion, followed by the sequence and a long list of the atoms and 
their coordinates. (The PDB file for bovine pancreatic ribonuclease 
(Figure 6.17, PDB id = 1FS3) is 47 pages long, with the coordi- 
nates of 1067 atoms.) 

Every macromolecular structure in this text includes the PDB id 
in the figure legend, so that instructors and students can download 
and study the structure using any of the freely available software 
packages for this purpose, including PyMol at Attps.//pymol.org, 
JSmol at Attps://sourceforge.net/projects/jsmoll/, and ICM Browser at 
http:/!www.molsoft.comlicm_browser.html. 


(b) 


Figure 6.17 The three-dimensional structure of bovine ribonuclease A (PDB id = 1FS3). (a) Ribbon 
diagram; (b) space-filling model. 
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Most polar side chains in ribonuclease face the outside of the protein structure and interact 
with solvent water. With its hydrophobic core and a hydrophilic surface, ribonuclease illus- 
trates the typical properties of many folded globular proteins. 

The helices and sheets that make up the core of most globular proteins probably 
represent the starting point for protein folding. Thus, the folding of a globular protein, 
in its simplest conception, could be viewed reasonably as the aggregation of multiple 
elements of secondary structure. Most peptide segments that form helices, sheets, or 
beta turns in proteins are mostly disordered in small model peptides that contain those 
amino acid sequences. Thus, hydrophobic interactions and other noncovalent interac- 
tions with the rest of the protein must stabilize these relatively unstable helices, sheets, 
and turns in the folded protein. 

The cores of most globular and membrane proteins consist almost entirely of a-helices 
and B-sheets. The reason is that the highly polar N—H and C—O moieties of the peptide 
backbone must be neutralized in the hydrophobic core of the protein. The extensively 
H-bonded nature of a-helices and B-sheets is ideal for this purpose, and these structures 
effectively stabilize the polar groups of the peptide backbone in the protein core. Hydro- 
gen bonds in the nonpolar core of globular proteins are typically 5—6 kJ/mole stronger 
than those on the protein surface. 

Whereas the interior of a globular protein is composed of conserved sheets and heli- 
ces, the surface of a globular protein is different in several ways. Typically, much of the 
protein surface is composed of the loops and tight turns that connect the helices and 
sheets of the protein core, although helices and sheets may also be found on the sur- 
face. The result is that the surface of a globular protein is often a complex landscape of 
different structural elements. These complex surface structures can interact in certain 
cases with small molecules or even large proteins that have complementary structure 
or charge (Figure 6.18). These regions of complementary, recognizable structure are 
formed typically from the peptide segments that connect elements of secondary struc- 
ture. They are the basis for enzyme-substrate interactions, protein-protein associations 
in cell signaling pathways, antigen—antibody interactions, and more. 

The segments of the protein that are neither helix, sheet, nor turn have tradition- 
ally been referred to as coil or random coil. Both of these terms are misleading. Most 
of these loop segments are neither coiled nor random in any sense of the words. These 
structures are every bit as organized and stable as the defined secondary structures. 
They just don’t conform to any frequently recurring pattern. These so-called coil struc- 
tures are strongly influenced by side-chain interactions with the rest of the protein. 


6.4d Water Molecules on the Protein Surface Stabilize the Structure 


A globular protein’s surface structure is affected by the surrounding water molecules. 
Many of the polar backbone and side chain groups on the surface of a globular protein 
make H bonds with solvent water molecules. There are often several such water mol- 
ecules per amino acid residue, and some are in fixed positions (Figure 6.19). Relatively 
few water molecules are found inside the protein. 

In some globular proteins (Figure 6.20), it is common for one face of an a-helix to 
be exposed to the water solvent, with the other face toward the hydrophobic interior 
of the protein. The outward face of such an amphiphilic helix consists mainly of polar 
and charged residues, whereas the inward face contains mostly nonpolar, hydrophobic 
residues. A good example of such a surface helix is that of residues 153 to 166 of flavo- 
doxin from Anabaena (Figure 6.20). Note that the helical wheel presentation of this helix 
readily shows that one face contains four hydrophobic residues and that the other is 
almost entirely polar and charged. 


6.4e Packing Considerations 


The secondary and tertiary structures of ribonuclease A (Figure 6.17) and other globu- 
lar proteins illustrate the importance of packing in tertiary structures. Secondary struc- 
tures pack closely to one another and also intercalate with (insert between) extended 
polypeptide chains. If the sum of the van der Waals volumes of a protein’s constituent 
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Figure 6.18 The surfaces of proteins are comple- 
mentary to the molecules they bind. PEP carboxyki- 
nase (shown here, PDB id = 1K3D) is an enzyme 
from the metabolic pathway that synthesizes glucose 
(gluconeogenesis). In the so-called active site (yellow) 
of this enzyme, catalysis depends on complementary 
binding of substrates. Shown in this image are ADP 
(brown), a Mg?! ion (blue), and AIF;~ (a phosphate 
analog, in green, above the Mg?*). 


Figure 6.19 The surfaces of proteins are ideally 
suited to form multiple H bonds with water 
molecules. Shown here are waters (blue and white) 
associated with actinidin, an enzyme from kiwi fruit 
that cleaves polypeptide chains at arginine residues 
(PDB id = 2ACT). The polar backbone atoms and 
side chain groups on the surface of actinidin are 
extensively H-bonded with water. 
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a-Helix from flavodoxin (residues 153-166) 


Figure 6.20 The helical wheel presentation can reveal the polar or nonpolar character of a-helices. 
If the helix is viewed end on, and the residues are numbered with residue 1 closest to the viewer, it is 
easy to see how polar and nonpolar residues are distributed to form a wheel. The a-helix consisting 
of residues 153-166 (red) in flavodoxin from Anabaena is a surface helix and is amphipathic 


(PDB id = IRCF). 


Human Biochemistry 


Collagen-Related Diseases 


Collagen provides an ideal case study of the molecular basis of 
physiology and disease. For example, the nature and extent of col- 
lagen crosslinking depends on the age and function of the tissue. 
Collagen from young animals is predominantly uncrosslinked and 
can be extracted in soluble form, whereas collagen from older ani- 
mals is highly crosslinked and thus insoluble. The loss of flexibility 
of joints with aging is probably due in part to increased crosslinking 
of collagen. 

Several serious and debilitating diseases involving collagen abnor- 
malities are known. Lathyrism occurs in animals due to the regular 
consumption of seeds of Lathyrus odoratus, the sweet pea, and in- 
volves weakening and abnormalities in blood vessels, joints, and bones. 
These conditions are caused by B-aminopropionitrile, which covalently 
inactivates lysyl oxidase, preventing intramolecular crosslinking of 
collagen and causing abnormalities in joints, bones, and blood vessels. 


n 
N = C— CH; — CH; — NH; 


B-Aminopropionitrile 


Scurvy results from a dietary vitamin C deficiency and involves 
the inability to form collagen fibrils properly. This is the result of 
reduced activity of prolyl hydroxylase, which is vitamin C depen- 
dent, as previously noted. Scurvy leads to lesions in the skin and 
blood vessels, and in its advanced stages, it can lead to grotesque 
disfiguration and eventual death. Although rare in the modern 
world, it was a disease well known to sea-faring explorers in earlier 
times who did not appreciate the importance of fresh fruits and 
vegetables in the diet. 

A number of rare genetic diseases involve collagen abnormali- 
ties, including Marfan syndrome and the Ehlers—Danlos syndromes, 
which result in hyperextensible joints and skin. The formation of 
atherosclerotic plaques, which cause arterial blockages in advanced 
stages, is due in part to the abnormal formation of collagenous 
structures in blood vessels. 


amino acids is divided by the volume occupied by the protein, packing densities of 
0.72 to 0.77 are typically obtained. These packing densities are similar to those of a 
collection of solid spheres. This means that even with close packing, approximately 
25% of the total volume of a protein is not occupied by protein atoms. Nearly all of 
this space is in the form of very small cavities. Cavities the size of water molecules or 
larger do occasionally occur, but they make up only a small fraction of the total protein 
volume. It is likely that such cavities provide flexibility for proteins and facilitate con- 
formation changes and a wide range of protein dynamics. 
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Figure 6.21 Ton EBP is a DNA-binding protein consisting of two distinct domains. The N-terminal 
domain is shown here on the right, with DNA (orange) in the middle, and the C-terminal domain on 
the left (PDB id = 1IMH). 


6.4f Protein Domains Are Nature’s Modular Strategy for Protein Design 


Proteins composed of about 250 amino acids or less often have a simple, compact 
globular shape. However, larger proteins are usually made up of two or more rec- 
ognizable and distinct structures, termed domains or modules—compact, folded pro- 
tein structures that are usually stable by themselves in aqueous solution. Figure 6.21 
shows a two-domain DNA-binding protein, TonEBP, in which the two distinct 
domains are joined by a short segment of the peptide chain. Most domains con- 
sist of a single continuous portion of the protein sequence, but in some proteins the 
domain sequence is interrupted by a sequence belonging to some other part of the 
protein that may even form a separate domain (Figure 6.22). In either case, typical 
domain structures consist of hydrophobic cores with hydrophilic surfaces (as was the 
case for ribonuclease; Figure 6.17). Importantly, individual domains often possess 
unique functional behaviors (for example, the ability to bind a particular ligand with 
high affinity and specificity), and an individual domain from a larger protein often 
expresses its unique function within the larger protein in which it is found. Multido- 
main proteins typically possess the sum total of functional properties and behaviors 
of their constituent domains. 

It is likely that proteins consisting of multiple domains (and thus multiple functions) 
evolved by the fusion of genes that once coded for separate proteins. This would require 
gene duplication to be common in nature, and analysis of completed genomes has con- 
firmed that approximately 90% of domains in eukaryotes have been duplicated. Thus, 
the protein domain is a fundamental unit in evolution. Many proteins have been assem- 
bled by duplicating domains and then combining them in different ways. Many proteins 
are assemblies constructed from several individual domains, and some proteins contain 
multiple copies of the same domain. Figure 6.23 shows the tertiary structures of nine 
domains that are frequently duplicated, and Figure 6.24 presents several proteins that 
contain multiple copies of one or more of these domains. 


6.4g Denaturation Leads to Loss of Protein Structure and Function 


Whereas the primary structure of proteins arises from covalent bonds, the secondary, 
tertiary, and quaternary levels of protein structure are maintained by weak, noncovalent 
forces. The environment of a living cell is exquisitely suited to maintain these weak 
forces and to preserve the structures of its many proteins. However, a variety of external 
stresses—for example, heat or chemical treatment—can disrupt these weak forces in a 
process termed denaturation—the loss of protein structure and function. 


Figure 6.22 Malonyl CoA:ACP transacylase (PDB 
id = 1NM2) is a metabolic enzyme consisting of two 
subdomains. The large subdomain (blue) includes 
residues 1-132 and 198-316 and consists of a B-sheet 
surrounded by 12 a-helices. The small subdomain 
(gold = residues 133-197) consists of a four-stranded 
antiparallel B-sheet and two a-helices. 
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Figure 6.23 Ribbon structures of several protein modules used in the construction of complex multi- 
module proteins. (a) The complement control protein module (PDB id = 1HCC). (b) The immunoglobu- 
lin module (PDB id = 1T89). (c) The fibronectin type I module (PDB id = 1Q06). (d) The growth factor 
module (PDB id = 1FSB). (e) The kringle module (PDB id = 1HPK). (f) The GYF module (PDB id = 
1GYF). (g) The y-carboxyglutamate module (PDB id = 1CFI). (h) The FF module (PDB id = 1UZC). 
(i) The DED domain (PDB id = 1A1W). 
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Biochemical Sciences 16:13-17.) 
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Human Biochemistry 
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Domain-Based Engineering of Proteins Forms the Basis of a Novel Cancer Treatment 


The rapidly developing field of synthetic biology aims to design 
and create novel biological molecules, processes, and systems for 
useful purposes. An experimental therapy for acute lymphoblastic 
leukemia (ALL) illustrates this goal. The patient’s own T-cells 
(one type of immune system cell) are extracted (see figure) and 
then genetically engineered, using a disabled virus, to express a 
chimeric antigen receptor (CAR) with a target-binding domain 
that binds to the B-cell antigen CD19. These engineered T-cells 
are able to recognize and kill B-cells in the patient that display the 
CD19 protein. This therapy works because ALL affects B-cells, 
and destruction of B-cells in many cases overcomes the disease. 
The genetic material inserted by the viral vectors has been engi- 
neered to encode several different protein domains: a monoclonal 
antibody domain (the target-binding domain) and two (or more) 
signaling domains (which promote enhanced proliferation, cyto- 
toxicity, and persistence of the T-cells in the treated patient). Also 
required are a linking domain and a transmembrane domain, for 
a total of five or more protein domains! Collectively, the domains 


Reprogrammed cells are 
infused back into the 
patient’s blood. These cells 
search for and destroy cancer 
cells expressing the CD19 
antigen protein targeted 
by the CARs. 


T-cells 


constitute CARs. The target-binding domain of the CAR is extra- 
cellular; the signaling domains are intracellular. 

In clinical trials involving severe cases of ALL, nearly two-thirds 
of patients receiving CAR T-cell therapy have remained in complete 
remission 6 months after being treated. Many of these are still in 
remission more than a year after treatment. 
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Figure 6.26 Proteins can be denatured by heat, with 
commensurate loss of function. Ribonuclease A (blue, 
open circle) and ribonuclease B (red, closed circle) lose 
activity above about 55°C. These two enzymes share 
identical protein structures, but ribonuclease B pos- 
sesses a carbohydrate chain attached to Asn™. 
(Adapted from Arnold, U., and Ulbrich-Hofmann, 
R., 1997. Kinetic and thermodynamic thermal stabili- 
ties of ribonuclease A and ribonuclease B. Biochemis- 
try 36:2166-2172.) 
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Ovalbumin monomer 


Figure 6.25 The proteins of egg white are denatured (as evidenced by their precipitation and 
aggregation) during cooking. More than half of the protein in egg whites is ovalbumin. (Ovalbumin 
PDB id = 1OVA) 


An everyday example is the denaturation of the protein ovalbumin during the cook- 
ing of an egg (Figure 6.25). About 10% of the mass of an egg white is protein, and 54% 
of that is ovalbumin. When a chicken egg is cracked open, the egg white is a nearly 
transparent, viscous fluid. Cooking turns this fluid to a solid, white mass. The egg 
white proteins have unfolded and have precipitated out of solution, and the unfolded 
proteins have aggregated into a solid mass. 

As a typical protein solution is heated slowly, the protein remains in its native 
state until it approaches a characteristic melting temperature, Tm. As the solution is 
heated further, the protein denatures over a narrow range of temperatures around 
Tm (Figure 6.26). Denaturation over a very small temperature range such as this is 
evidence of a two-state transition between the native and the unfolded states of the 
protein, and this implies that unfolding is an all-or-none process. When weak forces 
are disrupted in one part of the protein, the entire structure breaks down. 

Most proteins can also be denatured below the transition temperature by a variety of 
chemical agents, including acid or base, organic solvents, detergents, and particular dena- 
turing solutes. Guanidine hydrochloride and urea are examples of the latter (Figure 6.27). 
Denaturation in all these cases involves disruption of the weak forces that stabilize 
proteins. Covalent bonds are not affected. Acids and bases cause protonation and depro- 
tonation of dissociable groups on the protein, altering ionic interactions and hydrogen 
bonds. Organic solvents and detergents disrupt hydrophobic interactions that bury non- 
polar groups in the protein interior. The effects of guanidine hydrochloride and urea are 
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Figure 6.27 Proteins can be denatured by high concentrations of guanidine-HCl or urea. The dena- 
turation of chymotrypsin in guanidine-HCI (GdmC)l) is plotted here. (Adapted from Fersht, A., 1999. 
Structure and Mechanism in Protein Science. New York: W. H. Freeman.) 
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more complex. These agents denature proteins by both direct effects (binding to hydro- 
philic groups on the protein) and indirect effects (altering the structure and dynamics 
of the water solvent). Also, both guanidine hydrochloride and urea are good H-bond 
donors and acceptors. 


6.4h Anfinsen’s Classic Experiment Proved 
That Sequence Determines Structure 


What chemical forces drive protein folding? As noted earlier (Section 6.2), all the informa- 
tion needed to fold a polypeptide into its physiological structure is contained in the amino 
acid sequence. This simple but profound truth of protein structure was confirmed in the 
1950s by the elegant studies of denaturation and renaturation of proteins by Christian 
Anfinsen and his co-workers at the National Institutes of Health. For their pivotal stud- 
ies, they chose the small enzyme ribonuclease A from bovine pancreas, a protein with 124 
residues and four disulfide bonds (Figures 6.17 and 6.28). (Ribonuclease cleaves chains 
of ribonucleic acid. Only ribonuclease in its native structure possesses enzyme activity, so 
loss of activity in a denaturation experiment was proof of loss of structure.) They treated 
solutions of ribonuclease with a combination of urea, which unfolded the protein, and 
mercaptoethanol, which reduced the disulfide bridges. This treatment destroyed all enzy- 
matic activity of ribonuclease. 

Anfinsen discovered that removing the mercaptoethanol but not the urea restored 
only 1% of the enzyme activity. This was attributed to the formation of random disul- 
fide bridges by the still-denatured protein. With eight Cys residues, there are 105 pos- 
sible ways to make four disulfide bridges; thus, a residual activity of 1% made sense to 
Anfinsen. (The first Cys to form a disulfide has seven possible partners, the next Cys 
has five possible partners, the next has three, and the last Cys has only one choice. 
7X 5X3 xX 1 = 105). However, if Anfinsen removed mercaptoethanol and urea at the 
same time, the polypeptide was able to fold into its native structure, the correct set of 
four disulfides reformed, and full enzyme activity was recovered (Figure 6.28). This 
experiment demonstrated that the information needed for protein folding resided entirely 
within the amino acid sequence of the protein itself. Many subsequent experiments 
with a variety of proteins have confirmed this fundamental postulate. For his studies 
of the relationship of sequence and structure, Anfinsen shared the 1972 Nobel Prize in 
Chemistry with William H. Stein and Stanford Moore. 


6.41 Is There a Single Mechanism for Protein Folding? 


Christian Anfinsen’s experiments demonstrated that proteins can fold reversibly and 
also implies that the native structures of at least some proteins are thermodynami- 
cally stable states. But the matter of how a given protein achieves such a stable state is 
a complex one. Cyrus Levinthal pointed out in 1968 that so many conformations are 
possible for a typical protein that the protein does not have sufficient time to reach 
its most stable conformational state by sampling all the possible conformations. This 
argument, termed Levinthal’s paradox, goes as follows: Consider a protein of 100 
amino acids. Assume that there are only two conformational possibilities per amino 
acid, or 2!°° = 1.27 X 10% possibilities. Allow 107" s for the protein to test each con- 
formational possibility in search of the overall energy minimum: 


(10-13 s)(1.27 X 10%) = 1.27 x 10" s = 4 X 10° years 


Four billion years is the approximate age of the earth. 

Levinthal’s paradox led protein chemists to hypothesize that proteins must fold by 
specific folding pathways, and many research efforts have been devoted to the search 
for these pathways. Several consistent themes have emerged from these studies. Each of 
them may well play a role in the folding process: 


e Secondary structures—helices, sheets, and turns—probably form first. 
e Nonpolar residues may aggregate or coalesce in a process termed hydrophobic 
collapse. 


A collaboration of computer scientists and bio- 
chemists led by David Baker at the University 

of Washington created Foldit, a computer game 
based on crowdsourcing and distributed comput- 
ing. Nearly 200,000 Foldit players have devoted 
millions of hours of computer time to solving 

a variety of protein folding problems. In Foldit, 
players working collaboratively develop a rich 
assortment of new strategies and algorithms to 
better understand the folding of globular pro- 
teins. Foldit can be found at http://fold.it/portal/. 


(Cooper, S., et al., 2009. Predicting protein struc- 
tures with a multiplayer online game. Nature 
466:756-760; Farley, P. C., 2013. Using the com- 
puter Game “Foldlt” to entice students to explore 
external representations of protein structure in a 
biochemistry course for nonmajors. Biochemistry 
and Molecular Biology Education 41:56-57.) 
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Figure 6.28 Ribonuclease can be unfolded by treatment with urea, and B-mercaptoethanol (MCE) cleaves disulfide 
bonds. If B-mercaptoethanol is then removed (but not urea) under oxidizing conditions, disulfide bonds reform in the 
still-unfolded protein (one possible hypothetical inactive form is shown). If urea is removed in the presence of a small 
amount of -mercaptoethanol with gentle warming, ribonuclease returns to its native structure (with the correct set 

of disulfide bonds), and full enzymatic activity is restored. This experiment demonstrated that the information 
required for folding of globular proteins is contained in the primary structure. 


+MCE 
+Urea Hs 
— s 
-MCE 
—Urea 
+ Oxygen 
Ss @ 
o waso 
Active Inactive 

Small amount of MCE —MCE 
w/gentle warming + Oxygen 


Hypothetical Inactive Form 
(Note random formation of disulfides) 


e Subsequent steps probably involve formation of long-range interactions between 
secondary structures or other hydrophobic interactions. 

e The folding process may involve one or more intermediate states, including transi- 
tion states and what have become known as molten globules. 


The folding of most globular proteins may well involve several of these themes. For 
example, even in the denatured state, many proteins appear to possess small amounts 
of residual structure due to hydrophobic interactions, with strong interresidue con- 
tacts between side chains that are relatively distant in the native protein structure. 
Such interactions, together with small amounts of secondary structure, may act as 
sites of nucleation for the folding process. A bit further in the folding process, the 
molten globule is postulated to be a flexible but compact form characterized by sig- 
nificant amounts of secondary structure, virtually no precise tertiary structure, and a 
loosely packed hydrophobic core. Moreover, it is likely that any one of these themes is 
more important for some proteins than for others. The process of folding is clearly 
complex, but sophisticated simulations have already provided reasonable models of 
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folding (and unfolding) pathways for many proteins (Figure 6.29). One school of 
thought suggests that for any given protein there may be multiple folding pathways. 
For these cases, Ken Dill at the University of California, San Francisco has suggested 
that the folding process can be pictured as a funnel of free energies—an energy land- 
scape (Figure 6.30). The rim at the top of the funnel represents the many possible 
unfolded states for a polypeptide chain, each characterized by high free energy and 
significant conformational entropy. Polypeptides fall down the wall of the funnel as 
contacts made between residues establish different folding possibilities. The narrowing 
of the funnel reflects the smaller number of available states as the protein approaches 


c12 
D (94 ns) D (70 ns) I (30 ns) TS (0.26 ns) P (0 ns) 
Barnase 
D (4 ns) D (2 ns) I (0.7 ns) TS (0.15 ns) P (0 ns) 


Unfolded states 


Partially folded states 


Physiological folded states 


Figure 6.29 Computer simulations of folding 
and unfolding of proteins can reveal possible 
folding pathways. Molecular dynamics simula- 
tions of the unfolding of small proteins such as 
chymotrypsin inhibitor 2 (CI2) and barnase are 
presented here on a reversed time scale to show 
how folding may occur. D = denatured, 

I = intermediate, TS = transition state, 

P = physiological. (Adapted from Daggett, V., 
and Fersht, A. R., 2003. Is there a unifying 
mechanism for protein folding? Trends in Bio- 
chemical Sciences 28:18-25. Figures provided by 
Alan Fersht and Valerie Daggett.) 


Figure 6.30 A model for the steps involved in the folding of globular 


proteins. The funnel represents a free energy surface or energy landscape 
for the folding process. The protein folding process is highly cooperative. 
Rapid and reversible formation of local secondary structures is followed 
by a slower phase in which establishment of partially folded 
intermediates leads to the final tertiary structure. 
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Figure 6.31 Proteins are dynamic structures. The 
marginal stability of a tertiary structure leads to 
flexibility and motion in the protein. Determination 
of structures of proteins (such as the SH3 domain of 
the a-chain of spectrin, shown here) by nuclear mag- 
netic resonance produces a variety of stable tertiary 
structures that fit the data. Such structural ensem- 
bles provide a glimpse into the range of structures 
that may be accessible to a flexible, dynamic protein 
(PDB id = 1M8M). 


its final state, and bumps or pockets on the funnel walls represent partially stable 
intermediates in the folding pathway. The most stable (native) folded state of the pro- 
tein lies at the bottom of the funnel. 


6.4) What Is the Thermodynamic Driving Force 
for Protein Folding? 


The free energy change for the folding of a globular protein must be negative if the 
folded state is more stable than the unfolded state. The free energy change for folding 
depends, in turn, on changes in enthalpy and entropy for the process: 


AG = AH —- TAS 


When AH, — TAS, and AG are measured separately for the polar side chains and for 
the nonpolar side chains of the protein, an important insight is apparent. The enthalpy 
and entropy changes for polar residues largely cancel each other out, and the AG of 
folding for the polar residues is approximately zero. 

To understand the behavior of the nonpolar residues, it is helpful to distinguish the 
AH and —TAS contributions for the polypeptide chain and for the water solvent. Both 
AH and —TAS for the nonpolar residues of the peptide chain are positive and thus make 
unfavorable contributions to the folding free energy. However, large numbers of water 
molecules restricted and immobilized around nonpolar residues in the unfolded protein 
are liberated in the folding process. The burying of nonpolar residues in the folded pro- 
tein’s core produces a dramatic entropy increase for these liberated water molecules. This 
is just enough to make the overall AG for folding negative (and thus favorable). The 
crucial results: 


e The largest contribution to the stability of a folded protein is the entropy change for 
the water molecules associated with the nonpolar residues. 

e The overall free energy change for the folding process is not large, typically —20 to 
—40 kJ/mol. 


6.4k Marginal Stability of the Tertiary Structure 
Makes Proteins Flexible 


A typical folded protein is only marginally stable. The hundreds of van der Waals inter- 
actions and hydrogen bonds in a folded structure are compensated and balanced by a 
dramatic loss of entropy suffered by the polypeptide as it assumes a compact folded 
structure. Because stability seems important to protein and cellular function, it is tempt- 
ing to ask what the advantage of marginal stability might be. The answer appears to lie 
in flexibility and motion. All chemical bonds undergo a variety of motions, including 
vibrations and (for single bonds) rotations. This propensity to move, together with the 
marginal stability of protein structures, means that the many noncovalent interactions 
within a protein can be interrupted, broken, and rearranged rapidly. 


6.41 Motion in Globular Proteins 


Proteins are best viewed as dynamic structures. Most globular proteins oscillate and fluc- 
tuate continuously about their average or equilibrium structures (Figure 6.31). This flex- 
ibility is essential for a variety of protein functions, including ligand binding, enzyme 
catalysis, and enzyme regulation, as shown throughout the remainder of this text. 

The motions of proteins may be motions of individual atoms, groups of atoms, 
or even whole sections of the protein. Furthermore, they may arise either from ther- 
mal energy or from specific, triggered conformational changes in the protein. Atomic 
fluctuations such as vibrations typically are random, are very fast, and usually occur 
over small distances, as shown in Table 6.1. These motions arise from the kinetic energy 
within the protein and are a function of temperature. In the tightly packed interior of 
the typical protein, atomic movements of an angstrom or less are typical. The closer 
to the surface of the protein, the more movement can occur, and on the surface atomic 
movements of several angstroms are possible. 
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| Table6.1 | 6.1 Motion and Fluctuations in Proteins 


Spatial 
Displacement Characteristic 
Type of Motion (A) Time (s) Source of Energy 
Atomic vibrations 0.01-1 107-!-10-! Kinetic energy 
Collective motions 0.01-5 1071-1073 Kinetic energy 
1. Fast: Tyr ring flips; Or MOIE 
methyl group rotations 
2. Slow: hinge bending 
between domains 
Triggered conformation changes 0.5-10 10~°-10° Interactions with 
or more triggering agent 
Proline cis-trans 3-10 10'-10+ Kinetic energy or 
isomerization enzyme driven 


Adapted from Petsko, G. A., and Ringe, D., 1984. Fluctuations in protein structure from X-ray diffraction. Annual Review 
of Biophysics and Bioengineering 13:331-371. 


A class of slower motions, which may extend over larger distances, is collective 
motions. These are movements of a group of atoms covalently linked in such a way 
that the group moves as a unit. Such a group can range from a few atoms to hundreds 
of atoms. These motions are of two types: (1) those that occur quickly but infrequently, 
such as tyrosine ring flips, and (2) those that occur slowly, such as the hinge-bending 
movement between protein domains. For example, the two antigen-binding domains 
of immunoglobulins move as relatively rigid units to selectively bind separate antigen 
molecules. These collective motions also arise from thermal energies in the protein and 
operate on a timescale of 10~'* to 107° s. It is often important to distinguish the time 
scale of the motion itself versus the frequency of its occurrence. A tyrosine ring flip 
takes only a picosecond (1 X 107!? s), but such flips occur only about once every 
millisecond (1 X 1077s). 

Conformational changes involve motions of groups of atoms (individual side 
chains, for example) or even whole sections of proteins. These motions occur on a 
time scale of 10~° to 10° s, and the distances covered can be as large as 1 nm. These 
motions may occur in response to specific stimuli or arise from specific interactions 
within the protein (hydrogen bonding, electrostatic interactions, or ligand binding— 
see Chapters 13-15). 

The cis—trans isomerization of proline residues in proteins (Figure 6.32) occurs over 
an even longer time scale—typically 10! to 104s. Conversion of even a single proline 
from its cis to its trans conformation can alter a protein structure dramatically. Proline 
cis—trans isomerizations sometimes act as switches to activate a protein or open a chan- 
nel across a membrane. 


6.4m The Folding Tendencies and Patterns of Globular Proteins 


Globular proteins adopt the most stable tertiary structure possible. The polypeptide 
itself does not usually form simple straight chains. Even in chain segments where 
helices and sheets are not formed, an extended peptide chain, being composed of 
L-amino acids, has a tendency to twist slightly in a right-handed direction. As shown 
in Figure 6.33, this tendency is apparently the basis for the formation of a variety 
of tertiary structures with a right-handed sense. Principal among these are the right- 
handed twists in B-sheets and right-handed cross-overs in parallel B-sheets. Right- 
handed twisted B-sheets are found at the center of a number of proteins (Figure 6.34) 
and provide an extended, highly stable structural core. 

Connections between B-strands are of two types—hairpins and cross-overs. 
Hairpins, as shown in Figure 6.33, connect adjacent antiparallel B-strands. Cross- 
overs are necessary to connect adjacent (or nearly adjacent) parallel B-strands. Nearly 
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Figure 6.32 The cis and trans configurations of pro- 
line residues in peptide chains are almost equally 
stable. Proline cis-trans isomerizations, often occur- 
ring over relatively long time scales, can alter protein 
structure significantly. 
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(a) (b) 


Antiparallel hairpin 


Natural right-handed twist by polypeptide chain 


Cross-overs Parallel, left-handed 


Figure 6.33 (a) The natural right-handed twist exhibited by polypeptide chains, and (b) the types of 
connections between $-strands. 


Hydrophobic residues are 
buried between layers 


(a) Cytochrome c’ (b) Phosphoglycerate kinase (c) Phosphorylase 
(domain 2) (domain 2) 


Figure 6.34 Examples of protein domains with different numbers of layers of backbone structure. 
(a) Cytochrome c’ with two layers of a-helix. (b) Domain 2 of phosphoglycerate kinase, composed 
of a B-sheet layer between two layers of helix, three layers overall. (c) An unusual five-layer struc- 
ture, domain 2 of glycogen phosphorylase, a B-sheet layer sandwiched between four layers of 
a-helix. (d) The concentric “layers” of B-sheet (inside) and a-helix (outside) in triose phosphate 
isomerase. Hydrophobic residues are buried between these concentric layers in the same manner as 
in the planar layers of the other proteins. The hydrophobic layers are shaded yellow. (Original art 
(d) Triosephosphate isomerase courtesy of Jane Richardson.) 
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A Deeper Look 


Metamorphic Proteins—A Consequence of Dynamism and Marginal Stability 


As our knowledge evolves, we find that our understanding of 
protein structure is like the story of the inebriated man wandering 
under a street lamp one night. A passerby asks what he is doing, and 
the man answers that he is looking for his keys. When the passerby 
asks “Where did you lose them?,” the man answers “Down the street 
a ways.” And when the passerby asks, “Why are you looking here?” 
the man replies, “There’s more light here!” 


N 


Until recently, biochemists have been like the inebriated man. 
Proteins that have been most amenable to structural studies, those 
have been the more stable proteins that favor a single conformation. 
As structural methodologies have evolved, we have come to appreci- 
ate that many proteins can exist as an ensemble of structures of more 
or less similar energies and stabilities. Some of the recent work has 
been remarkable. For example, it is becoming clear that a few muta- 
tions can dramatically change the three-dimensional structure of a 
protein. Philip Bryan and co-workers have shown that the conversion 


of Streptococcus protein G, a 56-residue protein, from a structure 
dominated by three a-helices to a structure with four B-strands and 
an a-helix can occur via a single amino acid substitution—changing 
residue 45 from Leu to Tyr. 


References 
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protein metamorphosis. ACS Chem. Biol. 13:1438-1446. 
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(Left) PDB id = 2KDL; (Right) PDB id = 2KDM 


all cross-over structures are right-handed. In many cross-over structures, the cross- 
over connection itself contains an a-helical segment. This creates a BaB-loop. As 
shown in Figure 6.33, the strong tendency in nature to form right-handed cross-overs, 
the wide occurrence of a-helices in the cross-over connection, and the right-handed 
twists of B-sheets can all be understood as arising from the tendency of an extended 
polypeptide chain (composed of L-amino acids) to adopt a right-handed twist struc- 
ture. This is a chiral effect. Proteins composed of D-amino acids would tend to adopt 
left-handed twist structures. 

The second driving force that affects the folding of polypeptide chains is the need 
to bury the hydrophobic residues of the chain, protecting them from solvent water. 
From a topological viewpoint, then, all globular proteins must have an “inside” where 
the hydrophobic core can be arranged and an “outside” toward which the hydrophilic 
groups must be directed. The sequestration of hydrophobic residues away from water 
is the dominant force in the arrangement of secondary structures and nonrepetitive 
peptide segments to form a given tertiary structure. Globular proteins can be classi- 
fied mainly on the basis of the particular kind of core or backbone structure they use 
to accomplish this goal. The term hydrophobic core, as used here, refers to a region 
in which hydrophobic side chains cluster together, away from the solvent. Backbone 
refers to the polypeptide backbone itself, excluding the particular side chains. Globular 
proteins can be pictured as consisting of “layers” of backbone, with hydrophobic core 
regions between them. More than half of the known globular protein structures have 
two layers of backbone (separated by one hydrophobic core). Roughly one-third of the 
known structures are composed of three backbone layers and two hydrophobic cores. 
There are also a few known four-layer structures and at least one five-layer structure. 
A few structures are not easily classified in this way, but it is remarkable that most pro- 
teins fit into one of these classes. Examples of each are presented in Figure 6.34. 
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6.4n Proteins Can Be Classified on the Basis of Folding Patterns 


The astounding diversity of properties and behaviors in living things can now be 
explored through the analysis of vast amounts of genomic information. Assessment of 
sequence and structural data from over 20 million proteins in both protein and genome 
databases has shown that there is a relatively limited number of structurally distinct 
domains in proteins. Moreover, most proteins are structurally similar to at least some 
other proteins, and in many cases are related evolutionarily. Several comprehensive proj- 
ects have organized the available information in defined hierarchies, or even more com- 
plex relationships or levels of protein structure. The Structural Classification of Proteins 
database (SCOP2, https://scop.mrc-lmb.cam.ac.uk/) uses both automated algorithms and 
manual inspection to describe the structural and evolutionary relationships between 
all the proteins whose structures are known. All proteins in SCOP2 are described and 
can be searched by four properties: protein types, structural classes, folding relation- 
ships, and evolutionary relationships. Protein type includes four groups: soluble, fibrous, 
membrane, and intrinsically disordered proteins. Protein structural classes define the 
types and arrangements of secondary structure (Figure 6.35) in four broad groups: the 
all œ proteins and all B proteins (in which the structures are dominated by a-helices and 
B-sheets, respectively) as well as a/B proteins (in which helices and sheets are intermin- 
gled) and a + B proteins (in which a-helical and B-sheet domains are separated for the 
most part). Folding relationships in SCOP2 are based on families, superfamilies, and 
folds. Family groups closely related proteins that show clear evidence of evolutionary 
origin. Superfamily brings together more distantly related protein domains, again based 
on common evolutionary origin. The fold criterion in SCOP2 relates superfamilies on 
the basis of the global structural features shared by the majority of the proteins. A fold 
describes the number, arrangement, and connections of secondary structure elements. 
Fold is an attribute of a superfamily, but the families within superfamilies that have 
evolved distinct structural features can belong to a different fold. 

Although the numbers of unique folds (more than 1500), superfamilies (more than 
2600), and families (more than 5600) increase as more genomes are known and analyzed, 
it has become apparent that the number of protein domains in nature is large but limited. 
How many proteins can we expect to identify and understand someday? The Protein 
Data Bank curates more than 200,000 structures at present. There are approximately 10? 
to 10° genes per organism and approximately 14 million species of living organisms on 
earth (and this latter number is likely an underestimate). Thus, there may be approxi- 
mately (10° X 1.4 X 107) or 10!° to 10! different proteins in all organisms on earth. Still, 
this vast number of proteins may well consist of only a couple thousand folds and a few 
thousand protein families—remarkably small numbers compared to the total number of 
protein-coding genes (see Table 1.5). It is anticipated that most newly identified proteins 
will resemble other known proteins and that most structures can be broken into two or 
more domains, which resemble tertiary structures observed in other proteins. 
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Figure 6.35 Four major classes of protein structure (as defined in the SCOP database). (a) All a 
proteins, where a-helices dominate the structure; (b) All 8 proteins, in which B-sheets are the primary 
feature; (c) a/B proteins, where a-helices and B-sheets are mixed within a domain; (d) a+ proteins, 
in which a-helical and B-sheet domains are separated to at least some extent. 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


6.4 How Do Polypeptides Fold into Three-Dimensional Protein Structures? 177 


(a) Same domain type, different functions: 


Triose phosphate isomerase Aldose reductase Phosphotriesterase 


(b) Same function, different structures: 


D-amino acid aminotransferase 


Aspartate aminotransferase 


Among the proteins in the PDB, many domains and folds recur hundreds or even 
thousands of times, yet conservation of amino acid sequence is much less common. 
When sequence and structure are both conserved, the evolutionary relationship is 
stronger. 

Because structure depends on sequence, and because function depends on structure, 
it is tempting to imagine that all proteins of a similar structure should share a common 
function, but this is not always true. For example, the TIM barrel is a common protein 
fold consisting of eight a-helices and eight B-strands that alternate along the peptide 
backbone to form a doughnut-like tertiary structure. The TIM barrel is named for 
triose phosphate isomerase, an enzyme that interconverts ketone and aldehyde sub- 
strates in the breakdown of sugars. However, other TIM barrel proteins carry out very 
different functions (Figure 6.36a), including the reduction of aldose sugars and hydro- 
lysis of phosphate esters. Moreover, not all proteins of similar function possess similar 
domains. Both proteins shown in Figure 6.36b catalyze the same reaction, but they 
bear little structural similarity to each other. 


6.40 Molecular Chaperones Are Proteins That Help Other Proteins to Fold 


All the information necessary to direct the folding of a polypeptide is contained in 
its primary structure. On the other hand, the high protein concentration inside cells 
may adversely affect the folding process because hydrophobic interactions may lead to 
aggregation of some unfolded or partially folded proteins. Also, it may be necessary 
to suppress or reverse incorrect or premature folding. A family of proteins, known as 
molecular chaperones, are essential for the correct folding of certain polypeptide chains 
in vivo; for their assembly into oligomers; and for preventing inappropriate liaisons with 
other proteins during their synthesis, folding, and transport. Many of these proteins 


Figure 6.36 Structure and function are not always 
correlated closely in proteins. (a) Some proteins 
share similar structural features but carry out quite 
different functions (triose phosphate isomerase, 

PDB id = 8TIM; aldose reductase, PDB id = 1ADS; 
phosphotriesterase, PDB id = 1DPM). (b) Proteins 
with quite different structures can carry out similar 
functions (yeast aspartate aminotransferase, 

PDB id = 1YAA); D-amino acid aminotransferase, 
PDB id = 3DAA). 
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were first identified as heat shock proteins, which are induced in cells by elevated tem- 
perature or other stress. The most thoroughly studied proteins are Hsp70, a 70-kDa 
heat shock protein, and the so-called chaperonins, also known as Cpn60s or Hsp60s, a 
class of 60-kDa heat shock proteins. A well-characterized Hsp60 chaperonin is GroEL, 
an E. coli protein that has been shown to affect the folding of several proteins. 


6.4p Some Proteins Are Intrinsically Unstructured 


Remarkably, many proteins exist and function normally in a partially unfolded state. 
Such proteins, termed intrinsically unstructured proteins (IUPs) or intrinsically dis- 
ordered proteins (IDPs), do not possess uniform structural properties but are none- 
theless essential for basic cellular functions. These proteins are characterized by an 
almost complete lack of folded structure and an extended conformation with high 
intramolecular flexibility. Yet they are essential for many basic cellular functions, 
including protein solubility enhancement, regulation of protein lifetimes via control 
of proteolysis, and control of protein—protein interactions. 

The ordered and disordered domains of a protein (Figure 6.37) may function inde- 
pendently, and regions of a protein can undergo order—disorder transitions. These 
transitions underlie biological function, and they can be induced by binding of small 
molecules and/or macromolecular binding partners, or by post-translational modifica- 
tions. For example, order—disorder transitions modulate DNA-protein interactions. In 
addition, the assembly of ribosomes (the centers of cellular protein synthesis) depends 
on order—disorder transitions in ribosomal proteins, essentially all of which have large 
intrinsically disordered regions. 

Intrinsically unstructured proteins are characterized by a unique combination of high 
net charge and low overall hydrophobicity. Compared with ordered proteins, IUPs have 
higher levels of E, K, R, G, Q, S, and P, and low amounts of I, L, V, W, F, Y, C, and N. 
Intrinsically disordered proteins and disordered regions in otherwise structured proteins 
do not fold primarily because they are rich in polar residues and proline and depleted in 
hydrophobic residues. The greater the net charge, the more extended an IDP region is. 

These features provide a rationale for prediction of regions of disorder from amino 
acid sequence information, and experimental evidence shows that such predictions 
are better than 80% accurate. Genomic analysis of disordered proteins indicates that 
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Figure 6.37 Intrinsically unstructured proteins (IUPs) contact their target proteins over a large sur- 
face area. (a) p275! (yellow) complexed with cyclin-dependent kinase 2 (Cdk2, blue) and cyclin A 
(CycA, green). (b) The transactivator domain CBD of TCF3 (yellow) bound to B-catenin (blue). Note: 
Part of the B-catenin has been removed for a clear view of the CBD. (From Tompa, P., 2002. Intrinsi- 
cally unstructured proteins. Trends in Biochemical Sciences 27:527—533.) (c) Some intrinsically un- 
structured proteins (in red and yellow) bind to their targets by wrapping around them. Shown here: 
SNAP-25 wrapped around BoNT/A, (Adapted from Figure | in Tompa, P., 2002. Intrinsically un- 
structured proteins. Trends in Biological Sciences 27:527-533.) 
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the proportion of the genome encoding IUPs and proteins with substantial regions of 
disorder tends to increase with the complexity of organisms. Thus, predictive analysis 
of whole genomes indicates that 2% of archaeal and 4.2% of bacterial proteins prob- 
ably contain long regions of disorder. On the other hand, 25% to 30% of eukaryotic 
proteins are mostly disordered, and more than half of eukaryotic proteins have long 
regions of disorder. Depending on which predictor is used, the human proteome is esti- 
mated to have between 35% and 50% disordered residues. More than 70% of human 
signaling proteins have long disordered regions. 

A given IDP or IDP region can change its shape in order to bind to multiple differ- 
ent partners—a phenomenon termed one-to-many binding. An example of such behav- 
ior is the tumor suppressor protein p53. This 53 kDa protein consists of an N-terminal 
disordered domain of 93 residues, a central DNA-binding domain of 200 residues, 
and a C-terminal disordered domain of about 100 residues. The N-terminal domain 
apparently binds to more than 40 different protein partners, and the C-terminal domain 
binds to an even larger number of target proteins (Figure 6.38). This large number of 
interactions is consistent with involvement of p53 in multiple signaling and regulatory 
pathways in cells and also suggests a complicated competition between the various pro- 
tein targets for p53. Indeed, p53 is a crucial transcription factor whose activity is mod- 
ulated by a wide spectrum of signals that affect cell proliferation and genome integrity. 

The prevalence of disordered segments in proteins may reflect two different cellular 
needs. First, disordered proteins are more malleable and thus can adapt their struc- 
tures to bind to multiple ligands, including other proteins. Each such interaction could 
provide a different function in the cell. Second, compared with compact, folded pro- 
teins, disordered segments in proteins appear to be able to form larger intermolecular 
interfaces to which ligands, such as other proteins, could bind (Figure 6.37). Folded 
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Figure 6.38 Molecular disorder of tumor suppressor protein p53. The sequence consists of a 93-resi- 
due, mostly disordered N-terminal segment, a central, DNA-binding domain of 200 residues, and a 
mostly disordered C-terminal segment of about a hundred residues. Gray arrows indicate binding in- 
teractions of each domain with partner proteins. Binding partners of the N-terminal domain include 
(clockwise from lower right) high-mobility group protein B1 (PDB id = 2LY4), Taz2 domain of p300 
(PDB id = 2K8F), nuclear coactivator—binding domain of p300 (PDB id = 2L14), MDM2 (PDB id = 
1YCR), N terminus of replication protein A (PDB id = 2B3G), and PH domain of RNA polymerase 
II transcription factor B subunit 1 (PDB id = 2GS0). Eight partners of the C terminal domain are 
(clockwise from upper left) histone acetyltransferase domain of Tetrahymena general control nondere- 
pressor 5 (PDB id = 1Q2D), SET9 (PDB id = 1XQH), CDK2/cyclin A (PDB id = 1H26), Sir2 
(PDB id = 1MA3), bromodomain of CBP (PDB id = IJSP), S1OOB(BB) (PDB id = 1DT7), Tudor2 
domain of PHF20 (PDB id = 2LDM), and p53 homotetramerization (PDB id = 3SAK). (Adapted 
from Oldfield, C. J. and Dunker, K. E., 2014. Intrinsically Disordered Proteins and Intrinsically 
Disordered Protein Regions. Annual Review of Biochemistry 83:553-584.) 
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proteins might have to be two to three times larger to produce the binding surface pos- 
sible with a disordered protein. Larger proteins would increase cellular crowding or 
could increase cell size by 15% to 30%. The flexibility of disordered proteins may thus 
reduce protein, genome, and cell sizes. 


6.4q Artificial Intelligence Algorithms Predict Protein Structures Accurately 


AlphaFold is an artificial intelligence program developed by Alphabet/Google’s 
DeepMind subsidiary that can predict three-dimensional structures of proteins with 
high accuracy. Millions of structures of proteins from the human proteome and other 
key proteins of interest have been generated, based only on the primary structures of 
these proteins. The AlphaFold2 software is open-source and the AlphaFold Protein 
Structure Database (https: //alphafold. ebi.ac.uk/) provides open access to protein models 
generated with this software. 


Critical Developments in Biochemistry 


NMR and Cryo-EM — Revolutionary Methods that Probe Protein Structure and Dynamics 


Nuclear magnetic resonance (NMR) spectroscopy is particularly 
suited for the study of macromolecular structure and dynamics in 
solution. Nuclei with an odd number of protons and/or neutrons 
possess a nuclear spin angular momentum, termed nuclear spin, 
and behave as magnetic dipoles. Certain of these nuclei ('H, 13C, 
ISN, °F, 3!P), when placed in a strong magnetic field, can absorb 
short pulses (~10—20 us) of electromagnetic energy in the radio 
frequency range. The absorption spectrum yields information about 
chemical environment, and the relaxation from high energy states 
provides information about dynamics in the molecule. 'H, °C, and 
ISN are all suitable for NMR studies of proteins, but the natural 
abundance of °C (1%) and N (0.4%) are so low that proteins must 
be expressed in bacteria grown in }C- and !*N-enriched media, to 
achieve the highest possible sensitivity in NMR experiments. 


Two-dimensional NMR can reveal interactions between two 
nuclei (proton-proton, proton-carbon, or proton-nitrogen) in a 
protein, in two ways. Couplings of nuclear spins through cova- 
lent bonds in total correlation spectroscopy (TOCSY) or hetero- 
nuclear single quantum coherence (HSQC) spectroscopy provides 
local structure information, and spin-spin coupling through space 
(the nuclear Overhauser effect, or NOE) identifies pairs of nuclei 
that are close in space but may be distant in the protein sequence in 
NOESY experiments. 

The real advantage of NMR, over other structural measure- 
ments, is in studies of dynamics in proteins. Many proteins are 
ensembles of interchanging states in solution, and the NMR spec- 
trum is a population-weighted average over this ensemble. NMR 
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Figure 1 (a) 'H-SN HSQC spectra of the TAZ1 domain of CREB-binding protein (CBP) 

(a human transcription factor) alone (black) and in complex with two intrinsically disordered 
proteins, HIF-la (gold) and CITED? (blue). Assignments are shown for selected crosspeaks. 
The tryptophan indole resonance is shown as an inset in the bottom left corner. (Adapted 
from Dyson, H. J. and Wright, P. E., Perspective: the essential role of NMR in the discovery 
and characterization of intrinsically disordered proteins. J. Biomol. NMR 73:651-659.) 

(b) NMR structure of TAZ1 in complex with HIF-la. 
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provides quantitative information about ensembles that is not avail- 
able from any other technique. 

Moreover, NMR parameters are now used routinely for charac- 
terization of intrinsically disordered proteins (IDPs, Figure 1) and 
form the basis for quantitative modeling of the conformational 
ensemble and energy landscape. NMR can be used to characterize 
the conformation and dynamics of the free and bound states of 
the IDP, the binding process, and the binding affinity. NMR relax- 
ation data can be used to characterize the kinetics and mechanism 
of coupled folding and binding processes. NMR offers a valu- 
able technology for characterization of conformational ensem- 
bles of proteins, their dynamics, and their complexes with other 
biomolecules. 

Three-dimensional electron microscopy, more commonly known 
as cryo-EM, can determine three-dimensional structures of large 
macromolecules. The methods of cryo-EM-single particle analysis, 
electron crystallography and electron tomography—all involve scat- 
tering of a coherent electron beam to image the biomolecule which 
has been flash-frozen into thin disks of ice using liquid ethane, to 
prevent the formation of crystalline ice. After the electron beam 
passes through the sample, a system of lenses converts the scattered 
signal into a magnified image recorded on a detector. Computerized 
signal processing then generates the three-dimensional structure of 
the sample. In single particle methods, many thousands of differ- 
ent single particles are preserved in a thin layer of non-crystalline 


(a) : (b) 


ice. If the imaged particles show the molecule in many different 
orientations, computational methods can produce a three-dimen- 
sional mass density map, and an atomic model can be fit to the 
map. In electron crystallography, diffraction of electrons from two- 
dimensional or three-dimensional crystals can be used to determine 
three-dimensional structure, in much the same way as X-ray crys- 
tallography. In electron tomography, the electron beam is passed 
through the sample at a series of different tilt angles, and the images 
obtained from multiples copies of the macromolecule are averaged 
by computer. 

Recent technical developments, including improvements in sam- 
ple preparation and preservation, improved detectors and electron 
optics, faster computers and improved data processing software, 
have advanced the power and resolution of cryo-EM dramatically 
(Figure 2). Both single particle cryo-EM and electron diffrac- 
tion methods have now achieved resolution (1.2-1.7 A) rivaling 
X-ray crystallography. For very large macromolecular assemblies, 
cryo-EM data can be combined with data from NMR spectros- 
copy, X-ray crystallography, mass spectrometry and computa- 
tional techniques to elucidate structural details. Such integrative 
methods (IM) have been applied to the structures of ribosomes, 
muscle structures, and tRNA complexes, among others. PDB-Dev 
at https://pdb-dev. wwpdb. org is a prototype archiving system for 
structural models obtained using integrative or hybrid modeling. 


Figure 2 Development of mRNA vaccines for the novel coronavirus, 
SARS -CoV-2, benefitted from years of work on other virus vaccines, 
including cryo-EM studies of virus spike protein structures. Coronavirus 
particles introduced into a host must fuse with (bind to) receptor pro- 
teins on the host’s cells. Prior to this binding, the spike protein is in its 
prefusion conformation, with a portion of its structure in the down po- 
sition (a, PDB id = 6VXX). This form cannot bind to host receptor 
proteins. A second prefusion state, referred to as the up conformation 
(b, PDB id = 6VSB) can bind to those receptors. An mRNA vaccine 
must produce spike proteins in the host that mimic the up prefusion 
conformation, to raise effective antibodies to the virus itself. Barney 
Graham and Jason McLellan produced a mutant spike protein with two 
added prolines that locked the protein into the up prefusion state. The 
mRNA for that “2P” mutant is the basis for both the Pfizer and 
Moderna COVID-19 vaccines. (Wrapp, D. Wang, N., Corbett, K. S. 

et al., 2020. Cryo-EM structure of the 2019-nCoV spike in the prefusion 
conformation. Science 367:1260—1263.) 


6.5 
Level of Protein Structure? 


How Do Protein Subunits Interact at the Quaternary 


Many proteins exist in nature as oligomers, complexes composed of (often sym- 
metric) noncovalent assemblies of two or more monomer subunits. In fact, subunit 
association is a common feature of macromolecular organization in biology. Most 
intracellular enzymes are oligomeric and may be composed either of a single type 
of monomer subunit (homomultimers) or of several different kinds of subunits 
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Figure 6.39 The quaternary structure of liver alcohol dehydrogenase. Within each subunit is a six- 
stranded parallel sheet. Between the two subunits is a two-stranded antiparallel sheet (PDB id = 1CDO). 


(heteromultimers). Liver alcohol dehydrogenase is a homodimer, a protein composed 
of identical subunits (Figure 6.39). Alcohol consumed in a beer or mixed drink is 
oxidized in the liver by alcohol dehydrogenase. Hormonal signals modulate blood 
sugar levels by controlling the activity of glycogen phosphorylase, another homodi- 
meric enzyme. Oxygen is carried in the blood by hemoglobin, which contains two 
each of two different subunits (heterotetramer). A counterpoint to these small clus- 
ters is made by the proteins that form large polymeric aggregates. Proteins are syn- 
thesized on large complexes of many protein units and several RNA molecules called 
ribosomes. Muscle contraction depends on large polymer clusters of the protein myo- 
sin sliding along filamentous polymers of another protein, actin. The way in which 


Human Biochemistry 


Diseases of Protein Folding 


A number of human diseases are linked to abnormalities of protein the onset of disease. The following table summarizes several other 
folding. Protein misfolding may cause disease by a variety of mecha- | mechanisms and provides an example of each. 
nisms. For example, misfolding may result in loss of function and 


Disease Affected Protein Mechanism 

Alzheimer’s disease B-Amyloid peptide (derived from Misfolded B-amyloid peptide accumulates in human neural tissue, 
amyloid precursor protein) forming deposits known as neuritic plaques. 

Familial amyloidotic Transthyretin Aggregation of unfolded proteins. Nerves and other organs are dam- 

polyneuropathy aged by deposits of insoluble protein products. 

Cancer p53 p53 prevents cells with damaged DNA from dividing. One class of 


p53 mutations leads to misfolding; the misfolded protein is unstable 
and is destroyed. 


Creutzfeldt-Jakob disease Prion Prion protein with an altered conformation (PrPS°) may seed confor- 
(human equivalent of mational transitions in normal PrP (PrP°) molecules. 
mad cow disease) 
Hereditary emphysema a,-Antitrypsin Mutated forms of this protein fold slowly, allowing its target, elastase, 
to destroy lung tissue. 
Cystic fibrosis CFTR (cystic fibrosis transmembrane Folding intermediates of mutant CFTR forms don’t dissociate freely 
conductance regulator) from chaperones, preventing the CFTR from reaching its destination 


in the membrane. 


Aguzzi, A., and O’Connor, T., 2010. Protein aggregation diseases: pathogenicity and therapeutic perspectives. Nature Reviews Drug Discovery 9:237-248. 
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Human Biochemistry 


Structural Genomics 


The prodigious advances in genome sequencing in recent years, 
together with advances in techniques for protein structure determi- 
nation, not only have provided much new information for biochem- 
ists but have also spawned a new field of investigation—structural 
genomics, the large-scale analysis of protein structures and functions 
based on gene sequences. The scale of this new endeavor is daunt- 
ing: hundreds of thousands of gene sequences are rapidly being 
determined, and current estimates suggest that there are probably 
fewer than 10,000 distinct and stable polypeptide folding patterns 
in nature. The feasibility of large-scale, high-throughput structure 
determination programs is being explored in a variety of pilot stud- 
ies in Europe, Asia, and North America. These efforts seek to add 
20,000 or more new protein structures to our collected knowledge 
in the near future; from this wealth of new information, it should 
be possible to predict and determine new structures from sequence 
information alone. This effort will be vastly more complex and more 
expensive than the Human Genome Project. It presently costs about 
$100,000 to determine the structure of the typical globular protein, 


and one of the goals of structural genomics is to reduce this number 
to $20,000 or less. Advances in techniques for protein crystallization, 
X-ray diffraction, and NMR spectroscopy, the three techniques es- 
sential to protein structure determination, will be needed to reach 
this goal in the near future. 

The payoffs anticipated from structural genomics are substan- 
tial. Access to large amounts of new three-dimensional structural 
information should accelerate the development of new families 
of drugs. The ability to scan databases of chemical entities for 
activities against drug targets will be enhanced if large numbers 
of new protein structures are available, especially if complexes of 
drugs and target proteins can be obtained or predicted. The im- 
pact of structural genomics will also extend, however, to functional 
genomics—the study of the functional relationships of genomic 
content—which will enable the comparison of the composite 
functions of whole genomes, leading eventually to a complete 
biochemical and mechanistic understanding of all organisms, 
including humans. 


separate folded monomeric protein subunits associate to form the oligomeric protein 
constitutes the quaternary structure of that protein. Table 6.2 lists several proteins 
and their subunit compositions (see also Table 4.2). Proteins with two to four sub- 
units predominate in nature, but many cases of higher numbers exist. 

The subunits of an oligomeric protein typically fold independently and then interact 
with other subunits. The surfaces at which subunits interact are similar in nature to the 
interiors of the individual subunits—closely packed with both polar and hydrophobic 
interactions. Interacting surfaces must therefore possess complementary arrangements 
of polar and hydrophobic groups. 


6.5a There Is Symmetry in Quaternary Structures 


Many multimeric proteins are symmetric arrangements of asymmetric objects (the mono- 
mer subunits). All of the polypeptide’s a-carbons are asymmetric, and the polypeptide 
nearly always folds to form a low-symmetry structure. Thus, protein subunits do not have 
mirror reflection planes, points, or axes of inversion. The only symmetry operation pos- 
sible for protein subunits is a rotation. The most common symmetries observed for mul- 
tisubunit proteins are cyclic symmetry and dihedral symmetry. In cyclic symmetry, the 
subunits are arranged around a single rotation axis, as shown in Figure 6.40. If there are 
two subunits, the axis is referred to as a twofold rotation axis. Rotating the quaternary 
structure 180° about this axis gives a structure identical to the original one. With three 
subunits arranged about a threefold rotation axis, a rotation of 120° about that axis gives 
an identical structure. Dihedral symmetry occurs when a structure possesses at least one 
twofold rotation axis perpendicular to another n-fold rotation axis. This type of subunit 
arrangement (Figure 6.40) occurs in annexin XII (where n = 3). 


6.5b Quaternary Association Is Driven by Weak Forces 


Weak forces stabilize quaternary structures. Typical dissociation constants for simple 
two-subunit associations range from 10~* to 107'® M. These values correspond to 
free energies of association of about 50 to 100 kJ/mol at 37° C. Dimerization of sub- 
units is accompanied by both favorable and unfavorable energy changes. The favor- 
able interactions include van der Waals interactions, hydrogen bonds, ionic bonds, and 
hydrophobic interactions. However, considerable entropy loss occurs when subunits 
interact. When two subunits move as one, three translational degrees of freedom (that 
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of Globular Proteins 


| Table 6.2 | 6.2 Aggregation Symmetries 


Number of 
Protein Subunits 
Alcohol dehydrogenase 2 
Malate dehydrogenase 2 
Superoxide dismutase 2 
Triose phosphate isomerase 2 
Glycogen phosphorylase 2 
Aldolase 3 
Bacteriochlorophyll protein 3 
Concanavalin A 4 
Glyceraldehyde-3-phosphate 4 
dehydrogenase 
Immunoglobulin 4 
Lactate dehydrogenase 4 
Prealbumin 4 
Pyruvate kinase 4 
Phosphoglycerate mutase 4 
Hemoglobin 2+2 
Insulin 6 
Aspartate transcarbamoylase 6+6 
Glutamine synthetase 12 
TMV protein disc 17 
Apoferritin 24 
Coat of tomato bushy stunt virus 180 
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Dimer Trimer Tetramer Pentamer 
ARNT PAS-B E. blattae acid phosphatase Bacillus anthracis dihydrodipicolinate Shiga-like toxin I B 
(PDB id = 2HV1) (PDB id = 2EOI) synthase (PDB id = 1XL9) (PDB id = 1CZG) 


Cyclic hexamer Trimer of dimers Bundled hexamer 
Circadian clock protein KaiC Annexin XII Uridylate kinase 
(PDB id = 1TF7) (PDB id = 1DM5) (PDB id = 2A1F) 


Ñ 
Heptamer Octamer Dodecamer 
Mycobacterium tuberculosis chaperonin-10 Limulus polyphemus SAP-like pentraxin Lactococcus lactis MG1363 DpsB protein 
(PDB id = 1HX5) (PDB id = 1QTJ) (PDB id = 1ZS3) 


Figure 6.40 Multimeric proteins are symmetric arrangements of asymmetric objects. A variety of 
symmetries is displayed in these multimeric structures. 
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is, the ability to move freely in three dimensions) are lost for one subunit because it 
is constrained to move with the other one. In addition, many residues at the subunit 
interface, which were previously free to move on the protein surface, now have their 
movements restricted by the subunit association. This unfavorable energy of associa- 
tion is in the range of 80 to 120 kJ/mol for temperatures of 25° to 37°C. Thus, to 
achieve stability, the dimerization of two subunits must involve approximately 130 to 
220 kJ/mol of favorable interactions.! Van der Waals interactions at protein interfaces 
are numerous, often running to several hundred for a typical monomer—monomer 
association. This would account for about 150 to 200 kJ/mol of favorable free energy 
of association. However, when solvent is removed from the protein surface to form 
the subunit—subunit contacts, nearly as many van der Waals associations are lost as 
are made. One subunit is simply trading water molecules for residues in the other sub- 
unit. As a result, the energy of subunit association due to van der Waals interactions 
actually contributes little to the stability of the dimer. Hydrophobic interactions at the 
subunit—subunit interface, however, are generally very favorable. For many proteins, 
the subunit association process effectively buries as much as 20 nm? of surface area 
previously exposed to solvent, resulting in as much as 100 to 200 kJ/mol of favorable 
hydrophobic interactions. Together with whatever polar interactions occur at the pro- 
tein—protein interface, this is sufficient to account for the observed stabilization that 
occurs when two protein subunits associate. 

An additional and important factor contributing to the stability of subunit asso- 
ciations for some proteins is the formation of disulfide bonds between different sub- 
units. All antibodies are a28>-tetramers composed of two heavy chains (53 to 75 kDa) 
and two light chains (23 kDa). In addition to intrasubunit disulfide bonds (four per 
heavy chain, two per light chain), two intersubunit disulfide bridges hold the two 
heavy chains together and a disulfide bridge links each of the two light chains to a 
heavy chain (Figure 6.41). 


6.5c Open Quaternary Structures Can Polymerize 


All of the quaternary structures we have considered to this point have been closed 
structures, with a limited capacity to associate. (“Closed” here means that addi- 
tional subunits cannot be bound.) Many proteins in nature associate to form open 
heterologous structures, which can polymerize more or less indefinitely, creating 
structures that are both esthetically attractive and functionally important to the 
cells or tissue in which they exist. One such protein is tubulin, the aB-dimeric pro- 
tein that polymerizes into long, tubular structures that are the structural basis 
of cilia, flagella, and the cytoskeletal matrix. The microtubule thus formed 
(Figure 6.42) consist of 13 parallel filaments arising from end-to-end aggregation 
of the tubulin dimers. Human immunodeficiency virus, HIV, the causative agent of 
AIDS is enveloped by a spherical shell composed of hundreds of coat protein sub- 
units, a large-scale, but closed, quaternary association. 


6.5d There Are Structural and Functional Advantages 
to Quaternary Association 


There are several important consequences when protein subunits associate in oligo- 
meric structures. 


Stability One general benefit of subunit association is a favorable reduction of the pro- 
tein’s surface-to-volume ratio. The surface-to-volume ratio becomes smaller as the radius 
of any particle or object becomes larger. (This is because surface area is a function of 


'For example, 130 kJ/mol of favorable interaction minus 80 kJ/mol of unfavorable interaction equals a 
net free energy of association of 50 kJ/mol. 
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3.5 nm to 4.0 nm 
subunit 


Figure 6.42 The structure of a typical microtubule, 
showing the arrangement of the a- and 
B-monomers of the tubulin dimer. 


Figure 6.41 Schematic drawing of an immunoglobulin molecule, showing the intermolecular and intra- 
molecular disulfide bonds. Two identical L chains are joined with two identical H chains. Each L chain is 
held to an H chain via an interchain disulfide bond. The variable regions of the four polypeptides lie at 
the ends of the arms of the Y-shaped molecule. These regions are responsible for the antigen recognition 
function of antibody molecules. For purposes of illustration, some features are shown on only one or the 


other L chain or H chain, but all features are common to both chains. 


Human Biochemistry 


Faster-Acting Insulin: Genetic Engineering Solves a Quaternary Structure Problem 


Insulin is a peptide hormone secreted by the pancreas that regulates 
glucose metabolism in the body. Insufficient production of insulin 
or failure of insulin to stimulate target sites in liver, muscle, and adi- 
pose tissue leads to the serious metabolic disorder known as diabetes 
mellitus. Diabetes afflicts millions of people worldwide. Diabetic 
individuals typically exhibit high levels of glucose in the blood, but 
insulin injection therapy allows these individuals to maintain nor- 
mal levels of blood glucose. 

Insulin is composed of two peptide chains covalently linked by 
disulfide bonds (see Figure 5.8). This monomer of insulin is the ac- 
tive form that binds to receptors in target cells. However, in solution, 
insulin spontaneously forms dimers, which themselves aggregate 
to form hexamers. The surface of the insulin molecule that self- 
associates to form hexamers is also the surface that binds to insulin 
receptors in target cells. Thus, hexamers of insulin are inactive. 

Insulin released from the pancreas is monomeric and acts rapidly at 
target tissues. However, when insulin is administered by injection to a 
diabetic patient, the insulin hexamers dissociate slowly and the patient’s 
blood glucose levels typically drop slowly (over several hours). 


In 1988, researchers at Novo Research Institute in Denmark 
showed that insulin could be genetically engineered to prefer the 
monomeric (active) state. They used recombinant DNA technology 
to produce insulin with an aspartate residue replacing a proline 
at the contact interface between adjacent subunits. The negative 
charge on the Asp side chain creates electrostatic repulsion between 
subunits and increases the dissociation constant for the hexamer 
= — monomer equilibrium. Injection of this mutant insulin into 
test animals produced more rapid decreases in blood glucose than 
did ordinary insulin. This mutant insulin, known as insulin aspart, 
marketed by the Danish pharmaceutical company Novo as Novo- 
Log in the United States and as NovoRapid in Europe, has several 
advantages over ordinary insulin in the treatment of diabetes. 
NovoLog (http.//www.novolog.com) has a faster rate of absorption, 
a faster onset of action, and a shorter duration of action than 
regular human insulin. It is particularly suited for mealtime dosing 
to control postprandial glycemia, the rise in blood sugar following 
consumption of food. Regular human insulin acts more slowly, so 
patients must usually administer it 30 minutes before eating. 
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6.5 How Do Protein Subunits Interact at the Quaternary Level of Protein Structure? 


the radius squared and volume is a function of the radius cubed.) Because interactions 
within the protein usually tend to stabilize the protein and because the interaction of the 
protein surface with solvent water is often energetically unfavorable, decreased surface-to- 
volume ratios usually result in more stable proteins. Subunit association may also serve to 
shield hydrophobic residues from solvent water. Subunits that recognize either themselves 
or other subunits avoid any errors arising in genetic translation by binding mutant forms 
of the subunits less tightly. 


Genetic Economy and Efficiency Oligomeric association of protein monomers is geneti- 
cally economical for an organism. Less DNA is required to code for a monomer that 
assembles into a homomultimer than for a large polypeptide of the same molecular mass. 
Another way to look at this is to realize that virtually all of the information that deter- 
mines oligomer assembly and subunit—subunit interaction is contained in the genetic 
material needed to code for the monomer. For example, HIV protease, an enzyme that 
is a dimer of identical subunits, performs a catalytic function similar to homologous cel- 
lular enzymes that are single polypeptide chains of twice the molecular mass. 


Bringing Catalytic Sites Together Many enzymes derive at least some of their catalytic 
power from oligomeric associations of monomer subunits. This can happen in several 
ways. The monomer may not constitute a complete enzyme active site. Formation of 
the oligomer may bring all the necessary catalytic groups together to form an active 
enzyme. For example, the active sites of bacterial glutamine synthetase are formed from 
pairs of adjacent subunits. The dissociated monomers are inactive. 

Oligomeric enzymes may also carry out different but related reactions on different 
subunits. Thus, tryptophan synthase is a tetramer consisting of pairs of different sub- 
units, @(3>. Purified a-subunits catalyze the following reaction: 


Indoleglycerol phosphate —— indole + glyceraldehyde-3-phosphate 
and the B-subunits catalyze this reaction: 
Indole + L-serine == L-tryptophan 


Indole, the product of the a-reaction and the reactant for the B-reaction, is passed 
directly from the a-subunit to the B-subunit and cannot be detected as a free 
intermediate. 


Cooperativity There is another, more important consequence when monomer subunits 
associate into oligomeric complexes. Most oligomeric enzymes regulate catalytic activity 
by means of subunit interactions, which may give rise to cooperative phenomena. Multi- 
subunit proteins typically possess multiple binding sites for a given ligand. If the binding 
of ligand at one site changes the affinity of the protein for ligand at the other binding 
sites, the binding is said to be cooperative. Information transfer in this manner across 
long distances in proteins is termed allostery, literally action “at another site.” Increases 
in affinity at subsequent sites represent positive cooperativity, whereas decreases in affin- 
ity correspond to negative cooperativity. The points of contact between protein subunits 
provide a mechanism for this signal transduction through the protein structure and for 
communication between the subunits. This in turn provides a way in which the bind- 
ing of ligand to one subunit can influence the binding behavior at the other subunits. 
Such cooperative behavior is the underlying mechanism for regulation of many biologi- 
cal processes. 
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Chapter 6 Proteins: Secondary, Tertiary, and Quaternary Structure 


Foundational Biochemistry Things You Should Know after Reading Chapter 6. 


The secondary, tertiary, and quaternary structures of proteins are sta- 
bilized by weak forces. 


Secondary structures arise through hydrogen bond formation. 
Protein folding is driven predominantly by hydrophobic interactions. 


Charged residues are found mainly on the surface of proteins; hydro- 
phobic residues are located mainly in the protein interior. 


All the information necessary for folding a polypeptide into its physi- 
ological structure is contained in the amino acid sequence of the 
peptide. 


The conformation of the peptide backbone is known if the ¢ and w 
angles of each amide plane are known. 


The nature and dimensions of the a—helix and -sheet and B-turn 
and how each is stabilized by hydrogen bonds. 


The nature and structure of a— and 6-keratin and the collagen helix. 


The reason why helices and sheets comprise the core of most globular 
proteins. 


Waters on the protein surface stabilize globular structures. 


Packing densities of globular proteins are typically 0.75; most of the 
space is small cavities that provide flexibility and facilitate protein 
dynamics. 


Protein domains are nature’s modular strategy for protein design. 


Many proteins, especially proteins larger than 250 residues, consist of 
two or more domains. 


Denaturation leads to loss of structure and function; proteins can be 
renatured under certain conditions. 


Levinthal’s paradox supports the hypothesis that proteins fold by 
concerted mechanisms. 


Folding of globular proteins typically involves a hydrophobic col- 
lapse and transient protein states, such as molten globules. 


The largest contribution to the stability of folded proteins is the 
entropy change for the waters associated with its nonpolar residues. 


Marginal stability contributes to flexibility and motion in proteins. 
The types of motion in proteins and the time scale of each. 


Polypeptide chains made from L-amino acids tend to undergo right- 
handed twists. 


Most globular proteins can be conceptualized as “layer structures.” 


The four classes of globular protein structure: all œ proteins, all 
b proteins, a/B proteins, and a+ proteins. 


Unstructured proteins can adapt to bind to multiple ligands, thus 
exerting multiple functions. 


The weak forces that modulate subunit interactions in multimeric 
proteins are similar to those that stabilize protein monomers. 


Open quaternary proteins structures can polymerize. 


The structural and functional advantages of subunit association 
include stability, genetic economy and efficiency, juxtaposition of 
catalytic sites, and cooperativity. 


Answers to all problems are at the end of this book. Detailed solutions 
are available in the Student Solutions Manual. 


1. 


Determining the Length of a Keratin Molecule The central rod domain 
of a keratin protein is approximately 312 residues in length. What is 
the length (in A) of the keratin rod domain? If this same peptide 
segment were a true a-helix, how long would it be? If the same 
segment were a B-sheet, what would its length be? 


. Calculating the Rate of Collagen Helix Growth A teenager can grow 4 


inches in a year during a growth spurt. Assuming that the increase in 
height is due to vertical growth of collagen fibers (in bone), calculate 
the number of collagen helix turns synthesized per minute. 


. Assessing the Roles of Weak Forces Between Amino Acids in Proteins 


Discuss the potential contributions to hydrophobic and van der Waals 
interactions and ionic and hydrogen bonds for the side chains of Asp, 
Leu, Tyr, and His in a protein. 


. Secondary Structures Give a possible explanation as to why so many 


proteins, independent of their primary structure, contain a-helices or 
B-sheets in their tertiary structure. 


. Conformation of Collagen Collagen protein forms triple helical, 


quaternary structures (see Figure 6.16) and is rich in modified proline 
and lysine amino acids called hydroxyproline or hydroxylysine. 
Explain why the presence of these modified proline or lysine amino 
acids is crucial to the conformation of collagen. 


. Hydrophobicity/Hydrophilicity Balance of Amino Acids Consider the 


amino acid composition of three different proteins. Which of these 
three proteins has the appropriate balance of hydrophobic, 
hydrophilic, and amphiphilic amino acids in the context of the folding 


of proteins discussed in section 6.4? (Revisit Section 4.1d for the 
classification of amino acids.) 


YA!) 2/Q) 3) m)o;a|> 


Protein A Protein B Protein C 
% % % % % % 
4.3 M 2.5 A 8.9 M 30 A 1.7 M 1.6 
4.8 N 3.7 G 35 N 2.7 C 8.1 N 62 
3.5 P 50 D 2.6 P 63 D 9.8 P 21 
6.0 Q 63 E 2.6 Q 27 E 83 Q 89 
5.0 R 48 F 8.7 R 29 F 2.9 R 68 
4.2 S 82 G 5.1 S 5.0 G 34 S 10.2 
4.1 T 47 H 2.9 T 20 H 9.0 T 93 
4.5 V 45 I 8.0 V 10.9 T. 2.1) V 21 
9.0 W 5.6 K 2.9 WwW 6.2 K 14 W 17 
3.9 Y 54 L 7.1 Y 60 L 24 Y 2.0 


. Amino Acid Structure in Proteins Amino acids typically consist of a 


central carbon atom. Attached to this central carbon are an amino 
group, a carboxylic acid group, a side chain (the R group), and a 
hydrogen. If the a-hydrogens of amino acids were replaced by a 
second side chain (second R group), amino acids could still form 
proteins. What may be the effect of amino acids with two side chains 
on protein structure? 
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10. 


11. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


. The Role of Proline Residues in B-Turns Proline is the amino acid least 


commonly found in a-helices but most commonly found in B-turns. 
Discuss the reasons for this behavior. 


. Assessing the Cross-Overs of Flavodoxin For flavodoxin (PDB id = 


SNLL), identify the right-handed cross-overs and the left-handed 
cross-overs in the parallel B-sheet. 


Assessing the Range of ¢ and ¢ Angles in Proteins Choose any three 
regions in the Ramachandran plot and discuss the likelihood of 
observing that combination of ¢ and y in a peptide or protein. 
Defend your answer using suitable molecular models of a peptide. 


Protein Structure Evaluation Based on Gel Filtration Data A new protein 
of unknown structure has been purified. Gel filtration chromatography 
reveals that the physiological protein has a molecular weight of 
240,000. Chromatography in the presence of 6 M guanidine hydrochlo- 
ride yields a single peak corresponding to a protein of M, 60,000. 
Chromatography in the presence of 6 M guanidine hydrochloride and 
10 mM B-mercaptoethanol yields peaks for proteins of M, 34,000 and 
26,000. Explain what can be determined about the structure of this 
protein from these data. 


Understanding the Role of Amino Acids in Oligomerization Behavior 
Two polypeptides, A and B, have similar tertiary structures, but A 
normally exists as a monomer, whereas B exists as a tetramer, By. 
What differences might be expected in the amino acid composition of 
A versus B? 


Evaluation of a-Helices in Proteins The hemagglutinin protein in 
influenza virus contains a remarkably long a-helix, with 53 residues. 


a. How long is this a-helix (in nm)? 
b. How many turns does this helix have? 


c. The typical residue in an a-helix is involved in two H bonds. How 
many H bonds are present in this helix? 


Understanding the Role of Gly Residues in Protein Secondary and 
Tertiary Structure It is often observed that Gly residues are conserved 
in proteins to a greater degree than other amino acids. From what you 
have learned in this chapter, suggest a reason for this observation. 


Understanding H-Bond Formation in Proteins Which amino acids 
would be capable of forming H bonds with a lysine residue in a 
protein? 

Assessing the pH Dependence of Poly-t-Glutamate Structure Poly-L- 


glutamate adopts an a-helical structure at low pH but becomes a 
random coil above pH 5. Explain this behavior. 


Exploring the Dimensions of the a-Helix and Coiled Coils Imagine that 
the dimensions of the «æ helix were such that there were exactly 3.5 
amino acids per turn instead of 3.6. What would be the consequences 
for coiled-coil structures? 


(Research Problem) The Nature and Roles of Linear Motifs in Proteins 
In addition to domains and modules, there are other significant 
sequence patterns in proteins—known as linear motifs—that are 
associated with a particular function. Consult the biochemical 
literature to answer the following questions: 


° What are linear motifs? 

° How are they different from domains? 

s What are their functions? 

bd How can they be characterized? 

There are several papers that are good starting points for this problem: 
Neduva, V., and Russell, R., 2005. Linear motifs: evolutionary inter- 
action switches. FEBS Letters 579:3342-3345. 

Gibson, T., 2009. Cell regulation: determined to signal discrete coop- 
eration. Trends in Biochemical Sciences 34:471—482. 

Diella, F., Haslam, N., Chica, C. et al., 2009. Understanding eu- 
karyotic linear motifs and their role in cell signaling and regulation. 
Frontiers of Bioscience 13:6580-6603. 


19. 


20. 


21. 


22. 
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Problems 


(Research Problem) The Nature of Protein-Protein Interactions How do 

proteins interact? When one protein binds to another, one or both 

changes conformation. Two hypotheses have been proposed to describe 

such binding: In the induced fit model, the interaction between a protein 

and a ligand induces a conformation change (in the protein or ligand) 

through a stepwise process. In the conformational selection model, the 

unliganded protein (in the absence of the ligand) exists as an ensemble 

of conformations in a dynamic equilibrium. The binding ligand 

interacts preferentially with one among many of these conformations 

and shifts the equilibrium in favor of the selected conformation. Three 

recent papers shed light on this question: 

Boehr, D., and Wright, P. E., 2008. How do proteins interact? Science 

320:1429-1430. 

Gsponer, J., et al., 2008. A coupled equilibrium shift mechanism in 

calmodulin-mediated signal transduction. Structure 16:736—-746. 

Lange, O., et al., 2008. Recognition dynamics up to microseconds re- 

vealed from an RDC-derived ubiquitin ensemble in solution. Science 

320:1471-1475. 

Consult these papers and answer the following questions: 

e What proteins were studied in these papers? 

e What techniques were used, and what time scales of protein 
motion were studied? 

e What were the conclusions of these papers, and how do these 
results illuminate the choice between induced fit and conforma- 
tional selection in protein-protein interactions? 


(Research Problem) Conformational Transitions in Proteins How do 
proteins accomplish conformational changes? How is it that proteins 
convert precisely and efficiently from one conformation to another? 
Recall from Figure 6.34 that any folding/unfolding transition must 
involve movement across a free-energy landscape, and try to imagine 
the nature of a conformational transition. Are bonds formed and 
broken along the way? What kinds of bonds and interactions might 
be involved? Suggest how such conformational transitions might 
occur. One reference that will be useful in this regard is: 

Boehr, D., 2009. During transitions proteins make fleeting bonds. Cell 
139:1049-1051. 


(Historical Context) The Third Person of the a-Helix Publication Who 
was Herman Branson? What was his role in the elucidation of the 
structure of the a-helix? Did he receive sufficient credit and recogni- 
tion for his contributions? And how did the rest of his career unfold? 
Do a Google search on Herman Branson to learn about his life, and 
read the article by David Eisenberg under Further Reading. You may 
also wish to examine the original paper by Pauling, Corey, and 
Branson as well as the following web site: 
http://www.pnas.org/sitelmisclclassics1.shtml 
Pauling, L., Corey, R. B., and Branson, H. R., 1951. The structure 
of proteins: two hydrogen-bonded helical configurations of the 
polypeptide chain. Proceedings of the National Academy of Sciences, 
USA 37:235-240. 
Predicting Secondary Structure on the Basis of Sequence Consider the 
following peptide sequences: 
EANQIDEMLYNVQCSLTTLEDTVPW 
LGVHLDITVPLSWTWTLYVKL 
QQNWGGLVVILTLVWFLM 
CNMKHGDSQCDERTYP 
YTREQSDGHIPKMNCDS 
AGPFGPDGPTIGPK 
Which of the preceding sequences would be likely to be found in each 
of the following: 
a. A parallel B-sheet 
b. An antiparallel B-sheet 
c. A tropocollagen molecule 
d. The helical portions of a protein found in your hair 
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23. Drawing Secondary Structures To fully appreciate the elements of 
secondary structure in proteins, it is useful to have a practical sense of 
their structures. On a piece of paper, draw a simple but large zigzag 
pattern to represent a B-strand. Then fill in the structure, drawing the 
locations of the atoms of the chain on this zigzag pattern. Then draw 
a simple, large coil on a piece of paper to represent an a-helix. Then 
fill in the structure, drawing the backbone atoms in the correction 
locations along the coil and indicating the locations of the R groups 
in your drawing. 


24. Calculating AG for a Monomer-to-Dimer Transition The dissociation 
constant for a particular protein dimer is 1 uM. Calculate the free 
energy difference for the monomer-to-dimer transition. 


25. Calculating AG for a State Change Consider a protein that can exist in 
two forms: folded and unfolded. Calculate the free energy difference 
at 298 K between a state in which 80% of the protein is folded and a 
state in which 80% of the protein is unfolded. 


26. Analyzing a Small Element of Tertiary Structure Consider a protein 
structure consisting of a Ba loop, with B-bends linking the ends of 
the a-helix to the B-strands. 

If the length of the a-helix and the lengths of each B-strand are all 

the same (4.2 nm): 

a. How many amino acid residues does the a-helical segment 
contain? 

b. How many turns does this a-helix have? 

c. How many amino acid residues does each B-strand contain? 
What is the total number of amino acid residues in the complete 
Ba structure? 

e. Approximately how many hydrogen bonds do you think this BaB 
structure has? 

27. Determining the Charge of a Peptide Consider the peptide 
EYSTKKHTR. Assuming that the pK, values for the ionizable 
groups in this peptide are the same as in the free amino acids, what is 
the charge on this peptide at pH 9.8? Explain your answer in detail. 


28. Determining the Charge of a Peptide Consider the peptide 
WDSTRKHCR. Assuming that the pK, values for the ionizable 
groups in this peptide are the same as in the free amino acids, what is 
the charge on this peptide at pH 6? Explain your answer in detail. 


29. Assessing the Structural Consequences of Protein Mutations The 
B.1.1.7 strain of the novel SARS-CoV-2 coronavirus has several 
mutations in its spike protein, including: 

N501Y AS570D P681H D1118H 
Consider each of these mutated positions in the spike protein and 


describe the possible consequences for the spike protein structure and 
for binding to potential ligands, such as human cell surface proteins. 


Think-Pair-Share question 


Secondary Structures as Modules The figure shows crude representations 
of amino acid segments that could be a-helices, parts of a B-sheet or have 
no specific structure. 


S 


amorphous 
segment 


a-helix 
segment 


segment for 
B-sheets 


Challenge a fellow student (and receive a similar challenge) to construct 
crude protein models following specific requirements with increasing 
complexity. 

A couple of examples are suggested: 


Example 1: Construct a protein model consisting of five segments 
built from one B-sheet with at least two segments, one a-helix and one 
amorphous segment. 

Example 2: Construct a protein model containing 15 amino acid 
segments, at least two a-helices, at least three B-sheets containing a 
minimum of three amino acid segments, not more than three amor- 
phous segments and no two a-helices or amorphous segments should 
be connected to each other. 
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Carbohydrates 
and the Glycoconjugates 
of Cell Surfaces 


Burstein Collection/Corbis 


Essential Question 


Carbohydrates are a versatile class of molecules of the formula (CH2O),. They are a major 
form of stored energy in organisms, and they are the metabolic precursors of virtually all 
other biomolecules. Conjugates of carbohydrates with proteins and lipids perform a variety 
of functions, including recognition events that are important in cell growth, transformation, 
and other processes. What are the structure, chemistry, and biological function of 
carbohydrates? 


Co are the single most abundant class of organic molecules found in 


nature. Energy from the sun captured by green plants, algae, and some bacteria 


during photosynthesis converts more than 250 billion kilograms of carbon dioxide 
into carbohydrates every day on earth. In turn, carbohydrates are the metabolic 
precursors of virtually all other biomolecules. Breakdown of carbohydrates provides 
the energy that sustains animal life. Carbohydrates can be covalently linked with a 
variety of other molecules. These glycoconjugates are important components of cell 
walls and extracellular structures in plants, animals, and bacteria. In addition to the 
structural roles such molecules play, they serve in a variety of processes involving 
recognition between cell types or recognition of cellular structures by other molecules. 
Recognition events are important in normal cell growth, fertilization, transformation 
of cells, and other processes. 


All of these functions are made possible by the characteristic chemical features of 


carbohydrates: 


The existence of at least one and often two or more asymmetric centers 

The ability to exist either in linear or ring structures 

The capacity to form polymeric structures via glycosidic bonds 

The potential to form multiple hydrogen bonds with water or other molecules in 
their environment 


Sugar in the gourd and honey in the horn, 
I never was so happy since the hour I was 


born. 


Turkey in the Straw, stanza 6 
(classic American folk tune) 


“The Discovery of Honey’—Piero di Cosimo (1492). 
Honey is mostly carbohydrates. 


Key Questions 


71 
7.2 


7.3 


7.4 


75 


7.6 


7.7 


How Are Carbohydrates Named? 


What Are the Structure and Chemistry 
of Monosaccharides? 


What Are the Structure and Chemistry 
of Oligosaccharides? 


What Are the Structure and Chemistry 
of Polysaccharides? 

What Are Glycoproteins, and How Do 
They Function in Cells? 

How Do Proteoglycans Modulate 
Processes in Cells and Organisms? 


How Do Carbohydrates Provide a 
Structural Code? 
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7.1 | How Are Carbohydrates Named? 


The name carbohydrate arises from the basic molecular formula (CH;O),, where 
n = 3 or more. (CH20), can be rewritten (C-H20), to show that these substances are 
hydrates of carbon. 

Carbohydrates are generally classified into three groups: monosaccharides (and their 
derivatives), oligosaccharides, and polysaccharides. The monosaccharides are also called 
simple sugars and have the formula (CH2O),. Monosaccharides cannot be broken down 
into smaller sugars under mild conditions. Oligosaccharides derive their name from the 
Greek word oligo, meaning “few,” and consist of two to ten simple sugar residues. Di- 
saccharides are common in nature, and trisaccharides also occur frequently. Four- to 
six-sugar-unit oligosaccharides are usually bound covalently to other molecules, includ- 
ing glycoproteins. As their name suggests, polysaccharides are polymers of the simple 
sugars and their derivatives. They may be either linear or branched polymers and may 
contain hundreds or even thousands of monosaccharide units. Their molecular weights 
range up to | million or more. 


Careers in Chemistry 


Dr. Shannon Timmons 


Dr. Shannon Timmons was born in Nova Scotia, 
Canada where she was inspired to pursue a career in 
science based on her admiration of a woman in her 
community who was a leader in public health. After 
encouragement from her introductory chemistry 
professor, she earned her Bachelor of Science in 
Chemistry from Acadia University and her Ph.D. 
in Medical Chemistry from Dalhousie University. 
Her Ph.D. research involved the development 
of new chemical and enzymatic methods to 
synthesize sugar nucleotides, the building blocks of 
carbohydrate polymers. 


In 2008, Dr. Timmons joined the faculty of 
Lawrence Technological University in Southfield, 
Michigan, where she teaches organic chemistry, 
biochemistry, and serves as chair of the Department of 
Natural Sciences. As a professor, she values inclusivity 
and takes care to highlight the contributions of 
women and people of color in chemistry and biology 
so that more students may feel emboldened to pursue 
careers in STEM fields. In her words, “The field 
of education is a calling that changes lives, and I 
feel privileged to have the opportunity to teach and 
mentor so many next-generation scientists.” 


(a) 


CHOH 
HO— C—H or H— C— OH 
| | 
CHOH CHOH CHOH 
L-Isomer D-Isomer 
Glyceraldehyde Dihydroxy- 
acetone 
(b) 
CHO CHO 
HO=—C—H H=C—OH 
CH,OH CH,OH 
t-Glyceraldehyde p-Glyceraldehyde 


Figure 7.1 The D,L system of nomenclature is based 
on the structures of D- and L-glyceraldehyde. 

(a) Structure of a simple aldose (glyceraldehyde) 
and a simple ketose (dihydroxyacetone). (b) The 
configurations of D- and L-glyceraldehyde. 


7.2 | What Are the Structure and Chemistry 


of Monosaccharides? 


Monosaccharides are ubiquitous in nature. This section describes their nomencla- 
ture, structures, and chemistry. 


7.2a Monosaccharides Are Classified as Aldoses and Ketoses 


Monosaccharides consist typically of three to seven carbon atoms and are described 
either as aldoses or ketoses, depending on whether the molecule contains an aldehyde 
function or a ketone group. The simplest aldose is glyceraldehyde, and the simplest 
ketose is dihydroxyacetone (Figure 7.1). These two simple sugars are termed trioses 
because they each contain three carbon atoms. The structures and names of a family 
of aldoses and ketoses with three, four, five, and six carbons are shown in Figures 7.2 
and 7.3. Hexoses are the most abundant sugars in nature. Nevertheless, sugars from 
all these classes are important in metabolism. 

Monosaccharides, either aldoses or ketoses, are often given more detailed generic 
names to describe both the important functional groups and the total number of car- 
bon atoms. Thus, one can refer to aldotetroses and ketotetroses, aldopentoses and keto- 
pentoses, aldohexoses and ketohexoses, and so on. Sometimes the ketone-containing 
monosaccharides are named simply by inserting the letters -ul- into the simple generic 
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ALDOTRIOSE 


Carbon 
number 


D-Glyceraldehyde 


ALDOTETROSES 


CHO CHO 
sel e 
con ALDOPENTOSES Hodi 
a TUBE 
Lab ae 


p-Arabinose (Ara) 


/ N 


C 
HCOH 
HCOH 
HOCH 
HCOH 
CHOH 
D-Mannose D-Gulose 
(Glc) (Man) 
ALDOHEXOSES 


D-Xylose (Xyl) 
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HOCH 


HCOH 


HOCH 


HCOH 


CH,OH 


D-Idose 


Figure 7.2 The structure and stereochemical relationships of D-aldoses with three to six carbons. The 
configuration in each case is determined by the highest numbered asymmetric carbon (shown in pink). 
In each row, the new asymmetric carbon is the lowest numbered and is shown in yellow. Blue 
highlights indicate the most common aldoses, D-glyceraldehyde, D-erythrose, D-ribose, D-glucose, 


D-mannose and D-galactose. 


terms, such as fetruloses, pentuloses, hexuloses, heptuloses, and so on. The simplest 
monosaccharides are water soluble, and most taste sweet. 


7.2b Stereochemistry Is a Prominent Feature of Monosaccharides 


Aldoses with at least three carbons and ketoses with at least four carbons contain 
chiral centers (see Chapter 4). The nomenclature for such molecules must specify the 
configuration about each asymmetric center, and drawings of these molecules must be 
based on a system that clearly specifies these configurations. As noted in Chapter 4, 
the Fischer projection system is used almost universally for this purpose today. The 
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Figure 7.3 The structure and stereochemical 
relationships of D-ketoses with three to six carbons. 
The configuration in each case is determined by the 
highest numbered asymmetric carbon (shown in 
pink). In each row, the new asymmetric carbon is the 
lowest numbered and is shown in yellow. Blue 
highlights indicate the most common ketoses, 
dihydroxyacetone, D-ribulose, and D-fructose. 


il CHOH 
Carbon 
number O KETOTRIOSE 
3) CHOH 
Dihydroxyacetone 
1 CHOH 
P (0) 
Car Pon KETOTETROSE 
number > MICOH 
4 CHOH 
D-Erythrulose 
i CHOH CHOH 
| 
2 C=O C=O 
Carbon | 
om) HCOH HOCH KETOPENTOSES 
number | | 
4 HCOH HCOH 
| 
5 CHOH CHOH 
D-Ribulose D-Xylulose 
1 CH,OH CHOH CHOH CHOH 
l 
2 C=O C—O C=0 C=O 
l 
Cah 3 HCOH HOCH HCOH HOCH 
te | KETOHEXOSES 
4 HCOH HCOH HOCH HOCH 
l 
5 HCOH HCOH HCOH HCOH 
l 
6 CHOH CHOH CHOH CHOH 
D-Psicose D-Fructose D-Sorbose D-Tagatose 


structures shown in Figures 7.2 and 7.3 are Fischer projections. For monosaccharides 
with two or more asymmetric carbons, the prefix D or L refers to the configuration of 
the highest numbered asymmetric carbon (the asymmetric carbon farthest from the 
carbonyl carbon). A monosaccharide is designated D if the hydroxyl group on the 
highest numbered asymmetric carbon is drawn to the right in a Fischer projection, as 
in D-glyceraldehyde (Figure 7.1). Note that the designation D or L merely relates the 
configuration of a given molecule to that of glyceraldehyde and does not specify the 
sign ofrotation of plane-polarized light. If the sign of optical rotation is to be specified 
in the name, the convention of D or L designations may be used along with a + (plus) 
or — (minus) sign. Thus, D-glucose (Figure 7.2) may also be called D(+)-glucose 
because it is dextrorotatory, whereas D-fructose (Figure 7.3), which is levorotatory, 
can also be named D(—)-fructose. 

All of the structures shown in Figures 7.2 and 7.3 are D-configurations, and the 
D-forms of monosaccharides predominate in nature, just as L-amino acids do. These 
preferences, established by apparently random choices early in evolution, persist uni- 
formly in nature because of the stereospecificity of the enzymes that synthesize and 
metabolize these small molecules. L-Monosaccharides do exist in nature, serving a few 
relatively specialized roles. L-Galactose is a constituent of certain polysaccharides, and 
L-arabinose is a constituent of bacterial cell walls. 
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According to convention, the D- and L-forms of a monosaccharide are mirror images 
of each other, as shown in Figure 7.4 for fructose. Stereoisomers that are nonsuper- 
imposable mirror images of each other are called enantiomers, or sometimes enantiomeric 
pairs. For molecules that possess two or more chiral centers, more than two stereoisomers 
can exist. Pairs of isomers that have opposite configurations at one or more of the 
chiral centers but that are not mirror images of each other are called diastereomers 
or diastereomeric pairs. Any two structures in a given row in Figures 7.2 and 7.3 are 
diastereomeric pairs. Two sugars that differ in configuration at only one chiral center 
are described as epimers. For example, D-mannose and D-talose are epimers and D-glucose 
and D-mannose are epimers, whereas D-glucose and D-talose are not epimers but merely 
diastereomers. 


7.2c Monosaccharides Exist in Cyclic and Anomeric Forms 


Although Fischer projections are useful for presenting the structures of particular 
monosaccharides and their stereoisomers, they discount one of the most interesting 
facets of sugar structure—the ability to form cyclic structures with formation of an 
additional asymmetric center. Alcohols react readily with aldehydes to form hemiacetals 
(Figure 7.5). The British carbohydrate chemist Sir Norman Haworth showed that the 
linear form of glucose (and other aldohexoses) could undergo a similar intramolecular 
reaction to form a cyclic hemiacetal. The resulting six-membered, oxygen-containing 
ring is similar to pyran and is designated a pyranose. The reaction is catalyzed by acid 
(H*) or base (OH _) and is readily reversible. 

In a similar manner, ketones can react with alcohols to form hemiketals. The analo- 
gous intramolecular reaction of a ketose sugar such as fructose yields a cyclic hemiketal 
(Figure 7.6). The five-membered ring thus formed is reminiscent of furan and is referred 
to as a furanose. The cyclic pyranose and furanose forms are the preferred structures for 
monosaccharides in aqueous solution. At equilibrium, the linear aldehyde or ketone 
structure is only a minor component of the mixture (generally much less than 1%). 
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Ti 1 OH H \ y 
H— G—0OH HO 13 2 | 
. C m C CHOH 
H—C—OH H OH 
H Ç O OH 
6 | H NU 
CHOH Cc C 
O 1 Mg H | 
p-Glucose A HO No È H 
) | | 
/ H OH 
Pyran B-p-Glucopyranose 


HAWORTH PROJECTION FORMULAS 


Figure 7.5 Glucose can undergo an intramolecular reaction to form a cyclic hemiacetal. (a) Formation 
of a hemiacetal. (b) The linear form of D-glucose undergoes an intramolecular reaction to form a 
cyclic hemiacetal, shown as Haworth projections. (c) Cyclic pyranose forms, shown as Fischer 
projections. 
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Figure 7.4 D-Fructose and L-fructose, an 
enantiomeric pair. Note that changing the 
configuration only at C; would change D-fructose 
to L-sorbose. 
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Figure 7.6 Fructose can undergo an intramolecular reaction to form a cyclic hemiketal. (a) Formation of 
a hemiketal. (b) The linear form of D-fructose undergoes an intramolecular reaction to form a cyclic 
hemiketal, shown as Haworth projections. (c) Cyclic furanose forms shown as Fischer projections. 


When hemiacetals and hemiketals are formed, the carbon atom that carried the 
carbonyl function becomes an asymmetric carbon atom. Isomers of monosaccharides 
that differ only in their configuration about that carbon atom are called anomers, 
designated as a or 8, as shown in Figure 7.5, and the carbonyl carbon is thus called the 
anomeric carbon. When the hydroxyl group at the anomeric carbon is on the same side 
of a Fischer projection as the oxygen atom at the highest numbered asymmetric carbon, 
the configuration at the anomeric carbon is a, as in a-D-glucose. When the anomeric 
hydroxyl is on the opposite side of the Fischer projection, the configuration is B, as in 
B-D-glucopyranose (Figure 7.5). 

Chiral molecules rotate plane-polarized light either clockwise or counter-clockwise, 
a property termed optical rotation. The addition of a new asymmetric center upon 
hemiacetal and hemiketal formation alters the optical rotation properties of monosac- 
charides, and the original assignment of the a and £ notations arose from studies of 
these properties. Early carbohydrate chemists frequently observed that the optical rota- 
tion of glucose (and other sugar) solutions could change with time, a process called 
mutarotation. This indicated that a structural change was occurring. It was eventually 
found that a-p-glucose has a specific optical rotation, [a], of 112.2°, and that B-p- 
glucose has a specific optical rotation of 18.7°. Mutarotation involves interconversion 
of a- and B-forms of the monosaccharide with intermediate formation of the linear 
aldehyde or ketone, as shown in Figures 7.5 and 7.6. 


7.2d Haworth Projections Are a Convenient Device for Drawing Sugars 


Another of Haworth’s lasting contributions to the field of carbohydrate chemistry was 
his proposal to represent pyranose and furanose structures as hexagonal and pentago- 
nal rings lying perpendicular to the plane of the paper, with thickened lines indicating 
the side of the ring closest to the reader. Such Haworth projections, which are now 
widely used to represent saccharide structures (Figures 7.5 and 7.6), show substituent 
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groups extending either above or below the ring. (A substituent is an atom or group of 
atoms taking the place of another atom or group or occupying a specified position in 
a molecule.) Substituents drawn to the left in a Fischer projection are drawn above the 
ring in the corresponding Haworth projection. Substituents drawn to the right in a 
Fischer projection are below the ring in a Haworth projection. Exceptions to these 
rules occur in the formation of furanose forms of pentoses and the formation of fura- 
nose or pyranose forms of hexoses. In these cases, the structure must be redrawn with 
a rotation about the carbon whose hydroxyl group is involved in the formation of the 
cyclic form (Figure 7.7) in order to orient the appropriate hydroxyl group for ring for- 
mation. This involves no change in configuration of the saccharide molecule. 

The rules previously mentioned for assignment of œ- and B-configurations can be 
readily applied to Haworth projection formulas. For the D-sugars, the anomeric 
hydroxyl group is below the ring in the a-anomer and above the ring in the B-anomer. 
For L-sugars, the opposite relationship holds. 

As Figure 7.7 implies, in most monosaccharides there are two or more hydroxyl groups 
that can react with an aldehyde or ketone at the other end of the molecule to form a hemi- 
acetal or hemiketal. Consider the possibilities for glucose, as shown in Figure 7.7. If the C-4 
hydroxyl group reacts with the aldehyde of glucose, a five-membered ring is formed, whereas 
if the C-5 hydroxyl reacts, a six-membered ring is formed. The C-6 hydroxyl does not react 
effectively because a seven-membered ring is too strained to form a stable hemiacetal. The 
same is true for the C-2 and C-3 hydroxyls, and thus five- and six-membered rings are by 
far the most likely to be formed from six-membered monosaccharides. D-Ribose, with five 
carbons, readily forms either five-membered rings (a@- or B-D-ribofuranose) or six- 
membered rings (a- or B-D-ribopyranose) (Figure 7.7). In general, aldoses and ketoses with 
five or more carbons can form either furanose or pyranose rings, and the more stable form 
depends on structural factors. The nature of the substituent groups on the carbonyl 
and hydroxyl groups and the configuration about the asymmetric carbon will determine 
whether a given monosaccharide prefers the pyranose or furanose structure. In general, the 
pyranose form is favored over the furanose ring for aldohexose sugars, although, as we shall 
see, furanose structures tend to be more stable for ketohexoses. 

Although Haworth projections are convenient for displaying monosaccharide 
structures, they do not accurately portray the conformations of pyranose and furanose 
rings. Given C—C—C tetrahedral bond angles of 109° and C—O—C angles of 111°, 
neither pyranose nor furanose rings can adopt true planar structures. Instead, they take on 
puckered conformations, and in the case of pyranose rings, the two favored structures are 
the chair conformation and the boat conformation, as shown in Figure 7.8. Note that the 
ring substituents in these structures can be equatorial, which means approximately coplanar 
with the ring, or axial, that is, parallel to an axis drawn through the ring as shown. 


CHOH 
O oH 
OH 
HO CH,OH 
Ori <<. | ge 
HC— OH CH),—OH 
anose form 
Pyr a N H a N H 
/ 7 
C Cc 
CHOH OH \ S 
| oO 
CHOH o a on San OH ee 
OH p-Glucose p-Ribose 
OH 
OH 


Furanose form 


Figure 7.7 D-Glucose, D-ribose, and other simple sugars can cyclize in two ways, forming either 
furanose or pyranose structures. 
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Figure 7.8 Conformations of simple sugars may be 
described in chair and boat representations. 

(a) Chair and boat conformations of a pyranose 
sugar. (b) Two possible chair conformations of 
B-D-glucose. 


a = axial bond 
e = equatorial bond 


Chair Boat 


(b) 


Two general rules dictate the conformation to be adopted by a given saccharide unit. First, 
bulky substituent groups on such rings are more stable when they occupy equatorial 
positions rather than axial positions, and second, chair conformations are slightly more 
stable than boat conformations. For a typical pyranose, such as B-D-glucose, there are two 
possible chair conformations (Figure 7.8). Of all the D-aldohexoses, B-D-glucose is the 
only one that can adopt a conformation with all its bulky groups in an equatorial position. 
With this advantage of stability, it may come as no surprise that B-D-glucose is the most 
widely occurring organic group in nature and the central hexose in carbohydrate 
metabolism. 


7.2e Monosaccharides Can Be Converted to Several Derivative Forms 


A variety of chemical and enzymatic reactions produce derivatives of the simple sug- 
ars. These modifications produce a diverse array of saccharide derivatives. Some of 
the most common derivations are discussed here. 


Sugar Acids Sugars with free anomeric carbon atoms are reasonably good reducing 
agents and will reduce hydrogen peroxide, ferricyanide, certain metals (Cu** and 
Ag*), and other oxidizing agents. Such reactions convert the sugar to a sugar acid. For 
example, addition of alkaline CuSO, (called Fehling’s solution) to an aldose sugar 
produces a red cuprous oxide (Cu,O) precipitate: 


ji 1 
RC—H + 2 Cu?* + 5 OHT —> RC—O™ + CwO} + 3 HO 
Aldehyde Carboxylate 


and converts the aldose to an aldonic acid, such as gluconic acid (Figure 7.9). Formation 
of a precipitate of red CuO constitutes a positive test for an aldehyde. Carbohydrates 
that can reduce oxidizing agents in this way are referred to as reducing sugars. By 
quantifying the amount of oxidizing agent reduced by a sugar solution, one can 
accurately determine the concentration of the sugar. Diabetes mellitus is characterized 
by high levels of glucose in urine and blood and frequent analysis of reducing sugars 
in diabetic patients is an important part of the diagnosis and treatment of this disease. 
Over-the-counter kits for the easy and rapid determination of reducing sugars have 
made this procedure a simple one for diabetic persons. 

Monosaccharides can be oxidized enzymatically at C-6, yielding uronic acids, such as 
D-glucuronic and L-iduronic acids (Figure 7.9). L-Iduronic acid is similar to D-glucuronic 
acid, except it has an opposite configuration at C-5. Oxidation at both C-1 and C-6 
produces aldaric acids, such as D-glucaric acid. 
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H—C— OH Figure 7.9 Oxidation of D-glucose to sugar acids. 
COOH Oxidized positions are highlighted. D-Gluconic acid and 
other aldonic acids exist in equilibrium with lactone 
p-Glucaric acid structures. 
CHOH CHOH 
HSCS OH HO- C-H CHOH CHOH 
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CH OH CH OH CH OH CHOH H OH CHOH 
D-Glucitol D-Mannitol D-Xylitol D-Glycerol myo-Inositol p-Ribitol 
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Figure 7.10 Structures of some sugar alcohols. (Note that myo-inositol is a polyhydroxy cyclohexane, 
not a sugar alcohol.) 


Sugar Alcohols Sugar alcohols, another class of sugar derivative, can be prepared by 
the mild reduction (with NaBH; or similar agents) of the carbonyl groups of aldoses 
and ketoses. Sugar alcohols, or alditols, are designated by the addition of -itol to the 
name of the parent sugar (Figure 7.10). The alditols are linear molecules that cannot 
cyclize in the manner of aldoses. Nonetheless, alditols are characteristically sweet 
tasting, and sorbitol, mannitol, and xylitol are widely used to sweeten sugarless gum 
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Figure 7.11 Sugar alcohols such as sorbitol, 
mannitol, and xylitol sweeten many sugarless gums 
and candies. 


A Deeper Look 
Honey—An Ancestral Carbohydrate Treat 


Honey, the first sweet known to humankind, is the only sweetening 
agent that can be stored and used exactly as produced in nature. Bees 
process the nectar of flowers so that their final product is able to sur- 
vive long-term storage at ambient temperature. Honey has been used 
as ceremonial material, medicinal agent, and food sweetener since at 
least 2500 BCE, when Ancient Egyptians developed beekeeping. Only 
in modern times have cane and beet sugar surpassed honey as the most 
frequently used sweetener. What is the chemical nature of this magical, 
viscous substance? 

The bees’ processing of honey consists of (1) reducing the water 
content of the nectar (30% to 60%) to the self-preserving range of 
15% to 19%, (2) hydrolyzing the significant amount of sucrose in 
nectar to glucose and fructose by the action of the enzyme invertase, 
and (3) producing small amounts of gluconic acid from glucose 
by the action of the enzyme glucose oxidase. Most of the sugar 
in the final product is glucose and fructose, and the final product 
is supersaturated with respect to these monosaccharides. Honey 
actually consists of an emulsion of microscopic glucose hydrate and 
fructose hydrate crystals in a thick syrup. Sucrose accounts for only 
about 1% of the sugar in the final product, with fructose at about 38% 
and glucose at 31% by weight. 

Figure 1 shows a °C nuclear magnetic resonance spectrum of 
honey from a mixture of wildflowers in southeastern Pennsylvania. 
Interestingly, five major hexose species contribute to this spectrum. 
Although it is often assumed that fructose exists exclusively in its 
furanose form, the predominant form of fructose in honey (67% of 
total fructose) is B-D-fructopyranose, with the B- and a-fructofuranose 
forms accounting for 27% and 6% of the fructose, respectively 


istockphoto.com/florintt 


and mints (Figure 7.11). Sorbitol buildup in the eyes of diabetic persons is implicated 
in cataract formation. Glycerol and myo-inositol, a cyclic alcohol, are components of 
lipids (see Chapter 8). There are nine different stereoisomers of inositol; the one 
shown in Figure 7.10 was first isolated from heart muscle and thus has the prefix myo- 
for muscle. Ribitol is a constituent of flavin coenzymes (see Chapter 19). 


Deoxy Sugars The deoxy sugars are monosaccharides with one or more hydroxyl 
groups replaced by hydrogens. 2-Deoxy-D-ribose (Figure 7.12), whose systematic 
name is 2-deoxy-D-erythropentose, is a constituent of DNA in all living things. 


(Figure 2). In polysaccharides, fructose invariably prefers the furanose 
form, but free fructose (and crystalline fructose) is predominantly 
B-fructopyranose, which is sweeter than the furanose form. 
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Figure 1 A °C nuclear magnetic resonance spectrum of honey 
from a mixture of wildflowers in southeastern Pennsylvania, 
showing five major hexose species. 
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Figure 2 The principal forms of fructose found in honey. 
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Figure 7.12 Several deoxy sugars. Hydrogen and carbon atoms highlighted in red circles are “deoxy” 
positions. 


H OH OH H 


Figure 7.13 Three sugar esters that play an important role in metabolism (phosphate groups 
highlighted in orange.) 


Deoxy sugars also occur frequently in glycoproteins and polysaccharides. L-Fucose 
and L-rhamnose, both 6-deoxy sugars, are components of some cell walls, and rham- 
nose is a component of ouabain, a highly toxic cardiac glycoside found in the bark and 
root of the ouabaio tree. Ouabain is used by the east African Somalis as an arrow 
poison. The sugar moiety is not the toxic part of the molecule (see Chapter 9). 


Sugar Esters The phosphate esters of glucose, fructose, and other monosaccharides 
are important metabolic intermediates, and the ribose moiety of nucleotides such as 
ATP and GTP is phosphorylated at the 5'-position (Figure 7.13). 


Amino Sugars The amino sugars, including D-glucosamine and D-galactosamine 
(Figure 7.14), contain an amino group instead of a hydroxyl group at the C-2 position. 
They are found in many oligosaccharides and polysaccharides, including chitin, a 
polysaccharide in the exoskeletons of crustaceans and insects. 


Muramic acid and neuraminic acid Muramic acid and neuraminic acid, which are com- 
ponents of the polysaccharides of cell membranes of higher organisms and also bacte- 
rial cell walls, are glycosamines linked to three-carbon acids at the C-1 or C-3 posi- 
tions. In muramic acid (thus named as an amine isolated from bacterial cell wall 
polysaccharides; murus is Latin for “wall”), the hydroxyl group of a lactic acid moiety 
makes an ether linkage to the C-3 of glucosamine. Neuraminic acid (an amine isolated 
from neural tissue) forms a C—C bond between the C-1 of N-acetylmannosamine and 
the C-3 of pyruvic acid (Figure 7.15). The N-acetyl and N-glycolyl derivatives of 
neuraminic acid are collectively known as sialic acids and are distributed widely in 
bacteria and animal systems. 


Acetals, Ketals, and Glycosides Hemiacetals and hemiketals can react with alcohols in 
the presence of acid to form acetals and ketals, as shown in Figure 7.16. This reaction 
is another example of a dehydration synthesis and is similar in this respect to the 
reactions undergone by amino acids to form peptides and nucleotides to form nucleic 
acids. The pyranose and furanose forms of monosaccharides react with alcohols in 
this way to form glycosides with retention of the œ- or B-configuration at the C-1 
carbon. The new bond between the anomeric carbon atom and the oxygen atom of 
the alcohol is called a glycosidic bond. Glycosides are named according to the parent 
monosaccharide. For example, methyl-B-p-glucoside (Figure 7.17) can be considered 
a derivative of B-D-glucose. 


OH OH 
a-p-Glucose-1-phosphate a-b-Fructose-1,6-bisphosphate a-D-Ribose-5-phosphate 


CHOH CHOH 
O (0) 
H OH HO OH 
H H 
OH H OH H 
H NHo H NH» 
B-p-Glucosamine B-p-Galactosamine 


Figure 7.14 Structures of D-glucosamine and 
D-galactosamine. 
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Figure 7.17 The anomeric forms of methyl-p- 
glucoside. 
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Figure 7.15 Structures of (a) muramic acid and (b) several depictions of a sialic acid. 
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Figure 7.16 Acetals and ketals can be formed from hemiacetals and hemiketals, respectively. 


7.3 | What Are the Structure and Chemistry 
of Oligosaccharides? 


Given the relative complexity of oligosaccharides and polysaccharides in higher 
organisms, it is perhaps surprising that these molecules are formed from relatively 
few different monosaccharide units. (In this respect, the oligosaccharides and 
polysaccharides are similar to proteins; both form complicated structures based on a 
small number of different building blocks.) Monosaccharide units include the hexoses 
glucose, fructose, mannose, and galactose and the pentoses ribose and xylose. 


7.3a Disaccharides Are the Simplest Oligosaccharides 


The simplest oligosaccharides are the disaccharides, which consist of two monosaccha- 
ride units linked by a glycosidic bond. As in proteins and nucleic acids, each individual 
unit in an oligosaccharide is termed a residue. The disaccharides shown in Figure 7.18 
are all commonly found in nature, with sucrose, maltose, and lactose being the most 
common. Each is a mixed acetal, with one hydroxyl group provided intramolecularly 
and one hydroxyl from the other monosaccharide. Except for sucrose, each of these 
structures possesses one free unsubstituted anomeric carbon atom, and thus each of 
these disaccharides is a reducing sugar. The end of the molecule containing the free 
anomeric carbon is called the reducing end, and the other end is called the nonreducing 
end. In the case of sucrose, both of the anomeric carbon atoms are substituted; that is, 
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OH 


Isomaltose (glucose-a-1,6-glucose) 


Figure 7.18 The structures of several important disaccharides. Note that the notation “HOH” means 
that the configuration can be either a or B. If the —OH group is above the ring, the configuration is 
termed B. The configuration is a if the —OH group is below the ring. Also note that sucrose has no 


free anomeric carbon atom. 


A Deeper Look 


Trehalose—A Natural Protectant for Bugs 


Insects use an open circulatory system to circulate hemolymph (insect 
blood). The “blood sugar” is not glucose but rather trehalose, an 
unusual, nonreducing disaccharide (see figure). Trehalose is found 
typically in organisms that are naturally subject to temperature 
variations and other environmental stresses—bacterial spores, 
fungi, yeast, and many insects. (Interestingly, honeybees do not 
have trehalose in their hemolymph, perhaps because they practice 
a colonial, rather than solitary, lifestyle. Bee colonies maintain a 
rather constant temperature of 18 °C, protecting the residents from 
large temperature changes.) 

What might explain this correlation between trehalose 
utilization and environmentally stressful lifestyles? Konrad Bloch 
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suggests that trehalose may act as a natural cryoprotectant. 
Freezing and thawing of biological tissues frequently causes 
irreversible structural changes, destroying biological activity. High 
concentrations of polyhydroxy compounds, such as sucrose and 
glycerol, can protect biological materials from such damage. 
Trehalose is particularly well suited for this purpose and has 
been shown to be superior to other polyhydroxy compounds, 
especially at low concentrations. Support for this novel idea comes 
from studies by Paul Attfield, which show that trehalose levels in 
the yeast Saccharomyces cerevisiae increase significantly during 
exposure to high salt and high growth temperatures. 


HO 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


206 


Chapter 7 Carbohydrates and the Glycoconjugates of Cell Surfaces 


neither has a free —OH group. The substituted anomeric carbons cannot be converted 
to the aldehyde configuration and thus cannot participate in the oxidation—reduction 
reactions characteristic of reducing sugars. Thus, sucrose is not a reducing sugar. 

Maltose, isomaltose, and cellobiose are all homodisaccharides because they each 
contain only one kind of monosaccharide, namely, glucose. Maltose is produced from 
starch (a polymer of a-D-glucose produced by plants) by the action of amylase 
enzymes and is a component of malt, a substance obtained by allowing grain 
(particularly barley) to soften in water and germinate. The enzyme diastase, produced 
during the germination process, catalyzes the hydrolysis of starch to maltose. Maltose 
is used in beverages (malted milk, for example). Because it is fermented readily by 
yeast, it is important in the brewing of beer. In both maltose and cellobiose, the 
glucose units are 1—— 4 linked, meaning that the C-1 of one glucose is linked by a 
glycosidic bond to the C-4 oxygen of the other glucose. The only difference between 
them isin the configuration at the glycosidic bond. Maltose exists in the a-configuration, 
whereas cellobiose is a B-configuration. Isomaltose is obtained in the hydrolysis of 
some polysaccharides (such as dextran), and cellobiose is obtained from the acid 
hydrolysis of cellulose. Isomaltose also consists of two glucose units in a glycosidic 
bond, but in this case, C-1 of one glucose is linked to C-6 of the other, and the 
configuration is a. 

The complete structures of these disaccharides can be specified in shorthand notation 
by using abbreviations for each monosaccharide, a or B, to denote configuration, and 
appropriate numbers to indicate the nature of the linkage. Thus, cellobiose is GlcB1-— 
4Glc, whereas isomaltose is Glca1—6Glc. Often the glycosidic linkage is written with an 
arrow so that cellobiose and isomaltose would be Glc81—+4Glc and Glcal—+6Glc, 
respectively. Because the linkage carbon on the first sugar is always C-1, a newer trend is 
to drop the 1— or 1—> and describe these simply as GlcB4Glc and Glca6Glc, respec- 
tively. More complete names can also be used, however; for example, maltose would be 
O-a-D-glucopyranosyl-(1—>4)-p-glucopyranose. Cellobiose, because of its B-glycosidic 
linkage, is formally O-B-D-glucopyranosyl-(1 —>4)-D-glucopyranose. 

B-p-Lactose (O-8-D-galactopyranosyl-(1 —>4)-p-glucopyranose) (Figure 7.18) is 
the principal carbohydrate in milk and is of critical nutritional importance to 
mammals in the early stages of their lives. It is formed from D-galactose and D-glucose 
via a B(1—~>4) link, and because it has a free anomeric carbon, it is capable of 
mutarotation and is a reducing sugar. It is an interesting quirk of nature that lactose 
cannot be absorbed directly into the bloodstream. It must first be broken down into 
galactose and glucose by lactase, an intestinal enzyme that exists in young, nursing 
mammals but is not produced in significant quantities in the mature mammal. Most 
adult humans, with the exception of certain groups in Africa and northern Europe, 
produce only low levels of lactase. For most individuals, this is not a problem, but 
some cannot tolerate lactose and experience intestinal pain and diarrhea upon 
consumption of milk. 

Sucrose, in contrast, is a disaccharide of almost universal appeal and tolerance. 
Produced by many higher plants and commonly known as table sugar, it is one of the 
products of photosynthesis and is composed of fructose and glucose. Sucrose has a 
specific optical rotation, [a], of +66.5°, but an equimolar mixture of its component 
monosaccharides has a net negative rotation ([a],~° of glucose is +52.5° and of fructose 
is —92°). Sucrose is hydrolyzed by the enzyme invertase, so named for the inversion of 
optical rotation accompanying this reaction. Sucrose is also easily hydrolyzed by dilute 
acid. Although sucrose and maltose are important to the human diet, they are not taken 
up directly in the body. In a manner similar to lactose, they are first hydrolyzed by 
sucrase and maltase, respectively, in the human intestine. 


7.3b A Variety of Higher Oligosaccharides Occur in Nature 


In addition to the simple disaccharides, many other oligosaccharides are found in 
both prokaryotic and eukaryotic organisms, either as naturally occurring substances 
or as hydrolysis products of natural materials. 
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Alpha-Gal and Red Meat Allergy 


Many people in the southern United States and other parts of the 
world have become allergic to red meat in recent years. The cause 
turns out to be bites from certain ticks, including the lone star tick 
(Amblyomma americanum—see Figure 1). These tick bites trigger an 
immune response against the disaccharide galactose-a-1,3-galactose, 
known colloquially as “alpha-gal.” This moiety is synthesized in 
mammals by the enzyme a-1,3-galactosyltransferase and displayed 
on the extracellular surface of cells. The gene for this enzyme is 
inactivated in humans and other Old World primates, and for this 
reason alpha-gal decorations are not found on human cells. Ticks 
that have fed on other mammals can introduce alpha-gal into the 
human bloodstream via tick bites. Because alpha-gal is foreign to 
human cells, it is antigenic (that is, it provokes an immune response) 
and the human immune system produces anti-alpha-gal 
immunoglobulin E (IgE) antibodies. Once bitten, persons who eat red 
meat (such as beef, pork, and lamb) may experience a severe reaction, 
including swelling, hives, and anaphylaxis, within a few hours. 

The link between tick bites and meat allergy was first identified in 
2007 by Professor Sheryl van Nunen in Sydney, Australia. In 2008, 
Professor Thomas Platts-Mills and his colleagues at the University 
of Virginia found that patients administered cetuximab, a monoclo- 
nal antibody from mouse cells to treat certain cancers, later experi- 
enced a similar red meat allergy. They showed that the allergic 
responses to cetuximab and to tick bites were both due to alpha-gal. 
Ironically, while he was involved in the early stages of this work, 
Platts-Mills himself was bitten by lone star ticks while hiking one 
day, and he developed a red meat allergy. 

It is interesting to note the similarity between alpha-gal and the 
human blood group A and B antigens, all three of which contain 
the galactose-a-1,3-galactose linkage. The blood group antigens 
both contain an L-fucose in B(1,2) linkage to the central galactose. 
Humans have galactosyltransferase enzymes that can add a second 


Scott Camazine/Alamy 


Figure 1 A lone-star tick. 


galactose to the fucosyl-a1,2-galactose precursor of the blood group 
antigens, but (as noted earlier), they lack the enzyme activity that 
would add a second galactose in a(1,3) linkage to a bare galactose 
to form alpha-gal (Figure 2). 


CH,OH 


S. 


Blood group A antigen 


= = D-galactose 
=> = L-fucose 


Figure 2 Alpha-gal is galactose-a-1,3-galactose. 
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Figure 7.19 Some antibiotics are oligosaccharides or contain oligosaccharide groups. 


Oligosaccharides also occur widely as components (via glycosidic bonds) of antibiot- 
ics derived from various sources. Figure 7.19 shows the structures of two representative 
carbohydrate-containing antibiotics. 


7.4 | What Are the Structure and Chemistry 
| of Polysaccharides? 


Polysaccharides are polymers of the simple sugars and their derivatives. They may be 
either linear or branched polymers and may contain hundreds or even thousands of 
monosaccharide units. Polysaccharides mediate and provide a vast variety of func- 
tions in nature, including energy storage, recognition and signaling between cells, 
and support for cells and tissues. 


7.4a Nomenclature for Polysaccharides Is Based 
on Their Composition and Structure 


By far the majority of carbohydrate material in nature occurs in the form of 
polysaccharides. Polysaccharides include not only those substances composed only 
of glycosidically linked sugar residues, but also molecules that contain polymeric 
saccharide structures linked via covalent bonds to amino acids, peptides, proteins, 
lipids, and other structures. 

Polysaccharides, also called glycans, are assembled from monosaccharides and their 
derivatives. If a polysaccharide contains only one kind of monosaccharide molecule, it 
is a homopolysaccharide, or homoglycan, whereas those containing more than one kind 
of monosaccharide are heteropolysaccharides. The most common constituent of 
polysaccharides is D-glucose, but D-fructose, D-galactose, L-galactose, D-mannose, 
L-arabinose, and D-xylose are also common. Common monosaccharide derivatives in 
polysaccharides include the amino sugars (D-glucosamine and D-galactosamine), their 
derivatives (N-acetylneuraminic acid and N-acetylmuramic acid), and simple sugar 
acids (glucuronic and iduronic acids). Polysaccharides differ not only in the nature of 
their component monosaccharides, but also in the length of their chains and in the 
amount of chain branching that occurs. Although a given sugar residue has only one 
anomeric carbon and thus can form only one glycosidic linkage with hydroxyl groups 
on other molecules, each sugar residue carries several hydroxyls, one or more of which 
may be an acceptor of glycosyl substituents (Figure 7.20). This ability to form branched 
structures distinguishes polysaccharides from proteins and nucleic acids, which occur 
only as linear polymers. 
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Figure 7.20 Amylose and amylopectin are the two forms of starch. Note that the linear linkages are 
a(1—4) but the branches in amylopectin are a(1—>6). Branches in polysaccharides can involve any 
of the hydroxyl groups on the monosaccharide components. Amylopectin is a highly branched 
structure, with branches occurring every 12 to 30 residues. 


7.4b Polysaccharides Serve Energy Storage, Structure, 
and Protection Functions 


Polysaccharides function as storage materials, structural components, or protective sub- 
stances. Thus, starch and glycogen, as readily metabolizable sources of glucose, provide 
energy reserves for cells. Chitin and cellulose provide strong support for the skeletons of 
arthropods and green plants, respectively. Mucopolysaccharides, such as the hyaluronic 
acids, form protective coats on animal cells. In each of these cases, the relevant polysac- 
charide is either a homopolymer or a polymer of small repeating units. Recent research 
indicates, however, that oligosaccharides and polysaccharides with varied structures 
may also be involved in much more sophisticated tasks in cells, including a variety of 
cellular recognition and intercellular communication events, as discussed later. 


7.4c Polysaccharides Provide Stores of Energy 


Organisms store carbohydrates in the form of polysaccharides rather than as 
monosaccharides to lower the osmotic pressure of the sugar reserves. Because osmotic 
pressures depend only on numbers of molecules, the osmotic pressure is greatly 
reduced by formation of a few polysaccharide molecules out of thousands (or even 
millions) of monosaccharide units. 


Starch By far the most common storage polysaccharide in plants is starch, which exists 
in two forms: a-amylose and amylopectin (Figure 7.20). Most forms of starch in nature 
are 10% to 30% a-amylose and 70% to 90% amylopectin. a-Amylose is composed of 
linear chains of D-glucose in a(1—~+4) linkages. The chains are of varying length, 
having molecular weights from several thousand to half a million. As can be seen from 
the structure in Figure 7.20, the chain has a reducing end and a nonreducing end. 
Although poorly soluble in water, a-amylose forms micelles in which the polysaccha- 
ride chain adopts a helical conformation (Figure 7.21). Iodine reacts with a-amylose 
to give a characteristic blue color (the well-known iodine test for starch), which arises 
from the insertion of iodine into the hydrophobic middle of the amylose helix. 
Amylopectin is a highly branched chain of glucose units (one branch is shown in 
Figure 7.20). Branches occur in these chains every 12 to 30 residues. The average branch 
length is between 24 and 30 residues, and molecular weights of amylopectin molecules 
can range up to 100 million. The linear linkages in amylopectin are a(1—>4), whereas 
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Figure 7.21 Suspensions of amylose in water adopt 
a helical conformation. Iodine (I,) can insert into 
the middle of the amylose helix to give a blue color 
that is characteristic and diagnostic for starch. 
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Figure 7.22 The starch phosphorylase reaction 
cleaves glucose residues from amylose, producing 
a-D-glucose-1-phosphate. 
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the branch linkages are a(1—~+6). As is the case for a-amylose, amylopectin forms 
micellar suspensions in water; iodine reacts with such suspensions to produce a red- 
violet color. 

Starch is stored in plant cells in the form of granules in the stroma of plastids (plant 
cell organelles). When starch is to be mobilized and used by the plant that stored it, it is 
split into its monosaccharide elements by stepwise phosphorolytic cleavage of glucose 
units, a reaction catalyzed by starch phosphorylase (Figure 7.22). The products are one 
molecule of glucose-1-phosphate and a starch molecule with one less glucose unit. In 
a-amylose, this process continues all along the chain until the end is reached. 

In animals, digestion and use of plant starches begin in the mouth with salivary 
a-amylase (a(1—>4)-glucan 4-glucanohydrolase), the major enzyme secreted by the 
salivary glands. Although the capability of making and secreting salivary a-amylases is 
widespread in the animal world, some animals (such as cats, dogs, birds, and horses) do not 
secrete them. Salivary a-amylase is an endoamylase that splits a(1 —>4) glycosidic linkages 
only within the chain. Raw starch is not very susceptible to salivary endoamylase. However, 
when suspensions of starch granules are heated, the granules swell, taking up water and 
causing the polymers to become more accessible to enzymes. This is why cooked starch is 
more digestible. Most starch digestion occurs in the small intestine via glycohydrolases. 


Glycogen The major form of storage polysaccharide in animals is glycogen. Glycogen 
is found mainly in the liver (where it may amount to as much as 10% of liver mass) 
and skeletal muscle (where it accounts for 1% to 2% of muscle mass). Liver glycogen 
consists of granules containing highly branched molecules, with a(1—>6) branches 
occurring every 8 to 12 glucose units. Like amylopectin, glycogen yields a red-violet 
color with iodine. Glycogen can be hydrolyzed by both a- and B-amylases, yielding 
glucose and maltose, respectively, as products and can also be hydrolyzed by glycogen 
phosphorylase, an enzyme present in liver and muscle tissue, to release glucose-l- 
phosphate for glycolysis. 


Dextran Another important family of storage polysaccharides is the dextrans, which 
are a(1—-+6)-linked polysaccharides of D-glucose with branched chains found in 
yeast and bacteria. Because the main polymer chain is a(1 —>6) linked, the repeating 
unit is isomaltose, Glcal —>6Glc. The branch points may be 1—2, 1—>3, or 1—>4 
in various species. The degree of branching and the average chain length between 
branches depend on the species and strain of the organism. Bacteria growing on the 
surfaces of teeth produce extracellular accumulations of dextrans, an important 
component of dental plaque. 


7.4d Polysaccharides Provide Physical Structure and Strength to Organisms 


Cellulose The structural polysaccharides have properties that are dramatically dif- 
ferent from those of the storage polysaccharides, even though the compositions of 
these two classes are similar. The structural polysaccharide cellulose is the most 
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abundant natural polymer in the world. Found in the cell walls of nearly all plants, 
cellulose is one of the principal components providing physical structure and 
strength. The wood and bark of trees are insoluble, highly organized structures 
formed from cellulose and also from lignin. It is awe inspiring to look at a large tree 
and realize the amount of weight supported by polymeric structures derived from 
sugars and organic alcohols. Cellulose also has its delicate side, however. Cotton, 
whose woven fibers make some of our most comfortable clothing fabrics, is almost 
pure cellulose. Derivatives of cellulose have found wide use in our society. Cellulose 
acetates are produced by the action of acetic anhydride on cellulose in the presence 
of sulfuric acid and can be spun into a variety of fabrics with particular properties, 
such as softness and sheen. Referred to simply as acetates, they have a silky appear- 
ance, a luxuriously soft feel, and a deep luster and are used in dresses, lingerie, 
linings, and blouses. 

Cellulose is a linear homopolymer of D-glucose units, just as in a-amylose. The 
structural difference, which completely alters the properties of the polymer, is that in 
cellulose the glucose units are linked by B(1—~+4)-glycosidic bonds, whereas in 
a-amylose, the linkage is a(1—>4). The conformational difference between these two 
structures is shown in Figure 7.23. The a(1—~>4)-linkage sites of amylose are naturally 
bent, conferring a gradual turn to the polymer chain, which results in the helical 
conformation already described (Figure 7.21). The most stable conformation about the 
BCU —-4) linkage involves alternating 180° flips of the glucose units along the chain so 
that the chain adopts a fully extended conformation, referred to as an extended ribbon. 
Juxtaposition of several such chains permits efficient interchain hydrogen bonding, the 
basis of much of the strength of cellulose. 

The structure of one form of cellulose, determined by X-ray and electron diffraction 
data, is shown in Figure 7.24. The flattened sheets of the chains lie side by side and are 
joined by hydrogen bonds. These sheets are laid on top of one another in a way that 
staggers the chains, just as bricks are staggered to give strength and stability to a wall. 
Cellulose is extremely resistant to hydrolysis, whether by acid or by the digestive tract 
amylases described earlier. As a result, most animals (including humans) cannot digest 
cellulose to any significant degree. Ruminant animals, such as cattle, deer, giraffes, and 
camels, are an exception because bacteria that live in the rumen (Figure 7.25) secrete the 
enzyme cellulase, a B-glucosidase effective in the hydrolysis of cellulose. The resulting 
glucose is then metabolized in a fermentation process to the benefit of the host animal. 
Termites and shipworms (Teredo navalis) similarly digest cellulose because their diges- 
tive tracts also contain bacteria that secrete cellulase. 


Chitin A polysaccharide that is similar to cellulose, both in its biological function 
and its primary, secondary, and tertiary structure, is chitin. Chitin is present in the 
cell walls of fungi and is the fundamental material in the exoskeletons of crustaceans, 
insects, and spiders. The structure of chitin, an extended ribbon, is identical to that 
of cellulose, except that the —OH group on each C-2 is replaced by —NHCOCHs, 
so the repeating units are N-acetyl-p-glucosamines in B(1—~>4) linkage. Like 
cellulose (Figure 7.24), the chains of chitin form extended ribbons (Figure 7.26) and 
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Figure 7.23 The structures of amylose and cellulose. (a) Amylose, composed exclusively of the relatively 
bent a(1—4) linkages, prefers to adopt a helical conformation, whereas (b) cellulose, with B(1—>4)- 
glycosidic linkages, can adopt a fully extended conformation with alternating 180° flips of the glucose 
units. The hydrogen bonding inherent in such extended structures is responsible for the great strength of 
tree trunks and other cellulose-based materials. 
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Figure 7.25 Giraffes, cattle, deer, and camels are 
ruminant animals that are able to metabolize 
cellulose, thanks to bacterial cellulase in the rumen, 
a large first compartment in the stomach of a 
ruminant. 
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Figure 7.24 The structure of cellulose, showing the hydrogen bonds (blue) between the sheets, which 
strengthen the structure. Intrachain hydrogen bonds are in red, and interchain hydrogen bonds are in 
green. 


Cellulose 
30" pF 
aor MA o ai 
HO HO 
HG Bo, H9 hoy 


| N-Acetylglucosamine units T 
CH; CH3 


Mannan 


m HO ao" 
So A ae 
HO Hop 


Mannose units 


Figure 7.26 Cellulose, chitin and mannan form extended ribbons and pack together efficiently, taking 
advantage of multiple hydrogen bonds. 


pack side by side in a crystalline, strongly hydrogen-bonded form. One significant 
difference between cellulose and chitin is whether the chains are arranged in parallel 
(all the reducing ends together at one end of a packed bundle and all the nonreducing 
ends together at the other end) or antiparallel (each sheet of chains with the chains 
arranged oppositely from the sheets above and below). Natural cellulose seems to 
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Ruth Benerito and Wrinkle-Free Cotton Fabrics 


Cotton, which is usually 94% cellulose, is the natural fiber 
most commonly used for clothing. Ordinary cotton fabrics are 
comfortable and easily dyed, but they wrinkle easily. Permanent 
press fabrics made from a combination of synthetic resin and cotton 
fibers appeared in 1932, but these fabrics were less comfortable 
than pure cotton fabrics. 

In the 1950s, chemist Ruth Benerito at the U.S. Agricultural 
Research Service in New Orleans began experimenting with 
crosslinking reactions that joined cellulose chains covalently (see 
figure). 

Benerito’s work revolutionized the cotton fabric industry, and 
today this reaction is the basis for more than 80% of “wrinkle-free” 


and “permanent press” cotton fabrics. Fabrics made with this pro- 
cess release small amounts of formaldehyde over time, and concerns 
about safety have prompted the development of new crosslinking 
processes that produce formaldehyde-free products. 
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Strands of cellulose can be cross-linked with reagents like Dimethylol dihydroxy ethylene urea (DMDHEU). 


occur only in parallel arrangements. Chitin, however, can occur in three forms, 
sometimes all in the same organism. a-Chitin is an all-parallel arrangement of the 
chains, whereas -chitin is an antiparallel arrangement. In 6-chitin, the structure is 
thought to involve pairs of parallel sheets separated by single antiparallel sheets. 

Chitin is the Earth’s second most abundant carbohydrate polymer (after cellulose), 
and its ready availability and abundance offer opportunities for industrial and commercial 
applications. Chitin-based coatings can extend the shelf life of fruits, and a chitin 
derivative that binds to iron atoms in meat has been found to slow the reactions that 
cause rancidity and flavor loss. Without such a coating, the iron in meats activates oxygen 
from the air, forming reactive free radicals that attack and oxidize polyunsaturated lipids, 
causing most of the flavor loss associated with rancidity. Chitin-based coatings coordinate 
the iron atoms, preventing their interaction with oxygen. 


Agarose An important polysaccharide mixture isolated from marine red algae 
(Rhodophyceae) is agar, which consists of two components: agarose and agaropectin. 
Agarose (Figure 7.27) is a chain of alternating D-galactose and 3,6-anhydro-L- 
galactose, with side chains of 6-methyl-p-galactose. Agaropectin is similar, but in 
addition, it contains sulfate ester side chains and p-glucuronic acid. The three- 
dimensional structure of agarose is a double helix with a threefold screw axis, as 
shown in Figure 7.27. The central cavity is large enough to accommodate water 
molecules. Agarose and agaropectin readily form gels containing large amounts (up 
to 99.5%) of water. Agarose gels are used in laboratories to separate macromolecules 
on the basis of size. 
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Figure 7.27 The favored conformation of agarose in 
water is a double helix with a left-handed threefold 
screw axis (the structure is identical at every 120 


degrees of rotation). 


Glycosaminoglycans A class of polysaccharides known as glycosaminoglycans is 
involved in a variety of extracellular (and sometimes intracellular) functions. 
Glycosaminoglycans consist of linear chains of repeating disaccharides in which one 
of the monosaccharide units is an amino sugar and one (or both) of the monosac- 
charide units contains at least one negatively charged sulfate or carboxylate group. 
The repeating disaccharide structures found commonly in glycosaminoglycans are 
shown in Figure 7.28. Glycosaminoglycans are components of proteoglycans. Heparin, 
with the highest net negative charge of the disaccharides shown, is a natural antico- 
agulant substance. It binds strongly to antithrombin III (a protein involved in termi- 
nating the clotting process) and inhibits blood clotting. Hyaluronate molecules may 
consist of as many as 25,000 disaccharide units, with molecular weights of up to 10’. 
Hyaluronates are important components of the vitreous humor in the eye and of 
synovial fluid, the lubricant fluid of joints in the body. Chondroitin and keratan sulfate 
are found in tendons, cartilage, and other connective tissue; dermatan sulfate, as its 
name implies, is a component of the extracellular matrix of skin. Glycosaminoglycans 
are fundamental constituents of proteoglycans (discussed later). 


7.4e Polysaccharides Provide Strength and Rigidity to Bacterial Cell Walls 


Some of nature’s most interesting polysaccharide structures are found in bacterial cell 
walls. Given the strength and rigidity provided by polysaccharide structures, it is not 
surprising that bacteria use such structures to provide protection for their cellular 
contents. Bacteria normally exhibit high internal osmotic pressures and frequently 
encounter variable, often hypotonic, exterior conditions. The rigid cell walls synthesized 
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Figure 7.28 Glycosaminoglycans are formed from repeating disaccharide arrays. 
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A Complex Polysaccharide in Red Wine—The Strange Story of Rhamnogalacturonan II 


For many years, cotton and grape growers and other farmers have 
known that boron is an essential trace element for their crops. Un- 
til recently, however, the role or roles of boron in sustaining plant 
growth were unknown. Recent reports show that at least one role for 
boron in plants is that of cross-linking an unusual polysaccharide 
called rhamnogalacturonan II (RGII). RGII is a low-molecular- 
weight (5 to 10 kDa) polysaccharide, but it is thought to be the most 
complex polysaccharide on Earth, composed as it is of 11 different 
sugar monomers. It can be released from plant cell walls by treat- 
ment with a galacturonase, and it is also present in red wine. A liter 
of red wine typically contains about 100-150 mg of RG II. Part of 
the structure of RGII is shown in the accompanying figure. The 
nature of the borate ester cross-links (also indicated in the figure) 
was elucidated by Malcolm O’Neill and his colleagues, who used a 
combination of chemical methods and boron-11 NMR. 

Why is rhamnogalacturonan II essential for the structure and 
growth of plant walls? Plant walls are extremely sophisticated 
composite materials, composed of networks of protein, poly- 
saccharides, and phenolic compounds. Cellulose microfibrils as 
strong as steel provide a load-bearing framework for the plant. 
These microfibrils are tiny wires made of crystalline arrays of 
B(1—4)-linked chains of glucose residues, which are extruded 
from hexameric spinnerets in the plasma membrane of the plant 
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cell, surrounding the growing plant cell like hoops around a 
barrel. These microfibrils thus constrain the directions of cell 
expansion and determine the shapes of the plant cells and the 
plant as well. The separation of the barrel hoops is controlled 
by hemicelluloses, such as xyloglucans, which form H-bonded 
crosslinks with the cellulose microfibrils. The hemicellulose net- 
work is embedded in a hydrated gel inside the plant wall. This 
gel consists of complex galacturonic acid—rich polysaccharides, 
including RGII. The gel provides a dynamic operating environ- 
ment for cell wall processes. 

It is interesting to note that the tiny spinnerets of plant cells are 
nature’s version of the viscose process, developed in 1910, for the 
production of rayon fibers. In this process, viscose—liiterally a vis- 
cous solution of cellulose—is forced through a spinneret (a device 
resembling a shower head with many tiny holes). Each hole pro- 
duces a fine filament of viscose. The fibers precipitate in an acid bath 
and are stretched, resulting in the formation of interchain H bonds 
that give the filaments the properties essential for use as textile fibers. 
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Rhamnogalacturonan II (RGII) forms dimers linked by boron atoms to sustain plant growth. 
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by bacteria maintain cell shape and size and prevent swelling or shrinkage that would 
inevitably accompany variations in solution osmotic strength. 

Bacteria are conveniently classified as either gram-positive or gram-negative, depending 
on their response to Gram staining, which identifies multilayered peptidoglycan struc- 
tures. Despite substantial differences in the various structures surrounding these two types 
of cells, nearly all bacterial cell walls have a strong, protective peptide—polysaccharide 
layer called peptidoglycan. Gram-positive bacteria have a thick (approximately 25 nm) cell 
wall consisting of multiple layers of peptidoglycan. This thick cell wall surrounds the 
bacterial plasma membrane. Gram-negative bacteria, in contrast, have a much thinner 
(2 nm to 3 nm) cell wall consisting of a single layer of peptidoglycan sandwiched between 
the inner and outer lipid bilayer membranes. In either case, peptidoglycan, sometimes 
called murein (from the Latin murus, meaning “wall”), is a continuous crosslinked 
structure—in essence, a single molecule—built around the cell. The structure is shown in 
Figure 7.29. The backbone is a B(1—~4)-linked polymer of N-acetylglucosamine and 
N-acetylmuramic acid units. This part of the structure is similar to that of chitin, but it is 
joined to a tetrapeptide, usually L-Ala - D-Glu - L-Lys - D-Ala, in which the L-lysine is 
linked to the y-COOH of p-glutamate. The peptide is linked to the N-acetylmuramic acid 
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Figure 7.29 The structure of peptidoglycan. (a) The tetrapeptides linking adjacent backbone mae 

chains contain an unusual y-carboxyl linkage. (b) The crosslink in gram-positive cell walls is a L-Lys 

pentaglycine bridge. (c) In gram-negative cell walls, the linkage between the tetrapeptides of p-Ala 

adjacent carbohydrate chains in peptidoglycan involves a direct amide bond between the lysine eee " 
Crossin. 


side chain of one tetrapeptide and D-alanine of the other. 
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units via its D-lactate moiety. The e-amino group of lysine in this peptide is linked to the 
—COOH of b-alanine of an adjacent tetrapeptide. In gram-negative cell walls, the lysine 
€-amino group forms a direct amide bond with this D-alanine carboxyl (Figure 7.29). 
In gram-positive cell walls, a pentaglycine chain bridges the lysine e-amino group and the 
p-Ala carboxyl group. Gram-negative cell walls are also covered with highly complex 
lipopolysaccharides (Figure 7.30). 


Cell Walls of Gram-Negative Bacteria In gram-negative bacteria, the peptidoglycan 
wall is the rigid framework with an elaborate membrane structure around it (Figure 
7.30). The peptidoglycan layer encloses the periplasmic space and is attached to the 
outer membrane via a group of hydrophobic proteins. 

As shown in Figure 7.31, the outer membrane of gram-negative bacteria is coated with a 
highly complex lipopolysaccharide, which consists of a lipid group (anchored in the outer 
membrane) joined to a polysaccharide made up of long chains with characteristic repeating 
structures (Figure 7.31). Its many different unique units determine the antigenicity of the 
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Figure 7.30 The structures of the cell wall and membrane(s) in gram-positive and gram-negative 
bacteria. The gram-positive cell wall is thicker than that in gram-negative bacteria, compensating for 
the absence of a second (outer) bilayer membrane. 
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Figure 7.31 Lipopolysaccharide (LPS) coats the outer 


membrane of gram-negative bacteria. The lipid 
portion of the LPS is embedded in the outer 


membrane and is linked to a complex polysaccharide. 
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bacteria; that is, animal immune systems recognize them as foreign substances and raise 
antibodies against them. As a group, these antigenic determinants are called the O antigens, 
and there are thousands of different ones. The Salmonella bacteria alone have well over a 
thousand known O antigens that have been organized into 17 different groups. The great 
variation in these O antigen structures apparently plays a role in the recognition of one type 
of cell by another and in evasion of the host immune system. 


Cell Walls of Gram-Positive Bacteria In gram-positive bacteria, the cell exterior is less 
complex than for gram-negative cells. Having no outer membrane, gram-positive cells 
compensate with a thicker wall. Covalently attached to the peptidoglycan layer are teichoic 
acids, which often account for 50% of the dry weight of the cell wall. The teichoic acids are 
polymers of ribitol phosphate or glycerol phosphate linked by phosphodiester bonds. 


7.4f Animals Display a Variety of Cell Surface Polysaccharides 


Compared to bacterial cells, which are identical within a given cell type (except for 
O antigen variations), animal cells display a wondrous diversity of structure, 
constitution, and function. Although each animal cell contains in its genetic material 
the instructions to replicate the entire organism, each differentiated animal cell 
carefully controls its composition and behavior within the organism. A great part of 
each cell’s uniqueness begins at the cell surface. This surface uniqueness is critical to 
each animal cell because cells spend their entire life span in intimate contact with other 
cells and must therefore communicate with one another. That cells are able to pass 
information among themselves is evidenced by numerous experiments. For example, 
heart myocytes, when grown in culture (in glass dishes), establish synchrony when they 
make contact, so that they beat or contract in unison. If they are removed from the 
culture and separated, they lose their synchronous behavior, but if allowed to reestablish 
cell-to-cell contact, they spontaneously restore their synchronous contractions. 

As these and many other related phenomena show, it is clear that molecular struc- 
tures on one cell are recognizing and responding to molecules on the adjacent cell or to 
molecules in the extracellular matrix, the complex mix of connective proteins and other 
molecules that exists outside of and among cells. Many of these interactions involve 
glycoproteins on the cell surface and proteoglycans in the extracellular matrix. The 
information held in these special carbohydrate-containing molecules is not encoded di- 
rectly in the genes (as with proteins), but is determined instead by expression of the 
appropriate enzymes that assemble carbohydrate units in a characteristic way on these 
molecules. Also, by virtue of the several hydroxyl linkages that can be formed with each 
carbohydrate monomer, these structures are more information-rich than proteins and 
nucleic acids, which can form only linear polymers (see Section 7.7). A few of these 
glycoproteins and their unique properties are described in the following sections. 


7.5 | What Are Glycoproteins, and How Do They Function 
in Cells? 


Many proteins found in nature are glycoproteins because they contain covalently 
linked oligosaccharide and polysaccharide groups. The list of known glycoproteins 
includes structural proteins, enzymes, membrane receptors, transport proteins, and 
immunoglobulins, among others. In most cases, the precise function of the bound 
carbohydrate moiety is not understood. 


7.5a Carbohydrates on Proteins Can Be O-Linked or N-Linked 


Carbohydrate groups are linked to polypeptide chains in only a few ways: either via the 
hydroxyl groups of serine, threonine, or hydroxylysine residues (in O-linked saccharides) 
(Figure 7.32a) or via the amide nitrogen of an asparagine residue (in N-linked saccharides) 
(Figure 7.32b). The carbohydrate residue linked to the protein in O-linked saccharides is 
usually an N-acetylgalactosamine, but mannose, galactose, and xylose residues linked to 
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Figure 7.32 The carbohydrate 
moieties of glycoproteins may 
be linked to the protein via 

(a) serine or threonine residues 
(in the O-linked saccharides) or 
(b) asparagine residues (in the 
N-linked saccharides). 

(c) N-linked glycoproteins are of 
three types: high mannose, 
complex, and hybrid, the latter 
of which combines structures 
found in the high mannose and 
complex saccharides. 
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protein hydroxyls are also found (Figure 7.32a). Oligosaccharides O-linked to glycophorin 
involve N-acetylgalactosamine linkages and are rich in sialic acid residues. N-linked 
saccharides always have a unique core structure composed of two N-acetylglucosamine 
residues linked to a branched mannose triad (Figure 7.32b, c). Many other sugar units 
may be linked to each of the mannose residues of this branched core. 


7.5b O-GIcNAc Signaling Is Altered in Diabetes and Cancer 


O-linked B-N-acetylglucosamine (O-GlcNAc) linked to serine or threonine (Figure 7.32a) 
on nuclear and cytoplasmic proteins serves as a cellular signal, much like phosphoryla- 
tion at Ser and Thr. O-GlcNAcylation is an abundant, ubiquitous, and dynamic post- 
translational modification. Proteins modified with O-GlcNAc groups serve as nutrient 
sensors, both independently and in concert with other post-translational modifications, 
to regulate transcription, translation, cytoskeletal assembly, signal transduction, and 
cellular metabolism. Interestingly, O-GlcNAc signaling and its interaction with phos- 
phorylation are both altered in the major diseases of aging, including diabetes, cancer, 
Alzheimer’s, and cardiovascular disease. 


7.5c O-Linked Saccharides Form Rigid Extended 
Extracellular Structures 


O-linked saccharides are often found in cell surface glycoproteins and in mucins, the 
large glycoproteins that coat and protect mucous membranes in the respiratory and 
gastrointestinal tracts in the body. Certain viral glycoproteins also contain O-linked 
sugars. O-linked saccharides in glycoproteins are often found clustered in richly 
glycosylated domains of the polypeptide chain. Physical studies on mucins show that 
they adopt rigid, extended structures. An individual mucin molecule (M, = 10’) may 
extend over a distance of 150 nm to 200 nm in solution. Inherent steric interactions 
between the sugar residues and the protein residues in these cluster regions cause the 
peptide core to adopt an extended and relatively rigid conformation. This interesting 
effect may be related to the function of O-linked saccharides in glycoproteins. It 
allows aggregates of mucin molecules to form extensive, intertwined networks, even 
at low concentrations. These viscous networks protect the mucosal surface of the 
respiratory and gastrointestinal tracts from harmful environmental agents. 

There appear to be two structural motifs on the extracellular surface of membrane 
glycoproteins containing O-linked saccharides. Certain glycoproteins, such as 
leukosialin, are O-glycosylated throughout much or most of their extracellular domain 
(Figure 7.33). Leukosialin, like mucin, adopts a highly extended conformation, allowing 
it to project great distances above the membrane surface, perhaps protecting the cell 
from unwanted interactions with macromolecules or other cells. The second structural 
motif is exemplified by the low-density lipoprotein (LDL) receptor and by decay- 
accelerating factor (DAF). These proteins contain a highly O-glycosylated stem region 
that separates the transmembrane domain from the globular, functional extracellular 
domain. The O-glycosylated stem serves to raise the functional domain of the protein 
far enough above the highly glycosylated membrane surface (a region termed the 
glycocalyx) to make it accessible to the extracellular macromolecules with which it 
interacts. 


7.5d Polar Fish Depend on Antifreeze Glycoproteins 


A unique family of O-linked glycoproteins permits fish to live in the icy seawater of 
the Arctic and Antarctic regions, where water temperature may reach as low as 
—1.9°C. Antifreeze glycoproteins (AFGPs) are found in the blood of nearly all Antarctic 
fish and at least five Arctic fish. These glycoproteins have the peptide structure 


[Ala-Ala-Thr],,-Ala-Ala 
where n can be 4, 5, 6, 12, 17, 28, 35, 45, or 50. Each of the threonine residues is 


glycosylated with the disaccharide B-galactosyl-(1—->3)-a-N-acetylgalactosamine 
(Figure 7.34). This glycoprotein adopts a flexible rod conformation with regions of 
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threefold left-handed helix. The evidence suggests that antifreeze glycoproteins may 
inhibit the formation of ice in the fish by binding specifically to the growth sites of ice 
crystals, inhibiting further growth of the crystals. 


7.5e N-Linked Oligosaccharides Can Affect the Physical 
Properties and Functions of a Protein 


N-linked oligosaccharides are found in many different proteins, including immuno- 
globulins G and M, ribonuclease B, ovalbumin, and peptide hormones. Many different 
functions are known or suspected for N-glycosylation of proteins. N-Glycosylation 
can affect the physical and chemical properties of proteins, altering solubility, mass, 


Figure 7.33 The O-linked saccharides of 
glycoproteins appear in many cases to adopt 
extended conformations that serve to extend the 
functional domains of these proteins above the 
membrane surface. (Adapted from Jentoft, N., 1990. 
Why are proteins O-glycosylated? Trends in 
Biochemical Sciences 15:291-294.) 


Figure 7.34 The structure of the repeating unit of 
antifreeze glycoproteins, a disaccharide consisting of 
B-galactosyl-(1—+3)-a-N-acetylgalactosamine in 
glycosidic linkage to a threonine residue. 
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Human Biochemistry 


Drug Research Finds a Sweet Spot 


A variety of diseases are being successfully treated with sugar-based either on the market or at various stages of clinical trials. Some of 
therapies. As this table shows, several carbohydrate-based drugs are these drugs are enzymes, whereas others are glycoconjugates. 


Drug Description Manufacturer 
Tamiflu and These “transition state analogs” (see Chapter 14) inhibit neuraminidase, GlaxoSmithKline (Relenza) 
Relenza an influenza enzyme that enables new virus particles to bud from a host cell. and Genentech (Tamiflu) 
Cerezyme This enzyme degrades glycolipids, compensating for an enzyme deficiency that causes Genzyme 
(imiglucerase) Gaucher’s disease. Cambridge, MA 
Vancocin A very potent glycopeptide antibiotic that is typically used against antibiotic- Eli Lilly 
(vancomycin) resistant infections. It inhibits synthesis of peptidoglycan in the bacterial cell wall. Indianapolis, IN 
GMK A vaccine containing ganglioside GM2; it triggers an immune response against Progenics Pharmaceuticals 
cancer cells carrying GM2. Tarrytown, NY 
Bimosiamose A sugar analog that inhibits selectin-based inflammation in blood vessels. Revotar 
(TBC1269) PharmAG, Berlin 
GCS-100 A sugar that blocks action of galectin-3 (Gal-3), a sugar-binding protein on tumors. GlycoGenesys 
Boston 
PI-88 A sugar that inhibits growth factor-dependent angiogenesis and enzymes that Progen 
promote metastasis. Darra, Australia 
StaphVAX A vaccine that is a protein with a linked bacterial sugar; it is intended to prevent NABI Pharmaceuticals 
Staphylococcus infection. Boca Raton, FL 


Ernst, B., and Magnani, J. L., 2009. From carbohydrate leads to glycomimetic drugs. Nat Rev Drug Discovery 8:661-676. 
O'Flaherty, R., Trbojevic-Akmacic, l., Greville, G, Rudd, P. M., and Lauc, G., 2018. The sweet spot for biologics: recent advances in characterization of biotherapeutic glycoproteins. 
Expert Rev. Proteomics 15:13-29. 


Licensed carbohydrate-based vaccines 


Organism Targeted Vaccine Manufacturer (Trade Name) 
Haemophilus influenzae type b Glycoconjugate (polysaccharide with Sanofi Pasteur (ActHIB) 
tetanus toxin) GlaxoSmithKline (Hiberix) 
Neisseria meningitidis Glycoconjugate (meningococcal Sanofi Pasteur (Menactra) 
polysaccharide with DT) 
Salmonella typhi Vi capsular polysaccharide Sanofi Pasteur (TYPHIM Vi) 
Streptococcus pneumoniae Pneumococcal polysaccharide-protein conjugate Wyeth Pharmaceuticals (Prevnar) 
Streptococcus pneumoniae Pneumococcal polysaccharide Merck and Co. (Pneumovax 23) 
Reference 


Khatun, F., Toth, 1., and Stephenson, R. J., 2020. Immunology of carbohydrate-based vaccines. Advanced Drug Delivery Reviews 165-166:117-126. 


and electrical charge. Carbohydrate moieties have been shown to stabilize protein 
conformations and protect proteins against proteolysis. Eukaryotic organisms use 
post-translational additions of N-linked oligosaccharides to direct selected proteins to 
various membrane compartments. Recent evidence indicates that N-linked 
oligosaccharides promote the proper folding of newly synthesized polypeptides in the 
endoplasmic reticulum. 


7.5f Sialic Acid Terminates the Oligosaccharides 
of Glycoproteins and Glycolipids 
The oligosaccharide chains of glycoproteins and glycolipids serve as ligands for 


mammalian sugar-binding proteins and also as receptors for pathogens such as 
influenza and polyoma virus, for bacteria, and for bacterial toxins when they bind to 
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the host cell. The terminal sugars of most such oligosaccharide chains are sialic 
acids, the most common being NeuSAc (Figure 7.15), and the sialic acid—capped 
oligosaccharides themselves are termed sialosides. Sialosides on glycoproteins and 
glycolipids mediate aspects of immune responses and leukocyte trafficking to lymph 
nodes and inflammation sites, among other functions. The biosynthesis of sialosides 
is carried out by 20 sialyltransferase enzymes that are highly conserved in mammals. 


7.5g Sialic Acid Cleavage Can Serve as a Timing 
Device for Protein Degradation 


The slow cleavage of sialic acid residues from N-linked glycoproteins circulating in 
the blood targets these proteins for normal degradation by the organism. The liver 
contains specific receptor proteins that recognize and bind glycoproteins that are 
ready to be degraded and recycled. Newly synthesized serum glycoproteins contain 
N-linked triantennary (three-chain) oligosaccharides with structures similar to those 
in Figure 7.35, in which sialic acid residues cap galactose residues. As these 
glycoproteins circulate, enzymes on the blood vessel walls remove the sialic acid 
groups, exposing the galactose residues. In the liver, the asialoglycoprotein receptor 
binds the exposed galactose residues of these glycoproteins with very high affinity 
(K, = 10°? M to 1078 M). The complex of receptor and glycoprotein is then taken into 
the cell by endocytosis, and the glycoprotein is degraded in cellular lysosomes. Highest 
affinity binding of glycoprotein to the asialoglycoprotein receptor requires three free 
galactose residues. Oligosaccharides with only one or two exposed galactose residues 
bind less tightly. This is an elegant way for the body to keep track of how long 
glycoproteins have been in circulation. Over a period of time—anywhere from 
a few hours to weeks—the sialic acid groups are cleaved one by one. The longer 
the glycoprotein circulates and the more sialic acid residues are removed, the more 
galactose residues become exposed so that the glycoprotein is eventually bound to the 
liver receptor. 
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Figure 7.35 Progressive cleavage of sialic acid residues exposes 
galactose residues. Binding to the asialoglycoprotein receptor in the 
liver becomes progressively more likely as more Gal residues are exposed. 
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7.6 | How Do Proteoglycans Modulate Processes 
in Cells and Organisms? 


Proteoglycans are a family of glycoproteins whose carbohydrate moieties are 
predominantly glycosaminoglycans. The structures of only a few proteoglycans are 
known, and even these few display considerable diversity (Figure 7.36). Those known 
range in size from serglycin, with 104 amino acid residues (10.2 kD), to versican, with 
2409 residues (265 kD). Each of these proteoglycans contains one or two types of cova- 
lently linked glycosaminoglycans. In the known proteoglycans, the glycosaminoglycan 
units are O-linked to serine residues of Ser-Gly dipeptide sequences. Serglycin is named 
for a unique central domain of 49 amino acids composed of alternating serine and 
glycine residues. The cartilage matrix proteoglycan contains 117 Ser-Gly pairs to which 
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Figure 7.36 The known proteoglycans include a variety of structures. The carbohydrate 
groups of proteoglycans are predominantly glycosaminoglycans O-linked to serine 
residues. Proteoglycans include both soluble proteins and integral transmembrane proteins. 
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chondroitin sulfates attach. Decorin, a small proteoglycan secreted by fibroblasts and 
found in the extracellular matrix of connective tissues, contains only three Ser-Gly pairs, 
only one of which is normally glycosylated. In addition to glycosaminoglycan units, 
proteoglycans may contain other N-linked and O-linked oligosaccharide groups. 


7.6a Functions of Proteoglycans Involve Binding to Other Proteins 


Proteoglycans may be soluble and located in the extracellular matrix, as is the case for 
serglycin, versican, and the cartilage matrix proteoglycan, or they may be integral 
transmembrane proteins, such as syndecan. Both types of proteoglycan appear to func- 
tion by interacting with a variety of other molecules through their glycosaminoglycan 
components and through specific receptor domains in the polypeptide itself. For ex- 
ample, syndecan (from the Greek syndein, meaning “to bind together”) is a transmem- 
brane proteoglycan that associates intracellularly with the actin cytoskeleton. Outside 
the cell, it interacts with fibronectin, an extracellular protein that binds to several cell 
surface proteins and to components of the extracellular matrix. The ability of syndecan 
to participate in multiple interactions with these target molecules allows them to act as 
a sort of glue in the extracellular space, linking components of the extracellular matrix, 
facilitating the binding of cells to the matrix, and mediating the binding of growth fac- 
tors and other soluble molecules to the matrix and to cell surfaces (Figure 7.37). 

Many of the functions of proteoglycans involve the binding of specific proteins to 
the glycosaminoglycan groups of the proteoglycan. The glycosaminoglycan-binding 
sites on these specific proteins contain multiple basic amino acid residues. The amino 
acid sequences BBXB and BBBXXB (where B is a basic amino acid and X is any 
amino acid) recur repeatedly in these binding domains. Basic amino acids such as 
lysine and arginine provide charge neutralization for the negative charges of 
glycosaminoglycan residues, and in many cases, the binding of extracellular matrix 
proteins to glycosaminoglycans is primarily charge dependent. For example, more 
highly sulfated glycosaminoglycans bind more tightly to fibronectin. However, certain 
protein—glycosaminoglycan interactions require a specific carbohydrate sequence. A 
particular pentasaccharide sequence in heparin, for example, binds tightly to 
antithrombin III (Figure 7.38), accounting for the anticoagulant properties of heparin. 
Other glycosaminoglycans interact much more weakly. 
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Figure 7.37 Proteoglycans serve a variety of 
Cytoskeleton functions on the cytoplasmic and extracellular 
(actin) surfaces of the plasma membrane. Many of these 
functions appear to involve the binding of specific 
proteins to the glycosaminoglycan groups. 
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Figure 7.38 A portion of the structure of heparin, a carbohydrate having anticoagulant properties. 
It is used by blood banks to prevent the clotting of blood during donation and storage and also by 
physicians to prevent the formation of life-threatening blood clots in patients recovering from serious 
injury or surgery. This sulfated pentasaccharide sequence in heparin binds with high affinity to 
antithrombin II, accounting for this anticoagulant activity. The 3-O-sulfate marked by an asterisk is 
essential for high-affinity binding of heparin to antithrombin III. 


7.6b Proteoglycans May Modulate Cell Growth Processes 


Several lines of evidence support the possibility of modulation or regulation of cell 
growth processes by proteoglycans. First, heparin and heparan sulfate are known 
to inhibit cell proliferation in a process involving internalization of the glycosamino- 
glycan moiety and its migration to the cell nucleus. Second, fibroblast growth factor 
binds tightly to heparin and other glycosaminoglycans, and the heparin—growth 
factor complex protects the growth factor from degradative enzymes, thus enhancing 
its activity. Binding of fibroblast growth factors by proteoglycans and glycosamino- 
glycans in the extracellular matrix creates a reservoir of growth factors for cells to 
use. Third, transforming growth factor B has been shown to stimulate the synthesis 
and secretion of proteoglycans in certain cells. Fourth, several proteoglycan core 
proteins, including versican and /ymphocyte homing receptor, have domains similar in 
sequence to those of epidermal growth factor and complement regulatory factor. These 
growth factor domains may interact specifically with growth factor receptors in the 
cell membrane in processes that are not yet understood. 


7.6c Proteoglycans Make Cartilage Flexible and Resilient 


Cartilage matrix proteoglycan is responsible for the flexibility and resilience of 
cartilage tissue in the body. In cartilage, long filaments of hyaluronic acid are studded 
or coated with proteoglycan molecules, as shown in Figure 7.39. The hyaluronate 
chains can be as long as 4 wm and can coordinate 100 or more proteoglycan units. 
Cartilage proteoglycan possesses a hyaluronic acid-binding domain on the NH,- 
terminal portion of the polypeptide, which binds to hyaluronate with the assistance 
of a link protein. The proteoglycan—hyaluronate aggregates can have molecular 
weights of 2 million or more. 

The proteoglycan—hyaluronate aggregates are highly hydrated by virtue of strong 
interactions between water molecules and the polyanionic complex. When cartilage is 
compressed (such as when joints absorb the impact of walking or running), water is briefly 
squeezed out of the cartilage tissue and then reabsorbed when the stress is diminished. 
This reversible hydration gives cartilage its flexible, shock-absorbing qualities and cushions 
the joints during physical activities that might otherwise injure the involved tissues. 


7.7 | How Do Carbohydrates Provide a Structural Code? 


The surprisingly low number of genes in the genomes of complex multicellular organisms 
has led biochemists to consider other explanations for biological complexity and diversity. 
Oligosaccharides and polysaccharides, endowed with an unsurpassed variability of 
structures, are information carriers. Glycoconjugates—complexes of proteins and lipids 
with oligosaccharides and polysaccharides—are the mediators of information transfer 
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Carboxylate Figure 7.39 The human knee joint is a marvel 
group of nature’s engineering. Proteoglycans in 
structures in the joint allow it to absorb shock 
during movements such as walking and 
running. Articular cartilage slides with 
extremely low friction against the cushioning 
meniscus when the knee bends. Articular 
Core protein aa cartilage is 68-85% water, adsorbed in a 


od network of collagen fibers (10-20%) and 
Link protein \ i i 


Proteoglycan 


proteoglycans (5—10%). The meniscus is also 
mostly water (70%). The meniscus’s dry weight 
is 75% collagen and 25% proteoglycan. 
Hyaluronate (see Figure 7.28) forms the 
backbone of proteoglycan structures, such as 
those found in cartilage. The proteoglycan 
subunits consist of a core protein containing 
numerous O-linked and N-linked 
glycosaminoglycans. The glycosaminoglycans 
in the meniscus proteoglycan include 
chondroitin sulfate (50-60%), dermatan sulfate 
(20-30%), and keratan sulfate (15%). 

(See Figure 7.28 for these structures.) In 
cartilage, these highly hydrated proteoglycan 
structures are enmeshed in a network of 
collagen fibers. Release (and subsequent 
reabsorption) of water by these structures 
during compression accounts for the shock- 
absorbing qualities of cartilaginous tissue. 
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by these carbohydrate structures. Individual sugar units are the letters of the sugar code, 
and the words and sentences of this code are synthesized by glycosyltransferases, 
glycosidases, and other enzymes. These sentences are read by the binding of proteins to 
these sugar groups. The total number of permutations for a six-unit polymer formed 
from an alphabet of 20 hexose monosaccharides is a staggering 1.44 X 10!5, whereas only 
6.4 X 10’ hexamers can be formed from 20 amino acids and only 4096 hexanucleotides 
can be formed from the four nucleotides of DNA. 
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Studies of glycomics have generated vast amounts of 
information, which is stored and managed in multiple 
publicly available carbohydrate databases: 
http://www.glycosciences.de/ 
http://www.genome.jp/kegg/glycan/ 
http://functionalglycomics.org/ 
http://icggdb.jp/rcmg/glycodb/Ms_ResultSearch 


https://glytoucan.org/ 


The covalent addition of glycans to proteins and lipids represents not only the most 
abundant post-translational modification of proteins, but also the most structurally 
diverse. Glycoconjugates respond to and control metabolic states and developmental 
stages of cells, and they are directly involved in almost every biological process and play 
a major role in nearly every human disease. The vast array of possible glycan structures 
adds a glycomic dimension to the genomic complexity achieved by protein expression in 
organisms. The glycome is the complete set of glycans and glycoconjugates that are 
made by a cell or organism under specific conditions, and glycomics refers to studies 
that attempt to define or quantify the glycome of a cell, tissue, or organism. 

Glycan microarray technology, in which dozens or hundreds of complex glycans 
are immobilized in a resolvable pattern on a solid support, has made it possible to char- 
acterize the interactions between particular glycans and their protein- and enzyme- 
binding partners. The specificities of glycan-binding proteins (GBPs) are determined by 
comparing binding to all glycans presented on the microarray. More than a thousand 
GBPs and glycan-processing enzymes are known, and the human glycome is estimated 
to encompass hundreds of thousands of structures. 


7.7a Lectins Translate the Sugar Code 


The processes of cell migration, cell-cell interaction, immune response, and blood 
clotting, along with many other biological processes, depend on information transfer 
modulated by glycoconjugates. Many of the proteins involved in glycoconjugate 
formation belong to the lectins—a class of proteins that bind carbohydrates with high 
specificity and affinity. Lectins are the translators of the sugar code. Table 7.1 describes 
a few of the many known lectins, their carbohydrate affinities, and their functions. A 
few examples of lectin—carbohydrate complexes and their roles in biological information 
transfer will illustrate the nature of these important and complex interactions. 


7.7b Selectins, Rolling Leukocytes, and the Inflammatory Response 


Human bodies are constantly exposed to a plethora of bacteria, viruses, and other 
inflammatory substances. To combat these infectious and toxic agents, the body has 
developed a carefully regulated inflammatory response system. Part of that response 
is the orderly migration of leukocytes (a class of white blood cells) to sites of inflam- 
mation. Leukocytes roll along the vascular wall and into the tissue site of inflamma- 
tion. This rolling movement is mediated by reversible adhesive interactions between 
the leukocytes and the vascular surface. 

These interactions involve adhesion proteins called selectins, which are found both 
on the rolling leukocytes and on the endothelial cells of the vascular walls. Selectins 
have a characteristic domain structure, consisting of an N-terminal extracellular lectin 
(LEC) domain, a single epidermal growth factor (E) domain, a series of two to nine 


| Table 7.1 | Specificities and Functions of Some Animal Lectins 


Lectin Family Carbohydrate Specificity Function 

Calnexins Glucose Ligand-selective molecular chaperones in ER 

C-type lectins Variable Cell-type specific endocytosis and other 
functions 

ERGIC-53 Mannose Intracellular routing of glycoproteins and 
vesicles 

Galectins Galactose/lactose Cellular growth regulation and cell—matrix 
interactions 

Pentraxins Variable Anti-inflammatory action 

Selectins Variable Cell migration and routing 
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short consensus repeat (SCR) domains, a single transmembrane segment, and a short 
cytoplasmic domain. The lectin domains bind carbohydrates with high affinity and 
specificity. Selectins of three types are known: E-selectins, L-selectins, and P-selectins. 
L-Selectin is found on the surfaces of leukocytes, including neutrophils and lymphocytes, 
and binds to carbohydrate ligands on endothelial cells (Figure 7.40). The presence of 
L-selectin is a necessary component of leukocyte rolling. P-Selectin and E-selectin are 
located on the vascular endothelium and bind with carbohydrate ligands on leukocytes. 
Typical neutrophil cells possess 10,000 to 20,000 P-selectin—binding sites. Selectins are 
expressed on the surfaces of their respective cells by exposure to inflammatory signal 
molecules, such as histamine, hydrogen peroxide, and bacterial endotoxins. P-Selectins, 
for example, are stored in intracellular granules and are transported to the cell membrane 
within seconds to minutes of exposure to a triggering agent. 

Substantial evidence supports the hypothesis that selectin—-carbohydrate ligand 
interactions modulate the rolling of leukocytes along the vascular wall. Studies with 
L-selectin—deficient and P-selectin—deficient leukocytes show that L-selectins mediate 
weaker adherence of the leukocyte to the vascular wall and promote faster rolling along 
the wall. Conversely, P-selectins promote stronger adherence and slower rolling. Thus, 
leukocyte rolling velocity in the inflammatory response could be modulated by variable 
exposure of P-selectins and L-selectins at the surfaces of endothelial cells and leukocytes, 
respectively. 


7.7c Galectins—Mediators of Inflammation, Immunity, and Cancer 


The galectins are a highly conserved family of proteins with carbohydrate recognition 
domains (CRDs) of about 135 amino acids that bind B-galactosides specifically. 
Galectins occur in both vertebrates and invertebrates, and they participate in 
processes such as cell adhesion, growth regulation, inflammation, immunity, and 
cancer metastasis. In humans, one galectin is associated with increased risk of heart 
attacks and another is implicated in inflammatory bowel disease. Human galectin-l 
is a dimer of antiparallel beta-sandwich subunits (Figure 7.41). Lactose binds at 
opposite ends of the dimer. Structural studies of the protein in the presence and 
absence of ligand reveal that the amino acid residues implicated in galactose binding 
are kept in their proper orientation in the absence of ligand by a hydrogen-bonded 
network of four water molecules. 


(a) 


L-Selectin 
Selectin 


receptors Leukocyte 


(b) 


P-Selectin 


E-Selectin 


Endothelial cell 


Figure 7.40 Selectins. (a) The interactions of selectins with their receptors. (b) The selectin family of 
adhesion proteins. 
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obit 


Figure 7.41 Structure of the human galectin-1 dimer. The lactose-binding sites are at opposite ends of 
the dimer. (PDB id = 1W60) 


(a) (b) 


Figure 7.42 C-reactive protein is a lectin that limits inflammation damage. (a) The C-reactive protein 
pentamer. (b) The phosphocholine-binding site of C-reactive protein contains two bound Ca?* ions 
(orange) and a hydrophobic pocket. Phe® is shown in yellow. (PDB id = 1B09) 


C-reactive protein belongs to a protein family called pentraxins. These lectins adopt an 
unusual quaternary structure in which five identical subunits combine to form a planar 
ring with a central hole (Figure 7.42a). C-reactive protein functions to limit tissue 
damage, acute inflammation, and autoimmune reactions. C-reactive protein acts by 
binding to phosphocholine moieties on damaged membranes. A CRP (C-reactive 
protein) test can be used to determine the risk of developing coronary artery disease. 
Binding of the protein to phosphocholine is apparently mediated through a bound 
calcium ion and a hydrophobic pocket centered on Phe® (Figure 7.42b). 
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Problems 


Foundational Biochemistry Things You Should Know after Reading Chapter 7. 


Carbohydrates have the molecular formula (CH,O),. 

Carbohydrates are the metabolic precursors to virtually all other bio- 
molecules and the breakdown of carbohydrates provides energy that 
sustains animal life. 

Carbohydrates contain at least one, and often two or more, asymmet- 
ric centers and may exist in linear or cyclic forms. 

Monosaccharides may be described as aldoses and ketoses, or as tri- 
oses, tetroses, pentoses, and hexoses. Cyclic forms may be described 
as furanoses and pyranoses. 

Stereoisomers that are nonsuperimposable mirror images of each 
other are enantiomers. 

Stereoisomers that have opposite configurations at one or more 
chiral centers, but that are not mirror images of each other, are 
diastereomers. 

Sugars that differ in configuration at only one chiral center are 
epimers. 

When hemiacetals and hemiketals are formed from simple sugars, the 
carbon atom that carried the carbonyl function becomes an asym- 
metric carbon and is known as the anomeric carbon. 
Monosaccharides can be converted to several derivative forms, in- 
cluding sugar acids, sugar alcohols, deoxy sugars, sugar esters, and 
amino sugars. 

The furanose and pyranose forms of monosaccharides react with 
alcohols to form glycosides. The new bond between the anomeric 
carbon and the oxygen atom of the alcohol is called a glycosidic bond. 
Short polymers of simple sugars are known as oligosaccharides. 

The end of an oligosaccharide (or polysaccharide) with an unsubsti- 
tuted anomeric carbon is known as a reducing end. Ends of oligo- 
and polysaccharides that do not contain an unsubstituted anomeric 
carbon are known as nonreducing ends. 


Problems 


Polysaccharides that contain only one kind of monosaccharide are 
termed homopolysaccharides. Polysaccharides that contain two or 
more kinds of monosaccharides are known as heteropolysaccharides. 
Starch, glycogen, and dextrans are storage polysaccharides. Amyl- 
ose and amylopectin are composed of linear chains of D-glucose 
in a(1—>4) linkages. The branches in amylopectin and glycogen are 
a(1—>6) linkages. The main polymer chain of dextrans consists of 
a(l— 6) linkages, whereas the branches are 1—2, 1—3, or 1—4. 
Cellulose is a structural polysaccharide consisting of chains of D- 
glucose with B(1—4) linkages. 

Glycosaminoglycans consist of linear chains of repeating disaccha- 
rides in which one of the monosaccharide units is an amino sugar and 
one or both contain at least one negative charge. 

Gram-positive bacteria have a thick cell wall consisting of multiple 
layers of peptidoglycan. Gram-negative bacteria have a thin pepti- 
doglycan layer sandwiched between two lipid bilayers. 

O-linked saccharides of glycoproteins typically form rigid extended 
extracellular structures. 

N-linked saccharides of glycoproteins affect the physical and chemi- 
cal properties of proteins, stabilizing protein conformations, pro- 
tecting against proteolysis, and promoting proper folding of newly 
synthesized polypeptides. 

Proteoglycans, glycoproteins whose carbohydrate moieties are mostly 
glycosaminoglycans, modulate a variety of processes in cells and 
organisms. 

Oligosaccharides and polysaccharides are information carriers that 
are directly involved in almost every biological process and play a 
major role in many human diseases. 


Answers to all problems are at the end of this book. Detailed solutions are 
available in the Student Solutions Manual. 


1. 


Drawing Haworth Structures of Sugars Draw Haworth structures for 
the two possible isomers of D-altrose (Figure 7.2) and D-psicose 
(Figure 7.3). 


. Drawing the Structure of a Glycopeptide (Integrates with Chapters 4 


and 5.) Consider the peptide DGNILSR, where N has a covalently 
linked galactose and S has a covalently linked glucose. Draw the 
structure of this glycopeptide, and also draw titration curves for the 
glycopeptide and for the free peptide that would result from 
hydrolysis of the two sugar residues. 


Draw the structures of all the possible epimers of D-arabinose 
(Figure 7.2). 


Draw the structures of all the possible diastereomers of D-xylose 
(Figure 7.2) that are D-sugars. 


If the configuration at C-5 of D-mannose is changed, the molecule 
becomes an L-sugar, but it is not L-mannose. Instead it becomes the 
L-enantiomer of what sugar? (Compare the aldohexose structures in 
Figure 7.2 to determine the answer.) 


The distinction between configuration and conformation was 

described in Section 5.1d. 

a. Consider D-galactose and D-talose (Figure 7.2). Do these 
molecules represent different configurations or different 
conformations? Explain. 


7. 


10. 


b. Draw the two possible chair structures of B-D-allose. Do these 
structures represent two configurations or two conformations? 
Explain. 


Draw the structure of D-2-deoxyribose. Which other D-2-deoxy- 
aldopentose would have the same structure as D-2-deoxyribose? 
(Compare the aldopentose structures in Figure 7.2 to determine the 
answer. 


The ketone group of fructose cannot be oxidized without decompo- 
sition of the sugar, yet fructose is considered to be a reducing sugar. 
Explain. (Hint: fructose can undergo tautomerizations.) 


Naming and Characterizing a Disaccharide Trehalose, a disaccharide 
produced in fungi, has the following structure: 


CH,OH H OH 
O. 
H H 
OH H 
HO 
H OH 


a. What is the systematic name for this disaccharide? 
b. Is trehalose a reducing sugar? Explain. 


Drawing the Fischer Projection of a Simple Sugar Draw a Fischer 
projection structure for L-sorbose (D-sorbose is shown in Figure 7.3). 
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11. 


12. 


13. 


14. 
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Linked Carbohydrates Consider the structure shown and give the 
proper description of the linkage from A to B. 


OH OH 


Conversion of Fisher Projection into Haworth Projection Consider the 
structure shown. Draw the Haworth projection for the corresponding 
B-pyranose of this carbohydrate. 


N 
Da 


HN —C—H 
HO—C—H 
H—C—OH 


H—C—OH 


CHOH 
Epimers-Anomers-Enantiomers Consider the structure shown and 


draw its anomer, enantiomer, and C3 epimer. 


HOH,C OH 


H H 
H CHOH 


OH OH 


Carbohydrate Classification Consider the structures shown. For each 
structure select all descriptions that apply. Is it a pentose, hexose, or 
heptose? D or L? Ketose or aldose? œ or 8? Pyranose or furanose? 


O H 


HO 


H 


HO 


HO 


Structure A 


15. 


16. 


17. 


—C—OH 


—C—OH 


—C—H 


=—C—H 


Cc 


OH 


CH,OH 


OH NH, 


CH,OH 


Structure B Structure C 
Calculating the Composition of Anomeric Sugar Mixtures a-D- 
Glucose has a specific rotation, [a],”°, of +112.2°, whereas B-D- 
glucose has a specific rotation of +18.7°. What is the composition of 
a mixture of a-D- and B-D-glucose, which has a specific rotation of 
83.0°? 


Naming Sugars in the (R,S) System Use the information in the 
Critical Developments in Biochemistry box titled “Rules for 
Description of Chiral Centers in the (R, S) System” (Chapter 4) to 
name D-galactose using (R, S) nomenclature. Do the same for 
L-altrose. 


Determining the Branch Points and Reducing Ends of Amylopectin 
A 0.2 g sample of amylopectin was analyzed to determine the 
fraction of the total glucose residues that are branch points in the 


18. 


19. 


20. 


21. 


22. 


23. 


24. 


25. 


structure. The sample was exhaustively methylated and then digested, 
yielding 50 umol of 2,3-dimethylglucose and 0.4 umol of 1,2,3,6- 
tetramethylglucose. 

a. What fraction of the total residues are branch points? 


b. How many reducing ends does this sample of amylopectin have? 


The Effect of Carbohydrates on Proteolysis of Glycophorin (Integrates 
with Chapters 5, 6, and 9.) Consider the sequence of glycophorin 
(see Figure 9.10), and imagine subjecting glycophorin, and also a 
sample of glycophorin treated to remove all sugars, to treatment with 
trypsin and chymotrypsin. Would the presence of sugars in the native 
glycophorin make any difference to the results? 


Assessing the Caloric Content of Protein and Carbohydrate (Integrates 
with Chapters 4, 5, and 23.) The caloric content of protein and 
carbohydrate are quite similar, at approximately 16 to 17 kJ/g, 
whereas that of fat is much higher, at 38 kJ/g. Discuss the chemical 
basis for the similarity of the values for carbohydrate and for protein. 


Writing a Mechanism for Starch Phosphorylase Write a reasonable 
chemical mechanism for the starch phosphorylase reaction 
(Figure 7.22). 


Assessing the Toxicity of Laetrile Laetrile is a glycoside found in 
bitter almonds and peach pits. Laetrile treatment is offered in some 
countries as a cancer therapy. This procedure is dangerous, and there 
is no valid clinical evidence of its efficacy. Look up the structure of 
laetrile and suggest at least one reason that laetrile treatment could 
be dangerous for human patients. 


Assessing the Efficacy of Glucosamine and Chondroitin for Arthritis 
Pain Treatment with chondroitin and glucosamine is offered as one 
popular remedy for arthritis pain. Suggest an argument for the 
efficacy of this treatment, and then comment on its validity, based on 
what you know of polysaccharide chemistry. 


Assessing a Remedy for Flatulence Certain foods, particularly beans 
and legumes, contain substances that are indigestible (at least in part) 
by the human stomach, but which are metabolized readily by 
intestinal microorganisms, producing flatulence. One of the compo- 
nents of such foods is stachyose. 


CHOH 


HO 


Stachyose 


CHOH 


OH 


Beano is a commercial product that can prevent flatulence. Describe 
the likely breakdown of stachyose in the human stomach and 
intestines and how Beano could contribute to this process. What 
would be an appropriate name for the active ingredient in Beano? 


Determining the Systematic Name for a Trisaccharide Give the 
systematic name for stachyose. 


Assessing the Formation and Composition of Limit Dextrins Pro- 
longed exposure of amylopectin to starch phosphorylase yields a 
substance called a limit dextrin. Describe the chemical composition 
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26. 


27. 


28. 


29. 


30. 


31. 


32. 


of limit dextrins, and draw a mechanism for the enzyme-catalyzed 
reaction that can begin the breakdown of a limit dextrin. 


Assessing the Chemistry and Enzymology of “Light Beer” Biochemist 
Joseph Owades revolutionized the production of beer in the United 
States by developing a simple treatment with an enzyme that 
converted regular beer into “light beer,” which was marketed 
aggressively as a beverage that “tastes great,” even though it is “less 
filling.” What was the enzyme-catalyzed reaction that Owades used 
to modify the fermentation process so cleverly, and how is regular 
beer different from light beer? 


Brewing “Light Beer” on a Budget Amateur brewers of beer, who do 
not have access to the enzyme described in Problem 26, have 
nonetheless managed to brew light beers using a readily available 
commercial product. What is that product, and how does it work? 


Assessing the Growth Potential and Enzymology of a Prolific Plant 
Nuisance Kudzu is a vine that grows prolifically in the southern and 
southeastern United States. A native of Japan, China, and India, 
kudzu was brought to the United States in 1876 at the Centennial 
Exposition in Philadelphia. During the Great Depression of the 
1930s, the Soil Conservation Service promoted kudzu for erosion 
control, and farmers were paid to plant it. Today, however, kudzu is a 
universal nuisance, spreading rapidly, and covering and destroying 
trees in large numbers. Already covering 7 to 10 million acres in the 
United States, kudzu grows at the rate of a foot per day. Assume that 
the kudzu vine consists almost entirely of cellulose fibers, and 
assume that the fibers lie parallel to the vine axis. Calculate the rate 
of the cellulose synthase reaction that adds glucose units to the grow- 
ing cellulose molecules. Use the structures in your text to make a 
reasonable estimate of the unit length of a cellulose molecule (from 
one glucose monomer to the next). 


Lactaid is a commercial product that mitigates intestinal distress for 
individuals who are lactose intolerant. What is the active ingredient 
in Lactaid that accounts for its beneficial effects? 


Lactaid milk is available in most grocery stores. It is regular milk 
from which the lactose sugar has been removed. How is Lactaid milk 
prepared commercially? Is the lactose sugar separated from the milk, 
or could there be another way to produce it? How could a clever 
biochemistry student accomplish this at home? Consult appropriate 
online resources to learn more: 

https://www.fodmapeveryday.com /diy-lactose-free-dairy/ 

and 

https: //www.lacteeze.com.au/about-lacteeze/how-does-lacteeze-help/. 


It is estimated that one-third of the proteins encoded by the 
mammalian genome are made at and then exported from the 
endoplasmic reticulum, and nearly all of these are glycosylated at 
asparagine residues. The N-linked oligosaccharides on these proteins 
are said to enhance solubility and folding via what are termed 
intrinsic and extrinsic effects. Consult a suitable reference (for 
example, Hebert, D. N., Lamriben, L., Powers, E. T., and Kelly, J. W., 
2014. The intrinsic and extrinsic effects of N-linked glycans on 
glycoproteostasis. Nature Chemical Biology 10:902-910 ) and 
describe the detailed nature of these intrinsic and extrinsic effects on 
protein solubility and folding. 


The plant family Cucurbitaceae includes melons, cucumbers, and 
pumpkins, all of which produce particularly large fruits. Growth of 
such large fruits depends on a transport system that can deliver the 


33. 


34. 


35. 


36. 


37. 


38. 
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products of photosynthesis—sugars and other nutrients—from the 
leaves through the stem and into the sink tissues, such as the fruits 
and roots, where they are used in growth and storage. The phloem is 
the living tissue that is responsible for this transport. Consult suitable 
references (for example, Turgeon, R., and Oparka, K., 2010. The 
secret phloem of pumpkins. Proceedings of the National Academy of 
Sciences 107:13201-13202) to learn: 


a. Which part of the phloem is primarily responsible for transport 
of nutrients, 

b. what is the concentration of sugars in this phloem that enables 
the growth of large fruits like pumpkins, and 

c. what is the polysaccharide that is deposited at the site of tissue 
damage in such plants? 

d. Draw the two kinds of sugar—sugar linkages found in this 
polysaccharide. 


Lewis-a trisaccharide is a fucosyloligosaccharide present in human 
milk and colostrum, which is important for the protection of infants 
against infection by pathogenic microorganisms. Lewis-a trisaccharide 
is found on the surface of erythrocytes and is a tumor-related antigen. 
It is an amino trisaccharide consisting of N-acetylglucosamine having 
a fucosyl residue attached at the 4-position via an alpha-linkage and a 
galactosyl residue attached at the 3-position via a B-linkage. Draw the 
structure of this trisaccharide. 


The human mannose receptor is expressed on macrophages and on 
liver endothelial cells. It acts to scavenge lysosomal proteins, 
glycopeptide fragments of collagen, and pathogenic organisms. The 
receptor binds fucose, mannose, and N-acetylglucosamine residues 
on these target proteins, mainly via interactions with a Ca** ion on 
the CRD4 domain of the receptor. Glycan microarray screening has 
been used recently to identify the preferred oligosaccharide ligands 
of the receptor (Feinberg, H., Jegouzo, S., Lasanajak, Y., et al., 2021. 
Structural analysis of carbohydrate binding by the macrophage 
mannose receptor CD206. J. Biol. Chem. 296:100368.) Consult this 
reference, summarize the results of the glycan microarray screening, 
describe the domain structure of the mannose receptor, and describe 
the details of the interaction of mannose and fucose ligands with the 
Ca? ion in the CRD4 domain. 


Heparin has a characteristic pattern of hydroxy and anionic 
functions. Which amino acid side chains on antithrombin III might 
be the basis for the strong interactions between this protein and the 
anticoagulant heparin? 


What properties of hyaluronate, chondroitin sulfate, and keratan 
sulfate make them ideal components of cartilage? 


Use a suitable molecular graphics software package (PyMOL, JSmol, 
or ICM-Browser — all available as free downloads) to study and 
describe the C-reactive protein complexed with phosphocholine 
(Figure 7.42, PDB id = 1B09). What are the residues that coordinate 
the phosphocholine? What are the interactions that facilitate the 
formation of the pentamer structure? 


Use a suitable molecular graphics software package (PyMOL, JSmol, 
or ICM-Browser — all available as free downloads) to study and 
describe the galectin-1 dimer (Figure 7.41, PDB id = 1W60). What 
are the hydrophobic residues at the dimer interface that facilitate 
dimer formation? What are the residues that coordinate the bound 
lactose? 
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Lipids 


8 


A feast of fat things, a feast of wines on the lees. 


Isaiah 25:6 


< Whales have a layer of superficial blubber that 
provides insulation, buoyancy, and an energy reserve. 
(Humpback whale [Megaptera novaeangliae] 
breaching, Cape Cod, MA) 


Essential Question 


Key Questions 


Lipids are a class of biological molecules defined by low solubility in water and high solubil- 
ity in nonpolar solvents. Lipid molecules are key components of membranes and also serve 
a myriad of roles as signal molecules in biological systems. What are the structure, chemis- 
try, and biological function of lipids? 


| ees are largely hydrocarbon in nature. They consist of highly reduced forms of 
carbon and, upon oxidation in metabolism, yield large amounts of energy. Lipids are 
thus the molecules of choice for metabolic energy storage. Lipids are integrators of cel- 
lular function and intercellular communication. Lipid—lipid and lipid-protein 
interactions regulate cellular physiology. The lipids found in biological systems are either 
hydrophobic (containing only nonpolar groups) or amphipathic (possessing both polar 
and nonpolar groups). The hydrophobic nature of lipid molecules allows membranes to 
act as effective barriers to more polar molecules. In this chapter, we discuss the chemical 
and physical properties of the various classes of lipid molecules. The following chapter 
considers membranes, whose properties depend intimately on their lipid constituents. 


8.1 | What Are the Structures and Chemistry of Fatty Acids? 


Fatty acids are lipids that rarely occur as free molecules in nature but are components 
of more complex lipids. A fatty acid is composed of a long hydrocarbon chain (the 
tail) and a terminal carboxyl group (the head). The carboxyl group is normally ion- 
ized under physiological conditions. Fatty acids occur in large amounts in biological 
systems but only rarely in the free, uncomplexed state. They are typically esterified to 
glycerol or other backbone structures. Most of the fatty acids found in nature have an 
even number of carbon atoms (usually 14 to 24). Certain marine organisms, however, 
contain substantial amounts of fatty acids with odd numbers of carbon atoms. Fatty 
acids are either saturated (all carbon—carbon bonds are single bonds) or unsaturated 


8.1 What Are the Structures and 
Chemistry of Fatty Acids? 


8.2 What Are the Structures and 
Chemistry of Triacylglycerols? 

8.3 What Are the Structures and 
Chemistry of Glycerophospholipids? 

8.4 What Are Sphingolipids, and How Are 
They Important for Higher Animals? 

8.5 What Are Waxes, and How Are They 
Used? 

8.6 What Are Terpenes, and What Is Their 
Relevance to Biological Systems? 

8.7 What Are Steroids, and What Are 
Their Cellular Functions? 

8.8 How Do Lipids and Their Metabolites 
Act as Biological Signals? 


8.9 What Can Lipidomics Tell Us about 
Cell, Tissue, and Organ Physiology? 
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| Table 81 | 8.1 Common Biological Fatty Acids 


Number of Carbons Common Name 


Saturated fatty acids 


12 Lauric acid 

14 Mpyristic acid 
16 Palmitic acid 
18 Stearic acid 

20 Arachidic acid 
22 Behenic acid 
24 Lignoceric acid 


Unsaturated fatty acids (all double bonds are cis) 


16 Palmitoleic acid 
18 Oleic acid 

18 Linoleic acid 

18 a-Linolenic acid 
18 y-Linolenic acid 
20 Arachidonic acid 
24 Nervonic acid 


Systematic Name Symbol Structure 

Dodecanoic acid 12:0 CH;(CH2) COOH 

Tetradecanoic acid 14:0 CH;3(CH3);,COOH 

Hexadecanoic acid 16:0 CH;(CH3),4COOH 

Octadecanoic acid 18:0 CH;3(CH3);6 COOH 

Eicosanoic acid 20:0 CH;(CH2)ı COOH 

Docosanoic acid 22:0 CH3(CH2)29 COOH 

Tetracosanoic acid 24:0 CH;(CH32)2.>COOH 

9-Hexadecenoic acid 16:1* CH;(CH2);CH=CH(CH32);COOH 
9-Octadecenoic acid 18:1 CH3(CH2);CH=CH(CH2);COOH 
9,12-Octadecadienoic acid 18:2 CH;3(CH32)4(CH=CHCH3)2(CH2)6 COOH 
9,12,15-Octadecatrienoic acid 18:3 CH3CH,(CH=CHCH);)3;(CH2)sCOOH 
6,9,12-Octadecatrienoic acid 18:3 CH;(CH32)4(CH=CHCH;)3(CH2); COOH 
5,8,11,14-Eicosatetraenoic acid 20:4 CH;(CH)2)a4(CH=CHCH>)4(CH2)»COOH 
15-Tetracosenoic acid 24:1 CH;3(CH3);CH=CH(CH;),;COOH 


*Palmitoleic acid can also be described as 16:149, in a convention used to indicate the position of the double bond. 


Luis Carlos Jimenez del rio/Shutterstock.com 


Olive oil varieties exhibit different fatty acid composi- 
tions, with additional variation depending on matura- 
tion stage of the fruit and the conditions of growth. For 
example, Koroneiki olives from Greece contain 12.9% 
palmitic acid and 74.4% oleic acid, whereas Chemlali 
olives from Tunisia contain 19.7% palmitic acid and 
only 54.9% oleic acid. 


(with one or more double bonds in the hydrocarbon chain). If a fatty acid has a single 
double bond, it is said to be monounsaturated, and if it has more than one, polyunsatu- 
rated. Fatty acids can be named or described at least three ways, as shown in Table 8.1. 
For example, a fatty acid composed of an 18-carbon chain with no double bonds can 
be called by its systematic name (octadecanoic acid), its common name (stearic acid), 
or its shorthand notation, in which the number of carbons is followed by a colon and 
the number of double bonds in the molecule (18:0 for stearic acid). The structures of 
several common fatty acids are given in Figure 8.1. Stearic acid (18:0) and palmitic acid 
(16:0) are the most common saturated fatty acids in nature. 

Free rotation around each of the C-C single bonds makes saturated fatty acids 
extremely flexible molecules. Due to steric constraints, however, the fully extended con- 
formation (Figure 8.1) is the most stable for saturated fatty acids. Nonetheless, the 
degree of stabilization is slight, and (as will be seen) saturated fatty acid chains adopt a 
variety of conformations. 

Unsaturated fatty acids are slightly more abundant in nature than saturated fatty 
acids, especially in higher plants. The most common unsaturated fatty acid is oleic acid, 
or 18:14°, with the number following the superscript A indicating that the double bond 
is between carbons 9 and 10. The number of double bonds in an unsaturated fatty acid 
typically varies from one to four, but in the fatty acids found in most bacteria, this num- 
ber rarely exceeds one. 

The double bonds found in fatty acids are nearly always in the cis configuration. As 
shown in Figure 8.1, this causes a bend or kink in the fatty acid chain. This bend has very 
important consequences for the structure of biological membranes. Saturated fatty acid 
chains can pack closely together to form ordered, rigid arrays under certain conditions, but 
unsaturated fatty acids prevent such close packing and produce flexible, fluid aggregates. 

Some fatty acids are not synthesized by mammals and yet are necessary for normal 
growth and life. These essential fatty acids include linoleic and y-linolenic acids. These 
must be obtained by mammals in their diet (specifically from plant sources). Arachidonic 
acid, which is not found in plants, can be synthesized by mammals only from linoleic 
acid. At least one function of the essential fatty acids is to serve as a precursor for the 
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8.1 What Are the Structures and Chemistry of Fatty Acids? 
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Figure 8.1 The structures of some typical fatty acids. Note that most natural fatty acids contain an 
even number of carbon atoms and that the double bonds are nearly always cis and rarely conjugated. 
Fatty acids are shown in their most protonated forms. (a) palmitic acid, (b) stearic acid, (c) oleic acid, 
(d) linoleic acid, (e) a-linolenic acid, and (f) arachidonic acid. 


synthesis of eicosanoids, such as prostaglandins, a class of compounds that exert hor- 
monelike effects in many physiological processes. 

Fats in the modern human diet vary widely in their fatty acid composition 
(Table 8.2) but play an important role in human health. The incidence of cardiovascular 
disease is correlated with diets high in saturated fatty acids. By contrast, a diet relatively 
higher in unsaturated fatty acids (especially polyunsaturated fatty acids) may reduce the 
risk of heart attacks and strokes. Although vegetable oils usually contain a higher pro- 
portion of unsaturated fatty acids than do animal oils and fats, several plant oils are 
actually high in saturated fats. Palm oil is low in polyunsaturated fatty acids and 
particularly high in (saturated) palmitic acid (hence the name palmitic). Coconut oil is 
particularly high in lauric and myristic acids (both saturated) and contains little unsatu- 
rated fatty acid. 

Canola oil has been promoted as a healthy dietary oil because it consists primarily 
of oleic acid (60%), linoleic acid (20%), and a-linolenic acid (9%) with a very low satu- 
rated fat content (7%). Canola oil is actually rapeseed oil, from the seeds of the rape 
plant Brassica rapa (from the Latin rapa, meaning “turnip”), a close relative of mus- 
tard, kale, cabbage, and broccoli. Asians and Europeans used rapeseed oil in lamps for 
hundreds of years, but it was not usually considered edible because of its high erucic 


Arachidonic acid 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


240 Chapter 8 Lipids 


H ou 


Elaidic acid 
trans double bond at 9,10 position 


H OH 


Vaccenic acid 
trans double bond at 11,12 position 


Figure 8.2 Structure of elaidic acid and vaccenic 
acid, two trans fatty acids. 


Table 8.2 Fatty Acid Compositions of Some Dietary Lipids* 
Lauric and 

Source Myristic Palmitic Stearic Oleic Linoleic 
Beef 5 24-32 20-25 37-43 2-3 
Milk 25 12 33 3 
Coconut 74 10 2 7 

Corn 8-12 3-4 19-49 34-62 
Olive 9 2 84 4 
Palm 39 4 40 8 
Safflower 6 3 13 78 
Soybean 9 6 20 52 
Sunflower 6 1 21 66 


Data from Merck Index, 10th ed. Rahway, NJ: Merck and Co.; and Wilson, E. D., et al., 1979, Principles of Nutrition, 
4th ed. New York: Wiley. 
*Values are percentages of total fatty acids. 


acid content, a 22:14'? monounsaturated fatty acid (often 20% to 60%). In the first half 
of the twentieth century, it was used as a steam engine lubricant (especially in World 
War IT). Conventional breeding techniques have reduced the erucic acid content to less 
than 1%, producing the “canola oil” (the name is derived from Canadian oil, low acid) 
now used so commonly for cooking and baking. 

Although most unsaturated fatty acids in nature are cis fatty acids, trans fatty acids 
are formed by some bacteria via double-bond migration and isomerization. These bacte- 
rial reactions produce trans fats in ruminant animals (which carry essential bacteria in 
their rumen), and butter, milk, cheese, and the meat of these animals contain modest 
quantities of trans fats (typically 2% to 8% by weight), such as those in Figure 8.2. Mar- 
garine and other processed fats, made by partial hydrogenation of polyunsaturated oils 
(for example, corn, safflower, and sunflower) contain substantial levels of various trans 
fats, and clinical research has shown that chronic consumption of processed foods con- 
taining partially hydrogenated vegetable oils can contribute to cardiovascular disease. 
Diets high in trans fatty acids raise plasma low-density lipoprotein (LDL) cholesterol 
and triglyceride levels while lowering high-density lipoprotein (HDL) cholesterol levels. 
The effects of trans fatty acids on LDL, HDL, and cholesterol levels are similar to those 


Chemistry in History 


Alice Ball 


Alice Ball (1892-1916) was both the first African 
American and the first woman to earn a Master of 
Chemistry from the University of Hawaiʻi in 1915. 
A surgeon at the U.S. Public Health Service in 
Hawai'i, learning of her Master’s research on plants, 
invited Alice to study the oil of the chaulmoogra 
tree, which had been used for hundreds of years as 
a treatment for leprosy. Taken by mouth, the oil was 
an effective remedy, but caused unbearable nausea 
in patients. Ball learned to convert the fatty acids in 
the oil to ethyl esters, which were water-soluble and 
could be injected, without the nauseous side effects. 
The “Ball Method” was used until the 1940s when it 
was replaced by synthetic antibiotics. (More modern 
research showed that the therapeutic fatty acids in 
chaulmoogra oil contained novel cyclopenteny] rings.) 


After finishing her studies, Ball accepted a 
teaching position at the University of Hawaiʻi as 
a chemistry instructor. Alice died unexpectedly in 
1916 at the age of 24 from complications related to 
chlorine gas inhalation during a lab accident. The 
president of the college at the time continued her 
research after her death and claimed the discovery 
for himself, until a local doctor published a paper 
on the subject that gave Ball the proper credit she 
deserved. In 2000, the governor of Hawaiʻi declared 
February 29 “Alice Ball Day”, and a plaque was 
erected at a chaulmoogra tree on the university 
campus. Alice Ball was posthumously awarded 
the Regents Medal of Distinction for her scientific 
contributions and a scholarship was created in 
her name. 
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8.2 What Are the Structures and Chemistry of Triacylglycerols? 


of saturated fatty acids. Diets aimed at reducing the risk of coronary heart disease should 
be low in both trans and saturated fatty acids. The U.S. Food and Drug Administration 
(USFDA) has banned trans fats in foods sold in the United States as of May 2019. 


8.2 | What Are the Structures and Chemistry 
of Triacylglycerols? 


A significant number of the fatty acids in plants and animals exist in the form of 
triacylglycerols (also called triglycerides). Triacylglycerols are a major energy reserve 
and the principal neutral derivatives of glycerol found in animals. These molecules 
consist of a glycerol esterified with three fatty acids (Figure 8.3). If all three fatty acid 
groups are the same, the molecule is called a simple triacylglycerol. Examples include 
tristearoylglycerol (common name tristearin) and trioleoylglycerol (triolein). Mixed 
triacylglycerols contain two or three different fatty acids. Triacylglycerols in animals 
are found primarily in the adipose tissue (body fat), where it serves as a depot or res- 
ervoir of stored energy. Monoacylglycerols and diacylglycerols are also found in na- 
ture, but they are far less common than the triacylglycerols. Most natural plant and 
animal fat is composed of mixtures of simple and mixed triacylglycerols. 

Acylglycerols can be hydrolyzed by heating with acid or base or by treatment with 
lipases. Hydrolysis with alkali is called saponification and yields salts of free fatty acids 
and glycerol. This is how our ancestors made soap (a metal salt of an acid derived from 
fat). One method used potassium hydroxide (potash) leached from wood ashes to 
hydrolyze animal fat (mostly triacylglycerols). (The tendency of such soaps to be pre- 
cipitated by Mg?* and Ca?* ions in hard water makes them less useful than modern 
detergents.) 

When the fatty acids esterified at the first and third carbons of glycerol are different, 
the second carbon is asymmetric. The various acylglycerols are normally soluble in ben- 
zene, chloroform, ether, and hot ethanol. Although triacylglycerols are insoluble in water, 
monoacylglycerols and diacylglycerols readily form organized structures in water (see 
Chapter 9), due to the polarity of their free hydroxyl groups. 

Triacylglycerols are rich in highly reduced carbons and thus yield large amounts of 
energy in the oxidative reactions of metabolism. Complete oxidation of 1 g of triacyl- 
glycerols yields about 38 kJ of energy, whereas proteins and carbohydrates yield only 
about 17 kJ/g. Also, their hydrophobic nature allows them to aggregate in highly 


(a) (b) (c) 


H,C — CH — CH, HC — CH — CH, H,C — CH — CH, 

| | | / | \ / | \ 
HO OH OH O O O oO (0) O 

| | | | | | 

Glycerol o=¢ Ç=0 Ç= o=c <=o C=0 
Myristic Palmitoleic 
Stearic 
Tristearin 

(a simple triacylglycerol) A mixed triacylglycerol 


Figure 8.3 Triacylglycerols are formed from glycerol and fatty acids. (a) Glycerol, (b) tristearin, and 
(c) a mixed triacylglycerol. 
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A Deeper Look 


Polar Bears Prefer Nonpolar Food 


The polar bear is magnificently adapted to thrive in its harsh Arctic 
environment. Research by Malcolm Ramsay at the University of 
Saskatchewan in Canada and others has shown that polar bears gorge 
only during a few weeks out of the year and then fast for periods of 
8 months or more, consuming no food or water during that time. 
The adult polar bear feeds almost exclusively on seal blubber (largely 
composed of triacylglycerols), thus building up its own triacylglyc- 
erol reserves. Through the Arctic summer, the polar bear maintains 
normal physical activity, roaming over long distances, relying almost 
entirely on its body fat for sustenance, burning as much as 1 kg to 
1.5 kg of fat per day. It neither urinates nor defecates for extended 
periods. All the water needed to sustain life is provided from the 
metabolism of triacylglycerols because oxidation of fatty acids yields 
carbon dioxide and water. 

Ironically, the word Arctic comes from the ancient Greeks, who 
understood that the northernmost part of the earth lay under the 
stars of the constellation Ursa Major, the Great Bear. Although 
unaware of the polar bear, they called this region Arktikés, which 
means “the country of the great bear.” 


© Incredible Arctic/Shutterstock.com 


anhydrous forms, whereas polysaccharides and proteins are highly hydrated. For these 
reasons, triacylglycerols are the molecules of choice for energy storage in animals. Body 
fat (mainly triacylglycerols) also provides good insulation. Whales and Arctic mammals 
rely on body fat for both insulation and energy reserves. 


8.3 | What Are the Structures and Chemistry 
of Glycerophospholipids? 


A 1,2-diacylglycerol that has a phosphate group esterified at C-3 of the glycerol back- 
bone is a glycerophospholipid, also known as a phosphoglyceride or a glycerol phospha- 
tide (Figure 8.4). These lipids form one of the largest and most important classes of 
natural lipids. They are essential components of cell membranes and are found in small 
concentrations in other parts of the cell. It should be noted that all glycerophospholip- 
ids are members of the broader class of lipids known as phospholipids. 

The numbering and nomenclature of glycerophospholipids present a dilemma in 
that C-2 of the glycerol backbone of a phospholipid is asymmetric. It is possible to 
name these molecules either as D- or L-isomers. Thus, glycerol phosphate itself can be 
referred to either as D-glycerol-1-phosphate or as L-glycerol-3-phosphate (Figure 8.5). 
Instead of naming the glycerol phosphatides in this way, biochemists have adopted the 
stereospecific numbering or sn- system. The stereospecific numbering system is based on 
the concept of prochirality. If a tetrahedral center in a molecule has two identical 


Figure 8.4 Phosphatidic acid, the parent compound for glycerophospholipids. 
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(a) 
HOH.C CHOH 
2a yR 
A 
H OH 1-d, 2(S)-Glycerol 
Glycerol (S-configuration at C-2) 
(b) 
pro-S position CHOH CH,OPO;7— 
HO—C—H = H—C—OH 
| 
pro-R position CH,0PO ie CHOH 
L-Glycerol-3-phosphate D-Glycerol-l-phosphate 
X J. 
N 
sn-Glycerol-3-phosphate 


Figure 8.5 (a) The two identical —CH,OH groups on the central carbon of glycerol may be 
distinguished by imagining a slight increase in priority for one of them (by replacement of an H by a D) 
as shown. (b) The absolute configuration of sn-glycerol-3-phosphate is shown. The pro-R and pro-S 
positions of the parent glycerol are also indicated. 


substituents, it is referred to as prochiral because if either of the like substituents is con- 
verted to a different group, the tetrahedral center then becomes chiral. Consider glyc- 
erol (Figure 8.5): The central carbon of glycerol is prochiral because replacing either of 
the —CH OH groups would make the central carbon chiral. Nomenclature for prochi- 
ral centers is based on the (R, S) system (see Chapter 4). To name the otherwise identical 
substituents of a prochiral center, imagine increasing slightly the priority of one of 
them (by substituting a deuterium for a hydrogen, for example) as shown in Figure 8.5. 
The resulting molecule has an (S) configuration about the (now chiral) central carbon 
atom. The group that contains the deuterium (an isotope of hydrogen with an added 
neutron) is thus referred to as the pro-S group. As a useful exercise, you should confirm 
that labeling the other CH.OH group with a deuterium produces the (R) configuration 
at the central carbon so that this latter CHOH group is the pro-R substituent. 

Now consider the two presentations of glycerol phosphate in Figure 8.5. In the 
stereospecific numbering system, the pro-S position of a prochiral atom is denoted as 
the /-position, the prochiral atom as the 2-position, and so on. When this scheme is used, 
the prefix sn- precedes the molecule name (glycerol phosphate in this case) and 
distinguishes this nomenclature from other approaches. In this way, the glycerol phos- 
phate in natural phosphoglycerides is named sn-glycerol-3-phosphate. 


8.3a Glycerophospholipids Are the Most Common Phospholipids 


Phosphatidic acid, the parent compound for the glycerol-based phospholipids 
(Figure 8.4), consists of sn-glycerol-3-phosphate, with fatty acids esterified at the 1- 
and 2-positions. Phosphatidic acid is found in small amounts in most natural systems 
and is an important intermediate in the biosynthesis of the more common glycero- 
phospholipids (Figure 8.6). In these compounds, a variety of polar groups are esteri- 
fied to the phosphoric acid moiety of the molecule. The phosphate, together with such 
esterified entities, is referred to as the head group. Phosphatides with choline or etha- 
nolamine are referred to as phosphatidylcholine (known commonly as lecithin) or 
phosphatidylethanolamine, respectively. These phosphatides are two of the most com- 
mon constituents of biological membranes. Other common head groups found in 
phosphatides include glycerol, serine, and inositol (Figure 8.6). Another kind of glyc- 
erol phosphatide found in many tissues is diphosphatidylglycerol. First observed in 
heart tissue, it is also called cardiolipin. In cardiolipin, a phosphatidylglycerol is 
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(a) 


O ps 
I 
O—P—O— CH,CH,— N+ CH; 
| 


Phosphatidylcholine o~ CH; 


(b) 
GLYCEROLIPIDS WITH OTHER HEAD GROUPS: 


O 


r 
O—P—O—CH,CH,— NH, O—P—O—CH, 


| | 
oT o7 
. H—C— OH 
Phosphatidylethanolamine o | 


O—P—O—CH, 
(0) COO- a 
Il | 9 
O—P—O—CH,—CH 


Diphosphatidylglycerol (Cardiolipin) 


Figure 8.6 Structures of several glycerophospholipids: (a) Phosphatidylcholine, with shading to indicate the 
diacylglycerol backbone. (b) Structures built on the same diacylglycerol core: phosphatidylethanolamine, 
phosphatidylserine, phosphatidylglycerol, diphosphatidylglycerol (cardiolipin) and phosphatidylinositol. 

(c) Space-filling models of phosphatidylcholine, phosphatidylglycerol, and phosphatidylinositol. 


esterified through the C-1 hydroxyl group of the glycerol moiety of the head group to 
the phosphoryl group of another phosphatidic acid molecule. 

Phosphatides exist in many different varieties, depending on the fatty acids esterified 
to the glycerol group. As we shall see, the nature of the fatty acids can greatly affect the 
chemical and physical properties of the phosphatides and the membranes that contain 
them. In most cases, glycerol phosphatides have a saturated fatty acid at position 1 and 
an unsaturated fatty acid at position 2 of the glycerol. Thus, /-stearoyl-2-oleoyl- 
phosphatidylcholine (Figure 8.7) is a common constituent in natural membranes, but 
1-linoleoyl-2-palmitoylphosphatidylcholine is not. 

Both structural and functional strategies govern the natural design of the many differ- 
ent kinds of glycerophospholipid head groups and fatty acids. Glycerophospholipids 
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influence and modulate membrane structure and function. Certain phospholipids, 
including phosphatidylinositol and phosphatidylcholine, participate in complex cellular 
signaling events. 


8.3b Ether Glycerophospholipids Include PAF and Plasmalogens 


Ether glycerophospholipids possess an ether linkage instead of an acyl group at the C-1 
position of glycerol (Figure 8.8a). One of the most versatile biochemical signal mole- 
cules found in mammals is platelet-activating factor, or PAF, a unique ether glycero- 
phospholipid (Figure 8.8b). The alkyl group at C-1 of PAF is typically a 16-carbon 
chain, but the acyl group at C-2 is a 2-carbon acetate unit. By virtue of this acetate 
group, PAF is much more water soluble than other lipids, allowing PAF to function as 
a soluble messenger in signal transduction. 

Plasmalogens are ether glycerophospholipids in which the alkyl moiety is cis-a, 5; gure 6:7’ Aspase-Ailine modeior i:stearyl- 
B-unsaturated (Figure 8.9). Common plasmalogen head groups include choline, etha- 2_oleoyl-phosphatidylcholine. 
nolamine, and serine. These lipids are referred to as phosphatidal choline, phosphatidal 
ethanolamine, and phosphatidal serine. 


(a) (b) 
(0) CH, 
+ = Il P 

O — CH,— CH; — NH; (0) i O — CH,— CHo ; CH, 
(0 CH; 


Ether 
linkage 
Platelet- 
activating factor 


Figure 8.8 (a) The structure of a l-alkyl 2-acyl- 
phosphatidylethanolamine (an ether 
glycerophospholipid). (b) The structure of 1-alkyl 
2-acetyl-phosphatidylcholine, also known as platelet- 
activating factor or PAF. The acetyl group is 
highlighted. 


(0) CH, 
-O —P—O—CH,CH,— N* CH, 
| | 
Choline plasmalogen O CH; 
| 


The ethanolamine 
plasmalogens have 

| l ethanolamine in 
AL ZO O place of choline. 


Figure 8.9 The structure and a space-filling model 
of a choline plasmalogen. 
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Human Biochemistry 


Platelet-Activating Factor: A Potent Ether Glycerophospholipid Mediator 


Platelet-activating factor (PAF) functions in platelet activation and 
aggregation in the heart, liver, bowel, pancreas, and lungs. It was 
first identified by its ability to cause platelet aggregation and dila- 
tion of blood vessels, but it is now known to be a potent mediator 
in inflammation, allergic responses, and shock. PAF effects are 
observed at tissue concentrations as low as 1071? M. PAF causes 
a dramatic inflammation of air passages and induces asthmalike 
symptoms in laboratory animals. Toxic shock syndrome occurs 
when fragments of destroyed bacteria act as toxins and induce the 


synthesis of PAF. PAF causes a drop in blood pressure and a re- 
duced volume of blood pumped by the heart, which leads to shock 
and, in severe cases, death. 

Beneficial effects have also been attributed to PAF. In repro- 
duction, PAF secreted by the fertilized egg is instrumental in the 
implantation of the egg in the uterine wall. PAF is produced in 
significant quantities in the lungs of the fetus late in pregnancy and 
may stimulate the production of fetal lung surfactant, a protein—lipid 
complex that prevents collapse of the lungs in a newborn infant. 


8.4 | What Are Sphingolipids, and How Are 
They Important for Higher Animals? 


Sphingolipids represent another class of lipids frequently found in biological membranes. 
An 18-carbon amino alcohol, sphingosine (Figure 8.10a), forms the backbone of these 
lipids rather than glycerol. Typically, a fatty acid is joined to a sphingosine via an amide 
linkage to form a ceramide (Figure 8.10b). 

Sphingomyelins represent a phosphorus-containing subclass of sphingolipids and are 
especially important in the nervous tissue of higher animals. A sphingomyelin is formed by 
the esterification of a phosphorylcholine or a phosphorylethanolamine to the 1-hydroxy 
group of a ceramide (Figure 8.10c). 

Glycosphingolipids are another class of ceramide-based lipids that, like the sphingo- 
myelins, are important components of muscle and nerve membranes in animals. They 
consist of a ceramide with one or more sugar residues in a B-glycosidic linkage at the 
1-hydroxyl moiety. The neutral glycosphingolipids contain only neutral (uncharged) 
sugar residues. When a single glucose or galactose is bound in this manner, the molecule 
is a cerebroside (Figure 8.10d). (Another class of lipids is formed when a sulfate is esteri- 
fied at the 3-position of the galactose to make a sulfatide.) Gangliosides (Figure 8.10e) 
are more complex glycosphingolipids that consist of a ceramide backbone with three or 
more sugars esterified, one of these being a sialic acid such as N-acetylneuraminic acid. 
These latter compounds are referred to as acidic glycosphingolipids, and they have a net 
negative charge at neutral pH. 

The glycosphingolipids have a number of important cellular functions, despite the 
fact that they are present only in small amounts in most membranes. Glycosphingolip- 
ids at cell surfaces appear to determine, at least in part, certain elements of tissue and 
organ specificity. Cell-cell recognition and tissue immunity depend on specific glyco- 
sphingolipids. Gangliosides are present in nerve endings and are important in nerve 
impulse transmission. A number of genetically transmitted diseases involve the accu- 
mulation of specific glycosphingolipids in the brain due to an absence of the enzymes 
needed for their degradation. Such is the case for ganglioside Gy? in the brains of Tay- 
Sachs disease victims, a rare but fatal childhood disease characterized by a red spot on 
the retina, gradual blindness, and self-mutilation. 


8.5 | What Are Waxes, and How Are They Used? 


Waxes are esters of long-chain alcohols with long-chain fatty acids. The resulting mol- 
ecule has a weakly polar head group (the ester moiety itself) and a long, nonpolar tail 
(the hydrocarbon chains) (Figure 8.11). Fatty acids found in waxes are usually saturated. 
The alcohols found in waxes may be saturated or unsaturated and may include sterols, 
such as cholesterol (see later section). Waxes are water insoluble due to their mostly 
hydrocarbon composition. Asa result, this class of molecules confers a water-repellant 
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Figure 8.10 Sphingolipids are based on the structure of sphingosine (a). The amino group (highlighted) is 
the site of amide linkage with fatty acids to form(b) ceramides (amide linkage highlighted). (c) Addition 
of a phosphocholine produces a choline sphingomyelin. (d) Addition of a sugar to a ceramide yields a 
cerebroside. (e) Adding three or more sugars to a ceramide core produces a ganglioside. Typical 
gangliosides are Gy), Gm2 and Gyy3, as indicated. 


character to animal skin, to the leaves of certain plants, and to bird feathers. The 
glossy surface of a polished apple results from a wax coating. Carnauba wax, obtained 
from the fronds of a species of palm tree in Brazil, is a particularly hard wax used for 
high-gloss finishes, such as in automobile wax, boat wax, floor wax, and shoe polish. 
Lanolin, a wool wax, is used as a base for pharmaceutical and cosmetic products 
because it is rapidly assimilated by human skin. The brand name Oil of Olay 
was coined by Graham Wulff, a South African chemist who developed it. The name refers 


c=o Ganglioside Gy; 
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(a) (b) 
(0) zO 
| o—c/ 
Rı—C—O— Rs 
General formula of a wax 
oO 
I 
CH3(CHg) 14— C — O — CHy— (CHg) 1g, — CH3 
Stearyl palmitate Oleoyl alcohol Stearic acid 
o / 
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(c) 


Figure 8.11 Waxes consist of long-chain alcohols 
esterified to long-chain fatty acids. (a) The general 
formula of a wax and the formulas of stearyl 
palmitate and triacontanol palmitate. Triacontanol 
palmitate is the principal component of beeswax. 
(b) The structure of oleoyl stearate. (c) Waxes are 
components of the waxy coating on the leaves of 
plants, such as jade plants (shown here). Such 
species typically contain dozens of different waxy 
esters. 


Oleoyl stearate 


yuris/Shutterstock.com 


A Deeper Look 


Novel Lipids with Valuable Properties 


Lipids from many natural sources are useful and highly prized 
materials for many purposes in our world. When oil from the head 
of the sperm whale is cooled, spermaceti, a translucent wax with a 
white, pearly luster, crystallizes from the mixture. Spermaceti, which 
makes up 11% of whale oil, is composed mainly of the wax cetyl 
palmitate 


CH3(CH2)14 —COO—(CH2)isCH3 


as well as smaller amounts of cetyl alcohol: 
HO—(CH)2)isCHs 


In the literary classic Moby Dick, Herman Melville describes 
Ishmael’s impressions of spermaceti, when he muses that the waxes 
“discharged all their opulence, like fully ripe grapes their wine; as I 
snuffed that uncontaminated aroma—titerally and truly, like the 
smell of spring violets.” 

Spermaceti and cetyl palmitate have been widely used in the mak- 
ing of cosmetics, fragrant soaps, and candles. In addition, the low 
viscosity of sperm whale oil made it an ideal lubricant for a variety of 


industrial applications until whale populations declined precipitously 
in the twentieth century and commercial whaling was outlawed. 
Since then, other lipids with similar properties have been sought as 
replacements for whale oil. One example is the lipids found in the 
shrub Euonymus alatus (also known as the “burning bush”), which 
makes large amounts of 3-acetyl-1,2-diacyl-sn-glycerols (see figure). 
These unusual glycerols, with two long acyl chains and one short 
(2-carbon) chain, possess a much lower viscosity than triglycerides 
with three long chains. As such, they are appropriate for use in 
biodiesel fuels. 


References 
Durrett, T. P., McClosky, D. D., Tumaney, A. W., Elzinga, D. A., 
Ohlrogge, J., and Pollard, M., 2010. A distinct DGAT with sn-3 
acetyltransferase activity that synthesizes unusual, reduced-viscosity oils in 
Euonymus and transgenic seeds. Proceedings of the National Academy of 
Sciences, USA 107:9464-9469. 
Melville, H., 1984. Moby Dick. London: Octopus Books, p. 205. (Adapted 
from Waddell, T. G., and Sanderlin, R. R., 1986. “Chemistry in Moby 
Dick.” Journal of Chemical Education 63:1019-1020.) 

(Continued) 
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to lanolin, a key ingredient that is a complex mixture of waxes with 33 different alco- 
hols esterified to 36 different fatty acids. 


8.6 | What Are Terpenes, and What Is Their Relevance 
to Biological Systems? 


The terpenes are a class of lipids formed from combinations of two or more molecules 
of 2-methyl-1,3-butadiene, better known as isoprene (a five-carbon unit that is abbrevi- 
ated C;). A monoterpene (Cio) consists of two isoprene units, a sesquiterpene (C45) con- 
sists of three isoprene units, a diterpene (C29) has four isoprene units, and so on. Isoprene 
units can be linked in terpenes to form straight-chain or cyclic molecules, and the usual 
method of linking isoprene units is head to tail (Figure 8.12). Monoterpenes occur in all 


Head-to-tail 
H CH, linkage 


OH Tail-to-tail 
linkage 


HC CH; 


Isoprene Geraniol 


Figure 8.12 The structure of isoprene (2-methyl-1,3-butadiene) and the structure of head-to-tail and 
tail-to-tail linkages. Isoprene itself can be formed by distillation of natural rubber, a linear head-to- 
tail polymer of isoprene units. 
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(a) MONOTERPENES 
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Figure 8.14 Dolichol phosphate is an initiation 
point for the synthesis of carbohydrate polymers in 
animals. The analogous alcohol in bacterial systems, 
undecaprenol, also known as bactoprenol, consists of 
11 isoprene units. Undecaprenyl phosphate delivers 
sugars from the cytoplasm for the synthesis of cell 
wall components such as peptidoglycans, 
lipopolysaccharides, and glycoproteins. 


Lanosterol 


higher plants, whereas sesquiterpenes and diterpenes are less widely known. Several 
examples of these classes of terpenes are shown in Figure 8.13. The triterpenes are C3 
terpenes and include squalene and lanosterol, two of the precursors of cholesterol and 
other steroids (discussed later). Tetraterpenes (C49) are less common but include the 
carotenoids, a class of colorful photosynthetic pigments. B-Carotene is the precursor of 
vitamin A, whereas lycopene, similar to B-carotene, is a pigment found in tomatoes. 

Long-chain polyisoprenoid (also known as polyprenyl) molecules with a terminal 
alcohol moiety are called polyprenols. The dolichols, one class of polyprenols (Figure 8.14), 
consist of 16 to 22 isoprene units and, in the form of dolichyl phosphates, function 
to carry carbohydrate units in the biosynthesis of glycoproteins in animals. Polyprenyl 
groups serve to anchor certain proteins to biological membranes. 
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Figure 8.13 Structures of common terpenes. Highlighting in two cases shows the isoprene building 
blocks of the molecule. (a) Many monoterpenes are readily recognized by their characteristic flavors 
or odors (limonene in lemons; citronellal in roses, geraniums, and some perfumes; and menthol from 
peppermint, used in cough drops and nasal inhalers). (b) Bisabolene and eudesmol are sesquiterpenes. 
(c) The diterpenes, which are C2 terpenes, include retinal (the essential light-absorbing pigment in 
rhodopsin, the photoreceptor protein of the eye), and phytol (a constituent of chlorophyll). (d) The 
triterpene lanosterol is a constituent of wool fat. (e) Lycopene is a carotenoid found in ripe fruit, 
especially tomatoes. 
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A Deeper Look 


Why Do Plants Emit lsoprene? 


The Blue Ridge Mountains of Virginia are so named for the 
misty blue vapor or haze that hangs over them through much 
of the summer season. This haze is composed in part of iso- 
prene that is produced and emitted by the plants and trees of 
the mountains. Global emission of isoprene from vegetation 
is estimated at 3 X 10" g/yr. Plants frequently emit as much 
as 15% of the carbon fixed in photosynthesis as isoprene, and 
Thomas Sharkey, a botanist at the University of Wisconsin, 
has shown that the kudzu plant can emit as much as 67% of 
its fixed carbon as isoprene as the result of water stress. Why 
should plants and trees emit large amounts of isoprene and other 
hydrocarbons? Sharkey has shown that an isoprene atmosphere 
or “blanket” can protect leaves from irreversible damage in- 
duced by high (summerlike) temperatures. He hypothesizes that 
isoprene in the air around plants dissolves into leaf-cell mem- 
branes, altering the lipid bilayer and/or lipid-protein and protein- 
protein interactions within the membrane to increase thermal 
tolerance. Blue Ridge Mountains 


Jo Crebbin/Shutterstock.com 


Human Biochemistry 


Coumadin or Warfarin—Agent of Life or Death 


The isoprene-derived molecule whose structure is shown here is o o 
known alternately as Coumadin and warfarin. Warfarin (as it was O 
first named) was promoted as a rodent poison. Later, it was mar- Ane | 


keted and prescribed as the anticoagulant coumadin, a tradename a 
that distinguished it from its original usage. How can the same 
chemical species be used for such disparate purposes? The key to 
both uses lies in its ability to act as an antagonist of vitamin K in 


the body. Warfarin (Coumadin) 

Vitamin K is necessary for the carboxylation of glutamate Glu y-carboxy-Glu 
residues on certain proteins, including some proteins in the blood- O 
clotting cascade (including prothrombin, factor VII, factor IX, and = oml 
factor X, which undergo a Ca**-dependent conformational change l 
in the course of their biological activity as well as protein C and H | 
protein S, two regulatory proteins in coagulation). Carboxylation of O CH: O 


these coagulation factors is catalyzed by a carboxylase that requires 
the reduced form of vitamin K (vitamin KH;), molecular oxygen, 
and carbon dioxide. Oxidation and recycling of vitamin K is shown 
in the figure. KH; is oxidized to vitamin K epoxide, which is recycled 


to KH, by the enzymes (1) vitamin K epoxide reductase and (2,3) CO» 

vitamin K reductase. Coumadin/warfarin exerts its anticoagulant Warfarin 

effect by inhibiting vitamin K epoxide reductase and possibly also resistant 

vitamin K reductase. This inhibition depletes vitamin KH, and K ~~; *” KH, Oxidation and recycling 
reduces the activity of the carboxylase. of vitamin K 


Coumadin/warfarin, given at a typical dosage of 4 mg/day to 
5 mg/day, prevents the deleterious formation in the bloodstream . 
of small blood clots and thus reduces the risk of heart attacks and 
strokes for individuals whose arteries contain sclerotic plaques. 
Taken in much larger doses, as for example in rodent poisons, 
Coumadin/warfarin can cause massive hemorrhages and death. 
Warfarin 


inhibits 
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Figure 8.15 The structure of caldarchaeol, an isoprene-based lipid found in Archaea. Glycerol groups 
are highlighted. 


The Membranes of Archaea Are Rich in Isoprene-Based Lipids Archaea are ubiquitous, 
but they were first discovered in harsh environments. Some thrive in the high tempera- 
tures of geysers and ocean steam vents, whereas others are found in extremely acidic, 
cold, or salty environments. Archaea also live in extremes of pH in the digestive tracts 
of cows, termites, and humans. Many archaea are well adapted to harsh environments, 
and one such adaptation is found in their cell membranes, which contain isoprene-based 
lipids (Figure 8.15). These isoprene chains are linked at both ends by ether bonds to 
glycerols. Ether bonds are more stable to hydrolysis than the ester linkages of glycero- 
phospholipids (Figure 8.6). With a length twice that of typical glycerophospholipids, 
these molecules can completely span a cell membrane, providing additional stability. 
Interestingly, the glycerols in archaeal lipids are in the (R) configuration, whereas glyc- 
erolipids of animals, plants, and eubacteria are almost always in the (S) configuration. 


8.7 | What Are Steroids, and What Are 
| Their Cellular Functions? 


A large and important class of terpene-based lipids is the steroids. This molecular 
family, whose members affect an amazing array of cellular functions, is based on a 
common structural motif of three 6-membered rings and one 5-membered ring all 
fused together. 


8.7a Cholesterol 


Cholesterol (Figure 8.16) is the most common steroid in animals and the precursor for 
all other animal steroids. The numbering system for cholesterol applies to all such 
molecules. Many steroids contain methyl groups at positions 10 and 13 and an 8- to 
10-carbon alkyl side chain at position 17. The polyprenyl nature of this compound is 
particularly evident in the side chain. Many steroids contain an oxygen at C-3, either a 
hydroxyl group in sterols or a carbonyl group in other steroids. The carbons at posi- 
tions 10 and 13 and the alkyl group at position 17 are nearly always oriented on the 
same side of the steroid nucleus, the B-orientation. Alkyl groups that extend from the 
other side of the steroid backbone are in an a-orientation. 

Cholesterol is a principal component of animal cell plasma membranes, and smaller 
amounts of cholesterol are found in the membranes of intracellular organelles. 
The relatively rigid fused ring system of cholesterol and the weakly polar alcohol group 
at the C-3 position have important consequences for the properties of plasma mem- 
branes. Cholesterol is also a component of lipoprotein complexes in the blood, and it is 
one of the constituents of plaques that form on arterial walls in atherosclerosis. 


8.7b Steroid Hormones Are Derived from Cholesterol 


Steroids derived from cholesterol in animals include five families of hormones (the 
androgens, estrogens, progestins, glucocorticoids, and mineralocorticoids) and bile 
acids (Figure 8.17). Androgens such as testosterone and estrogens such as estradiol 
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Cholesterol 


Figure 8.16 The structure of cholesterol, shown with steroid ring designations and carbon numbering. 
When cholesterol is viewed as shown here, the B-orientation is above the page. A space-filling model 
of cholesterol is shown at right. 
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Figure 8.17 The structures of several important sterols derived from cholesterol. 


mediate the development of sexual characteristics and sexual function in animals. 
The progestins such as progesterone participate in control of the menstrual cycle and 
pregnancy. Glucocorticoids (cortisol, for example) participate in the control of carbo- 
hydrate, protein, and lipid metabolism, whereas the mineralocorticoids regulate salt 
(Na*, K*, and Cl) balances in tissues. The bile acids (including cholic and deoxycholic 
acid) are detergent molecules secreted in bile from the gallbladder that assist in the 
absorption of dietary lipids in the intestine. 
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Human Biochemistry 


Plant Sterols and Stanols—Natural Cholesterol Fighters 


Dietary guidelines for optimal health call for reducing the cho- 
lesterol intake. Plant sterols and stanols are a healthier substitute 
for cholesterol. Despite their structural similarity to cholesterol, 
minor isomeric differences and the presence of methyl and ethyl 
groups in the side chains of these substances result in their poor 
absorption by intestinal mucosal cells. Interestingly, stanols are 
even less well absorbed than their sterol counterparts. (Stanols 
are fully reduced sterols that is, with no double bonds). Both 
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sterols and stanols bind to cholesterol receptors on intestinal cells 
and block the absorption of cholesterol itself. Stanols esterified 
with long-chain fatty acids form micelles that are more effectively 
distributed in the fat phase of digested food and provide the 
most effective blockage of cholesterol uptake. Raisio Group, a 
Finnish company, has developed Benecol, a stanol ester spread 
that can lower LDL cholesterol by up to 14% if consumed daily 
(see graph). 


B-Sitostanol 


Serum cholesterol levels before and after the consumption of margarine with and without sitostanol ester for 12 months. 
Green circles: 0 g/day. Red squares: 2.6 g/day. Blue triangles: 1.8 g/day. Note: The y-axis begins at 200 mg cholesterol/dL. 
(Adapted from Miettinen, T. A., et al., 1995. Reduction of serum cholesterol with sitostanol-ester margarine in a mildly 
hypercholesterolemic population. New England Journal of Medicine 333:1308—1312.) 


8.8 


How Do Lipid Metabolites Act 


as Biological Signals? 


Glycerophospholipids and sphingolipids play important structural roles as the princi- 
pal components of biological membranes. However, their modification and break- 
down also produce an eclectic assortment of substances that act as powerful chemical 
signals (Figures 8.18 and 8.19). In contrast to the steroid hormones (Figure 8.17), which 
travel from tissue to tissue in the blood to exert their effects, these lipid metabolites act 
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Human Biochemistry 


Beating COVID-19: Lipid Nanoparticles Make RNA Vaccines Practical and Effective 


RNA-based vaccines for protection against infectious diseases like 
the coronavirus that causes COVID-19 depend on novel ionizable 
lipid nanoparticles (LNPs). Messenger RNA is sensitive and eas- 
ily degraded by enzymes in the environment and in human tissue. 
LNPs package the messenger RNA of these vaccines and protect 
it on its perilous journey from hypodermic needle to human lymph 
node cells. The four critical components of the LNPs are shown in 
the figure: A special ionizable lipid, cholesterol, a glycerophospho- 
lipid (distearoylphosphatidylcholine - DSPC), and a diacylglycerol 
linked to polyethyleneglycol (PEG). The PEG controls particle size 
and improves stability of the particles. The cholesterol and glycero- 
phospholipid are termed helper lipids and form the impermeable 
outer membrane of the LNP. 

The ionizable lipid has a pK, near 6.5 and is uncharged upon in- 
jection of the vaccine. When the LNPs encounter lymph node cells, 
they are taken up in endosomes, whose pH decreases to about 4.5, 
protonating the ionizable lipid. The acquired positive charge enables 
the lipid to encapsulate the negatively-charged RNA molecules and 
facilitates interaction with the endosome membrane to release the 
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The Pfizer-BioNTech and Moderna vaccines consist of lipid nanoparticles like the one 
at upper right. The ionizable lipids of the Pfizer-BioNTech (ALC-0315) and Moderna 
(SM-102) vaccines are shown. The highlighted nitrogens have pK, values of about 6.5. 
Uptake by endosomes protonates at the N to produce the ionized form of the molecule. 


locally, either within the cell in which they are made or on nearby cells. Signal mole- 
cules typically initiate a cascade of reactions with multiple possible effects, and the 
lifetimes of these signals in or near a cell are usually very short. Thus, the creation and 
breakdown of signal molecules is almost always carefully timed and regulated. 
Enzymes known as phospholipases hydrolyze the ester bonds of glycerophospho- 
lipids, as shown in Figure 8.18. Phospholipases A; and A, remove fatty acid chains 
from the 1- and 2-positions of glycerophospholipids, respectively. Phospholipases C 
and D attack the polar head group of a glycerophospholipid. Hydrolysis of inositol 
phospholipids by phospholipase C produces a diacylglycerol and inositol-1,4,5- 
trisphosphate (IP3) (Figure 8.19), two signal molecules whose combined actions 
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Figure 8.18 Phospholipases are enzymes that cleave and degrade glycerophospholipids. Phospholipases 
are components of the venoms of many poisonous snakes. The pain and physiological consequences of a 
snake bite result partly from breakdown of cell membranes by phospholipases. (a) Phospholipases A, 
and A; cleave fatty acids from a glycerophospholipid, producing lysophospholipids. Phospholipases 


C and D hydrolyze on either side of the phosphate in the polar head group. (b) Diamondback 
rattlesnake. (c) Indian cobra. 
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Figure 8.19 Modification and breakdown of glycerophospholipids produce a variety of signals 
and regulatory effects. (a) Phospholipase A, cleaves a fatty acid from phosphatidic acid to produce 
lysophosphatidic acid (LPA), which can act as an extracellular signal. If the fatty acid released 

is arachidonic acid, it can be the substrate for synthesis of prostaglandins, thromboxanes, and 
leukotrienes. (b) Phospholipase C action on phosphatidylinositol-4,5-bisphosphate produces 
diacylglycerol and inositol-1,4,5-trisphosphate, two signal molecules that work together to 

activate protein kinases—enzymes that phosphorylate other proteins in signaling pathways. 


Human Biochemistry 


The Endocannabinoid Signaling System: A Target for Next-Generation Therapeutics 


Chemical signals called neurotransmitters are the fundamental mode 
of intercellular communication in the nervous system. The classic 
neurotransmitters are small, soluble molecules derived from amino 
acids (or are amino acids themselves). However, recent research has 
shown that lipids known as endocannabinoids are chemical mes- 
sengers that operate by a novel mechanism. This class of chemical 


signals came to light originally from studies of the physiological 
effects of A?-tetrahydrocannabinol and other metabolites from Can- 
nabis sativa, better known as marijuana. Early research showed that 
the effects of Cannabis involved binding of its metabolites to spe- 
cific receptor proteins in the brain. Further research demonstrated 
that these receptors are normally activated by endogenous lipid 


(Continued) 
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molecules. Two of these are anandamide and 2-arachidonoylglycerol 
(2AG). The biosyntheses of these signal molecules are shown in the 
figure. Biosynthesis of anandamide from N-arachidonoylphosphati- 
dylethanolamine (NAPE) appears to proceed by at least three differ- 
ent routes in neurons. 

Endocannabinoids are termed retrograde messengers because 
they traverse the synaptic cleft in the opposite direction of classical 
neurotransmitters. Activated endocannabinoid receptors can act to 
either inhibit or activate neuronal circuits, depending on the type of 
neurons involved. Research on the physical and functional relation- 
ships between endocannabinoid receptors and the proteins with 
which they interact reveals novel targets for drug discovery. 
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Human Biochemistry 


Fingolimod—a Sphingosine-1-P Mimic Is an Oral Drug for Multiple Sclerosis 


Multiple sclerosis (MS) is a chronic disabling autoimmune dis- 
order. In MS, the body’s natural defense system mistakenly at- 
tacks myelin, the lipid-rich sheath that protects the nerves. About 
400,000 U.S. citizens have multiple sclerosis. Earlier treatments for 
MS were all injectable medications, until fingolimod was approved 
by the USFDA for relapsing forms of multiple sclerosis. It was the 
first oral disease-modifying therapy to be approved for multiple 
sclerosis in the United States. 

In multiple sclerosis, lymph nodes release certain types of lym- 
phocytes (themselves a type of white blood cell), which migrate 
to and invade the central nervous system, killing neural cells. 
Lymphocyte release from the lymph nodes is induced by binding 
of sphingosine-1-phosphate (S1P) to receptors on the lymphocyte 
surface. Fingolimod, an analog of sphingosine, is phosphorylated 
by sphingosine kinase enzymes, forming (S)-fingolimod-phosphate 
(FIP). Binding of FIP to lymphocyte SIP receptors prevents 
these lymphocytes from migrating to target tissues, averting further 
lymphocyte-induced neural cell destruction. 


Fingolimod 
Sphingosine-1-P Sphingosine 
phosphatases kinases 


(S)-Fingolimod-phosphate 


trigger signaling cascades that regulate many cell processes. Action of phospholipase 
A» on a phosphatidic acid releases a fatty acid and a lysophosphatidic acid (LPA, 
Figure 8.19). If the fatty acid is arachidonic acid, further chemical modifications can 
produce a family of 20-carbon compounds—that is, eicosanoids. The eicosanoids are 
local hormones produced as a response to injury and inflammation. They exert their 
effects on cells near their sites of synthesis. LPA produced outside the cell is a signal 
that can bind to receptor proteins on nearby cells, thereby regulating a host of pro- 
cesses, including brain development, cell proliferation and survival, and olfaction (the 
sense of smell). 

The venoms of poisonous snakes contain (among other things) a class of enzymes 
known as phospholipases, enzymes that cause the breakdown of phospholipids. For ex- 
ample, the venoms of the eastern diamondback rattlesnake (Crotalus adamanteus) and 
the Indian cobra (Naja naja) both contain phospholipase A», which catalyzes the hydro- 
lysis of fatty acids at the C-2 position of glycerophospholipids (Figure 8.18). The phos- 
pholipid breakdown product of this reaction, /ysolecithin, acts as a detergent and 
dissolves the membranes of red blood cells, causing them to rupture. Indian cobras kill 
several thousand people each year. 

Sphingolipids can also be modified or broken down to produce chemical signals. 
Sphingosine itself can be phosphorylated to produce sphingosine-1-phosphate (S1P) in- 
side cells (Figure 8.20). SIP may either exert a variety of intracellular effects or may be 
secreted from the cell, to bind membrane receptor proteins, either on adjacent cells or 
on the cell from which the S1P was released. Secreted S1P binds to at least five different 
receptor proteins and provokes many different cell and tissue effects, among them in- 
flammation in allergic reactions, cell proliferation, angiogenesis, malignant transforma- 
tion, movement, and migration of certain cells. Sphingolipid signal molecules are 
carefully balanced and regulated in organisms, and chemical agents that disturb this 
balance can be highly toxic. For example, fumonisin is a common fungal contaminant of 
corn and corn-based products that inhibits sphingolipid biosynthesis (Figure 8.20). 
Fumonisin can trigger esophageal cancer in humans and /eucoencephalomalacia, a fatal 
neurological disease in horses. 
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8.9 | What Can Lipidomics Tell Us about Cell, Tissue, 


and Organ Physiology? 


The crucial role of lipids in cells is demonstrated by the large number of human diseases 
that involve the disruption of lipid metabolic enzymes and pathways. Examples of such 
diseases include atherosclerosis, diabetes, cancer, infectious diseases, and neurodegen- 
erative diseases. A new approach - lipidomics - allows scientists to perform global anal- 
yses of lipids and their interacting protein partners in organs, cells, and organelles. A 
typical cell may contain more than a thousand different lipids, each with a polar head 
and a hydrophobic tail or tails. The LIPID MAPS Structure Database (LMSD) at 
http://www. lipidmaps. org/datalstructurelindex.html, annotates more than 40,000 bio- 
logically relevant lipids. There is an increasing awareness of the critical importance of 


lipids in all aspects of life. 
Interestingly, lipid compositions vary in many ways: 


e Between species 
e Between individuals within a given species 
e Within an individual from day to day 


S1P 


transporter B 


Figure 8.20 Signaling by sphingosine-1-phosphate 
(SIP). (a) Sphingosine is derived from ceramides and 
is phosphorylated to give sphingosine-1-phosphate 
(SIP). (b) After transport out of the cell. SIP can 
bind to at least five different receptor proteins. 
Activation of each of these receptors elicits different 
intracellular responses. Fingolimod-1-phosphate (see 
Human Biochemistry box in this section) binds to 
and activates the same receptors, with the exception 
of SIP2. 
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e Between tissues within an organism 

e Between cell types 

e Between cellular compartments 

e Between regions of a cellular compartment 
e Between parts of a membrane 


Complete understanding of lipid function, as well as alteration of such function in 
disease states, will require the determination of which lipids are present and in what 
concentrations in every intracellular location. The same knowledge will be needed 
about each lipid’s interaction partners. Mass spectrometric analyses of rat heart mus- 
cle lipids reveal that the onset of diabetes results in dramatic changes in triglyceride 
levels, an increase in phosphatidylinositol levels, and a decrease in phosphatidyletha- 
nolamine. On the other hand, mass spectrometric analyses of brain white matter lipids 
in the very early stages of Alzheimer’s disease show a dramatic decrease in one type of 
plasmalogen and a threefold increase in ceramide levels. Three-dimensional mass spec- 
tral imaging can provide three-dimensional maps of individual lipid species in biologi- 
cal tissues and organs (Figure 8.21). 

Cellular lipidomics provides a framework for understanding the myriad roles of 
lipids, which include (but are not limited to) membrane transport, metabolic regulation, 
and cell signaling. For example, six different classes of lipids have been shown to 
modulate systems important in the regulation of pain responses. Each of these classes 
of lipids exerts its action by interacting with one or more receptor proteins. True under- 
standing of the molecular basis for diseases and physiologic conditions may require 
comprehensive and simultaneous analyses of many lipid species and their respective 
receptors. 
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Figure 8.21 Identification and three-dimensional mapping of lipids in the crab (Cancer borealis) 
brain. (a) Representative mass spectrometry (MS) spectrum of lipids from crab brain slice with the 
masses and identities of each peak labeled; (b) Comparison of three-dimensional distribution of two 
different species of lipids PC 36:1 (upper) and SM 16:0 (lower). (Adapted from Chen, R., Hui, L., 
Sturm, R. M., and Li, L., 2009. Three dimensional mapping of neuropeptides and lipids in crustacean 
brain by mass spectral imaging. J. Amer. Soc. Mass Spectrom. 20:1068—1077.) 
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Problems 


Foundational Biochemistry Things You Should Know after Reading Chapter 8. 


Lipids found in biological systems are either hydrophobic (containing 
only non-polar groups) or amphipathic (possessing both polar and 
nonpolar groups). 

Fatty acids are composed of a long hydrocarbon chain (the tail) and 
a terminal carboxyl group (the head). 

Fatty acids may be saturated (all C-C bonds are single bonds) or un- 
saturated (with one or more double bonds in the hydrocarbon chain). 
The double bonds of nearly all fatty acids in nature are cis double 
bonds. 

Triacylglycerols (also called triglycerides) consist of a glycerol esteri- 
fied with three fatty acids. 

Triacylglycerols are a major energy reserve and the principal neutral 
derivatives of glycerol found in animals. 

A glycerophospholipid is a 1,2-diacylglycerol that has a phosphate 
group esterified at carbon C-3 of the glycerol backbone. 

Glycerol is a prochiral molecule and, according to the stereospecific 
numbering convention, the pro-S carbon of glycerol is designated as 
C-l. 

A variety of polar groups may be esterified to the phosphate of a glyc- 
erophospholipid, including choline, ethanolamine, glycerol, serine, 
inositol, and even another phosphatide (in diphosphatidylglycerol, 
also known as cardiolipin). 


Platelet activating factor (PAF) and the plasmalogens are ether glyc- 
erophospholipids. 

Sphingolipids are based on the 18-carbon alcohol, sphingosine. A 
fatty acid in amide linkage to sphingosine yields a ceramide, and 
a ceramide with one or more sugars in B-glycosidic linkage at the 
1-hydroxyl moiety of sphingosine is a glycosphingolipid. 

Common sphingolipids include sphingomyelins, cerebrosides, and 
gangliosides. 

Terpenes are formed from combinations of two or more molecules of 
2-methyl-1,3-butadiene, also known as isoprene. Terpene-based lipids 
include monoterpenes, sesquiterpenes, diterpenes, and triterpenes 
(including squalene, lanosterol, and cholesterol). 

Steroids are based on a common structural motif of three 6-membered 
rings and one 5-membered ring. 

Cholesterol is a principal component of animal cell plasma mem- 
branes. Cholesterol is the precursor of the androgens (such as 
testosterone), estrogens (such as estradiol), and the glucocorticoids 
(for example, cortisol), the mineralocorticoids, and the bile acids. 
The action of phospholipases produces derivatives of glycerophos- 
pholipids and sphingolipids that can act as biological signals. 
Lipidomics is the global analysis of lipids and their interacting pro- 
tein partners in organs, cell, and organelles. 


Answers to all problems are at the end of this book. Detailed solutions are 
available in the Student Solutions Manual. 


1. 


Drawing Structures of Triacylglycerols Draw the structures of (a) all 
the possible triacylglycerols that can be formed from glycerol with 
stearic and arachidonic acid and (b) all the phosphatidylserine 
isomers that can be formed from palmitic and linolenic acids. 


. Distinguishing Structures of Glycerolipids, Steroids, and Sphingolipids 


Describe in your own words the structural features of 


a. aceramide and how it differs from a cerebroside. 


b. a phosphatidylethanolamine and how it differs from a phospha- 
tidylcholine. 


c. anether glycerophospholipid and how it differs from a 
plasmalogen. 


d. a ganglioside and how it differs from a cerebroside. 
e. testosterone and how it differs from estradiol. 


. Identifying Glycerophospholipids From your memory of the 


structures, name the glycerophospholipids that 
a. carry a net positive charge at pH 7. 

b. carry a net negative charge at pH 7. 

c. have zero net charge at pH 7. 


. Properties of Phospholipids Glycerophospholipids (shown in 


Figure 8.6.) are the major components of cell membranes. Just like 
phosphates (POx), sulfates (SO4) can make diester forms of molecules. 
Imagine a class of molecules known as glycerosulfolipids: An 
imaginary class of lipids in which the phosphate group of the 
glycerophospholipids is replaced with a sulfate group. Discuss a 
critical difference that would exist between the glycerophospholipids 
and the imaginary glycerosulfolipids. 


. Comparing Health Effects of Cholesterol and Plant Sterols Compare 


and contrast two individuals, one whose diet consists largely of meats 
containing high levels of cholesterol and the other whose diet is rich 


10. 


Tl. 


in plant sterols. Are their risks of cardiovascular disease likely to be 
similar or different? Explain your reasoning. 


Evaluating Organic Secretions of Trees James G. Watt, Secretary of 
the Interior (1981-1983) in Ronald Reagan’s first term, provoked 
substantial controversy by stating publicly that trees cause significant 
amounts of air pollution. Based on your reading of this chapter, 
evaluate Watt’s remarks. 


Evaluating Nutritional Benefits of Foods In a departure from his usual 
and highly popular westerns, author Louis L’Amour wrote a novel in 
1987, Last of the Breed (Bantam Press), in which a military pilot with 
Indigenous heritage is shot down over the former Soviet Union and is 
forced to use the survival skills of his ancestral culture to escape his 
enemies. On the rare occasions when he is able to trap and kill an 
animal for food, he selectively eats the fat, not the meat. Based on 
your reading of this chapter, what is his reasoning for doing so? 


Exploring the Action of Phospholipase A, As you read Section 8.7, 
you might have noticed that phospholipase A>, the enzyme found in 
rattlesnake venom, is also the enzyme that produces essential and 
beneficial lipid signals in most organisms. Explain the differing 
actions of phospholipase A, in these processes. 


Evaluating the Action of Warfarin Pet owners often worry about 
unintended ingestion of rodent poisons. Visit a grocery store near 
you, stop by the rodent poison section, and examine a container of 
warfarin or a related product. From what you can glean from the 
packaging, how much warfarin would a typical dog (40 Ibs) have to 
consume to risk hemorrhages and/or death? 


Understanding Isoprene Structures Refer to Figure 8.13 and draw 
each of the structures shown and try to identify the isoprene units in 
each of the molecules. (Note that there may be more than one correct 
answer for some of these molecules.) 


Evaluating the Caloric Content of Stored Fat (Integrates with Chapter 3.) 
As noted in the Deeper Look box on polar bears, a polar bear may 
burn as much as 1.5 kg of fat resources per day. What weight of seal 
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13. 


14. 


15. 


16. 


17. 


18. 


19. 


20. 


21. 


22. 


Chapter 8 Lipids 


blubber would you have to ingest if you were to obtain all your 
calories from this energy source? 


Omega-3 Fatty Acids Omega-3 (or w-3) fatty acids are a class of 
polyunsaturated fatty acids that have a double bond at the third 
carbon counting from the last (omega) carbon of the fatty acid chain. 
Identify the omega-3 fatty acid or acids in Table 8.1. 


Assessing the Fat Composition of Desserts If you are still at the 
grocery store working on problem 9, stop by the cookie shelves and 
choose your three favorite cookies from the shelves. Estimate how 
many calories of fat and how many other calories from other sources 
are contained in 100 g of each of these cookies. Survey the ingredients 
listed on each package, and describe the contents of the package in 
terms of (a) saturated fat, (b) cholesterol, and (c) trans fatty acids. 
(Note that food makers are required to list ingredients in order of 
decreasing amounts in each package.) 


Evaluating the Structures of Sterols Describe all of the structural 
differences between cholesterol and stigmasterol. 


Understanding the Functions of Steroid Hormones Describe the 
functions of androgens, glucocorticoids, and mineralocorticoids. 


Understanding Structures and Properties of Terpenes Look through 
your refrigerator, your medicine cabinet, and your cleaning solutions 
shelf or cabinet, and find at least three commercial products that 
contain fragrant monoterpenes. Identify each one by its scent and 
then draw its structure. 


Terpenes in Lipid Structures Lanosterol (see Figure 8.13d) is an 
important precursor in the biosynthesis of cholesterol. Identify the 
isoprene units (see Figure 8.12) in lanosterol. How many are 
present? 


Understanding the Chemistry of Soaps Our ancestors kept clean 
with homemade soap (see Section 8.2), often called lye soap. Go to 
http://www. wikihow. com/ Make- Your-Own-Soap and read the 
procedure for making lye soap from vegetable oils and lye (sodium 
hydroxide). What chemical process occurs in the making of lye 
soap? Draw reactions to explain. How does this soap work as a 
cleaner? 


Understanding the Chemistry of Common Foods Mayonnaise is 
mostly vegetable oil and vinegar. So what’s the essential difference 
between oil and vinegar salad dressing and mayonnaise? Learn for 
yourself: Combine a half cup of pure vegetable oil (olive oil will 
work) with two tablespoons of vinegar in a bottle, cap it securely, 
and shake the mixture vigorously. What do you see? Now let the 
mixture sit undisturbed for an hour. What do you see now? Add 
one egg yolk to the mixture, and shake vigorously again. Let the 
mixture stand as before. What do you see after an hour? After two 
hours? Egg yolk is rich in phosphatidylcholine. Explain why the egg 
yolk caused the effect you observed. 


Ether or Ester Linkages in Lipids Ether glycerophospholipids contain 
an ether bond between glycerol and an acyl chain. From a chemical or 
biochemical point of view, what is the benefit of having an ether bond 
rather than an ester bond between the glycerol and acyl chains? 


Evaluating the Physiological Benefits of Stanol Esters The cholesterol- 
lowering benefit of stanol-ester margarine is only achieved after 
months of consumption of stanol esters (see graph in the Human 
Biochemistry box “Plant Sterols and Stanols—Natural Cholesterol 
Fighters” in Section 8.7). Suggest why this might be so. Suppose 
dietary sources represent approximately 25% of total serum choles- 
terol. Based on the data in the graph, how effective are stanol esters at 
preventing uptake of dietary cholesterol? 


Comparing Therapies for Cholesterol Lowering Statins are cholesterol- 
lowering drugs that block cholesterol synthesis in the human liver (see 
the Human Biochemistry box “Statins Lower Serum Cholesterol” in 


23. 


24. 


25. 


26. 


27. 


28. 


Section 24.4). Would you expect the beneficial effects of stanol esters 
and statins to be duplicative or additive? Explain. 


Assessing the Sterol and Stanol Content of Food Products If most 
plant-derived food products contain plant sterols and stanols, would 
it be as effective for cholesterol-lowering purposes to simply incorpo- 
rate plant fats in one’s diet as to use a sterol- or stanol-fortified spread 
like Benecol? Consult a suitable reference (for example, http: /llpi 
.oregonstate. edulinfocenter/phytochemicals/sterols/#sources at the 

Linus Pauling Institute) to compose your answer. 


Evaluating the History and Culture of Anabolic Steroids Tetrahydroges- 
trinone is an anabolic steroid. It was banned by the USFDA in 2003, 
but it has been used illegally since then by athletes to increase muscle 
mass and strength. Nicknamed “The Clear,” it has received considerable 
attention in high-profile steroid-abuse cases among athletes such as 
baseball player Barry Bonds and track star Marion Jones. Use your 
favorite web search engine to learn more about this illicit drug. How is it 
synthesized? Who is the father of prohormones who first synthesized it? 
Why did so many prominent athletes use The Clear (and its relative 
“The Cream”) when less expensive and more commonly available 
anabolic steroids are in common use? (Hint: There are at least two 
answers to this last question.) 


Tetrahydrogestrinone 


Make a list of the advantages polar bears enjoy from their nonpolar diet. 
Why wouldn’t juvenile polar bears thrive on an exclusively nonpolar diet? 


Snake venom phospholipase A, causes death by generating membrane- 
soluble anionic fragments from glycerophospholipids. Predict the fatal 
effects of such molecules on membrane proteins and lipids. 


Testosterone, the principal male sex steroid hormone, is synthesized in 
five steps from cholesterol. In the last step, five isozymes catalyze the 
17b-hydroxysteroid dehydrogenase reaction that interconverts 
4-androstenedione and testosterone. Defects in the synthesis or action of 
testosterone can impair the development of the male phenotype during 
embryogenesis and cause the disorders of human sexuality termed male 
pseudohermaphroditism. Consult a suitable reference (for example, 
Anderson, S., Russell, D. W., and Wilson, J. D., 1996. 17b-Hydroxysteroid 
dehydrogenase 3 deficiency. Trends in Endocrinology and Metabolism 
7:121-126; or Yang, S.-Y., He, X.-Y., and Miller, D., 2011. Hydroxysteroid 
(178) dehydrogenase X in human health and disease. Mol. Cell 
Endocrinol. 343:1—6) to learn more about the consequences of alterations 
of steroid biosynthesis. How are these alterations involved in Alzheimer’s 
disease and Down’s syndrome? 


Phosphatidylinositol is sometimes mistakenly taken to be a 
glycophospholipid—specifically a phosphatidyl glycoside. However, the 
head group of phosphatidylinositol is a poly-hydroxy cyclohexane, not a 
monosaccharide. On the other hand, a true glycophospholipid, 
namely phosphatidyl-B-d-glucopyranoside (or simply phosphatidylglu- 
coside), has been identified in several cell types, including HL-60 cells 

(a line of human leukemia cells used frequently for studies of blood cell 
formation). In these cells, phosphatidylglucoside plays a role in the 
formation of plasma membrane signaling microdomains involved in 
cellular differentiation and maturation. Consult a suitable reference (for 
example, Greimel, P., and Ito, Y., 2008. First synthesis of natural 
phosphtidyl-6-D-glucoside. Tetrahedron Letters 49:3562—3566.) to learn 
more about its role in glial cell development and differentiation. 
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29. A signaling pathway begins with a particular membrane glycerophos- 

pholipid and produces several signal molecules by enzymatic reactions. 

a. The first enzyme reaction cleaves this beginning glycerophospho- 
lipid to produce a phosphatidic acid and a phosphatidylglycerol. 
Name the enzyme that carried out this reaction. 

b. Reaction of the phosphatidylglycerol with an enzyme produces 
glycerol and another molecule of phosphatidic acid. Name the 
enzyme that carried out this reaction. 


c. The phosphatidic acids produced in the two reactions described 
above turned out to be identical. They were treated with another 
enzyme to yield arachidonic acid and a second molecule. Name 
the enzyme that carried out this reaction. 


d. Name the second molecule produced in this third reaction. 


30. 2-a-linolenoylglycerol-3-phosphate may be produced by a series of two 
reactions from a phosphatidylcholine that contains linoleic acid. The first 
reaction removes a fatty acid from the PC. Draw and name the structures 
of the starting phospholipid and all the products produced in these two 
reactions. Name both enzymes. 


31. Acknowledging the threat to human health of trans fats, the USFDA has 
banned the addition of partially hydrogenated oils (PHOs) in all foods 
produced and sold in the U.S. However, there are natural sources of 
trans fats in various foods, particularly milk, cheese, and meats from 
ruminant animals. What are the sources of these trans fats in these foods? 
What are the levels of trans fats in these foods? The USFDA actually 
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allows food manufacturers to label their products as having “Zero grams 
trans fat per serving”, even when these foods contain trans fats. How is 
this possible? Consult the USFDA guidelines to learn more and explain 
this apparent inconsistency. 


Think-Pair-Share question 


32. ATP Content from Fatty Acids in Food Items In a fatty acid, every 
pair of carbons (-CH3-CH,-) can generate about 13.5 molecules of 
ATP when it is broken down in metabolism. However, if the pair of 
carbons from an unsaturated bond (-CH=CH-), then about 
12 molecules of ATP can be generated. 


Consider the nutritional information of the two energy bars. 


Energy barı Energy bar 2 
Serving size (g) 68 52 
% (w/w) saturated fat 1.25 1.12 
% (w/w) monounsaturated fat 0.524 1.5 
% (w/w) polyunsaturated fat 2.35 2.8 


Using stearic acid (CjgH36O2), oleic acid (CjgH3402) and a-linolenic 
acid (CjsH3 90>) as your models for saturated, monounsaturated, and 
polyunsaturated fatty acids (see Table 8.1), calculate the number of 
moles of ATP that can be generated from these energy bars. 
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Membranes and Membrane 
Transport 


< Frog eggs are macroscopic facsimiles of 
microscopic cells. All cells are surrounded 
by a thin, delicate yet stable membrane. 


Membranes serve a number of essential cellular functions. They constitute the boundaries 
of cells and intracellular organelles, and they provide a surface where many important bio- 
logical reactions and processes occur. Membranes have proteins that mediate and regulate 
the transport of metabolites, macromolecules, and ions. Photosynthesis, electron trans- 
port, oxidative phosphorylation, muscle contraction, and electrical activity all depend on 
membranes. What are the properties and characteristics of biological membranes that 
account for their broad influence on cellular processes and transport? 


Morn: are key structural and functional elements of cells. All cells have a 
cytoplasmic membrane, or plasma membrane, that functions (in part) to sepa- 
rate the cytoplasm from the surroundings. The plasma membrane is also responsible 
for (1) the exclusion of certain toxic ions and molecules from the cell, (2) the accumula- 
tion of cell nutrients, and (3) energy transduction. It also functions in (4) cell locomotion, 
(5) reproduction, (6) signal transduction processes, and (7) interactions with molecules 
or other cells in the vicinity. Hormones and many other biological signal molecules 
and regulatory agents exert their effects via interactions with membranes and mem- 
brane proteins. As an index of the importance of membranes, an estimated 26% of 
human protein-coding genes encode membrane proteins. 

Even the plasma membranes of prokaryotic cells are complex (Figure 9.1). With no 
intracellular organelles to divide and organize the work, bacteria carry out processes 
either at the plasma membrane or in the cytoplasm itself. Eukaryotic cells, however, 
contain numerous intracellular organelles that perform specialized tasks. Nucleic acid 
biosynthesis is handled in the nucleus; mitochondria are the site of electron transport, 
oxidative phosphorylation, fatty acid oxidation, and the tricarboxylic acid cycle; and 
the secretion of proteins and other substances is handled by the endoplasmic reticulum 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content m: 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional coni 


9 


It takes a membrane to make sense out of 
disorder in biology. 


Lewis Thomas 
The World's Biggest Membrane, 
The Lives of a Cell (1974) 


9.1 What Are the Chemical and Physical 
Properties of Membranes? 


9.2 What Are the Structure and Chemistry 
of Membrane Proteins? 


9.3 How Are Biological Membranes 
Organized? 


9.4 What Are the Dynamic Processes That 
Modulate Membrane Function? 


9.5 How Does Transport Occur across 
Biological Membranes? 


9.6 What Is Passive Diffusion? 
9.7 How Does Facilitated Diffusion Occur? 


9.8 How Does Energy Input Drive Active 
Transport Processes? 


9.9 How Are Certain Transport Processes 
Driven by Light Energy? 


9.10 How Is Secondary Active Transport 
Driven by lon Gradients? 
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Figure 9.1 Electron micrographs of several different membrane structures: (a) Plasma membrane of 
Menoidium, a protozoan. (b) Two plasma membranes from adjacent neurons in the central nervous 
system. (c) Golgi apparatus. (d) Many membrane structures are evident in pancreatic acinar cells. 


(ER) and the Golgi apparatus. This partitioning of labor is not the only contribution of 
the membranes in these cells. Many of the processes occurring in these organelles (or in 
the prokaryotic cell) actively involve membranes. Thus, some of the enzymes involved in 
nucleic acid metabolism are membrane associated. The electron transfer chain and its 
associated system for ATP synthesis are embedded in the mitochondrial membrane. 
Many enzymes responsible for aspects of lipid biosynthesis are located in the ER 
membrane. This chapter discusses the composition, structure, and dynamic processes of 
biological membranes. 


9.1| What Are the Chemical and Physical 
Properties of Membranes? 


The interactions of lipids and proteins to form membranes are based on several 
properties of water. These include water’s tendency to form hydrogen bonds, share 
in polar interactions, and the hydrophobic effect, which promotes self-association of 
lipids in water to maximize entropy. These forces drive amphiphilic glycerolipids, 
sphingolipids, and sterols to form membrane structures in water, and they facilitate 
the association of proteins (and thus myriad biological functions) with membranes. 
A symphony of molecular events over a range of times from picoseconds to many 
seconds results in the movement of lipids and proteins across and between mem- 
branes; coordinates reactions at or in the membrane and the transport of ions, 
sugars, and amino acids across membranes; and organizes and directs hundreds of 
cell-signaling events. 


9.la The Composition of Membranes Suits Their Functions 


Biological membranes may contain as much as 75% to 80% protein (and only 20% to 
25% lipid) or as little as 15% to 20% protein. Membranes that carry out many enzyme- 
catalyzed reactions and transport activities (the inner mitochondrial membrane, 
chloroplast membranes, and the plasma membrane of Escherichia coli, for example) 
are typically richer in protein, whereas membranes that carry out fewer protein- 
related functions (myelin sheaths, the protective coating around neurons, for example) 
are richer in lipid. 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


D.W. Fawcett/Photo Researchers, Inc. 


9.1 What Are the Chemical and Physical Properties of Membranes? 267 
Phosphatidylcholine : Nuclear 
Pep G 10 KEDE 49 membrane 
Phosphatidylethanolamine 
; MO Golgi 
E Sphingolipids sf) 10 17 42 apparatus 
H Phosphatidylinositol 
r [4] 17 [5 34 | 38 Mitochondria 
Phosphatidylserine l 
E Cardiolipin sH é = Lysosomes 
E] Minor lipids 
Plasma 

[| Cholesterol 16 28 membrane 


Figure 9.2 The lipid composition of rat liver cell membranes, in weight percent. (Adapted from 
Andreoli, T. E., 1987. Membrane Physiology, 2nd ed. Chapter 27, Table II, and Daum, G., 1985. 
Lipids of mitochondria. Biochimica et Biophysica Acta 822:1-42.) 


Cellular mechanisms adjust lipid composition to functional needs. Thus, for exam- 
ple, the lipid makeup of red blood cell membranes is consistent across species, whereas 
the lipid complement of different (specialized) membranes within a particular cell type 
(rat liver, Figure 9.2) reflects differences of function. Plasma membranes are enriched in 
cholesterol but do not contain diphosphatidylglycerol (cardiolipin), whereas mitochon- 
dria contain considerable amounts of cardiolipin (essential for some mitochondrial 
proteins) and no cholesterol. The protein components of membranes vary even more 
greatly than their lipid compositions. 


9.1b Lipids Form Ordered Structures Spontaneously in Water 


Amphipathic lipids spontaneously form a variety of structures when added to aque- 
ous solution. All these structures form in ways that minimize contact between the 
hydrophobic lipid chains and the aqueous milieu. 


Monolayers and Micelles When small amounts of a fatty acid are added to an aqueous 
solution, a monolayer is formed at the air—water interface, with the polar head groups 
in contact with the water surface and the hydrophobic tails in contact with the air 
(Figure 9.3). Few lipid molecules are found as monomers in solution. 
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Figure 9.3 Several spontaneously 
formed lipid structures. Drawings 

of (a) monolayers and bilayers, 

(b) micelles, (c) a unilamellar vesicle, 
(d) a multilamellar vesicle, and (e) an 
electron micrograph of a multilamellar 
Golgi structure. 
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Further addition of fatty acid eventually results in the formation of micelles. 
Micelles formed from an amphipathic lipid in water position the hydrophobic tails in 
the center of the lipid aggregation with the polar head groups facing outward. 
Amphipathic molecules that form micelles are characterized by a unique critical micelle 
concentration, or CMC. Below the CMC, individual lipid molecules predominate. 
Nearly all the lipid added above the CMC, however, spontaneously forms micelles. 
Micelles are the preferred form of aggregation in water for detergents and soaps. Some 
typical CMC values are listed in Figure 9.4. 


Lipid Bilayers The fundamental structure of membranes is the lipid bilayer, which 
consists of back-to-back arrangements of monolayers (Figure 9.3). The nonpolar por- 
tions of the lipids face the middle of the bilayer, with the polar head groups arrayed 
on the bilayer surface. Phospholipid bilayers form rapidly and spontaneously when 
phospholipids are added to water, and they are stable structures in aqueous solution. 
As opposed to micelles, which are small, self-limiting structures of a few hundred 
molecules, bilayers may form spontaneously over large areas (10° nm? or more). 
Because exposure of the edges of the bilayer to solvent is highly unfavorable, exten- 
sive bilayers normally wrap around themselves and form closed vesicles (Figure 9.3). 
The nature and integrity of these vesicle structures are very much dependent on the 
lipid composition. Physicochemical studies in the laboratory with these substances 
reveal that phospholipids can form either unilamellar vesicles (with a single lipid 
bilayer), known as liposomes, or multilamellar vesicles. These latter structures are 
reminiscent of the layered structure of onions. 

Liposomes are highly stable structures, a consequence of the amphipathic nature of the 
phospholipid molecule. Ionic interactions between the polar head groups and water are 
maximized, whereas hydrophobic interactions (see Chapter 2) facilitate the association 
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of hydrocarbon chains in the interior of the bilayer. The formation of vesicles results in a 
favorable increase in the entropy of the solution because the water molecules are not re- 
quired to order themselves around the lipid chains. It is important to consider for a mo- 
ment the physical properties of the bilayer membrane, which is the basis of vesicles and 
also of natural membranes. Bilayers have a polar surface and a nonpolar core. This hy- 
drophobic core provides a substantial barrier to ions and other polar entities. The rates of 
movement of such species across membranes are thus quite slow. However, this same core 
also provides a favorable environment for nonpolar molecules and hydrophobic proteins. 
We will encounter numerous cases of hydrophobic molecules that interact with mem- 
branes and regulate biological functions in some way by binding to or embedding them- 
selves in membranes. 


9.1c The Fluid Mosaic Model Describes Membrane Dynamics 


In 1972, S. J. Singer and G. L. Nicolson proposed the fluid mosaic model for mem- 
brane structure, which suggested that membranes are dynamic structures composed 
of proteins and phospholipids. In this model, the phospholipid bilayer is a fluid 
matrix, in essence, a two-dimensional solvent for proteins. Both lipids and proteins 
are capable of rotational and lateral movement. 

Singer and Nicolson also pointed out that a mosaic of proteins can be associated 
with the surface of this bilayer or embedded in the bilayer to varying degrees (Figure 9.5). 
They defined two classes of membrane proteins. The first, called peripheral proteins (or 
extrinsic proteins), includes those that do not penetrate the bilayer to any significant 
degree and are associated with the membrane by virtue of ionic interactions and hydro- 
gen bonds between the membrane surface and the surface of the protein. Peripheral 
proteins can be dissociated from the membrane by treatment with salt solutions or by 
changes in pH (treatments that disrupt hydrogen bonds and ionic interactions). Integral 
proteins (or intrinsic proteins), in contrast, possess hydrophobic surfaces that can readily 
penetrate the lipid bilayer itself, as well as surfaces that prefer contact with the aqueous 
medium. These proteins can either insert themselves into the membrane or extend all the 
way across the membrane and expose themselves to the aqueous solvent on both sides. 
Singer and Nicolson also suggested that a portion of the bilayer lipid interacts in specific 
ways with integral membrane proteins and that these interactions might be important for 
the function of certain membrane proteins. Because of these intimate associations with 
membrane lipid, integral proteins can be removed from the membrane only by agents 
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Figure 9.4 The structures of some common 
detergents and their physical properties. Micelles 
formed by detergents can be quite large. Triton 
X-100, for example, typically forms micelles 

with a total molecular mass of 90 kDa to 95 kDa. 
This corresponds to approximately 150 molecules of 
Triton X-100 per micelle. 
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Figure 9.5 The fluid mosaic model of membrane structure proposed by S. J. Singer and 
G. L. Nicolson. In this model, the lipids and proteins are assumed to be mobile; they can diffuse 
laterally in the plane of the membrane. Transverse motion may also occur, but it is much slower. 


such as detergents or organic solvents that are capable of breaking up the hydrophobic 
interactions within the lipid bilayer itself. The fluid mosaic model became the paradigm 
for modern studies that have advanced our understanding of membrane structure and 
function. 


The Thickness of a Membrane Depends on Its Components Electron micrographs of 
typical cellular membranes show the thickness of the entire membrane—including 
lipid bilayer and embedded protein—to be 50 Å or more. Electron microscopy, 
NMR, and X-ray and neutron diffraction measurements have shown that membrane 
thickness is influenced by the particular lipids and proteins in the membrane. The 
thickness of a phospholipid bilayer made from dipalmitoyl phosphatidylcholine, 
measured as the phosphorus-to-phosphorus spacing, is about 37 Å, and the hydro- 
phobic phase of such membranes is approximately 26 Å thick. Natural membranes 
are thicker overall than simple lipid bilayers because many membrane proteins 
extend out of the bilayer significantly. 

Among the known membrane protein structures, there is considerable variation in 
the hydrophobic surface perpendicular to the membrane plane. If the hydrophobic sur- 
face of the protein is larger or smaller than the lipid bilayer, the thickness of the lipid 
bilayer must be increased or decreased. The change in bilayer thickness due to mem- 
brane proteins can be as much as 5 Å. 


Lipid Chains May Bend and Tilt in the Membrane The long hydrocarbon chains of 
lipids are typically portrayed as more or less perpendicular to the membrane plane 
(Figure 9.3). In fact, the hydrocarbon tails of phospholipids may tilt and bend and 
adopt a variety of orientations. Typically, the portions of a lipid chain near the membrane 
surface lie most nearly perpendicular to the membrane plane, and lipid chain ordering 
decreases toward the end of the chain (toward the middle of the bilayer). 


Membranes Are Crowded with Many Different Proteins Membranes are crowded places, 
with a large number of proteins either embedded or associated in some way. The 
E. coli genome codes for more than a thousand membrane proteins. Moreover, as 
more membrane protein structures are determined (Figure 9.6), it has become appar- 
ent that many membrane proteins have large structures extending outside the lipid 
bilayer that share steric contacts and other interactions. 
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Figure 9.6 Membrane protein structures solved since the first structure in 1985. As of March, 2022, 
more than 1400 structures had been published. (Data from the Web site Membrane Proteins of 
Known 3D Structure at the laboratory of Stephen H. White, Attp://blanco. biomol.uci.edulmpstruc. 
See also: /ttp://webclu. bio. wzw. tum. delbinfolprojlproamp.) 


9.2 | What Are the Structure and Chemistry 
of Membrane Proteins? 


Although the lipid bilayer constitutes the fundamental structural unit of all biological 
membranes, proteins carry out essentially all of the active functions of membranes. 
Singer and Nicolson defined two classes of membrane proteins. Peripheral proteins are 
globular proteins that interact with the membrane mainly through electrostatic and 
hydrogen-bonding interactions, and integral proteins are those that are strongly asso- 
ciated with the lipid bilayer. Another class of proteins not anticipated by Singer and 
Nicolson, the lipid-anchored proteins, is important in a variety of functions in different 
cells and tissues. These proteins associate with membranes by means of a variety of 
covalently linked lipid anchors. 


9.2a Peripheral Membrane Proteins Associate Loosely 
with the Membrane 


Peripheral proteins can bind to membranes in several ways (Figure 9.7). They may 
form ionic interactions and hydrogen bonds with polar head groups of membrane 
lipids or with other (integral) proteins, or they may interact with the nonpolar mem- 
brane core by inserting a hydrophobic loop or an amphipathic a-helix. Examples of 
each of these interaction types are shown in Figure 9.8. 


9.2b Integral Membrane Proteins Are Firmly Anchored 
in the Membrane 


Hundreds of structures of integral membrane proteins are now available in the Pro- 
tein Data Bank, and the number of membrane protein structures is doubling about 
every 3 years. The known structures show a surprising diversity, but in all cases the 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


271 


272 Chapter 9 Membranes and Membrane Transport 


ai 


huh 


Figure 9.8 Models for membrane association of 
peripheral proteins. (a) Bee venom phospholipase 
A> (PDB id = 1POC), (b) p40 phox PX domain of 
NADH oxidase (PDB id = 1H6H), and (c) PH 
domain of phospholipase Cô (PDB id = IMAI). 
Amphipathic a-helices dominate the membrane 
interaction in (a) and (b), whereas hydrophobic 
loops are shown in (c). 
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Figure 9.7 Four possible modes for the binding of peripheral membrane proteins. 


portions of the protein in contact with the nonpolar core of the lipid bilayer are 
dominated by a-helices or B-sheets because these secondary structures neutralize 
the highly polar N—H and C=O functions of the peptide backbone through 
H-bond formation. There are three classes of integral proteins. Monotopic proteins 
are associated with only one face of the membrane. Bitopic proteins span the bilayer 
once, whereas polytopic proteins span the bilayer multiple times (Figure 9.9). 


Bitopic Proteins Possess a Single Transmembrane Segment In proteins that are an- 
chored by a single hydrophobic segment, that segment typically takes the form of an 
a-helix. One of the best examples is glycophorin. Most of glycophorin’s mass 1s ori- 
ented on the outside surface of the red blood cell, exposed to the aqueous milieu 
(Figure 9.10). Hydrophilic oligosaccharide units are attached to this extracellular 
domain. These oligosaccharide groups constitute the ABO and MNS blood group 
antigenic specificities of the red cell. Glycophorin has a total molecular weight of 
about 31,000 and is approximately 40% protein and 60% carbohydrate. The gly- 
cophorin primary structure consists of a segment of 19 hydrophobic amino acid resi- 
dues with a short hydrophilic sequence on one end and a longer hydrophilic sequence 
on the other end. The 19-residue sequence is just the right length to span the cell 
membrane if it is coiled in the shape of an a-helix. 

Monoamine oxidase from the mitochondrial outer membrane is another typical 
single transmembrane-segment protein (Figure 9.11); this enzyme is the target for 
many antidepressant and neuroprotective drugs. Each monomer of the dimeric protein 
binds to the membrane through a C-terminal transmembrane a-helix. Residues in two 
loops (Pro-109 and Ile-110 in the 99-112 loop and Phe-481, Leu-482, Leu-486, and 
Pro-487 in the 481—488 loop) also provide nonpolar residues that participate in mem- 
brane binding. 
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Figure 9.9 Three classes of membrane proteins. (a) Integral membrane proteins are classified as 
monotopic, bitopic, or polytopic, depending on the topology of their membrane association. (b) PglC 
is a phosphoglycosyltransferase from Campylobacter concisus and is an integral monotopic protein. 
PglC associates with the membrane via three a-helices at the membrane surface and one a-helix 
imbedded in the membrane. The bend in the latter helix occurs at a proline residue (PDB id = 5W7L). 
(Adapted from Figure 1 of Ray, L. C., et al., 2018. Membrane association of monotopic 
phosphoglycosyl transferase underlies function. Nature Chemical Biology 14: 538-541.) 
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Figure 9.10 Glycophorin A spans the membrane of the human erythrocyte via a single a-helical 
transmembrane segment. The C-terminus of the peptide faces the cytosol of the erythrocyte; the N-terminal 
domain is extracellular. Points of attachment of oligosaccharide groups are indicated by triangles. 


The number of TM segments found in membrane proteins varies between | and 33, 
with most proteins predicted to have between | and 12 TM regions. The largest group 
of membrane proteins has only one predicted TM segment. In animals, many of these 
function as cell surface receptors for extracellular signaling molecules or as recogni- 
tion sites that allow the immune system to recognize and distinguish cells of the host 
organism from invading foreign cells or viruses. The proteins that represent the major 
transplantation antigens H2 in mice (Figure 9.11) and human leukocyte antigen (HLA) 
proteins in humans are members of this class. Other such proteins include the surface 
immunoglobulin receptors on B lymphocytes and the spike proteins of many membrane 
viruses. The function of many of these proteins depends primarily on their extracel- 
lular domain; thus, the segment facing the intracellular surface is often a shorter one. 


Polytopic Proteins Have Multiple Transmembrane Segments Many integral transmem- 
brane proteins cross the lipid bilayer more than once. These multi-spanning membrane 
proteins typically have 2 to 12 transmembrane segments, and they carry out a variety 
of cellular functions (Figure 9.12). A well-characterized example of such a protein is 
bacteriorhodopsin (Figure 9.13), which clusters in purple patches in the membrane of 
the archaeon Halobacterium halobium. The name Halobacterium refers to the fact that 
this prokaryote thrives in solutions with high concentrations of sodium chloride, such 
as the salt ponds of San Francisco Bay. H. halobium carries out a light-driven proton 
transport by means of bacteriorhodopsin, named in reference to its spectral similari- 
ties to rhodopsin in the rod outer segments of the mammalian retina. The amino acid 
sequence of bacteriorhodopsin contains seven different segments, each about 20 non- 
polar residues in length—yjust the right size for an a-helix that could span a bilayer 
membrane. (Twenty residues times 1.5 A per residue equals 30 A.) 
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Figure 9.11 Partial structures of proteins with a single 
transmembrane segment. (Complete structures are 
not available.) (a) Major histocompatibility antigen 
HLA-A2 (PDB id = IJF1). The arrow notes the 
presumed connection with the transmembrane 
segment. (b) Monoamine oxidase (PDB id = 1GOS). 
Both are membrane-associated proteins with a single 
transmembrane helical segment. 
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Figure 9.13 Bacteriorhodopsin is composed of seven 
transmembrane a-helical segments connected by 
short loops (PDB id = 1M0M). Nearly all of this 
protein is embedded in the membrane. Only the 
short loops connecting helices are exposed to 
solvent. A retinal chromophore (a light-absorbing 
molecule, shown in blue) lies approximately parallel 
to the membrane and between the helical segments. 
A proline residue (red) induces a kink in one of the 
helical segments (green). 
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Table 9.1 Hydropathy Scale for 
Amino Acid Side Chains in Proteins* 


Side Chain Hydropathy Index 
Isoleucine 4.5 
Valine 4.2 
Leucine 3.8 
Phenylalanine 2.8 
Cysteine 2.5 
Methionine 1.9 
Alanine 1.8 
Glycine —0.4 
Threonine —0.7 
Serine —0.8 
Tryptophan —0.9 
Tyrosine —1.3 
Proline —1.6 
Histidine —3.2 
Glutamic acid —3.5 
Glutamine =3:5 
Aspartic acid =33 
Asparagine =3:5 
Lysine —3.9 
Arginine —4.5 


*From Kyte, J., and Doolittle, R., 1982. A simple method for 
displaying the hydropathic character of a protein. Journal of 
Molecular Biology 157:105-132. 
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Figure 9.12 The vast majority of membrane proteins possess 1 to 12 transmembrane segments, 
though a few percent of known proteins have more. Those involved in transport functions have 


between 6 and 12 transmembrane segments. (Adapted from von Heijne, G., 2006. Membrane-protein 
topology. Nature Reviews Molecular Cell Biology 7:909-918.) 


Bacteriorhodopsin clusters in symmetric, repeating arrays in the purple membrane 
patches of H. halobium, and it was this orderly, repeating arrangement of proteins in 
the membrane that enabled Nigel Unwin and Richard Henderson in 1975 to determine 
the bacteriorhodopsin structure. The polypeptide chain crosses the membrane seven times, 
in seven a-helical segments, with very little of the protein exposed to the aqueous milieu. 
The bacteriorhodopsin structure became a model of globular membrane protein structure. 
Many other integral membrane proteins contain numerous hydrophobic sequences that, 
like those of bacteriorhodopsin, form a-helical transmembrane segments. 


Membrane Protein Topology Can Be Revealed by Hydropathy Plots The topology of a 
membrane protein is a specification of the number of transmembrane segments and 
their orientation across the membrane. The topology of a transmembrane helical pro- 
tein can be revealed by a hydropathy plot based on its amino acid sequence. If a mea- 
sure of hydrophobicity is assigned to each amino acid (Table 9.1), then the overall 
hydrophobicity of a segment of a polypeptide chain can be estimated. The hydropathy 
index for any segment is an average of the hydrophobicity values for its residues. 

The hydropathy index can be calculated at any residue in a sequence by averaging the 
hydrophobicity values for a segment surrounding that residue. Typically, segment sizes 
for such calculations can be 7 to 21 residues. With a 7-residue segment size, the calcula- 
tion of hydropathy index at residue 10 would average the values for residues 7 through 
13. The calculation for a 21-residue segment around residue 100 would include residues 
90 to 110. A polypeptide segment approximately 20 residues long with a high hydropa- 
thy index is likely to be an a-helical transmembrane segment. A hydropathy plot for 
glycophorin (Figure 9.14a) reveals a single region of high hydropathy index between 
residues 73 and 93, the location of the a-helical segment in this transmembrane protein 
(see Figure 9.10). A hydropathy plot for rhodopsin (Figure 9.14b) reveals the locations 
of its seven a-helical transmembrane segments. Rhodopsin, the light-absorbing pig- 
ment protein of the eye, is a member of the G-protein—coupled receptor (GPCR) family 
of membrane proteins. 


Proline Residues Can Bend a Transmembrane a-Helix Transmembrane a-helices often 
contain distortions and kinks—more so than for water-soluble proteins. As more in- 
tegral membrane protein structures have been determined, it has become clear that 
most transmembrane a-helices contain significant distortions from ideal helix geom- 
etry. Helix distortions may have evolved in membrane proteins because (1) helices, 
even distorted ones, are highly stable in the membrane environment, and (2) helix 
distortions may be one way to create structural diversity. 

About 60% of known membrane helix distortions are kinks at proline residues 
(see Figure 9.13). Proline distorts the ideal a-helical geometry because of its fixed p 
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Figure 9.14 Hydropathy plots for (a) glycophorin and (b) rhodopsin. Hydropathy index is plotted 
versus residue number. At each position in the polypeptide chain, the average of hydropathy indices 
for a certain number of adjacent residues (eight, in this case) is calculated and plotted on the y-index, 
and the number of the residue in the middle of this window is shown on the x-axis. 


value, steric conflict with the preceding residue, and because of the loss of a backbone 
H bond. Proline-induced kinks create weak points in the helix, which may facilitate 
movements required for transmembrane transport channels. 


Amino Acids Have Preferred Locations in Transmembrane Helices Transmembrane 
protein sequences and structures are adapted to the transition from water on one 
side of the membrane, to the hydrocarbon core of the membrane, and then to water 
on the other side of the membrane. The amino acids that make up transmembrane 
segments reflect these transitions. Charged residues (Figure 9.15a) occur commonly 
at the lipid—water interface, but positively charged residues are found more often on 
the cytoplasmic face of transmembrane proteins. Hydrophobic amino acids (Ala, Val, 
Leu, Ile, and Phe) are found most often in the hydrocarbon interior, where charged 
and polar amino acids almost never reside (Figure 9.15b). Gunnar von Heijne 
has termed this the positive inside rule. Tryptophan, histidine, and tyrosine are 
special cases (Figure 9.15c). These residues have a mixed character, with nonpolar 
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Figure 9.15 Amino acids have distinct preferences for different parts of the membrane. The graphs 
show relative stabilization energies as a function of location in the membrane for (a) Arg, Asp, Glu, 
Lys, Asn, Gln, and Pro; (b) Ala, Gly, Ile, Leu, Met, Phe, and Val; and (c) His, Tyr, and Trp. In general, 
polar and charged residues are less stable in the membrane interior, whereas nonpolar residues tend to 
be more stable in the membrane interior. The stability profiles for His, Tyr, and Trp are more complex. 
(Adapted from von Heijne, G., 2006. Membrane-protein topology. Nature Reviews Molecular and Cell 
Biology 7:909-918.) 
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Snorkeling 


Antisnorkeling 


Figure 9.16 Snorkeling and antisnorkeling behavior 
in membrane proteins. The SdhC subunit of 
succinate dehydrogenase (PDB id = 1NEK). Lys!!! 
snorkels away from the membrane core and Phe” 
antisnorkels toward the membrane core. Numbers at 
left indicate lipid bilayer dimensions in angstroms. 
(Adapted from Liang, J., Adamian, L., and Jackups, 
R., Jr., 2005. The membrane—water interface region 
of membrane proteins: Structural bias and the anti- 
snorkeling effect. Trends in Biochemical Sciences 
30:355-357.) 


aromatic rings that also contain polar parts (the ring N—H of Trp and the 
substituent —OH of Tyr). As such, Trp and Tyr are found commonly at the lipid— 
water interface of transmembrane proteins. 

The amino acids Lys and Arg frequently behave in novel ways at the lipid—water 
interface. Both of these residues possess long aliphatic side chains with positively 
charged groups at the end. In many membrane proteins, the aliphatic chain of Lys or 
Arg is associated with the hydrophobic portion of the bilayer, with the positively charged 
groups (amino or guanidinium) extending beyond to associate with negatively charged 
phosphate groups. This behavior, with the side chain pointing up out of the membrane 
core, has been termed snorkeling (Figure 9.16). If a Phe residue occurs near the lipid— 
water interface, it is typically arranged with the aromatic ring oriented toward the mem- 
brane core. This is termed antisnorkeling. 


Membrane Protein Structures Show Many Variations on the Classical Themes Although 
it revealed many insights of membrane protein structure, bacteriorhodopsin gave a 
relatively limited view of the structural landscape. Many membrane protein struc- 
tures obtained since bacteriorhodopsin (and a few others) have provided a vastly 
more complex picture to biochemists. For example, the structures of a homodimeric 
chloride ion transport protein and a glutamate transport protein show several novel 
structural features (Figure 9.17). In addition to several transmembrane helices that 
lie perpendicular to the membrane plane (like those of bacteriorhodopsin), these 
structures each contain several Jong, severely tilted helices that span the membrane. 
Both these proteins also contain several reentrant loops, consisting of a pair of short 
a-helices and a connecting loop that together penetrate partway into the membrane 
core. There are also regions of nonhelical polypeptide deep in the membrane core of 
these proteins, with helical segments on either side that extend to the membrane 
surface (Figure 9.17). Some transmembrane helices are hydrophobic throughout 
their length, and these tilt largely to avoid exposing nonpolar residues to the water 
solvent. Other transmembrane helices have a mix of polar and nonpolar residues at 
one or both ends of the helix. In these cases, the helix tilt is the result of specific 
interactions of these polar residues with other parts of the protein. Nonpolar resi- 
dues in these capping segments face the surrounding membrane. 

Finally, most membrane protein structures are relatively stable; that is, transmem- 
brane helices do not flip in and out of the membrane, and they do not flip across the 
lipid bilayer, inverting their orientation. However, a few membrane proteins can in fact 
change their membrane orientation. Aquaporin-! is a protein that functions normally 


Figure 9.17 Not all the embedded segments of membrane proteins are transmembrane and oriented 
perpendicular to the membrane plane. (a) The glutamate transporter homolog (PDB id = 1XFH). 
Reentrant helices (orange) and interrupted helices (red) are shown. Several of the transmembrane 
helices deviate significantly from the perpendicular. (b) The E. coli CIC chloride transporter 

(PDB id = 1KPK). Few of the transmembrane helices are perpendicular to the membrane plane. 
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Figure 9.18 Dynamic insertion of helical segments of membrane proteins. (a) Aquaporin-1. The 
second and fourth transmembrane helices insert properly across the membrane only after reorientation 
of the third transmembrane helix. (b) The large envelope glycoprotein of the hepatitis B virus. The 
N-terminal pre-S domain translocates across the endoplasmic reticulum membrane in a slow process 
in 50% of the molecules. (Adapted from von Heijne, G., 2006. Nature Reviews Molecular and Cell 
Biology 7:909-918.) 


with six transmembrane a-helices. When this protein is first inserted into its mem- 
brane, it has only four transmembrane a-helices (Figure 9.18a). One of these, the third 
transmembrane helix (TM3), reorients across the membrane, pulling helices two and 
four into the membrane. Similarly, a glycoprotein of the hepatitis B virus is initially 
inserted into the viral membrane with its N-terminal domain lying outside. During the 
viral maturation process, about half of these glycoproteins rearrange (Figure 9.18b), 
with the N-terminal segment moving across the membrane as TM4 creates a new trans- 
membrane segment. 


Some Proteins Use -Strands and B-Barrels to Span the Membrane The a-helix is not the 
only structural motif by which a protein can cross a membrane. Some integral trans- 
membrane proteins use structures built from B-strands and B-sheets to diminish the 
polar character of the peptide backbone as it crosses the nonpolar membrane core. 
These -barrel structures (Figure 9.19) maximize hydrogen bonding and are highly 
stable. The barrel interior is large enough to accommodate water molecules and often 
structures as large as peptide chains, and most barrels are literally water filled. 

How does the B-barrel structure tolerate water on one surface (the inside) and the 
nonpolar membrane core on the other? In all transmembrane B-barrels, polar and non- 
polar residues alternate along the B-strands, with polar residues facing the center of the 
barrel and nonpolar residues facing outward, where they can interact with the hydro- 
phobic lipid milieu of the membrane. 

Porin proteins found in the outer membranes (OMs) of gram-negative bacteria such 
as E. coli, and also in the outer mitochondrial membranes of eukaryotic cells, span their 
respective membranes with large B-barrels. A good example is maltoporin, also known 
as LamB protein or lambda receptor, which participates in the entry of maltose and 
maltodextrins into Æ. coli. Maltoporin is active as a trimer. The 421-residue monomer 
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Figure 9.20 The arrangement of the peptide chain 
in maltoporin from E. coli. Porins are located in 
the outer membranes of gram-negative bacteria 
and in the outer membranes of mitochondria. 

The periplasmic space is the region between inner 
and outer membranes. 
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Figure 9.19 Some proteins traverse the membrane with B-barrel structures. 
Several examples are shown, including (a) maltoporin from Salmonella 
typhimurium (PDB id = 2MPR), (b) ferric enterobactin receptor (PDB id = 
1FEP), (c) TolC, an outer membrane protein from E. coli (PDB id = 1EK9), 
(d) the translocator domain of the NalP autotransporter of N. meningitides 
(PDB id = 1UYN), (e) the translocator domain of the Hia autotransporter 
from Haemophilus influenzae (PDB id = 2GR8), (f) the outer membrane 
cobalamin transporter from E. coli, in a complex with the 100 A coiled coil 
of colicin E3 (PDB id = 1UJW), (g) the fatty acid transporter FadL from 
E. coli (PDB id = 1T16). 
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forms an 18-strand 8-barrel with antiparallel B-strands connected to their nearest 
neighbors either by long loops or by B-turns (Figure 9.20; see also Figure 9.19a). The 
long loops are found at the end of the barrel that is exposed to the cell exterior, whereas 
the turns are located on the intracellular face of the barrel. Three of the loops fold into 
the center of the barrel. 

B-barrels can also be constructed from multiple subunits. The a-hemolysin toxin 
(Figure 9.21) forms a 14-stranded B-barrel with seven identical subunits that each 
contribute two antiparallel B-strands connected by a short loop. Staphylococcus aureus 
secretes monomers of this toxin, which bind to the plasma membranes of host blood 
cells. Upon binding, the monomers oligomerize to form the seven-subunit structure. 
The channel thus formed facilitates uncontrolled permeation of water, ions, and small 
molecules, destroying the host cell. 

Why have certain proteins evolved to use -strands instead of a-helices as 
membrane-crossing devices? Among other reasons, there is an advantage of genetic 
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Membrane 


economy in the use of -strands to traverse the membrane instead of a-helices. An 
a-helix requires 21 to 25 amino acid residues to span a typical biological membrane; a 
B-strand can cross the same membrane with 9 to 11 residues. Therefore, a given amount 
of genetic information could encode a larger number of membrane-spanning segments 
using a B-strand motif instead of a-helical arrays. 


Transmembrane Barrels Can Also Be Formed with a-Helices Many bacteria, including 
E. coli, produce extracellular polysaccharides, some of which form a discrete struc- 
tural layer—the capsule, which shields the cell, allowing it to evade or counteract host 
immune systems. In E. coli, the components of this polysaccharide capsule are syn- 
thesized inside the cell and then transported outward through an octameric outer 
membrane protein called Wza. To cross the outer membrane, Wza uses a distinctive 
a-helical barrel (Figure 9.22). Wza is composed of three novel domains that, with the 
a-helical barrel, form a large central cavity that accommodates the transported 
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Figure 9.22 The structure of Wza, an octameric membrane protein that anchors the peptidoglycan 
layer and the outer membrane of gram-negative bacteria. The structure contains a central barrel 
(black lines) constructed from a-helical segments (PDB id = 2J58). Axial view is from bottom of the 
structure in the side view. 


Figure 9.21 The structure of the heptameric 
channel formed by Staphylococcus aureus 
a-hemolysin. Each of the seven subunits 
contributes a B-sheet hairpin to the 
transmembrane channel (PDB id = 7AHL). 
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Farnesyltransferase complexed with 
farnesylpyrophosphate (green) and TKCVIM 
substrate peptide (red) (PDB id = 3DPY). 


Geranylgeranyltransferase complexed with a 
geranylgeranylpyrophosphate analog (green) 
and KKKSKTKCVIL peptide (red) (PDB id = 
1N4Q). 


Prenyl groups are linked to target proteins by the 
actions of farnesyl and geranylgeranyl transferases, 
which play crucial roles in multiple cell-signaling 
pathways and are thus promising therapeutic targets 
for a variety of cancers and other diseases. 


polysaccharides. The transmembrane a-helices of Wza are amphiphilic, with hydro- 
phobic outer surfaces that face the lipid bilayer and hydrophilic inner surfaces that 
face the water-filled pore. 


9.2c Lipid-Anchored Membrane Proteins Can Act as Switching Devices 


Certain proteins are found to be covalently linked to lipid molecules. For many of 
these proteins, covalent attachment of lipid is required for association with a membrane. 
The lipid moieties can insert into the membrane bilayer, effectively anchoring their 
linked proteins to the membrane. Some proteins with covalently linked lipid normally 
behave as soluble proteins; others are integral membrane proteins and remain 
membrane associated even when the lipid is removed. Covalently bound lipid in these 
latter proteins can play a role distinct from membrane anchoring. In many cases, 
attachment to the membrane via the lipid anchor serves to modulate the activity of the 
protein. 

An interesting facet of lipid anchors is that they are transient. Lipid anchors can be 
reversibly attached to and detached from proteins. This provides a “switching device” 
for altering the affinity of a protein for the membrane. Reversible lipid anchoring is one 
factor in the control of signal transduction pathways in eukaryotic cells. 

Four different types of lipid-anchoring motifs have been found to date. These are 
amide-linked myristoyl anchors, thioester-linked fatty acyl anchors, thioether-linked prenyl 
anchors, and amide-linked glycosyl phosphatidylinositol anchors. Each of these anchoring 
motifs is used by a variety of membrane proteins, but each nonetheless exhibits a charac- 
teristic pattern of structural requirements. 


Amide-Linked Myristoyl Anchors Myristic acid may be linked via an amide bond to the 
a-amino group of the N-terminal glycine residue of selected proteins (Figure 9.23a). 
The reaction is referred to as N-myristoylation and is catalyzed by myristoyl— 
CoA:protein N-myristoyltransferase, known simply as NMT. N-Myristoyl—anchored 
proteins include the catalytic subunit of cAMP-dependent protein kinase, the pp60°"° 
tyrosine kinase, the phosphatase known as calcineurin B, the a-subunit of G proteins 
(involved in GTP-dependent transmembrane signaling events), and the gag proteins of 
certain retroviruses (including the HIV-1 virus that causes AIDS). 


Thioester-Linked Fatty Acyl Anchors A variety of cellular and viral proteins contain fatty 
acids covalently bound via ester linkages to the side chains of cysteine and sometimes to 
serine or threonine residues within a polypeptide chain (Figure 9.23b). This type of fatty 
acyl chain linkage has a broader fatty acid specificity than N-myristoylation. Myristate, 
palmitate, stearate, and oleate can all be esterified in this way, with the Cj, and Cig chain 
lengths being most commonly found. Proteins anchored to membranes via fatty acyl 
thioesters include G-protein—coupled receptors, the surface glycoproteins of several 
viruses, the reggie proteins of nerve axons, and the transferrin receptor protein. 


Thioether-Linked Prenyl Anchors As noted in Chapter 8, polyprenyl (or simply prenyl) 
groups are long-chain polyisoprenoid groups derived from isoprene units. Prenylation 
of proteins destined for membrane anchoring can involve either farnesyl or geranyl- 
geranyl groups (Figure 9.23c and d). The addition of a prenyl group typically occurs at 
the cysteine residue of a carboxy-terminal CAAX sequence of the target protein, 
where C is cysteine, A is any aliphatic residue, and X can be any amino acid. As shown 
in Figure 9.23c and d, the result is a thioether-linked farnesyl or geranylgeranyl group. 
Once the prenylation reaction has occurred, a specific protease cleaves the three 
carboxy-terminal residues, and the carboxyl group of the now terminal Cys is methyl- 
ated to produce an ester. All of these modifications appear to be important for subse- 
quent activity of the prenyl-anchored protein. Proteins anchored to membranes via 
prenyl groups include yeast mating factors, the p21" protein (the protein product of the 
ras oncogene) and the nuclear lamins, structural components of the lamina of the inner 
nuclear membrane. 
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Figure 9.23 Certain proteins are anchored to 
biological membranes by lipid anchors. Shown are 
(a) the N-myristoyl motif, (b) the S-palmitoyl motif, 
(c) the farnesyl and geranylgeranyl motifs, (d) the 
prenylation and subsequent processing of prenyl- 
anchored proteins, (PPSEP = preny protein-specific 
endoprotease and PPSMT = prenyl protein-specific 
methyltransferase) and (e) several cases of the 
glycosyl phosphatidylinositol (GPI) motif. 
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Glycosyl Phosphatidylinositol Anchors Glycosyl phosphatidylinositol, or GPI, groups 
are structurally more elaborate membrane anchors than fatty acyl or prenyl groups. 
GPI groups modify the carboxy-terminal amino acid of a target protein via an etha- 
nolamine residue linked to an oligosaccharide, which is linked in turn to the inositol 
moiety of a phosphatidylinositol (Figure 9.23e). The oligosaccharide typically consists 
of a conserved tetrasaccharide core of three mannose residues and a glucosamine, 
which can be altered by addition of galactosyl side chains of various sizes and extra 
phosphoethanolamines, N-acetylgalactose, or mannosyl residues (Figure 9.23e). The 
inositol moiety can also be modified by an additional fatty acid, and a variety of fatty 
acyl groups are found linked to the glycerol group. GPI groups anchor a wide variety 
of surface antigens, adhesion molecules, and cell surface hydrolases to plasma mem- 
branes in various eukaryotic organisms. GPI anchors have not yet been observed in 
prokaryotic organisms or plants. 


9.3 | How Are Biological Membranes Organized? 


Biological membranes are asymmetric and heterogeneous structures. The two mono- 
layers of the lipid bilayer have different lipid compositions and different complements 
of proteins. The membrane composition is also different from place to place across the 
plane of the membrane. There are clusters of particular kinds of lipids, particular kinds 
of proteins, and a variety of specific lipid—protein associations and aggregates, all of 
which serve the functional needs of the cell. We say that both the lipids and the proteins 
of membranes exhibit lateral heterogeneity and transverse asymmetry. Lateral hetero- 
geneity arises when lipids or proteins of particular types cluster in the plane of the 
membrane, as discussed in Section 9.4. Transverse asymmetry refers to different lipid or 
protein compositions in the two leaflets or monolayers of a bilayer membrane. 

Many properties of a membrane depend on its two-sided nature. Properties that are 
a consequence of membrane “sidedness” include membrane transport that is driven in 
one direction only, the effects of hormones at the outsides of cells, and the recognition 
reactions that occur between cells (necessarily involving only the outside surfaces of the 
cells). The proteins involved in these and other interactions must be arranged asym- 
metrically in the membrane. 

Lipid transverse asymmetry can be seen in the typical animal cell, where the acidic and 
amine-containing phospholipids, phosphatidylserine (PS), phosphatidic acid (PA), phos- 
phatidylglycerol (PG), and phosphatidylinositol (PI), are enriched in the cytoplasmic leaf- 
let of the plasma membrane, and the choline-containing phospholipids and sphingolipids 
are enriched in the outer leaflet (Figure 9.24). In the erythrocyte, for example, phosphati- 
dylcholine (PC) comprises about 29% of the total phospholipid in the membrane. Of this 
amount, 76% is found in the outer monolayer and 24% is found in the inner monolayer. 

The inner and outer leaflets of most membranes also differ in degree of unsaturation. 
In erythrocytes, 35% of outer leaflet phospholipids are saturated and most have less 
than two unsaturations per lipid. By contrast, the cytoplasmic leaflet is highly enriched 
in phospholipids with polyunsaturated acyl chains, with the majority of lipids contain- 
ing four or more unsaturations. Phosphatidylcholine, however, is an exception. Inner 
leaflet PC is mostly saturated, whereas outer leaflet PC is largely polyunsaturated. Un- 
saturated lipid chains are much more flexible and fluid than saturated chains, and mem- 
branes rich in unsaturation have very different properties, as will be seen in Section 9.4. 

Asymmetric lipid distributions are important to cells in several ways. The carbohy- 
drate groups of glycolipids (and of glycoproteins) always face the outside of plasma 
membranes, where they participate in cell recognition phenomena. Asymmetric lipid 
distributions may also be important to various integral membrane proteins, which may 
prefer particular lipid classes in the inner and outer monolayers. 

Many proteins contain lipid-binding domains as well as positively charged regions 
that can interact with the negatively charged headgroups of acidic phospholipids. The 
resulting ionic protein—lipid interactions can direct cytoplasmic proteins to the plasma 
membrane, induce membrane protein clustering, and regulate protein conformations 
and functions. 
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Figure 9.24 Phospholipids are distributed asymmetrically in most membranes, including the human 
erythrocyte membrane, as shown here. (a) The distribution of phospholipids across the inner and 
outer leaflets of human erythrocytes. The x-axis values show, for each lipid type, its percentage of the 
total phospholipid in the membrane. (b) The phospholipid compositions of the inner and outer 
leaflets. Note that the bar chart and pie charts provide complementary information. All percentages in 
(a) and (b) are weight percentages. (Adapted from Zachowski, A., 1993. Phospholipids in animal 
eukaryotic membranes: Transverse asymmetry and movement. Biochemical Journal 294:1-14; and 
from Andreoli, T. E., 1987. Membrane Physiology, 2nd ed. Chapter 27, Table I. New York: Springer.) 


Loss of transverse lipid asymmetry has dramatic (and often severe) consequences for 
cells and organisms. For example, appearance of PS in the outer leaflet of the plasma 
membrane triggers apoptosis, the programmed death of the cell. Similarly, aging eryth- 
rocytes and platelets slowly externalize PS, culminating in engulfment by macrophages. 
Many disease states, including diabetes and malaria, involve microvascular occlusions 
that may result in part from alterations of transverse lipid asymmetry. 


9.4 | What Are the Dynamic Processes 
That Modulate Membrane Function? 


The motions of lipids and proteins in membranes underlie many cell functions. This 
section describes the nature of these motions and how they affect membrane structure. 


9.4a Lipids and Proteins Undergo a Variety 
of Movements in Membranes 


Lipid movements in membranes (Figure 9.25) range from bond vibrations (at 10!? per 
sec), to bilayer undulations (1 to 10° per sec), to transverse motion—called flip-flop 
(roughly one per day to one per sec). Lateral movement of lipids (in the plane of the 
membrane) is rapid. Adjacent lipids can change places with each other on the order of 
10’/sec. Thus, a typical phospholipid can diffuse laterally in a membrane at a linear 
rate of several microns per second. At that rate, a phospholipid molecule travels from 
one end of a bacterial cell to the other in less than a second or traverses a typical ani- 
mal cell in a few minutes. 

Many membrane proteins move laterally (through the plane of the membrane) at a 
rate of a few microns per minute. On the other hand, some integral membrane proteins 
are more restricted in their lateral movement, with diffusion rates of about 10 nm per 
sec or even slower. Slower motion is likely for proteins that associate and bind with each 
other and for proteins that are anchored to the cytoskeleton, a complex latticelike struc- 
ture that maintains the cell’s shape and assists in the controlled movement of various 
substances through the cell. 
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Figure 9.25 Lipid motions in the membrane and their characteristic frequencies. (Adapted from 
Gawrisch, K., 2005. The dynamics of membrane lipids. In The Structure of Biological Membranes, 
Chapter 4, Figure 4.1, Yeagle, P. L., ed., 2005. Boca Raton: CRC Press.) 


Flippases, Floppases, and Scramblases: Proteins That Redistribute Lipids across the 
Membrane Proteins that can flip and flop phospholipids from one side of a bilayer to 
the other have also been identified in several tissues (Figure 9.26). Three classes of such 
proteins are known: 


1. ATP-dependent flippases that transport PS, and to a lesser extent PE, from the 
outer leaflet to the inner leaflet of the plasma membrane 

2. ATP-dependent floppases that transport a variety of amphiphilic lipids, especially 
cholesterol, PC, and sphingomyelin, from the inner leaflet to the outer leaflet 

3. Bidirectional, Ca?*-activated (but ATP-independent) scramblases that function to 
randomize lipids and thus degrade transverse asymmetry 


The ATP-dependent flippases and floppases belong to two large and unrelated active 
transport systems: the ATP-binding cassette (ABC) transporters (see Section 9.10) 
and the P4-ATPases, a subclass of the P-type ATPases (see Section 9.8). Both families 
hydrolyze ATP to drive transport, although they employ different mechanisms to do 
so. These proteins reduce the half-time for phospholipid movement across a mem- 
brane from days to a few minutes or less. Approximately one ATP is consumed per 
lipid transported by flippases and floppases. Energy-dependent lipid flippase activity 
is essential for the creation and maintenance of transverse lipid asymmetries. A num- 
ber of diseases have been linked to defects in flippases and floppases. Tangier disease 
causes accumulation of high concentrations of cholesterol in various tissues and leads 
to cardiovascular problems. Infants with respiratory distress syndrome produce low 
amounts of lung surfactant (a mix of lipids) and typically die a few days after birth. 
Both of these diseases involve flippase or floppase defects. 


9.4b Membrane Lipids Can Be Ordered to Different Extents 


The phospholipids and sterols of membranes can adopt different structures depend- 
ing on the exact lipid and protein composition of the membrane and on the tem- 
perature. At low temperatures, bilayer lipids are highly ordered, forming a gel phase 
with the acyl chains nearly perpendicular to the plane of the membrane plane 
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Figure 9.26 The actions of flippases, floppases, and scramblases. (a) Phospholipids can be flipped, 
flopped, or scrambled across a bilayer membrane by the action of flippase, floppase, and scramblase 
proteins. (b) When, by normal diffusion through the bilayer, the lipid encounters one of these proteins, 
it can be moved quickly to the other face of the bilayer. (c) Structure of a flippase (PDB id = 7K YC). 
(d) Structure of a floppase (PDB id = 6BPP). (e) Structure of a scramblase (PDB id = 7JLP). 


(Figure 9.27). In this state—called the solid-ordered state (or S, state)—the lipid 
chains are tightly packed and undergo relatively little motion. The lipid chains are in 
their fully extended conformation, the surface area per lipid is minimal, and the 
bilayer thickness is maximal. At higher temperatures, the acyl chains undergo much 
more motion, with rotations around the acyl chain C—C bonds and significant 
large-scale disordering of the acyl chains. The membrane is then said to be in a liquid 
crystalline phase or liquid-disordered state (Lystate) (Figure 9.27). In this less ordered 
state, the surface area per lipid increases and the bilayer thickness decreases by 10% 
to 15%. Under most conditions, the transition from the gel phase to the liquid 
crystalline phase is a true phase transition, and the temperature at which this change 
occurs is referred to as a transition temperature or melting temperature (Tn). 

The sharpness of the transition in pure lipid preparations shows that the phase 
change is a cooperative behavior. This is to say that the behavior of one or a few 
molecules affects the behavior of many other molecules in the vicinity. The sharpness of 
the transition then reflects the number of molecules that are acting in concert. Sharp 
transitions involve large numbers of molecules all melting together. 

Phase transitions have been characterized in a number of different pure and mixed 
lipid systems. Table 9.2 shows a comparison of the transition temperatures observed for 
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Figure 9.27 The gel-to-liquid crystalline phase transition, which occurs when a membrane is warmed 
through the transition temperature, Tm. In the transition, the surface area increases, the membrane 
thickness decreases, and the mobility of the lipid chains increases dramatically. Membrane phase 
transitions can be characterized by measuring the rate of heat absorption by a membrane sample in a 
calorimeter. Pure, homogeneous bilayers (containing only a single lipid component) give sharp 
calorimetric peaks. As membrane heterogeneity increases, the calorimetric peaks broaden. Below 
phase transitions, lipid chains primarily adopt the anti conformation. Above the phase transition, 
lipid chains have absorbed a substantial amount of heat. This is reflected in the adoption of higher- 
energy conformations, including the gauche conformations shown. 


Table 9.2 


Transition Temperature 


Phospholipid (Tm), °C 
Dimyristoyl phosphatidylcholine (Di 14:0) 23.6 
Dipalmitoyl phosphatidylcholine (Di 16:0) 41.4 
Distearoyl phosphatidylcholine (Di 18:0) 58 
1-Stearoyl-2-oleoyl-phosphatidylcholine 3 
(1-18:0, 2-18: 1 PC) 

Dioleoyl phosphatidylcholine (Di 18:1 PC) —22 
Egg phosphatidylcholine (Egg PC) =15 
Dipalmitoyl phosphatidic acid (Di 16:0 PA) 67 
Dipalmitoyl phosphatidylethanolamine (Di 16:0 PE) 63.8 
Dipalmitoy! phosphatidylglycerol (Di 16:0 PG) 41.0 


Adapted from Jain, M., and Wagner, R. C., 1980. Introduction to Biological Membranes. New York: John Wiley and Sons; 
and Martonosi, A., ed., 1982. Membranes and Transport, Vol. 1. New York: Plenum Press. 
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several different phosphatidylcholines with different fatty acyl chain compositions. 
General characteristics of bilayer phase transitions include the following: 


1. The transitions are always endothermic; heat is absorbed as the temperature 
increases through the transition (Figure 9.27). 

2. Particular phospholipids display characteristic transition temperatures (Tm). As 
shown in Table 9.2, Tm increases with chain length, decreases with unsaturation, 
and depends on the nature of the polar head group. 

3. For pure phospholipid bilayers, the transition occurs over a narrow temperature 
range. The phase transition for dimyristoyl lecithin has a peak width of about 0.2°C. 

4. Native biological membranes also display characteristic phase transitions, but 
these are broad and strongly dependent on the lipid and protein composition of 
the membrane. 

5. With certain lipid bilayers, a change of physical state referred to as a pretransition 
occurs 5° to 15°C below the phase transition itself. These pretransitions involve a 
tilting of the hydrocarbon chains. 

6. A volume change is usually associated with phase transitions in lipid bilayers. 

7. Bilayer phase transitions are sensitive to the presence of solutes that interact with 
lipids, including multivalent cations, lipid-soluble agents, peptides, and proteins. 


Cells adjust the lipid composition of their membranes to maintain proper fluidity as 
environmental conditions change. 

In addition to the solid-ordered (S,) and liquid-disordered (Lg) states, model lipid 
bilayers can exhibit a third structural phase if the membrane contains sufficient sphin- 
golipid and cholesterol. The liquid-ordered (L,) state (Figure 9.28) is characterized by a 
high degree of acyl chain ordering (like the S, state) but has the translational disorder 
characteristic of the Ly state. Lipid diffusion in the L, phase is about twofold to three- 
fold slower than in the L4 phase. Biological membranes are hypothesized to contain re- 
gions equivalent to all three phases of model membranes: So, La, and L,. 


Dynamic Exchange of Lipids and Proteins with Membrane Domains The membrane 
heterogeneity described in Section 9.3 is the result of lateral organization. The plasma 
membranes of mammalian cells are organized into distinct domains of differing pro- 
tein and lipid composition. There is a continuous and highly dynamic exchange 
between lipid and protein monomers, nanoscale lipid and protein clusters, and 
microscale higher-order structures that orchestrates protein function (Figure 9.29a). 
Nanoscale domains (5-100 nm diameter) typically consist of a few dozen to a few 
hundred lipid molecules and just a few proteins. Microscale domains may extend over 
several micrometers. 


Liquid disordered (Lg) Liquid ordered (L,) 


Figure 9.28 Image from a molecular dynamics simulation of a model bilayer composed of 
dipalmitoylphosphatidylcholine and cholesterol, showing a phase separation of liquid disordered (La) 
and liquid ordered (L,) nanodomains. (Adapted with permission from Figure | of Javanainen, M., 
Martinez-Seara, H., and Vattulainen, I., 2017. Nanoscale membrane domain formation driven by 
cholesterol. Scientific Reports 7:1143. Image provided by Matti Javanainen.) 
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The maintenance of these domains, and the exchange of lipids and proteins between 
them, is regulated by factors both intrinsic and extrinsic to the membrane. Lipids (glyc- 
erophospholipids, sphingolipids, and cholesterol) may self-associate in clusters 
(Figure 9.29b), and membrane proteins may act to organize domain components and 
structures. Tetraspanin proteins, for example (Figure 9.29c) act as scaffolding proteins to 
cluster and organize other proteins. 

Tetraspanins are evolutionarily conserved proteins (there are 33 in the human body), 
involved in adhesion, migration, and signaling. Tetraspanins consist of four transmem- 
brane a-helical domains, two small intracellular tails, a small intracellular loop, and two 
extracellular loops (Figure 9.29d). Tetraspanin membrane domains are between 
100 nM and 200 nM in diameter and contain fewer than ten tetraspanin proteins. Tet- 
raspanin monomers and dimers exchange with tetraspanin nanodomains. Partner pro- 
tein interactions occur within or at the border of these nanodomains. Loss of these 
domains leads to cellular defects and the deregulation of signal pathways. 

Cortical actin filaments (F-actin and actin-binding proteins, Figure 9.29e) of the cy- 
toskeleton attach to the inner leaflet of the plasma membrane, and energy-driven pro- 
cesses organize these actin filaments that affect local membrane composition and shape. 
In turn, membrane composition can regulate cytoskeletal dynamics—a dynamic inter- 
play between membrane organization and the actin cytoskeleton that provides the cell 
with a stable yet flexible cell surface that can continuously adapt to its environment. The 
actin cytoskeleton is a rigid extrinsic organizer that forms fence-like structures under- 
neath the membrane that account for hop diffusion (see following section). At the same 
time, the continuous remodeling of cortical actin affects the dynamic exchange of mem- 
brane organization. Like instruments in a symphony, playing together but each chang- 
ing leads during a piece, all these lipid—protein interactions play different structural and 
functional roles at different times in biological membranes. 


(a) (b) 
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Figure 9.29 Dynamic exchange of domains in membranes involves lipid clustering, protein clusters 
organized by tetraspanins, as well as lipid and protein clusters assembled by cortical actin. 

(a) Nanodomains and microdomains of lipids and proteins in the plasma membrane are in a 
dynamic equilibrium with lipid and protein monomers. (b) Lipids of various types can form 
clusters in the plane of the membrane. (c) Tetraspanins (blue) can induce clustering of multiple 
proteins (gray objects) in membrane nanodomains. (d) Structure of tetraspanin CD81 (PDB id = 
5TCX). (e) Actin components on the cytoplasmic face of a membrane (termed cortical actin) 

can organize lipid and protein clusters in the membrane. 
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Lateral Membrane Diffusion Is Restricted by Barriers and Fences A variety of studies of 
lateral diffusion rates in membranes have shown that membrane proteins and lipids in 
plasma membranes diffuse laterally at a rate 5 to 50 times slower than those of artificial 
lipid membranes. Why should this be? Part of the answer has come from single particle 
tracking experiments (Figure 9.30a), which reveal that lipids and at least some membrane 
proteins tend to undergo hop diffusion, such that they can diffuse freely within a mem- 
brane “compartment” for a time and then hop to an adjacent compartment, where the 
process repeats. Akihiro Kusumi and colleagues have proposed the membrane-skeleton 
fence model to explain this behavior, suggesting that certain proteins that comprise the 
cytoskeleton—a network of proteins on the cytoplasmic face of the plasma membrane— 
restrict the lateral diffusion of other membrane proteins (Figure 9.30b). The fence 
proteins may include spectrin, a filamentous cytoskeletal protein in red blood cells, and 
actin, a cytoskeletal protein that is ubiquitous in eukaryotic cells. The single particle 
tracking experiments show that lipid molecules are typically confined within fenced 
compartments for approximately 13 ms to 15 ms, whereas transmembrane proteins are 
typically confined for 45 ms to 65 ms. Even lipids in the outer leaflet of the plasma 
membrane undergo hop diffusion, leading Kusumi and colleagues to postulate that 
transmembrane proteins act as rows of pickets extending across both monolayers in these 
membrane fences. Fences thus define regions of relatively unrestricted lipid diffusion. 


Lipids and Proteins Direct Dynamic Membrane Remodeling and Curvature The complex 
shapes of cells and organelles are the result of forces that operate on their membranes, 
and these forces in turn are orchestrated by lipids and proteins. Membranes change 
their shapes in special ways during movement, cell division, and other cellular events. 
This dynamic membrane remodeling is also accomplished by the interplay of lipids 
and proteins. The various membrane subdomains with particular curvatures have 
precise and specialized biological properties and functions. 


(b) 


Figure 9.30 Motions of a single (fluorescently labeled) lipid molecule on the surface of a cell can be 
measured by video fluorescence microscopy. (a) Video recording at 40,000 frames per second yields a 
time resolution of 25 us. Data collected over 62 ms (a total of 2500 frames) show that a lipid diffuses 
rapidly within small domains (defined by colors) and occasionally jumps or hops to an adjacent 
region (shown as a color change). (Adapted from Kusumi, A., et al., 2005. Paradigm shift of the 
plasma membrane concept from the two-dimensional continuum fluid to the partitioned fluid: High- 
speed single-molecule tracking of membrane molecules. Annual Review of Biophysics and Biomolecular 
Structure 34:351—378.) (b) Video microscopy studies of single lipid molecule movement in membranes 
are consistent with a compartmentalized model for the membrane, in which lipids and proteins 
undergo short-term diffusion within fenced compartments, with occasional hops to adjacent 
compartments. Akihiro Kusumi has suggested that elements of the cytoskeleton may define the fence 
boundaries at the membrane. 
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Figure 9.31 Membrane curvature can occur by 
several different mechanisms, including (a) changes 
in lipid composition, (b) insertion of membrane 
proteins that have intrinsic curvature or that 
oligomerize, (c) insertion of amphipathic helices into 
one leaflet of the bilayer, (d) interaction of the 
bilayer with molecular scaffolding proteins, or 

(e) changes induced by the cytoskeletal filaments 
inside the cell. (Adapted from McMahon, H. T., 
and Gallop, J. L., 2005. Membrane curvature and 
mechanisms of dynamic cell membrane remodeling. 
Nature 438:590-596.) 
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There are several ways to induce curvature in a membrane. Lipids can influence or ac- 
commodate membrane curvature (Figure 9.3la), either because of lipid molecule geom- 
etry or because of an imbalance in the number of lipids in the inner and outer leaflets of 
the bilayer. (In a liposome of 50 nm diameter, there is 56% more lipid in the outer leaflet 
than in the inner leaflet.) Integral membrane proteins with conical shapes can promote 
membrane curvature (Figure 9.31b). The structure of a voltage-gated K* channel is an 
example of a shape conducive to membrane curvature (see Figure 9.31). Proteins of the 
cytoskeleton, such as actin, typically contact the plasma membrane and can generate cur- 
vature by rearrangements of their own structure. Moreover, motor proteins moving along 
filaments of the cytoskeleton can generate curvature in the membrane. Scaffolding pro- 
teins, which can bind on either side of the plasma membrane, can influence membrane 
curvature in many ways. For example, BAR domains are dimeric, banana-shaped struc- 
tures (Figure 9.32) that bind preferentially to and stabilize curved regions of the plasma 
membrane. Finally, amphipathic a-helices can insert into bilayers, parallel to the mem- 
brane surface, thus forcing curvature on the membrane (Figure 9.31c). N-BAR domains 
are BAR domains that have an N-terminal a-helix preceding the BAR domain. The helix 
typically inserts to induce curvature, and the BAR domain binds to stabilize the curved 
structure. Harvey McMahon and his colleagues have proposed a structure (Figure 9.32b) 
for N-BAR domain-mediated membrane curvature by endophilin-Al1, a protein found at 
synapses and implicated in the formation of synaptic vesicles. Membrane curvature is es- 
sential to a variety of cellular functions, including cell division, viral budding, and the 
processes of endocytosis and exocytosis, described in the next section. 


Caveolins and Caveolae Respond to Plasma Membrane Changes Caveolae are flask- 
shaped indentations in plasma membranes. Caveolae participate in mechanosensation, 
the detection and sensing of mechanical forces at the membrane, and mechanotrans- 
duction, the conversion of mechanical forces into biochemical signals that result in 
cell responses that regulate cell growth, differentiation, cell shape, and cell death. 
Caveolae (Figure 9.33) are rich in cholesterol, sphingolipids, and caveolin, an integral 
membrane protein of 22,000 MW. There are three members of the caveolin family. 
CAVI and CAV? are found in endothelial, fibrous, and adipose (fat) tissue, whereas 
CAV3 is unique to skeletal muscle. Caveolins form homodimers in the plasma mem- 
brane, with both N- and C-termini oriented toward the cytosolic face of the mem- 
brane. The C-terminal domain has several palmitoyl lipid anchors and is separated 
from the N-terminal oligomerization domain by a 33-residue intramembrane hairpin 
domain. A typical caveolar structure consists of about 144 caveolin molecules, with 
up to 20,000 cholesterol molecules. 


Vesicle Formation and Fusion Are Essential Membrane Processes The membranes of 
cells are not static. Normal cell function requires that the various membrane- 
enclosed compartments of the cell constantly reorganize and exchange proteins, lip- 
ids, and other materials. These processes, and others such as cell division, exocytosis, 
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Figure 9.32 Model of BAR domain binding to membranes. (a) The classical model for binding of 
BAR domains to membranes (PDB id = 2C08). (b) A model for membrane binding of endophilin-A1, 
with amphiphilic helices inducing curvature that is stabilized by BAR domain binding. (Adapted from 
Gallop, J. L., et al., 2006. Mechanism of endophilin N-BAR domain-mediated membrane curvature. 
The EMBO Journal 25:2898-2910. Image in (b) kindly provided by Harvey T. McMahon.) 
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Figure 9.33 Formation of caveolae in membranes. (a) Cryo-EM image of 8S Cav1 complex with Transport 
11-fold symmetry. (A single subunit is highlighted in red.) Views are perpendicular to membrane (top) vesicle 
and parallel to membrane (bottom). (b) Proposed association of 8S Cav1 complex with membrane, 
showing two subunits. (c) A single 8S Cav1 complex modeled onto faces of a dodecahedral caveola. 
The N-terminus (NT) and C-terminus (CT) of each monomer both lie on the cytoplasmic face of the 
membrane. Adapted from Porta, J. C. et al., 2022. Molecular architecture of the human caveolin-1 
complex. Science Advances 8:eabn7232. 


Target 
endocytosis, and viral infection, all involve either fusion of one membrane with an- ase 
other or budding and separation of a vesicle from a membrane. Eukaryotic organelles 
communicate with one another by the exchange of trafficking vesicles. Vesicles are 
generated at a precursor membrane, transported to their destination, and then fused 
with the target compartment (Figure 9.34). Although the organelles involved in these 
processes are diverse—endoplasmic reticulum, Golgi, endosomes, and others—the 
basic reactions of budding and fusion are accomplished by protein families and mul- 
tiprotein complexes that have been conserved throughout eukaryotic evolution. The Figure 9.34 Vesicle-mediated transport in cells 
molecular events of exocytotic release of neurotransmitters into the synapses of nerve involves budding of vesicles from a donor membrane, 
cells are good examples of such processes. followed by fusion of the vesicle membrane with the 

Neurons communicate with one another by converting electrical signals into chemical ™¢™brane of a target compartment, a process that 
r ; : i : ‘ ahs transfers the contents of the donor compartment as 
signals and back again. When electrical signals arrive at the synapse, vesicles containing well as selected membrane proteins. (Adapted from 
neurotransmitters (such as acetylcholine) fuse with the plasma membrane, releasing the Alberts, B., 2007. Molecular Biology of the Cell, 
neurotransmitters into the synaptic cleft. The binding of neurotransmitters to receptors 5th ed., New York: Garland Science.) 
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Figure 9.35 The domain structure of the SNARE protein families. (a) A variety of N-terminal 
domains are found in Q, SNARE proteins, including the three-helix bundle of syntaxin-1 (PDB id = 
1BR0); (b) Qh. SNAREs (such as SNAP-25) are anchored in the membrane by palmitic acid lipid 
anchors; (c) Many R SNAREs contain small globular N-terminal domains such as Vam7, a 
PX-homology domain (PDB id = 10CS). (Adapted from Jahn, R., and Scheller, R. H., 2006. 
SNAREs—engines for membrane fusion. Nature Reviews Molecular Cell Biology 7:631-643.) 


on an adjacent neuron generates an electrical signal that is passed along. The fusion of 
vesicles with the plasma membrane is directed by SNAREs—a family of proteins that 
snare vesicles to initiate the fusion process. (The acronym, a somewhat strained effort to 
describe their function cleverly, stands for soluble N-ethylmaleimide-sensitive factor 
attachment protein receptor.) SNAREs are small proteins with a simple domain 
structure (Figure 9.35) that includes a SNARE motif, consisting of 60 to 70 residues of 
classical 7-residue repeats (see Chapter 6). These SNARE domains form a four-helix 
bundle that bridges two membranes, drawing them together in the first stage of mem- 
brane fusion. The N-terminal domains are variable across the SNARE family, but at 
their C-termini most SNAREs have a single transmembrane domain joined to the 
SNARE motif by a short linker. Syntaxin-1 (known as a Qa SNARE, for a conserved 
glutamine) has an N-terminal three-helix bundle, whereas SNAP-25 (known as a Qpe 
SNARE) consists of two SNARE domains joined by a linker with two palmitoyl lipid 
anchors (Figure 9.35). Synaptobrevin (known as an R SNARE, for a conserved arginine 
(R)) consists of a small N-terminal domain, in addition to its SNARE and transmem- 
brane domains. 

The fusion of a vesicle with the plasma membrane is a multistep process 
(Figure 9.36). When a neurotransmitter-laden vesicle approaches the plasma mem- 
brane, syntaxin-1 and SNAP-25 on the plasma membrane join with synaptobrevin on 
the vesicle to form a four-helix bundle that bridges the two membranes, drawing them 
together in the first stage of membrane fusion, termed Priming Stage I. The binding of 
complexin, a small helical protein, clamps the complex so that it is poised for mem- 
brane fusion but is unable to complete the process. Arrival of an action potential (elec- 
trical signal—see Chapter 32) triggers the flow of Ca** ions into the cell through 
channel proteins, and the binding of Ca?* ion to synaptotagmin displaces complexin 
and promotes the joining of the membranes (to form Priming Stage II) and the 
formation of a fusion pore. The complexin clamp (Figure 9.36) ensures that 
neurotransmitter release can occur in an instant following Ca’* influx because the 
slow steps of SNARE assembly have already been completed. 
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Figure 9.36 SNARE complex assembly and its control in plasma membrane fusion. Step 1: 

Q SNARES (syntaxin-1 and SNAP-25), organized in clusters, assemble into acceptor complexes in the 
plasma membrane. Step 2: Acceptor complexes interact with synaptobrevin in an approaching vesicle, 
forming a four-helical Priming Stage I. Step 3: Membrane fusion and pore formation are prevented by 
binding of complexin. Step 4: Arrival of an action potential (nerve impulse) triggers displacement of 
complexin by synaptotagmin, initiating fusion and pore formation. Step 5: Upon completion of the 
fusion process, the trans complex relaxes. Step 6: SNAREs are redistributed to their respective 
membrane domains and vesicles are reformed. (Adapted from Jahn, R., and Scheller, R. H., 2006. 
SNAREs—engines for membrane fusion. Nature Reviews Molecular Cell Biology 7:631—643.) 


9.5 | How Does Transport Occur across 
Biological Membranes? 


Transport processes are vitally important to all life forms, because all cells must 
exchange materials with their environment. Cells obviously must have ways to bring 
nutrient molecules into the cell and ways to send waste products and toxic substances 
out. Also, inorganic electrolytes must be able to pass in and out of cells and across 
organelle membranes. All cells maintain concentration gradients of various metabo- 
lites across their plasma membranes and also across the membranes of intracellular 
organelles. By their very nature, cells maintain a very large amount of potential 
energy in the form of such concentration gradients. Sodium and potassium ion gradi- 
ents across the plasma membrane mediate the transmission of nerve impulses and the 
normal functions of the brain, heart, kidneys, and liver, among other organs. Storage 
and release of calcium from cellular compartments controls muscle contraction as 
well as the response of many cells to hormonal signals. High acid concentrations in 
the stomach are required for the digestion of food. Extremely high hydrogen ion gra- 
dients are maintained across the plasma membranes of the mucosal cells lining the 
stomach in order to maintain high acid levels in the stomach. 

We shall consider the molecules and mechanisms that mediate these transport activi- 
ties. In nearly every case, the molecule or ion transported is water soluble, yet moves 
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Figure 9.37 Passive diffusion of an uncharged 
species across a membrane depends only on the 
concentrations (C, and C2) on the two sides of the 
membrane. The concentration ratio shown is for 
movement from side | (initial state) to side 2 (final 
state). For movement in the opposite direction, the 
concentration ratio would be the inverse. 
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Figure 9.38 The passive diffusion of a charged 
species across a membrane depends on the 


concentration and also on the charge of the particle, 


Z, and the electrical potential difference across the 
membrane, Ay for movement from side | (initial 
state) to side 2 (final state). 


across the hydrophobic, impermeable lipid membrane at a rate high enough to serve the 
metabolic and physiological needs of the cell. This perplexing problem is solved in each 
case by a specific transport protein. The transported species either diffuses through 
a channel-forming protein or is carried by a carrier protein. Transport proteins are all 
classed as integral membrane proteins. 

From a thermodynamic and kinetic perspective, there are only three types of membrane 
transport processes: passive diffusion, facilitated diffusion, and active transport. To be thor- 
oughly appreciated, membrane transport phenomena must be considered in terms of 
thermodynamics. Some of the important kinetic considerations also will be discussed. 


9.6 | What Is Passive Diffusion? 


Passive diffusion is the simplest transport process. In passive diffusion, the trans- 
ported species moves across the membrane in the thermodynamically favored direc- 
tion without the help of any specific transport system/molecule. For an uncharged 
molecule, passive diffusion is an entropic process, in which movement of molecules 
across the membrane proceeds until the concentration of the substance on both sides 
of the membrane is the same. For an uncharged molecule, the free energy difference 
between side | and side 2 of a membrane (Figure 9.37) is given by 


[C] 


AG = G — Gi = RT ln (a) (9.1) 


The difference in concentrations, [Cy] — [Ci], is termed the concentration gradient, and 
AG here is the chemical potential difference for movement from side 1 to side 2. 

For a charged species, the situation is slightly more complicated. In this case, the 
movement of a molecule across a membrane depends on its electrochemical potential. 
This is given by 


AG=G,- Gi = RT In (a) + ZFAW (9.2) 
1 

where Z is the charge on the transported species, ¥ is Faraday’s constant (the charge 
on | mole of electrons = 96,485 coulombs/mol = 96,485 joules/volt-mol, since 1 volt = 
1 joule/coulomb), and Aw is the electric potential difference (that is, voltage differ- 
ence) across the membrane. The second term in the expression thus accounts for the 
movement of a charge across a potential difference. Note that the effect of this second 
term on AG depends on the magnitude and the sign of both Z and Aw. The potential 
difference term should always be stated as final state potential minus initial state 
potential. 

For example, as shown in Figure 9.38, if side 2 has a higher potential than side 1 (so 
that Aw is positive), for a negatively charged ion the term Z#AwW makes a negative con- 
tribution to AG. In other words, the negative charge is spontaneously attracted to the 
more positive potential—and AG is negative. In any case, if the sum of the two terms on 
the right side of Equation 9.2 is a negative number, transport of the ion in question 
from side 1 to side 2 would occur spontaneously. The driving force for passive transport 
is the AG term for the transported species itself. 


9.7 | How Does Facilitated Diffusion Occur? 


The transport of many substances across simple lipid bilayer membranes via passive 
diffusion is far too slow to sustain life processes. On the other hand, the transport 
rates for many ions and small molecules across actual biological membranes are much 
higher than anticipated from passive diffusion alone. This difference is due to specific 
proteins in the cell membranes that facilitate the transport of these species across the 
membrane. Proteins capable of effecting facilitated diffusion of a variety of solutes are 
present in essentially all natural membranes. Such proteins have two features in 
common: (1) They facilitate net movement of solutes only in the thermodynamically 
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favored direction (that is, AG < 0), and (2) they display a measurable affinity and 
specificity for the transported solute. Consequently, facilitated diffusion rates display 
saturation behavior, like that seen for ligand binding to a protein (see Figure 5.26). 
Such behavior provides a simple means for distinguishing between passive diffusion 
and facilitated diffusion experimentally. The dependence of transport rate on solute 
concentration takes the form of a rectangular hyperbola (Figure 9.39), so the trans- 
port rate approaches a limiting value, Vmax, at a very high solute concentration. 
Figure 9.39 also shows the graphical behavior exhibited by simple passive diffusion. 
Because passive diffusion does not involve the formation of a specific solute—protein 
complex, the plot of rate versus concentration is linear, not hyperbolic. 


9.7a Membrane Channel Proteins Facilitate Diffusion 


Membrane transport channels carry out facilitated diffusion. In contrast to active 
transport systems (or pumps) like Na*, K*-ATPase and Ca?*-ATPase, channels sim- 
ply enable the (energetically passive) downhill movement of ions and other molecules. 
However, active pumps and most channels share one fundamental property: The abil- 
ity to transport species in a selective manner. Molecular selectivity is crucial to the 
operation of both pumps and channels. The human proteome lists 400 ion channels. 

The membrane channel structures determined to date have revealed some of nature’s 
strategies for moving ions and molecules across biological membranes. Channel compo- 
sition can take several forms. A single-channel pore can be formed from dimers, trimers, 
tetramers, or pentamers of protein subunits (for example, channels for Na*, K*, Mg?*, 
and glutamate; see Table 9.3). On the other hand, multimeric assemblies in which each 
subunit has its own pore have been described (in channels for C17, NH3, water, and 
glycerol). Several recurring themes are apparent from the channel structures studied to 
date: 


e Some channels are selective for a particular ion or molecule. Others select either for 
cations or for anions, and some are nonselective. 

e Channels can be gated—that is, they can open or close in response to a triggering 
signal. Some channels are voltage gated and open and close in response to a 
change in membrane electrical potential (that is, voltage). Other channels are li- 
gand gated, opening or closing in response to binding of an ion, a small organic 
molecule, or even a protein. 

e Channels that are ion selective usually have pores lined with (oppositely) charged 
amino acids. Cation channels are lined with negatively charged residues; anion 
channels are lined with positively charged residues. 

e Tons may flow through channels either in a hydrated or unhydrated state, depend- 
ing on the width of the channel. Wider pores are often unselective and accept ions 
in their hydrated states. Narrower pores lined with charged amino acids are more 
likely to be highly selective. 
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Figure 9.39 Passive diffusion and facilitated 
diffusion may be distinguished graphically. The 
plots for facilitated diffusion are similar to plots of 
enzyme-catalyzed processes and they display 
saturation behavior. 


Table 9.3 Subunit Composition and Pore Structure of Membrane Channels 
KcsA 
NaK 
Channel Glutamate 
MgE ASIC 
7 aw 
Kt 
Mg Na* Na* NH; 
Transported Species Glutamate Glycerol 
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The key questions for any channel protein are: 


e How does the channel open and close? 

e How does gating occur—that is, how is a voltage change sensed or how does a trig- 
gering ligand bind? 

e How is voltage sensing or ligand binding coupled to channel opening and closing? 


These recurring themes and key questions are illustrated particularly well by the 
K* channels characterized largely by Roderick MacKinnon and his colleagues. 


Potassium Channels Combine High Selectivity with High Conduction Rates Potassium 
transport (that is, conduction) is essential for many cell processes, including regula- 
tion of cell volume, electrical impulse formation (in electrically excitable cells, such as 
neurons), and secretion of hormones; all cells thus conduct K* ions across the cell 
membrane. Potassium channels are facilitated diffusion devices, conducting K* down 
the electrochemical gradient for K*. Whether found in bacteria, archaea, plants, or 
animals, all known potassium channels are members of a single protein family. Potas- 
sium channels have two important characteristics: They are highly selective for 
K* ions over Na* ions, and they conduct K* ions at very high rates (almost as fast as 
any entity can diffuse in water—the so-called diffusion limit). 

The structure of Kes A, from Streptococcus lividans, is typical of the simpler prokary- 
otic K* channels (Figure 9.40). The structure consists of four identical subunits, and, 
facing the cytosol is a water-filled pore that traverses more than half of the membrane 
bilayer, ending at the selectivity filter. A hydrated K* ion is suspended in the center of 
the pore. Each subunit consists of two transmembrane helices (S1 and S2, the outer and 
inner helices, respectively), with an intervening reentrant loop consisting of a 10-residue 
pore helix and a 5-residue 8-strand that is one quarter of the selectivity filter. The inner 
(S2) helices form an inverted “teepee” with the shorter pore helices arranged within. 

In contrast to the simple prokaryotic K* channels, the mammalian voltage-gated 
potassium channels involved in nerve and muscle function are more complex, with each 
tetramer subunit consisting of six transmembrane helical segments (S1—S6). Helices 
S1-S4 form the voltage sensor domain, whereas the S5-S6 domain is analogous to the 
transmembrane helices of KcsA and forms the transport channel. 

The selectivity filter in KcsA (Figure 9.40) consists of four pentapeptides, one from 
each subunit, with the sequence TVGYG. The backbone carbonyls of the first four resi- 
dues and the threonine side-chain oxygen—evenly spaced—face the center of the pore. 
These oxygens create four possible K *-binding sites. In each site, a bound, dehydrated K* 
is surrounded by eight oxygens from the protein: four above and four below. The arrange- 
ment of protein oxygens at each site is very similar to the arrangement of water molecules 
around a hydrated K*. This simple structure is strikingly selective for K*. The physical 
basis for selection between K* and Na* is the atomic radius—1.33 A for K* and 0.95 A 
for Na*. Still, K* channels select for K* over Na* by a factor of more than a thousand! 
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Figure 9.40 Structure of the KcsA potassium channel from Streptococcus lividans (PDB id = 1K4C). 
(a) The four identical subunits of the channel, which surround a central pore, are shown in different 
colors. (b) Each subunit contributes three a-helices (blue, green, red) to the tetramer structure. (c) The 
selectivity filter is made from loops from each of the subunits, two of which are shown here. (d) The 
tetrameric channel, as viewed through the pore. In (a), (b), and (c), the cytosolic side is at the top. 
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Figure 9.41 Model for inward and outward transport 
through the KcsA potassium channel. The selectivity 
filter in the channel contains four K*-binding sites, 
only two of which are filled at any time. 


As K* moves through the KesA channel, there are, on average, two K* ions bound 
in the selectivity filter at any given time, either in positions | and 3 or positions 2 and 4 
(with water molecules occupying the other positions). Ions can move in either direction 
across the channel, depending on the existing electrochemical gradient. One K* enters 
the channel from one side as a different ion exits on the other side. The cycle of steps for 
inward or outward movement is shown in Figure 9.41. 

High selectivity, along with high conduction rates, seems at first paradoxical. If K* 
ions bind too tightly in the filter, they could not move quickly through the pore. Two 
factors keep the binding just tight enough, but not too tight: (1) repulsion between the 
closely spaced K* ions at their two sites and (2) a conformational change induced by 
K* binding. 

The KcsA channel is gated by intracellular pH. It is closed at neutral pH and 
above, and it opens at acidic pH. What is the conformational change that opens this 
and other K* channels? After comparing the closed-pore conformation of KcsA 
with the open-pore conformation of the related MthK channel (Figure 9.42), 
MacKinnon has proposed that helix bending and rearrangement deep in the mem- 
brane opens K* channels. 

The mammalian K* channel is voltage gated. In this more complex structure 
(Figure 9.43), helix S4 contains a series of five positively charged Arg residues, spaced 
every three residues. These five residues all face the outside surface of the membrane in 
the open state of the channel. MacKinnon and coworkers have postulated that channel 
closing involves movement of the S4 helix inward, which pushes down on the short 
S4-S5 linker helix, which induces a conformation change in S5 and S6, thus closing the 
channel. 


9.7b The Bacillus cereus NaK Channel Uses a Variation 
on the K* Selectivity Filter 


Could the K* channel selectivity filter be modified to accommodate other ions, for 
instance Na‘? A comparison of amino acid sequences from a variety of ion channels 
(Figure 9.44) shows that this is indeed the case. Variations on the TVGYG filter 


Figure 9.42 Comparison of the closed (PDB id = 
1K4C) and open (PDB id = 1LNQ) states of the 
MthK potassium channel. The inner helices obstruct 
the central pore in the closed conformation. 
However, bending at a glycine residue near the center 
of the membrane splays the inner helices outward 
from the channel center, allowing free access for ions 
between the cytosol and the selectivity filter. This 
critical Gly residue is conserved in most K* channel 
sequences, making this a likely gating mechanism for 
Closed Open most K* channels. 
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Sequence of selectivity filter Selective for 


T GYGDLYP Kt 
T G GNFSP Nat, Kt 
LTG DWNSV Cad 


Figure 9.44 Ion selectivity in cation channels is a 
function of peptide sequence of the selectivity filter. 
Conserved glycines in the TVGYG motif of the 
KesA potassium channel are shown in red. Amino 
acids that are chemically similar are yellow. From 
top to bottom, the selectivity for K* over Na* 
decreases and the selectivity for Ca?* increases. 
(Adapted from Zagotta, W. N., 2006. Permutations 


of permeability. Nature 440:427-428.) 
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Figure 9.45 Structure of the channel from Bacillus 
cereus (PDB id = 2AHY), which has equal 
preference for Na* and K+. (a) One subunit of the 
tetramer has been removed to reveal the five ion 
binding sites in the center of the channel. A Ca?* ion 
is bound at the extracellular entrance to the channel 
(aqua, top), and K* is bound to the other four sites. 
(b) The tetrameric channel, as viewed through the 
pore. (c) Substitution of D for Y in the selectivity 
filter eliminates binding sites 1 and 2, leaving a pore 
vestibule that binds a K* with low affinity. Sites 3 
and 4 are preserved, but bind Na* and K* equally 
well. The bottom site contains a fully hydrated K*, in 
a manner similar to the KcsA channel (Figure 9.40). 


(b) hypothetical closed 


Figure 9.43 Model for voltage-dependent gating in mammalian K* channels. See PDB id = 1J4C. 
Parts (a) and (b) show a single subunit of the channel; (c) and (d) show the native tetrameric channel. 
Parts (a) and (c) represent the open channel conformation; (b) and (d) show the closed state. Photo 
courtesy of Roderick MacKinnon. 


sequence are found in ion channels with a range of selectivities for K*, Na*, and even 
Ca. Bacillus cereus contains an ion channel with equal preference for Nat and Kt 
that is similar to the transient receptor potential (TRP) channels found widely in 
eukaryotes. The structure of this channel (Figure 9.45) is similar in many ways to the 
K* channels, but the selectivity filter sequence of this NaK channel is TVGDG. The 
substitution of D for Y changes the filter in several ways. Binding sites | and 2, 
the sites most selective for K*, are eliminated, leaving a pore vestibule that can accom- 
modate an ion but not bind it tightly. The remaining sites, binding sites 3 and 4, bind 
Nat and K* equally well. In addition, the D for Y substitution creates a Ca?*-binding 
site at the extracellular entrance to the selectivity filter (see Figure 9.45c). It appears 
likely that variations of the selectivity filter sequence can tune it to accommodate and 
select for a variety of transported cations (Figure 9.45). 


9.7¢ CorA Is a Pentameric Mg?* Channel 


The transport of Mg” in bacteria and archaea is accomplished primarily by the CorA 


family of membrane channels. Its pentameric structure (Figure 9.46) contrasts with the 
tetrameric K* and NaK channels in several ways. With a large N-terminal cytosolic 
domain and C-terminal transmembrane domain, it resembles a funnel or cone. One of 
the two transmembrane a-helices extends 100 A into the cytosol and is the longest 
continuous a-helix in any known protein. Remarkable features of the structure include 
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Figure 9.46 The structure of the pentameric CorA Mg?* channel from Thermus maritima (PDB 

id = 2HN2). The transmembrane pore is formed from five short a-helices (red) and stabilized by four 
longer a-helices (aqua). The pore entrance from the periplasm is gated closed by a ring of five Asn 
residues (gold), and a ring of five Leu (orange) and five Met (beige) residues narrows the pore to 2.5 A. 
The large cytosolic domain includes a basic sphincter of 20 Lys residues (blue), and a ring of 50 Asp 
and Glu residues (green) at the tips of the helices. Asp*? and Asp”? (purple) participate in Mg?* 
binding. 


a ring of 20 lysine residues around the outside of the structure near the negatively 
charged membrane-—cytosol interface and a cluster of 50 aspartate and glutamate resi- 
dues (on helices) adjacent to the lysines and extending into the cytosol. The available 
structures of CorA are closed-pore structures. A ring of five Asn?" residues blocks the 
opening to the pore on the periplasmic face, a ring of Met?! residues narrows 
the pore to 3.3 A at the center of the membrane, and a ring of Leu? residues reduce 
the pore diameter to 2.5 A at the cytosolic face of the membrane. Gating of the pore 
must overcome these obstacles as well as the repulsive ring of positive Lys side chains. 
It has been hypothesized that the long a-helix and the negatively charged helices could 
act as a lever to pry apart the repulsive ring of lysines and open the Mg”* pore. 


9.7d Chloride, Water, Glycerol, and Ammonia 
Flow through Single-Subunit Pores 


Membrane channels can also be formed within a single subunit of a protein. The 
CIC channels (which transport chloride ions and are ubiquitous in cells from bacteria 
to animals) are homodimeric, with each subunit having two similar halves, with op- 
posite orientation in the membrane. The CIC pore is hourglass shaped, with a 15 A 
selectivity filter in the middle (Figure 9.47a). The filter contains three Cl -binding 
sites, with coordination from Tyr and Ser hydroxyls and several peptide backbone 
NH groups. The Cl” binding site nearest the extracellular solution can be occupied 
either by a Cl" ion or by a glutamate carboxyl group. With the glutamate carboxyl in 
place, the pore is closed, but an increase in Cl” concentration can displace the Glu 
side chain, with Cl” binding to this position and opening the pore. Thus, this Cl~ 
channel is chloride gated. 

Channel proteins often solve chemical and thermodynamic problems in innovative 
ways. Ion selectivity, for example, requires that ions be dehydrated in the channel, and 
dehydration is energetically expensive. Binding sites have to compensate for the ener- 
getic cost of dehydration by providing favorable compensatory interactions between the 
ion and the binding amino acid residues. The ammonia transport channel solves a 
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Tetramer 


Figure 9.47 Structures of channels for 

(a) chloride, (b) ammonia, and (c) glycerol. All 
structures are axial views. (a) The CIC chloride 
channel from E. coli (PDB id = 1OTS). (b) The 
AmtB ammonia channel from Æ. coli with four 
bound NH,* (PDB id = 1U7G). (c) The GlpF 
glycerol channel from E. coli, with bound glycerol 
(PDB id = 1FX8). 
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different problem. Ammonia is a gas, but the protonated ammonium ion, NH,", is the 
species that diffuses to the channel opening. The transport channel in this case is a 
hydrophobic pore 20 A in length (Figure 9.47b). The hydrophobic character of the 
channel lowers the pK, of ammonium from its normal 9.25 to less than 6, facilitating 
the transport of NH; but not the monovalent cation, NH,*. In the narrow hydrophobic 
channel, His! and His?!’ line the pore and stabilize three NH, molecules through 
hydrogen bonding. On either side of the narrow pore, broad vestibules contain NH; in 
equilibrium with NH,*. 

Another example of selectivity for the transported species is the tetrameric glycerol 
channel GIpF from E. coli, with a transport pore in each monomer. Six transmembrane 
helices and two half-membrane-spanning helices form a right-handed helical bundle 
around each channel (Figure 9.47c). Glycerol molecules taken up by an £. coli cell first 
enter a 15 A vestibule in the transport protein, becoming progressively dehydrated be- 
fore entry into a 28 A amphipathic channel and selectivity filter. The channel accom- 
modates three glycerol molecules, lined up in a single file, with their alkyl backbones 
wedged against a hydrophobic corner and their hydroxyl groups forming hydrogen 
bonds with the side chains of channel residues. 

The aquaporin water channels are closely related to the GlpF glycerol channel, with 
tetrameric structures and similar right-handed helical bundles forming transport 
channels. Selection for water or glycerol in these proteins is based on subtle differences 
in the selectivity filters within the transport channels. For example, the Phe and Trp resi- 
dues that comprise the hydrophobic corner surrounding the alkyl moiety of the middle 
glycerol site in GIpF are replaced by a His residue in the corresponding water-binding 
site in aquaporin Apql. 


9.8 | How Does Energy Input Drive Active 
Transport Processes? 


Passive and facilitated diffusion systems are relatively simple, in the sense that the 
transported species flow downhill energetically, that is, from high concentration to 
low concentration. However, other transport processes in biological systems must 
be driven in an energetic sense. In these cases, the transported species move from 
low concentration to high concentration, and thus the transport requires energy 
input. As such, it is considered active transport. The most common energy input is 
ATP hydrolysis, with hydrolysis being tightly coupled to the transport event. Other 
energy sources also drive active transport processes, including /ight energy and the 
energy stored in ion gradients. The original ion gradient is said to arise from a 
primary active transport process, and the transport that depends on the ion gradient 
for its energy input is referred to as a secondary active transport process (see later 
discussion of the E. coli proton—drug exchanger). When transport results in a net 
movement of electric charge across the membrane, it is referred to as electrogenic 
transport. If no net movement of charge occurs during transport, the process is 
electrically neutral. 


9.8a All Active Transport Systems Are Energy-Coupling Devices 


Hydrolysis of ATP is essentially a chemical process, whereas movement of species 
across a membrane is a mechanical process (that is, movement). An active transport 
process that depends on ATP hydrolysis thus couples chemical free energy to mechan- 
ical (translational) free energy. The bacteriorhodopsin protein in H. halobium couples 
light energy and mechanical energy. Oxidative phosphorylation involves coupling 
between electron transport, proton translocation, and the capture of chemical energy 
in the form of ATP synthesis. Similarly, the overall process of photosynthesis amounts 
to a coupling between captured light energy, proton translocation, and chemical 
energy stored in ATP. 
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9.8b Many Active Transport Processes Are Driven by ATP 


Active transport processes use the energy of hydrolysis of ATP to drive the transport 
of ions, sugars, and amino acids. This section describes some of those processes. 


Monovalent Cation Transport: Na*,K*-ATPase All animal cells actively extrude Na* 
ions and accumulate K* ions. These two transport processes are driven by Nat,Kt- 
ATPase, also known as the sodium pump, an integral protein of the plasma mem- 
brane. Most animal cells maintain cytosolic concentrations of Na* and K* of 10 mM 
and 100 mM, respectively. The extracellular milieu typically contains about 100 mM 
to 140 mM Na* and 5 mM to 10 mM K*. Potassium is required within the cell to 
activate a variety of processes, whereas high intracellular sodium concentrations are 
inhibitory. The transmembrane gradients of Na* and K* and the resulting gradients 
of Cl” and other ions provide the means by which neurons communicate. They also 
serve to regulate cellular volume and shape. Animal cells also depend upon these 
Nat and K* gradients to drive transport processes involving amino acids, sugars, 
nucleotides, and other substances. In fact, maintenance of these Na* and K* gradi- 
ents consumes large amounts of energy in animal cells—20% to 40% of total meta- 
bolic energy in many cases and up to 70% in neural tissue. 

The Na*- and K*-dependent ATPase comprises three subunits: an a-subunit of 
1016 residues (120 kD), a 35 kDa B-subunit, and a 6.5 kDa y-subunit. The sodium 
pump actively pumps three Na* ions out of the cell and two K* ions into the cell per 
ATP hydrolyzed: 


ATP* + HO + 3 Na*(inside) + 2 K*(outside) —» ADP?” + H PO + 
3 Na‘ (outside) + 2 K*(inside) (9.3) 


ATP hydrolysis occurs on the cytoplasmic side of the membrane (Figure 9.48), and the 
net movement of one positive charge outward per cycle makes the sodium pump elec- 
trogenic in nature. 
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Figure 9.48 The structure of the Na*,K*-ATPase (a) A schematic diagram of the Na*,K*-ATPase of the mammalian 


Aspartyl phosphate (Asp 


Ouabain 


369 
) 


plasma membrane. ATP hydrolysis occurs on the cytoplasmic side of the membrane, Na* ions are transported out of the 


cell, and K* ions are transported in. The transport stoichiometry is three Na* out and two K* in per ATP hydrolyzed. 


Ouabain and other cardiac glycosides inhibit Na*,K*-ATPase by binding on the extracellular surface of the pump protein. 


(b) Structure of the Na*,K*-ATPase, showing the a-subunit, residues 28-73 of the B-subunit (gray) and the transmembrane 


helix (residues 23-51, yellow) of the y-subunit (PDB id = 3N23). Ouabain and Asp*®-P are shown as space-filling spheres. 
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The a-subunit of Na*,K*-ATPase consists of 10 transmembrane a-helices, with 
three cytoplasmic domains, denoted A, P, and N. A large cytoplasmic loop between 
transmembrane helices four and five forms the P (phosphorylation) and N (nucleotide- 
binding) domains. The crystal structure of the enzyme reveals the inhibitor ouabain 
bound on the extracellular surface of the protein (Figure 9.48). 

A minimal mechanism for Na*,K*-ATPase postulates that the enzyme cycles 
between two principal conformations, denoted E; and E, (Figure 9.49). E; has a high 
affinity for Na* and ATP and is rapidly phosphorylated at Asp*® during ATP hydro- 
lysis in the presence of Mg’* to form E,-P, a state that contains three occluded Na* 
ions (occluded in the sense that they are tightly bound and not easily dissociated from 
the enzyme in this conformation). A conformation change yields E-P, a form of the 
enzyme with relatively low affinity for Na* but a high affinity for K*. This state 
releases three Na* ions and binds two K* ions on the outside of the cell. Dephos- 
phorylation leaves E,K>, a form of the enzyme with two occluded K* ions. A confor- 
mation change, which appears to be accelerated by the binding of ATP (with a 
relatively low affinity), releases the bound K* inside the cell and returns the enzyme 
to the E; state. Enzyme forms with occluded cations represent states of the enzyme 
with cations bound in the transport channel. The alternation between high and low 
affinities for Nat, K*, and ATP serves to tightly couple the hydrolysis of ATP and 
ion binding and transport. 


Na*,Kt-ATPase Is Inhibited by Cardiotonic Steroids Certain plant and animal steroids 
such as ouabain (Figure 9.50) specifically inhibit Nat,K*-ATPase and ion transport. 
These substances are traditionally referred to as cardiac glycosides or cardiotonic 
steroids, both names derived from the potent effects of these molecules on the heart. 
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D At x, = @ ate = @ at si, Da é-n—B dn 
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o- 
Aspartyl-phosphate (E-P) intermediate 


Figure 9.49 A mechanism for Na*,K*-ATPase. (a) The model assumes two principal conformations, 
E; and E). Binding of Na* ions to E; is followed by phosphorylation and release of ADP. Na* ions 
are transported and released, and K* ions are bound before dephosphorylation of the enzyme. 
Transport and release of K* ions complete the cycle. (b) The structure of the E-P intermediate formed 
in P-type ATPases. Residue Asp*” in the sodium pump attacks the y-P of 5’-ATP to form an aspartyl 
phosphate (more appropriately described as a phosphoryl aspartate) in the mechanism of the 
Na*,K*-ATPase. 


OH H 


Strophanthidin Digitoxigenin Ouabain 


Figure 9.50 The structures of several cardiotonic steroids. The lactone rings are yellow. 
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These molecules all possess a cis-configuration of the C-D ring junction, an unsatu- 
rated lactone ring (five or six membered) in the B-configuration at C-17, and a B-OH 
at C-14. There may be one or more sugar residues at C-3. The sugars are not required 
for inhibition, but do contribute to water solubility of the molecule. Cardiotonic 
steroids bind exclusively to the extracellular surface of Nat,K*-ATPase when it is in 
the E,-P state, forming a very stable E,-P (cardiotonic steroid) complex. 

Medical researchers studying high blood pressure have consistently found that 
people with hypertension have high blood levels of an endogenous Na*,K*-ATPase 
inhibitor. In such patients, inhibition of the sodium pump in the cells lining the blood 
vessel wall results in accumulation of sodium and calcium in these cells and the narrow- 
ing of the vessels to create hypertension. An 8-year study aimed at the isolation and 
identification of the agent responsible for these effects by researchers at the University 
of Maryland Medical School and the Upjohn Laboratories in Michigan yielded a sur- 
prising result. Mass spectrometric analysis of compounds isolated from many hundreds 
of gallons of blood plasma has revealed that the hypertensive agent is ouabain itself! 
About half of patients with hypertension have elevated levels of endogenous ouabain 
(EO), even with normal salt intake, and short-term periods of high dietary salt raise 
EO in these individuals. On the other hand, normal individuals suppress the high EO 
transient triggered by high-salt diets and avoid hypertension. Elevated EO and hyper- 
tension are related to poor modulation of EO biosynthesis (mainly in the adrenal 
glands), altered clearance from the blood, or both. 


Calcium Transport: Ca2+-ATPase Calcium, an ion acting as a cellular signal in virtually 
all cells, plays a special role in muscles. It is the signal that stimulates muscles to con- 
tract. In the resting state, the levels of Ca** near the muscle fibers are very low 
(approximately 0.1 uM), and nearly all of the calcium ion in muscles is sequestered 
inside a complex network of vesicles called the sarcoplasmic reticulum, or SR. Nerve 
impulses induce the SR membrane to quickly release large amounts of Ca?*, with 
cytosolic levels rising to approximately 10 uM. At these levels, Ca** stimulates con- 
traction. Relaxation of the muscle requires that cytosolic Ca** levels be reduced to 
their resting values. This is accomplished by an ATP-driven Ca** transport protein 
known as the Ca?*-ATPase, which bears many similarities to the Na*,K*t-ATPase. It 
has an a-subunit of the same approximate size, it forms a covalent E-P intermediate 
(at Asp**!) during ATP hydrolysis, and its mechanism of ATP hydrolysis and ion trans- 
port is similar in many ways to that of the sodium pump. 

The structure of the Ca?*-ATPase includes a transmembrane (M) domain consisting 
of 10 a-helical segments and a large cytoplasmic domain that itself consists of a 
nucleotide-binding (N) domain, a phosphorylation (P) domain, and an actuator (A) 
domain (Figure 9.51). 

The calcium transport cycle begins with binding of two Ca?* ions. Subsequent ATP 
binding causes a 90° rotation of N and a 30° rotation of A, thus joining all three cyto- 
plasmic domains (N, A, and P), and pulling a transmembrane helix partly out of the 
membrane. Phosphorylation of Asp**!, dissociation of ADP, and conversion of the 
E,-P state to E)-P induce a 110° rotation of A and a rearrangement of the transmem- 
brane domain, which acts like a piston to release Ca?* inside the SR. A TGES sequence 
in A (residues 181 to 184) then guides nucleophilic attack of water on E;-P, releasing 
phosphate and restoring the original structures of both the transmembrane and the 
cytoplasmic domains of the enzyme. 


The Gastric H*,Kt-ATPase Production of protons is a fundamental activity of cel- 
lular metabolism, and proton production plays a special role in the stomach. The 
highly acidic environment of the stomach is essential for the digestion of food in 
all animals. The pH of the stomach fluid is normally 0.8 to 1. The pH of the pari- 
etal cells of the gastric mucosa in mammals is approximately 7.4. This represents 
a pH gradient across the mucosal cell membrane of 6.6, the largest known trans- 
membrane gradient in eukaryotic cells. This enormous gradient must be main- 
tained constantly so that food can be digested in the stomach without damage to 
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Figure 9.51 The transport cycle of the sarcoplasmic reticulum Ca?*-ATPase involves at least five 
different conformations of the protein. The states shown here are E,-2Ca** (PDB id = 3B9B); 
E,-ATP:2Ca?* (PDB id = 1SU4); E,-P-ADP-2Ca** (PDB id = 1T5C); E,-Pi (PDB id = 2ZBD); and 
E» (PDB id = 2EAR). Blue-shaded states in the reaction sequence correspond to adjacent structures. 


the cells and organs adjacent to the stomach. The gradient of H* is maintained by 
an Ht,K*-ATPase, which uses the energy of hydrolysis of ATP to pump H* out of 
the mucosal cells and into the stomach interior in exchange for K* ions. This 
transport is electrically neutral, and the K* that is transported into the mucosal 
cell is subsequently pumped back out of the cell together with Cl” in a second 
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A Deeper Look 


Cardiac Glycosides: Potent Drugs from Ancient Times 


The cardiac glycosides have a long and colorful history. Many species 
of plants producing these agents grow in tropical regions and have been 
used by South American and African indigenous groups to prepare 
poisoned arrows used in fighting and hunting. Zulus in South Africa, 
for example, have used spears tipped with cardiac glycoside poisons. 
The sea onion, found commonly in southern Europe and northern 
Africa, was used by the Romans and the Egyptians as a cardiac stimu- 
lant, diuretic, and expectorant. The Chinese have long used a medicine 
made from the skins of certain toads for similar purposes. Cardiac gly- 
cosides are also found in several species of domestic plants, including 
the foxglove, lily of the valley, oleander (figure, part a), and milkweed 
plants. Monarch butterflies (figure, part b) acquire these compounds 
by feeding on milkweed and then storing the cardiac glycosides in their 
exoskeletons. Cardiac glycosides deter predation of monarch butterflies 
by birds, which learn by experience not to feed on monarchs. 


In 1785, the physician and botanist William Withering described 
the medicinal uses for agents derived from the foxglove plant. In 
modern times, digitalis (a preparation of dried leaves prepared from 
the foxglove, Digitalis purpurea) and other purified cardiotonic 
steroids have been used to increase the contractile force of heart 
muscle, to slow the rate of beating, and to restore normal function 
in hearts undergoing fibrillation (a condition in which heart valves 
do not open and close rhythmically but rather remain partially 
open, fluttering in an irregular and ineffective way). Inhibition of 
the cardiac sodium pump increases the intracellular Na* concen- 
tration, leading to stimulation of the Na*-Ca** exchanger, which 
extrudes sodium in exchange for inward movement of calcium. 
Increased intracellular Ca** stimulates muscle contraction. Careful 
use of digitalis drugs has substantial therapeutic benefit for patients 
with heart problems. 


(a) Cardiac glycoside inhibitors of Nat,K*-ATPase are 
produced by many plants, including foxglove, lily of the valley, 
milkweed, and oleander (shown here). (b) The monarch 
butterfly avoids predation by birds by concentrating cardiac 
glycosides in its exoskeleton. 


Tony Campbell/Shutterstock.com 


iofoto/Shutterstock.com 


(a) Oleander 


(b) Monarch butterfly 


electroneutral process (Figure 9.52). Thus, the net transport effected by these two 
systems is the movement of HCl into the interior of the stomach. (Only a small 
amount of K* is needed because it is recycled.) The H*,K*-ATPase bears many 
similarities to the plasma membrane Na*t,K*-ATPase and the SR Ca?*-ATPase 
described earlier. It has a similar molecular weight, it forms an E-P intermediate 
(Asp*8>), and many parts of its peptide sequence are homologous with the Na*,K*- 
ATPase and Ca**-ATPase. 


Bone Remodeling by Osteoclast Proton Pumps Other proton-translocating ATPases 
exist in eukaryotic and prokaryotic systems. Vacuolar proton-transporting ATPases 
(V-type ATPases) are found in vacuoles, lysosomes, endosomes, Golgi, chromaffin 
granules, and coated vesicles. Various H*-transporting ATPases occur in yeast and 
bacteria as well. H*-transporting ATPases found in osteoclasts (multinucleate cells 
that break down bone during normal bone remodeling) provide a source of circulat- 
ing calcium for soft tissues such as nerves and muscles. About 5% of bone mass in 
the human body undergoes remodeling at any given time. Once growth is complete, 
the body balances formation of new bone tissue by cells called osteoblasts with re- 
sorption of existing bone matrix by osteoclasts. Osteoclasts possess proton pumps— 
which are in fact V-type ATPases—on the portion of the plasma membrane that 
attaches to the bone. This region of the osteoclast membrane is called the ruffled 
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Figure 9.53 Proton pumps cluster on the ruffled 
border of osteoclast cells and function to pump 
protons into the space between the cell membrane 
and the bone surface. High proton concentration in 
this space dissolves the mineral matrix of the bone. 
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Figure 9.52 The H*,K*-ATPase of gastric mucosal cells mediates proton transport into the stomach. 
(a) Potassium ions are recycled by means of an associated K*/Cl~ cotransport system. The action of 
these two pumps results in net transport of H+ and CI” into the stomach. (b) Structure of the gastric 
H*,K*-ATPase, showing the phosphorylation site (Asp**°) and the A, N, and P domains of the 
a-subunit. The b-subunit is gray. (PDB id = SYLU). (c) The proton-binding ability (i.e., the pKa) of 
the Glu” carboxyl group is lowered substantially because of the close approach of Glu”>, the H 
bond to Asn”, and a salt bridge with Lys”!, enabling proton release, even at pH 1 in the gastric 
lumen. 

border. The osteoclast attaches to the bone in the manner of a cup turned upside 
down on a saucer (Figure 9.53), leaving an extracellular space between the bone 
surface and the cell. The H*-ATPases in the ruffled border pump protons into this 
space, creating an acidic solution that dissolves the bone mineral matrix. Bone min- 
eral consists mainly of poorly crystalline hydroxyapatite [Cajo(PO4)s(OH)2] with 
some carbonate (HCO; ) replacing OH or PO," in the crystal lattice. Transport of 
protons out of the osteoclasts lowers the pH of the extracellular space near the bone 
to about 4, dissolving the hydroxyapatite. 


9.8c ABC Transporters Use ATP to Drive Import and Export 
Functions and Provide Multidrug Resistance 


The word cell is from the Latin cella, meaning a “small room.” These small rooms 
are kept neat and tidy by special membrane transporters known as multidrug resis- 
tance (MDR) efflux pumps, often referred to as molecular vacuum cleaners. MDR 
pumps export cellular waste molecules and toxins as well as drugs that find their 
way into cells in various ways. Bacteria also have influx pumps, which bring essen- 
tial nutrients (for example vitamin B,,) into the cell (Figure 9.54). At least five fami- 
lies of influx and efflux pumps are known, among them the ABC transporters. In 
eukaryotes, ABC transporters are problematic because they export potentially ther- 
apeutic drugs (Figure 9.55) from cancer cells, so chemotherapy regimens must be 
changed often to avoid rejection of the beneficial drugs. 

All ABC transporters consist of two transmembrane domains (TMDs), which form 
the transport pore, and two cytosolic nucleotide-binding domains (NBDs) that bind 
and hydrolyze ATP. The TMDs and NBDs are separate subunits (thus composing a 
tetramer) in bacterial ABC importers (Figure 9.56). Bacterial exporters, on the other 
hand, are homodimers, with each monomer made up of an N-terminal TMD and a 
C-terminal NBD. Eukaryotic ABC exporters are monomeric, with all four necessary 
domains in a single polypeptide chain. 
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Figure 9.54 Influx pumps in the inner membrane of gram-negative bacteria bring nutrients into the 
cell, whereas efflux pumps export cellular waste products and toxins. (Adapted from Garmory, 

H. S., and Titball, R. W., 2004. ATP-binding cassette transporters are targets for the development of 
antibacterial vaccines and therapies. Infection and Immunity 72:6757-6763.) 
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Figure 9.55 Some of the cytotoxic cancer drugs that are transported by the MDR ATPase. 


The NBDs of ABC transporters from nearly all sources are similar in size, sequence, 
and structure. The TMDs, on the other hand, vary considerably in sequence, architec- 
ture, and number of transmembrane helices. ABC exporters contain a conserved core of 
12 transmembrane helices, whereas ABC importers can have between 10 and 20 trans- 
membrane helices. A variety of studies show that human MDR ATPases are similar to 
Sav1866, an exporting ABC transporter from S. aureus, and Sav1866 is considered to be 
a good model for the architecture of all ABC exporters. 
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Figure 9.56 Several ABC transporters are shown in 
different stages of their transport cycles. Left to 
right: PDB id = 1L7V, PDB id = 2QI9, PDB id = 
2NQ2. (a) MBP is a multidrug binding protein, 
which binds molecules to be transported and delivers 
them to the transport channel. (b) MBP has bound 
to the transport channel in the middle structure and 
delivered the transported entity to the channel. 

(c) Transport is complete. 
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The structures of several ABC transporters, in different stages of the transport cycle, 
provide a picture of how ATP binding and hydrolysis by the NBDs might be coupled to 
import and export of molecules (Figure 9.56). The TMDs can cycle from inward-facing 
to outward-facing conformations and back again, whereas the NBDs alternate between 
open and closed states. In all ABC transporters, a short “coupling helix” lies at the 
interface between each NBD and its corresponding TMD. Binding of ATP induces 
closing, or joining of the NBD domains, bringing the coupling helices 10 Å to 15 Å 
closer to each other than in the ATP-free state. The merger of the coupling helices in 
turn triggers a flip-flop of the TMDs from the inward-facing to the outward-facing 
conformation. In this state, ABC exporters release bound drugs to the extracellular en- 
vironment, whereas ABC importers accept substrate molecules from their associated 
substrate-binding proteins. Following ATP hydrolysis, release of ADP and inorganic 
phosphate allows the TMD to revert to its inward-facing conformation, where import- 
ers can release their substrates into the cytosol and exporters can bind new substrates to 
be exported. 


9.9 | How Are Certain Transport Processes 
Driven by Light Energy? 


As noted previously, certain biological transport processes are driven by light en- 
ergy rather than by ATP. Two well-characterized systems are bacteriorhodopsin, the 
light-driven H*-pump, and halorhodopsin, the light-driven CI pump, of H. halo- 
bium, an archaeon that thrives in high-salt media. H. halobium grows optimally at an 
NaClconcentration of4.3 M. It was extensively characterized by Walther Stoeckenius, 
who found it growing prolifically in the salt pools near San Francisco Bay, where salt 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


9.9 How Are Certain Transport Processes Driven by Light Energy? 


is commercially extracted from seawater. H. halobium carries out normal respiration 
if oxygen and metabolic energy sources are plentiful. However, when these substrates 
are lacking, H. halobium survives by using bacteriorhodopsin to capture light en- 
ergy. In oxygen- and nutrient-deficient conditions, purple patches appear on the sur- 
face of H. halobium. These purple patches of membrane are comprised of 75% 
protein—specifically bacteriorhodopsin (bR). The purple color arises from a retinal 
molecule that is covalently bound in a Schiff base linkage with an e-NH) group of 
Lys?! on each bacteriorhodopsin protein (Figure 9.57). Bacteriorhodopsin is a 
26 kDa transmembrane protein that packs so densely in the membrane that it 
naturally forms a two-dimensional crystal in the plane of the membrane. The retinal 
moiety lies parallel to the membrane plane, about 1 nm below the membrane’s outer 
surface (Figure 9.13). 

Light energy drives transport of protons (H*) through bacteriorhodopsin, provid- 
ing energy for the bacterium in the form of a transmembrane proton gradient. Pro- 
tons hop from site to site across bacteriorhodopsin, just as a person crossing a creek 
would jump from one stepping stone to another. The stepping stones in bacteriorho- 
dopsin are the carboxyl groups of Asp™%, Asp**, Glu, Glu!4, and the Schiff base 
nitrogen of the retinal chromophore (Figure 9.58). The numbered sequence of these 
steps (Figure 9.58) involves dramatic changes in pK, values for the stepping 
stone groups, driven by conformation changes in the protein itself. In Step 1, the pK, 
of the Schiff base nitrogen drops from 13.5 to about 8.5, enabling this group to donate 
a proton to Asp®, whose pK, value increases from 2.2 to more than 11, enabling it to 
accept the proton from the Schiff base. Similarly, the pK, of Asp® is above 10 at the 
start of the cycle, ensuring that it will be protonated, but in Step 3, this pK, decreases 
to less than 7, enabling it to donate a proton to the Schiff base, whose pK, has returned 
to its high value above 13. Thus, light-induced protein conformation changes drive 
changes in protonation state of the stepping stone groups, facilitating proton transport 
across the membrane and out of the cell. 
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Figure 9.57 The Schiff base linkage between the 
retinal chromophore and Lys?!6, 


Figure 9.58 The mechanism of proton transport by 
bacteriorhodopsin, showing the locations of key 
residues along the proton-translocation channel and 
the steps of the transport process. Step 1: 

A proton is transferred from the retinal Shiff base 
to Asp**; Step 2: A proton is released to the 
extracellular space from the proton release group 
consisting of Glu!**, Glu, and a water molecule; 
Step 3: Asp” donates a proton to reprotonate the 
retinal Schiff base; Step 4: A proton is taken up 
from the cytoplasm by Asp”, causing the all-trans 
retinal to be isomerized to the 13-cis configuration— 
the structural change that drives the subsequent 
steps; Step 5: Proton transfer from Asp*® converts 
the 13-cis retinal back to the all-trans resting state 
(PDB id = 1M0M). 
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Figure 9.59 A tripartite (three-part) complex of 
proteins comprises the large structure in E. coli that 
exports waste and toxin molecules. The transport 
pump is AcrB, embedded in the bacterial inner 
membrane. The rest of the channel is composed of 
TolC, embedded in the bacterial outer membrane, 
and a ring of AcrA subunits, which links AcrB and 
TolC. (Adapted from Lomovskaya, O., Zgurskaya, 
H. I., et al., 2007. Waltzing transporters and ‘the 
dance macabre’ between humans and bacteria. 
Nature Reviews Drug Discovery 6:56-65.) 


Chapter 9 Membranes and Membrane Transport 


9.10 | How Is Secondary Active Transport Driven 


by lon Gradients? 


The gradients of Ht, Na*, and other cations and anions established by ATPases 
and other energy sources can be used for secondary active transport of various 
substrates. 


9.10a Nat and H+ Drive Secondary Active Transport 


The best-understood secondary active transport systems use Na* or H* gradients to 
transport amino acids and sugars in certain cells. Many of these systems operate as 
symports, with the ion and the transported amino acid or sugar moving in the same 
direction (that is, into the cell). In antiport processes, the ion and the other transported 
species move in opposite directions. (For example, the anion transporter of erythrocytes 
is an antiport.) Proton symport proteins are used by E. coli and other bacteria to 
accumulate lactose, arabinose, ribose, and a variety of amino acids. E. coli also possesses 
Na*-symport systems for melibiose as well as for glutamate and other amino acids. 


9.10b AcrB Is a Secondary Active Transport System 


The ABC transporters described in Section 9.8 are just one of five different families 
of multidrug resistance transporters. AcrB, the major MDR transporter in E. coli, is 
responsible for pumping a variety of molecules including drugs such as erythromycin, 
tetracycline, and the B-lactams (for example, penicillin). AcrB is part of a large tripar- 
tite complex that bridges the E. coli inner and outer membranes and spans the entire 
periplasmic space (Figure 9.59). AcrB works with its partners, AcrA and TolC, to 
transport drugs and other toxins from the cytoplasm across the entire cell envelope 
and into the extracellular medium. 

AcrB is a secondary active transport system and an H*-drug antiporter. As protons 
flow spontaneously inward through AcrB in the E. coli inner membrane, drug molecules 
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Figure 9.60 In the AcrB trimer, the three identical subunits adopt three different conformations. The 
“loose” L state (blue), the “tight” T state (yellow), and the “open” O (dusty rose) state are indicated. 
Possible transport paths of drugs through the tunnels are shown in green. Tunnel 1 is lined with 
hydrophobic residues and is the likely point of entrance for drugs in the membrane bilayer. Tunnel 2 may 
serve either as an entrance port for water-soluble drugs or as an exit channel for nonsubstrates. Tunnel 3 
is the exit pathway. Tunnels 1 and 2 converge at the hydrophobic substrate binding pocket, where 
minocycline (an antibiotic similar to tetracycline) is bound in a hydrophobic pocket defined by 
phenylalanines 136, 178, 610, 615, 617, and 628; valines 139 and 612; isoleucines 277 and 626; and 
tyrosine 327. (Inset—all shown in spacefill. Minocycline is shown in stick and wireframe.) Panels A and B 
represent one step in a L-T-O (or T-L-O) transport cycle. (Image kindly provided by Klaas Martinus Pos.) 
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Figure 9.61 A model for drug transport by AcrB involves three possible conformations—loose (L, blue), 
tight (T, green), and open (O, pink)—for each of the three identical monomer subunits of the complex. 
The lateral grooves in L and T indicate low affinity and high affinity binding of drugs, respectively. The 
circle in the O state indicates that there is no drug binding in this state. Drugs to be transported (such as 
acridine, shown here) bind first to the L state. A conformational change to the T state moves the drug 
deeper into the tunnel, and a second conformation change opens the tunnel to the opposite side of the 
membrane, followed by release of the drug molecule. Binding, transport, and release of H* drives the 
drug transport cycle. (Adapted from Seeger, M., Schiefner, A., et al., 2006. Structural asymmetry of AcrB 
trimer suggests a peristaltic pump mechanism. Science 313:1295—1298; and Eicher, T., Seeger, M. A., 
Anselmi, C., et al., 2014. Coupling of remote alternating-access transport mechanisms for protons and 
substrates in the multidrug efflux pump AcrB. eLife 3:e03145.) 


are driven outward. AcrB is a homotrimer of large, 1100-residue subunits. Remarkably, 
the three identical subunits adopt slightly different conformations, denoted loose (L), 
tight (T), and open (O). Transported drug molecules enter AcrB through a tunnel that 
starts in the periplasmic space, about 15 A above the inner membrane, and ends at the 
trimer center (Figure 9.60). The three conformations of the AcrB monomers are three 
consecutive states of a transport cycle. As each monomer cycles through the L, T, and 
O states, drug molecules enter the tunnel, are bound, and then are exported (Figure 9.61). 
This three-step rotation has been likened to a Viennese waltz, and AcrB has been dubbed 
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a waltzing pump by Olga Lomovskaya and colleagues. 


Foundational Biochemistry Things You Should Know after Reading Chapter 9. 


The four classes of lipid-anchored membrane proteins include 
amide-linked myristoyl anchors, acyl- and thioester-linked fatty acyl 
anchors, thioether-linked prenyl anchors, and amide-linked glycosyl 
phosphatidylinositol anchors. 


Biological membranes are characterized by lateral and transverse 


Phosphatidylcholine and sphingomyelin are enriched in the outer 
leaflet of animal plasma membranes. 


Phosphatidylserine, phosphatidylethanolamine, phosphatidic acid, 
and phosphatidylinositol are found mainly in the inner leaflet of 
animal plasma membranes. 


Lipids and proteins undergo a variety of movements in membranes. 


Flippases transport phosphatidylserine and phosphatidylethanol- 
amine from the outer leaflet to the inner leaflet of animal plasma 


Floppases transport cholesterol, phosphatidylcholine, and sphingo- 
myelin from the inner leaflet to the outer leaflet of animal plasma 


Scramblases randomize lipids across the bilayer and degrade trans- 
verse membrane asymmetry. 


Membrane bilayer lipids form a highly-ordered gel phase (the S, state) 
at low temperatures and a more disordered liquid-crystalline phase 


e Lipid and protein composition of membranes varies between tissues ® 
and subcellular organelles. 
e Lipids spontaneously form a variety of stable structures in water, 
including micelles, vesicles, and bilayers. 
e 
e The critical micelle concentration describes the concentration at asymmetries 
which a detergent molecule begins forming micelles in water. y f 
e 
© The Singer-Nicolson fluid mosaic model describes the motion inher- 
ent in bilayer membranes and defines the characteristics of integral 
and peripheral membrane proteins. s 
e Integral proteins are strongly imbedded in membranes, whereas pe- 
ripheral proteins are loosely imbedded and are easily removed from 
the membrane. d 
e Integral membrane proteins may be monotopic, bitopic, or polytopic. ® 
© The cores of integral membrane proteins are dominated by a-helices enbra 
and B-sheets. : 
e 
e Hydropathy plots can be used to identify putative a-helical trans- 
membrane segments in membrane proteins. 
membranes. 
e Nonpolar amino acids are most often found in the hydrophobic core e 
of membrane proteins. 
e Polar amino acids of membrane proteins are most often found at the e 
lipid—water interface. 
e 


B-Barrel proteins traverse the outer membranes of gram-negative cells 
and mitochondria with antiparallel B-strands. 


(the Lg state) at higher temperatures. 
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The liquid-ordered state of bilayer lipids (Ls) is characterized by the 
high degree of acyl chain ordering of the S, state but the translational 
disorder of the La state. 


Membrane domains undergo dynamic exchange of lipids and proteins. 


Fusion of vesicles with the plasma membrane in animal cells is medi- 
ated by SNARE proteins. 


Passive diffusion, facilitated diffusion, and active transport are three 
modes of membrane transport. 


The free energy change for membrane transport can be calculated 
with terms corresponding to the chemical and electrical potential dif- 
ferences across a membrane. 


Channel proteins carry out voltage-gated and ligand-gated facilitated 
diffusion for ions and small molecules. 


The amino acids lining the transport channel of channel proteins can 
determine the selectivity for ions and small molecules. 


The energetic driving force for active membrane transport may be 
ATP hydrolysis, light energy, or the energy stored in an ion gradient. 


The P-type ATPases, including the plasma membrane Na‘*,K*- 
ATPase, sarcoplasmic reticulum Ca?*-ATPase, and the gastric H*,K*- 
ATPase, form covalent E-P intermediates during their transport cycle. 


ABC transporters use ATP to drive import and export functions in 
prokaryotic and eukaryotic cells. 


Bacteriorhodopsin uses light-induced protein conformation changes 
to facilitate proton transport across the plasma membrane of 
Halobacterium halobium. 


Secondary active transport systems use gradients of protons, cations, 
or anions established by transport ATPases to accomplish active 
transport of amino acids, sugars, and other entities. 


Answers to all problems are at the end of this book. Detailed solutions 
are available in the Student Solutions Manual. 


1. 


Understanding the Occurrence of Natural Phospholipids In Problem 1 
(b) in Chapter 8, you were asked to draw all the possible phosphati- 
dylserine isomers that can be formed from palmitic and linolenic 
acids. Which of the PS isomers are not likely to be found in biological 
membranes? 


. Calculation of Phospholipid-to-Protein Ratios The purple patches of 


the Halobacterium halobium membrane, which contain the protein 
bacteriorhodopsin, are approximately 75% protein and 25% lipid. If 
the protein molecular weight is 26,000 and an average phospholipid 
has a molecular weight of 800, calculate the phospholipid-to-protein 
mole ratio. 


. Membrane Composition Folk wisdom has it that pregnant women 


should eat an egg every day. A single egg contains 187 mg of 
cholesterol. Based upon what you have learned regarding the 
composition of cell membranes, why does this recommendation for 
pregnant women make good sense? 


. Understanding the Densities of Membrane Components Sucrose 


gradients for separation of membrane proteins must be able to 

separate proteins and protein—lipid complexes with a wide range of 

densities, typically 1.00 g/mL to 1.35 g/mL. 

a. Consult reference books (such as the CRC Handbook of 
Biochemistry) and plot the density of sucrose solutions versus 
percent sucrose by weight (grams of sucrose per 100 grams of 
solution) and versus percent by volume (grams of sucrose per 
100 milliliters of solution). Why is one plot linear and the other 
plot curved? 

b. What would be a suitable range of sucrose concentrations for 
separation of three membrane-derived protein—lipid complexes 
with densities of 1.03 g/mL, 1.07 g/mL, and 1.08 g/mL? 


. Understanding Diffusion of Phospholipids in Membranes Phospholipid 


lateral motion in membranes is characterized by a diffusion coefficient 
of about 1 X 1078 cm?/s. The distance traveled in two dimensions 
(across the membrane) in a given time is r = (4Dr)', where r is the 
distance traveled in centimeters, D is the diffusion coefficient, and f is 
the time during which diffusion occurs. Calculate the distance traveled 
by a phospholipid across a bilayer in 10 ms (milliseconds). 


. Understanding Diffusion of Proteins in Membranes Protein lateral 


motion is much slower than that of lipids because proteins are larger 


10. 


11. 


than lipids. Also, some membrane proteins can diffuse freely through 
the membrane, whereas others are bound or anchored to other 
protein structures in the membrane. The diffusion constant for the 
membrane protein fibronectin is approximately 0.7 X 107 cm’/s, 
whereas that for rhodopsin is about 3 X 10~° cm/s. 


a. Calculate the distance traversed by each of these proteins in 10 ms. 


b. What could you surmise about the interactions of these proteins 
with other membrane components? 


Understanding the Phase Transitions of Membrane Phospholipids 
Discuss the effects on the lipid phase transition of pure dimyristoyl 
phosphatidylcholine vesicles of added (a) divalent cations, 

(b) cholesterol, (c) distearoyl phosphatidylserine, (d) dioleoyl 
phosphatidylcholine, and (e) integral membrane proteins. 


Determining the Free Energy of a Galactose Gradient Calculate the 
free energy difference at 25°C due to a galactose gradient across a 
membrane, if the concentration on side 1 is 2 mM and the concentra- 
tion on side 2 is 10 mM. 


Determining the Electrochemical Potential of a Sodium lon Gradient 
Consider a phospholipid vesicle containing 10 mM Na* ions. The 
vesicle is bathed in a solution that contains 52 mM Na‘ ions, and the 
electrical potential difference across the vesicle membrane Aw = 
Woutside — Winside = —30 mV. What is the electrochemical potential at 
25°C for Na* ions? 


Digoxin Digoxin (in Lanoxin and other formulations) is a commonly 
used drug. Look up the structure of digoxin and compare it to other 
molecules described in this chapter. Based on what you have learned 
in this chapter, what may be the therapeutic applications of this drug 
(diseases or conditions treated) and its mode of action? 


Assessing the Nature of Transmembrane Histidine Transport Trans- 
port of histidine across a cell membrane was measured at several 
histidine concentrations: 


[Histidine], uM Transport, ~mol/min 


25 42.5 

7 119 
16 272 
31 527 
72 1220 


Does this transport operate by passive diffusion or by facilitated 
diffusion? 
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12. 


13. 


14. 


15. 


16. 


17. 


18. 


19. 


20. 


Determining the Concentration Limits of an Active Transport System 
(Integrates with Chapter 3.) Fructose is present outside a cell at 1 yM 
concentration. An active transport system in the plasma membrane 
transports fructose into this cell, using the free energy of ATP 
hydrolysis to drive fructose uptake. What is the highest intracellular 
concentration of fructose that this transport system can generate? 
Assume that one fructose is transported per ATP hydrolyzed; that 
ATP is hydrolyzed on the intracellular surface of the membrane; and 
that the concentrations of ATP, ADP, and P; are 3 mM, 1 mM, and 
0.5 mM, respectively. T = 298 K. (Hint: Refer to Chapter 3 to recall 
the effects of concentration on free energy of ATP hydrolysis.) 


Assessing the Energy Coupling of a Transport Process In this chapter, 
we have examined coupled transport systems that rely on ATP 
hydrolysis or on primary gradients of Na‘ or H*. Suppose you have 
just discovered an unusual strain of bacteria that transports rham- 
nose across its plasma membrane. Suggest experiments that would 
test whether it was linked to any of these other transport systems. 


Characterization of a Myristoyl Lipid Anchor Which of the following 
peptides would be the most likely to acquire an N-terminal myristoyl 
lipid anchor? 


a. VLIHGLEQN 
b. THISISIT 

c. RIGHTHERE 
d. MEMEME 

e. GETREAL 


Characterization of a Prenyl Lipid Anchor Which of the following 
peptides would be the most likely to acquire a prenyl anchor? 


a. RIGHTCALL 

b. PICKME 

c. ICANTICANT 
d. AINTMEPICKA 
e. 


None of the above 


Creating and Analyzing a Hydropathy Plot Online What would the 
hydropathy plot of a soluble protein look like, compared to those in 
Figure 9.14? Find out by creating a hydropathy plot. Go to: https:// 
web.expasy.org/protscale/. Enter the UniProtK B/Swiss-Prot accession 
number P61823 for bovine pancreatic ribonuclease in the accession 
number box and type Return. Scroll to the bottom of the page and 
enter “1” into the N-terminal box and “150” in the C-terminal box. 
Then click Submit. At the bottom of the next page, after a few 
seconds, you should see a hydropathy plot. How does the plot for 
ribonuclease compare to those in Figure 9.14? You should see a large 
positive peak at the left side of the plot. This is the signal sequence 
portion of the polypeptide. You can read about signal sequences in 
Section 31.3. 


Amino Acids in lon Channels Which amino acids would be most likely 
to be found in the interior of a cation channel? An anion channel? 
Explain. (Refer to Section 9.7a if needed.) 


lon Channel Selectively and Hydration Why could the hydration of 
ions make ion channels less selective? 


Assessing the Nature of Proline in Transmembrane @-Helices Proline 
residues are almost never found in short a-helices; nearly all trans- 
membrane a-helices that contain proline are long ones (about 

20 residues). Suggest a reason for this observation. 


Analyzing the Structure of Proline-Containing a-Helices As described 
in this chapter, proline introduces kinks in transmembrane a-helices. 
What are the molecular details of the kink, and why does it form? A 
good reference for this question is von Heijne, G., 1991. Proline kinks 
in transmembrane a-helices. Journal of Molecular Biology 218:499- 
503. Another is Barlow, D. J., and Thornton, J. M., 1988. Helix 
geometry in proteins. Journal of Molecular Biology 201:601-619. 


21. 


22. 


23. 


24. 


25. 


26. 


27. 


28. 


29. 
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Problems 


Comparing Membrane Barrel Structures Compare the porin proteins, 
which have transmembrane pores constructed from B-barrels, with 
the Wza protein, which has a transmembrane pore constructed from a 
ring of a-helices. How many amino acids are required to form the 
B-barrel of a porin? How many would be required to form the 
same-sized pore from a-helices? 


Assessing the Structural Consequences of the Hop-Diffusion Model 
The hop-diffusion model of Akihiro Kusumi suggests that lipid 
molecules in natural membranes diffuse within fenced areas before 
hopping the molecular fence to an adjacent area. Study Figure 9.30 
and estimate the number of phospholipid molecules that would be 
found in a typical fenced area of local diffusion. For the purpose of 
calculations, you can assume that the surface area of a typical 
phospholipid is about 60 A?. 


Assessing the Energetic Consequences of Snorkeling in Membrane 
Proteins What are the energetic consequences of snorkeling for a 
charged amino acid? Consider the lysine residue shown in Figure 9.16. 
If the lysine side chain was reoriented to extend into the center of the 
membrane, how far from the center would the positive charge of the 
lysine be? The total height of the peak for the lysine plot in Figure 
9.15 is about 4kT, where k is Boltzmann’s constant. If the lysine side 
chain in Figure 9.16 was reoriented to face the membrane center, how 
much would its energy increase? How does this value compare with 
the classical value for the average translational kinetic energy of a 
molecule in an ideal gas (3/2kT)? 


Assessing the Dissociation Behavior of Aspartic Acid Residues in a 
Membrane As described in the text, the pK, values of Asp‘ and 
Asp” of bacteriorhodopsin are shifted to high values (more than 11) 
because of the hydrophobic environment surrounding these residues. 
Why is this so? What would you expect the dissociation behavior of 
aspartate carboxyl groups to be in a hydrophobic environment? 


Assessing the Dissociation Behavior of Lysine and Arginine Residues in 
a Membrane Extending the discussion from Problem 24, how would a 
hydrophobic environment affect the dissociation behavior of the side 
chains of lysine and arginine residues in a protein? Why? 


Analyzing the Nature of Light-Driven Proton Transport In the 
description of the mechanism of proton transport by bacteriorhodop- 
sin, we find that light-driven conformation changes promote 
transmembrane proton transport. Suggest at least one reason for this 
behavior. In molecular terms, how could a conformation change 
facilitate proton transport? 


Assessing How Transmembrane a-helices Are Different. Transmem- 
brane a-helices have a higher content of hydrophobic amino acids and 
a lower content of charged and highly polar residues than a-helices in 
soluble proteins. These differences have consequences for the structure 
of these helices. Consult the following references and summarize the 
novel structural features of transmembrane a-helices. (Cross, T. A., 
Sharma, M., Yi, M., and Zhou, H.-X., 2014. Influence of solubilizing 
environments on membrane protein structures. Trends in Biochemical 
Sciences 36:117—125; Page, R. C., Kim, S., and Cross, T. A., 2008. 
Transmembrane helix uniformity examined by spectral mapping of 
torsion angles. Structure 16:787—797.) 


Assessing the Survival of Monarch Butterflies Feeding on a Toxic 
Diet How are monarch butterflies able to feed on the toxic 
milkweed plant and store cardiac glycosides in their exoskeleton 
without suffering the ill effects of this potent poison? Consult the 
following reference and summarize the biochemical accommoda- 
tion that makes this possible. (Dobler, S., Dalla, S., Wagschal, V., 
and Agrawal, A. A., 2012. Community-wide convergent evolution 
in insect adaptation to toxic cardenolides by substitutions in the 
Na,K-ATPase. Proceedings of the National Academy of Sciences, 
USA 109:13040-13045. 


Exploring the Mechanism of Lipid Transport by a Flippase ATP-dependent 
flippases are closely related in their structures to ion pumps like the 
plasma membrane Na*, K*-ATPase and sarcoplasmic reticulum 
Ca?*-ATPase, but they contend with a novel dilemma not shared by 
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the ion transport systems. Consult the following reference and 
summarize the structural features common to the flippases and P-type 
ATPases, and describe how the flippase solves the giant substrate 
problem. (Vestegaard, A. L., Coleman, J. A., Lemmin, T., et al., 2014. 
Critical roles of isoleucine-364 and adjacent residues in a hydropho- 
bic gate control of phospholipid transport by the mammalian 
P4-ATPase ATP8A2. Proceedings of the National Academy of 
Sciences, USA 111:16629-16638.) 


30. Exploring the Conformational Transitions of an lon Transport Enzyme 
Structures of intermediate states of the sarcoplasmic reticulum 
Ca?*-ATPase have begun to reveal the conformation changes that 
convert ATP hydrolysis free energy into transport of Ca?* ions from 

the cytosol to the SR lumen. Consult the following reference and 

describe the details of piston movement by a critical transmembrane 
a-helix in the transport cycle of this ion pump. (Kanai, R., Ogawa, 

H., Vilsen, B., et al., 2013. Crystal structure of a Nat-bound Na*,K*- 

ATPase preceding the E1P state. Nature 502:201—207; Moller, J. V., 

Olesen, C., Winther, A.-M., and Nissen, P., 2010. The sarcoplasmic 

Ca’*-ATPase: design of a perfect chemi-osmotic pump. Quarterly 

Review of Biophysics 43:501—5S66.) 


31. Exploring Lipopolysaccharide Transport in a Gram-Negative Bacterium 
Gram-negative bacteria have a cell wall which includes an outer 
membrane adorned with a tightly packed layer of lipopolysaccharide 
(LPS) molecules. How are these LPS molecules transported to the 
outer membrane? Consult the following references and describe the 
details of LPS transport and the membrane structures that are 
involved in this transport. (Bishop, R. E., 2014. Lipopolysaccharide 
rolls out the barrel. Nature 511:37-38; Dong, H., Xiang, Q., Gu, Y., 
et al., 2014. Structural basis for outer membrane lipopolysaccharide 
insertion. Nature 511:52—57; Qiao, S., Luo, Q., Zhao, Y., et al., 2014. 


Structural basis for lipopolysaccharide insertion in the bacterial outer 
membrane. Nature 511:108-111.) 


32. Understanding the Fluid Mosaic Model Singer and Nicolson’s fluid 
mosaic model of membrane structure presumed all of the following 
statements to be true EXCEPT: 


a. The phospholipid bilayer is a fluid matrix. 

b. Proteins can be anchored to the membrane by covalently linked 
lipid chains. 

c. Proteins can move laterally across a membrane. 

d. Membranes should be about 5 nm thick. 

e. Transverse motion of lipid molecules can occur occasionally. 


Think-Pair-Share Question 


Drug Diffusion through Patches Transdermal patches are designed to 
deliver a steady amount of drug from the patch through the skin layers 
into the general circulation. The diagram shows schematically this type 
of drug delivery system. 


Protective /adhesive layer 


The drug delivery process is driven by passive diffusion. With a fellow stu- 
dent, discuss as many parameters as you can think of that determine the 
rate of drug movement from its reservoir to the general circulation. 
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Nucleotides and 10 
Nucleic Acids 


We have discovered the secret of life! 


Proclamation by Francis H. C. Crick to patrons of 
The Eagle, a pub in Cambridge, England (1953) 


< Francis Crick (right) and James Watson 
(left) point out features of their model 
for the structure of DNA. 


A. Barrington Brown/Photo Researchers, Inc. 


Nucleotides and nucleic acids are substances that contain nitrogen bases (aromatic cyclic 10.1 What Are the Structure and Chemistry 
groups possessing nitrogen atoms) as part of their structure. Nucleotides are essential to of Nitrogenous Bases? 

cellular metabolism, and nucleic acids are the molecules of genetic information storage and 10.2 What Are Nucleosides? 

expression. What are the structures of the nucleotides? How are nucleotides joined together 10.3 What Are the Structure and Chemistry 
to form nucleic acids? How is information stored in nucleic acids? What are the biological of Nucleotides? 

functions of nucleotides and nucleic acids? 10.4 What Are Nucleic Acids? 


10.5 What Are the Different Classes of 
Nucleic Acids? 


ucleotides are biological molecules that possess a heterocyclic nitrogenous base, a 10.6 Are Nucleic Acids Susceptible to 

five-carbon sugar (pentose), and phosphate as principal components of their struc- Hydrolysis? 
ture. The biochemical roles of nucleotides are numerous; they participate as essential 
intermediates in virtually all aspects of cellular metabolism. Serving an even more cen- 
tral biological purpose are the nucleic acids, the elements of heredity and the agents of 
genetic information transfer. Just as proteins are linear polymers of amino acids, nucleic 
acids are linear polymers of nucleotides. Like the letters in this sentence, the orderly 
sequence of nucleotide residues in a nucleic acid can encode information. The two basic 
kinds of nucleic acids are deoxyribonucleic acid (DNA) and ribonucleic acid (RNA). The 
five-carbon sugar in DNA is 2-deoxyribose; in RNA, it is ribose. DNA is the repository 
of genetic information in cells, whereas RNA serves in the expression of this informa- 
tion through the processes of transcription and translation (Figure 10.1). An interesting 
exception to this rule is that some viruses have their genetic information stored as RNA. 

This chapter describes the chemistry of nucleotides and the major classes of nucleic 

acids. Chapter 11 presents methods for the determination of nucleic acid primary 
structure (nucleic acid sequencing) and describes the higher orders of nucleic acid 
structure. Chapter 12 introduces the molecular biology of genetic engineering: the 
construction and uses of novel DNA molecules assembled by combining segments 
from different DNA molecules. 
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The pyrimidine ring The purine ring system 


Figure 10.2 (a) The pyrimidine ring system; by 
convention, atoms are numbered as indicated. 


(b) The purine ring system, atoms numbered as shown. 


(1) Replication 
DNA replication yields two 
DNA molecules identical to the 
original one, ensuring transmission 
of genetic information to daughter 
cells with exceptional fidelity. 


(2) Transcription 
The sequence of bases in DNA is 
copied as a sequence of 
complementary bases in a single- 
stranded mRNA molecule. 


mRNA © 


tRNAs 
Translation P (3) Translation 
ome D Three-base codons on the mRNA 
mRNA / Rp corresponding to specific amino acids 
direct the sequence of building a 
8 protein. These codons are recognized 
by tRNAs (transfer RNAs) carrying the 
| appropriate amino acids. Ribosomes 
Attached are the machinery for protein 
amino acid synthesis. 
Growing 
peptide chain 


Protein Np 


Figure 10.1 The fundamental process of information transfer in cells. 


10.1 | What Are the Structure and Chemistry 
of Nitrogenous Bases? 


The bases of nucleotides and nucleic acids are derivatives of either pyrimidine or purine. 
Pyrimidines are six-membered heterocyclic aromatic rings containing two nitrogen 
atoms (Figure 10.2a). (In organic chemistry, heterocyclic ring structures contain atoms 
other than carbon atoms.) The atoms are numbered in a clockwise fashion, as shown 
in Figure 10.2. The purine ring system consists of two rings of atoms: one resembling 
the pyrimidine ring and another resembling the imidazole ring (Figure 10.2b). The 
nine atoms in this fused ring system are numbered according to the convention shown. 

The pyrimidine ring system is planar, whereas the purine system deviates somewhat 
from planarity in having a slight pucker between its imidazole and pyrimidine portions. 
Both are relatively insoluble in water, as might be expected from their pronounced 
aromatic character. 


10.la Three Pyrimidines and Two Purines Are Commonly Found in Cells 


The common naturally occurring pyrimidines are cytosine, uracil, and thymine 
(5-methyluracil) (Figure 10.3). Cytosine and thymine are the pyrimidines typically 
found in DNA, whereas cytosine and uracil are common in RNA. Note that the 
5-methyl group of thymine is the only thing that distinguishes it from uracil. Various 
pyrimidine derivatives, such as dihydrouracil, are present as minor constituents in 
certain RNA molecules. 

Adenine (6-amino purine) and guanine (2-amino-6-oxy purine), the two common 
purines, are found in both DNA and RNA (Figure 10.4). Other naturally occurring 
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Figure 10.3 The common pyrimidine bases—cytosine, uracil, and thymine—in the tautomeric forms 
predominant at pH 7. 


purine derivatives include hypoxanthine, xanthine, and uric acid (Figure 10.5). Hypo- 
xanthine and xanthine are found only rarely as constituents of nucleic acids. Uric acid, 
the most oxidized state for a purine derivative, is never found in nucleic acids. 


10.1b The Properties of Pyrimidines and Purines Can Be Traced 
to Their Electron-Rich Nature 


The aromaticity of the pyrimidine and purine ring systems and the electron-rich 
nature of their carbonyl and ring nitrogen substituents endow them with the capacity 
to undergo keto—enol tautomeric shifts. That is, pyrimidines and purines exist as 
tautomeric pairs, as shown in Figure 10.6 for uracil and Figure 10.7 for guanine. 
Tautomers are chemical isomers that interconvert through the relocation of a 
hydrogen atom. Tautomeric pairs differ in their hydrogen-bonding possibilities, a 
property that is key to nucleic acid structure and function. The keto tautomers of 
uracil, thymine, and guanine vastly predominate at neutral pH. In other words, pK, 
values for ring nitrogen atoms 1 and 3 in uracil (Figure 10.6) are greater than 8 (the 
pK, value for N-3 is 9.5). In contrast, the enamine form of cytosine predominates at 
pH 7 and the pK, value for N-3 in this pyrimidine is 4.5. Similarly, for guanine 
(Figure 10.7), the pK, value is 9.4 for N-1 and less than 5 for N-3. These pK, values 
specify whether protons are associated with the various ring nitrogens at neutral pH. 
As such, they are important in determining whether these nitrogens serve as H-bond 
donors or acceptors. Hydrogen bonding between purine and pyrimidine bases is 
fundamental to the biological functions of nucleic acids, as in the formation of the 
double-helix structure of DNA. The important functional groups participating in 
H-bond formation are the amino groups of cytosine, adenine, and guanine; the ring 
nitrogens at position 3 of pyrimidines and position 1 of purines; and the strongly 
electronegative oxygen atoms attached at position 4 of uracil and thymine, position 2 
of cytosine, and position 6 of guanine (see Figure 10.17). 

Another property of pyrimidines and purines is their strong absorbance of ultraviolet 
(UV) light, with wavelengths in the 260 nm region. This property is particularly useful 
in the quantitative and qualitative analysis of substances containing these nitrogenous 
bases, including nucleosides, nucleotides, and nucleic acids. 


10.2 | What Are Nucleosides? 


Nucleosides are compounds formed when a base is linked to a sugar. The sugars of 
nucleosides are pentoses (five-carbon sugars, see Chapter 7). Ribonucleosides contain 
the pentose D-ribose, whereas 2-deoxy-D-ribose is found in deoxyribonucleosides. In 
both instances, the pentose is in the five-membered ring form furanose: D-ribofuranose 
for ribonucleosides and 2-deoxy-D-ribofuranose for deoxyribonucleosides (Figure 10.8). 
In nucleosides, these ribofuranose atoms are numbered 1’, 2’, 3’, and so on to distin- 
guish them from the ring atoms of the nitrogenous bases. The base is linked to the 
sugar via a glycosidic bond. Glycosidic bonds in nucleosides (and nucleotides, see 
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NHo O 
H N 
| — | 
H H 
Adenine Guanine 
(6-amino purine) (2-amino-6-oxy purine) 


Figure 10.4 The common purine bases—adenine 
and guanine—in the tautomeric forms predominant 
at pH 7. 


Figure 10.5 Other naturally occurring purine 
derivatives—hypoxanthine, xanthine, and uric acid. 


Figure 10.6 The keto—enol tautomerization of uracil. 


Figure 10.7 The tautomerization of the purine guanine. 
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B-v-Ribofuranose 


p-2-Deoxyribose B-p-2-Deoxyribofuranose 


Figure 10.8 Furanose structures—ribose and deoxyribose. The 2—OH of ribose and the 2—H of 
2-deoxyribose are highlighted in these structures. 


Human Biochemistry 


Adenosine: A Nucleoside with Physiological Activity 


For the most part, nucleosides have no biological role other than 
to serve as component parts of nucleotides. Adenosine is a rare ex- 
ception. In mammals, adenosine functions as an autacoid, or local 
hormone, and as a neuromodulator. This nucleoside circulates in the 
bloodstream, acting locally on specific cells to influence such diverse 
physiological phenomena as blood vessel dilation, smooth muscle 
contraction, neuronal discharge, neurotransmitter release, and me- 
tabolism of fat. For example, when muscles work hard, they release 
adenosine, causing the surrounding blood vessels to dilate, which in 
turn increases the flow of blood and its delivery of O, and nutrients to 
the muscles. In a different autacoid role, adenosine acts in regulating 
heartbeat. The natural rhythm of the heart is controlled by a pace- 
maker, the sinoatrial node, which cyclically sends a wave of electrical 
excitation to the heart muscles. By blocking the flow of electrical cur- 
rent, adenosine slows the heart rate. Supraventricular tachycardia is a 
heart condition characterized by a rapid heartbeat. Intravenous injec- 
tion of adenosine causes a momentary interruption of the rapid cycle 
of contraction and restores a normal heart rate. Adenosine is licensed 
and marketed as Adenocard to treat supraventricular tachycardia. 

In addition, adenosine is implicated in sleep regulation. During 
periods of extended wakefulness, extracellular adenosine levels 
rise as a result of metabolic activity in the brain, and this increase 


promotes sleepiness. During sleep, adenosine levels fall. Caffeine 
promotes wakefulness by blocking the interaction of extracellular 
adenosine with its neuronal receptors. 
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Caffeine 


Caffeine is an alkaloid, a term used to define naturally occurring 
nitrogenous molecules that have pharmacological effects. Alkaloids are 
classified according to their metabolic precursors, so caffeine is a purine 
alkaloid. 


following discussion) are always of the B-configuration. Nucleosides are named by 
adding the ending -idine to the root name of a pyrimidine or -osine to the root name of 
a purine. The common nucleosides are thus cytidine, uridine, thymidine, adenosine, and 
guanosine (Figure 10.9). Nucleosides are more water soluble than the free bases because 
of the hydrophilicity of the pentose. 


10.3 | What Are the Structure and Chemistry 


of Nucleotides? 


A nucleotide results when phosphoric acid is esterified to a sugar —OH group of a 
nucleoside. The nucleoside ribose ring has three —OH groups available for ester- 
ification, at C-2', C-3’, and C-5’ (although 2’-deoxyribose has only two). The vast 
majority of monomeric nucleotides in the cell are ribonucleotides with 5’-phosphate 
groups. Figure 10.10 shows the structures of the common four ribonucleotides, whose 
formal names are adenosine 5’-monophosphate, guanosine 5’-monophosphate, cytidine 
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Figure 10.9 The common ribonucleosides—cytidine, uridine, adenosine, and guanosine—and the less- 
common inosine. (Purine nucleosides and nucleotides usually adopt the anti conformation, where the 
purine ring is not above the ribose, as it would be in the syn conformation. Pyrimidines are always 
anti, never syn, because the 2-O atom of pyrimidines sterically hinders the ring from a position above 
the ribose.) 
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Figure 10.10 Structures of the four common ribonucleotides—AMP, GMP, CMP, and UMP. Also 
shown is the nucleotide 3’-AMP. 3'-Nucleoside monophosphates are common products of nucleic 
acid hydrolysis. 


5’-monophosphate, and uridine 5’-monophosphate. These compounds are more often 
referred to by their abbreviations: 5'-AMP, 5’-GMP, 5'-CMP, and 5’-UMP, or even 
more simply as AMP, GMP, CMP, and UMP. Because the pK, value for the first 
dissociation of a proton from the phosphoric acid moiety is 1.0 or less, the nucleotides 
have acidic properties. This acidity is implicit in the other names by which these 
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3',5'-Cyclic GMP 


Figure 10.11 The cyclic nucleotides cAMP 
and cGMP. 
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substances are known—adenylic acid, guanylic acid, cytidylic acid, and uridylic acid. 
The pK, value for the second dissociation, pK», is about 6.0, so at neutral pH or above, 
the net charge on a nucleoside monophosphate is —2. Nucleic acids, which are 
polymers of nucleoside monophosphates, derive their name from the acidity of these 
phosphate groups. 


10.3a Cyclic Nucleotides Are Cyclic Phosphodiesters 


Nucleoside monophosphates in which the phosphoric acid is esterified to two of the 
available ribose hydroxyl groups (Figure 10.11) are found in all cells. Forming two 
such ester linkages with one phosphate results in a cyclic phosphodiester structure. 
3’,5’-cyclic AMP, often abbreviated cAMP, and its guanine analog 3’,5’-cyclic GMP, 
or cGMP, are important regulators of cellular metabolism. The cyclic pyridimine 
nucleotides, cCMP and cUMP, also occur in cells and affect many of the same pro- 
cesses controlled by cAMP and cGMP. 


10.3b Nucleoside Diphosphates and Triphosphates Are Nucleotides 
with Two or Three Phosphate Groups 


Additional phosphate groups can be linked to the phosphoryl group of a nucleotide 
through the formation of phosphoric anhydride linkages, as shown in Figure 10.12. 
Addition of a second phosphate to AMP creates adenosine 5’-diphosphate, or ADP, 
and adding a third yields adenosine 5’-triphosphate, or ATP. The respective phosphate 


Human Biochemistry 


cGAMP, a Cyclic Dinucleotide That Triggers a Response to Infection 


Second messengers are regulatory metabolites produced in cells 
in response to some signal. cAMP and cGMP are the best known 
second messengers (Figure 10.11). The first description of cyclic 
dinucleotides as second messengers dates to 1987 when the dicyclic 
nucleotide bis-(3'-5’)-cyclic dimeric GMP (c-di-GMP, see figure) 
was identified as the signal causing bacteria to switch from the free- 
living single-cell state into the formation of biofilms on surfaces. 
When animals sense double-stranded DNA (dsDNA) in the 
cytosol, they produce an anti-pathogen known as type I interferon 
(IFN). IFNs activate the immune system, which acts to protect the 
organism from the infection. (dsDNA is normally found only in 
the cell nucleus or in mitochondria, so its presence in the cytosol is 
indication of an invading bacterium or a virus.) Cytosolic detection 
of dsDNA activates stimulator of interferon genes (STING), insti- 
gating a program of gene expression aimed at halting infection. The 
cytosolic DNA sensor is cyclic GMP-AMP synthase (cGAS), the 
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bis-(3'-5')-cyclic dimeric GMP (c-di-GMP) 


enzyme that synthesizes cyclic GMP-AMP (cGAMP; also known 
as cyclic G(2',5')pA(3',5')p; see figure) from GTP and ATP. Binding 
of dsDNA to cGAS causes a conformational change in the enzyme 
that establishes a catalytically competent nucleotide-binding site 
where the GTP and ATP substrates bind and cGAMP synthesis 
occurs. Rising [GAMP] levels trigger STING expression and an 
immune response. An undesirable consequence of this system is 
that self-nucleic acid recognition (host dsDNA in the cytosol) may 
induce autoimmune diseases such as lupus erythematosus. 
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Figure 10.12 Formation of ADP and ATP by the successive addition of phosphate groups via 
phosphoric anhydride linkages. Note that the reaction is a dehydration synthesis reaction. 
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groups are designated by the Greek letters a, B, and y, starting with the a-phosphate 
as the one linked directly to the pentose. The abbreviations GTP, CTP, and UTP 
represent the other corresponding nucleoside 5’-triphosphates. Like the nucleoside 
5'-monophosphates, the nucleoside 5’-diphosphates and 5'-triphosphates all occur in 
the free state in the cell, as do their deoxyribonucleoside phosphate counterparts, 
represented as dAMP, dADP, and dATP; dGMP, dGDP, and dGTP; dCMP, dCDP, 
and dCTP; dUMP, dUDP, and dUTP; and dTMP, dTDP, and dTTP. 


10.3c NDPs and NTPs Are Polyprotic Acids 


Nucleoside 5’-diphosphates (NDPs) and nucleoside 5’-triphosphates (NTPs) are rela- 
tively strong polyprotic acids in that they dissociate three and four protons, respec- 
tively, from their phosphoric acid groups. The resulting phosphate anions on NDPs 
and NTPs form stable complexes with divalent cations such as Mg?* and Ca”. 
Because Mg?” is present at high concentrations (as much as 40 mM) intracellularly, 


NDPs and NTPs occur primarily as Mg** complexes in the cell. 


10.3d Nucleoside 5’-Triphosphates Are Carriers of Chemical Energy 


Nucleoside 5'-triphosphates are indispensable agents in metabolism because the 
phosphoric anhydride bonds they possess are a prime source of chemical energy to do 


biological work. 


Virtually all of the biochemical reactions of nucleotides involve either phosphoryl 
group transfer, pyrophosphoryl group transfer, or nucleotidyl group transfer: the transfer 
of a phosphoryl, pyrophosphoryl, or NMP (nucleotidyl) group from an NTP to an 
acceptor (Figure 10.13). The pentose and the base are not directly involved in this 
chemistry. The base serves as a recognition symbol that allows the cell to partition its 
chemical energy to the different metabolic pathways: ATP to serve as the primary 
nucleotide in central pathways of energy metabolism, whereas GTP is used to drive 
protein synthesis. CTP is essential for phospholipid synthesis, and UTP forms acti- 
vated intermediates with sugars that go on to serve as substrates in the biosynthesis of 
complex carbohydrates and polysaccharides. Thus, the various nucleotides are chan- 
neled in appropriate metabolic directions through specific recognition of the base of 
the nucleotide. The bases of nucleotides never participate directly in the covalent bond 
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Figure 10.13 The major biochemical reactions of nucleotides — phosphoryl] group transfer, 
pyrophosphory! group transfer, and nucleotidyl group transfer. The group being transferred is 
highlighted. 


chemistry that goes on. To complete the picture, the four NTPs and their dNTP coun- 
terparts are the substrates for synthesis of the remaining class of biomolecules—the 
nucleic acids. 


10.4 | What Are Nucleic Acids? 


Nucleic acids are polynucleotides: linear polymers of nucleotides linked 3’ to 5’ by 
phosphodiester bridges (Figure 10.14). Phosphodiester bridges occur when one phos- 
phate group is esterified to two different —OH groups. Phosphodiester bridges are 
the fundamental linkage creating polynucleotides from nucleotide monomers. They 
are formed as 5’-nucleoside monophosphates units are successively added to the 3’- 
OH group of the preceding nucleotide, a process that gives the polymer a directional 
sense. Polymers of ribonucleotides are named ribonucleic acid, or RNA. Deoxyribo- 
nucleotide polymers are called deoxyribonucleic acid, or DNA. Because C-1l’ and 
C-4' in deoxyribonucleotides are involved in furanose ring formation and because 
there is no 2’-OH, only the 3’- and 5'-hydroxyl groups are available for internucleotide 
phosphodiester bonds. In the case of DNA, a polynucleotide chain may contain hun- 
dreds of millions of nucleotide units. The convention in all notations of nucleic acid 
structure is to read the polynucleotide chain from the 5’-end of the polymer to the 3'-end. 
Note that this reading direction actually passes through each phosphodiester from 3’ 
to 5’ (Figure 10.14). A repetitious uniformity exists in the covalent backbone of 
polynucleotides. 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


Ribonucleic acid Deoxyribonucleic acid 


(RNA) (DNA) 


NH, 
1 ii HC 


|, 


TO 


Note how each phosphate 
group in the chain is esterified 
to two different — OH groups, 


creating a phosphodiester bridge. Guanine 


Figure 10.14 3', 5'-Phosphodiester bridges link nucleotides together to form polynucleotide chains. 
RNA is shown on the left and DNA on the right. The 5'-ends of the chains are at the top; the 3’-ends 
are at the bottom. The nucleotide sequence of the RNA would be read as ACGU: that of the DNA 
as TGCA 


10.4a The Base Sequence of a Nucleic Acid Is Its Defining Characteristic 


The only significant variation that commonly occurs in the chemical structure of nu- 
cleic acids is the nature of the base at each nucleotide position. These bases are not part 
of the sugar—-phosphate backbone but instead serve as distinctive side chains, much 
like the R groups of amino acids along a polypeptide backbone. They give the polymer 
its unique identity. A simple notation for nucleic acid structures is merely to list the order 
of bases in the polynucleotide using single capital letters—A, G, C, and U (or T). 


O= P— OCH, 


10.4 What Are Nucleic Acids? 325 


Adenine 


Careers in Chemistry 


Philip Nega 


Philip Nega received his Bachelors of Science in 
Chemistry from Haverford College and his Masters 
of Science in Engineering from University of 
Pennsylvania. He currently uses his education to 
further research in solid state battery technology 
for A123 Systems, a manufacturer of lithium iron 
phosphate batteries and energy storage systems. 


Nega worked previously at Lawrence Berkeley 
National Lab with perovskite structures, using 
robotics to perform more than 100 reactions at a 
time. He describes his knowledge of biochemistry as 
essential to understanding and assessing the struc- 
tural and chemical properties of the materials he 
has studied in his career. 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


326 


Chapter 10 Nucleotides and Nucleic Acids 


6699 


Occasionally, a lowercase “p” is written between each successive base to indicate the 
phosphodiester bridge, as in GpbApCpGpUpA. 

To distinguish between RNA and DNA sequences, DNA sequences may be preceded 
by a lowercase “d” to denote deoxy, as in d-GACGTA. Or, one can surmise from 
sequences containing U, but not T, that the nucleotide sequence given is for an RNA. 
From a simple string of letters such as this, any biochemistry student should be able to 
draw the unique chemical structure of, for example, a pentanucleotide, even though it 
may contain more than 200 atoms. 


10.5 | What Are the Different Classes of Nucleic Acids? 


The two major classes of nucleic acids are DNA and RNA. DNA has only one bio- 
logical role, but it is the more central one. The information to make all the func- 
tional macromolecules of the cell (even DNA itself) is preserved in DNA and 
accessed through transcription of the information into RNA copies. Coincident 
with DNA’s singular purpose, simple life forms such as viruses or bacteria usually 
contain only a single DNA molecule (or “chromosome”). Such DNA molecules 
must be quite large in order to carry enough information for making the macromol- 
ecules necessary to maintain a living cell. The Escherichia coli chromosome has a 
molecular mass of 2.9 X 10° Da and contains more than 9 million nucleotides. 
Eukaryotic cells have many chromosomes, and DNA is found principally in two 
copies in the diploid chromosomes of the nucleus. DNA is also found in mitochon- 
dria and in chloroplasts, where it encodes some of the proteins and RNAs unique to 
these organelles. 

In contrast, RNA occurs in multiple copies and various forms. Cells typically 
contain about eight times as much RNA as DNA. RNA has a number of important 
biological functions. Its central one is information transfer from DNA to protein. 
RNA molecules playing this role are categorized into several major types: messenger 
RNA, ribosomal RNA, transfer RNA, and small nuclear RNA. Another type, 
small RNAs (RNA 21 to 28 nucleotides in length), consists of important players in 
gene regulation. Beyond various roles in information transfer, RNA participates in 
a number of metabolic functions, including the processing and modification of 
tRNA, rRNA, and mRNA and several maintenance or “housekeeping” functions, 
such as the preservation of telomeres (the special sequences at the ends of chromo- 
somes). Furthermore, like proteins, RNA strands can fold into complex architec- 
tural arrangements that allow them to bind ligands with high specificity. Such 
ligand-binding specificity can even endow these RNAs with catalytic (that is, enzy- 
matic) activity. 

With these basic definitions in mind, let’s now briefly consider the chemical and 
structural nature of DNA and the various RNAs. 


10.5a The Fundamental Structure of DNA Is a Double Helix 


The DNA isolated from different cells and some viruses characteristically consist 
of two polynucleotide strands wound together to form a long, slender, helical mol- 
ecule, the DNA double helix. The strands run in opposite directions; that is, they 
are antiparallel. The two strands are held together in the double-helical structure 
through interchain hydrogen bonds (Figure 10.15). These H bonds pair the bases of 
nucleotides in one chain to complementary bases in the other, a phenomenon called 
base pairing. 


Erwin Chargaff’s Analysis of the Base Composition of Different DNAs Provided a Key 
Clue to DNA Structure A clue to the chemical basis of base pairing in DNA came from 
the analysis of the base composition of various DNAs by Erwin Chargaff in the late 
1940s. His data showed that the four bases commonly found in DNA (A, C, G, and T) 
do not occur in equimolar amounts and that the relative amounts of each vary from 
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Figure 10.15 The antiparallel nature of the DNA double helix. 


species to species (Table 10.1). Nevertheless, Chargaff noted that certain pairs of 
bases, namely, adenine and thymine, and guanine and cytosine, are always found in 
a 1:1 ratio and that the number of pyrimidine residues always equals the number of 
purine residues. These findings are known as Chargaff’s rules: [A] = [T]; [C] = [G]; 
[pyrimidines] = [purines]. 


Watson and Crick’s Postulate of the DNA Double Helix Became the Icon of DNA 
Structure James Watson and Francis Crick, working in the Cavendish Laboratory 
at Cambridge University in 1953, took advantage of Chargaff’s results and the data 
obtained by Rosalind Franklin and Maurice Wilkins in X-ray diffraction studies 
on the structure of DNA to conclude that DNA was a complementary double helix. 
Two strands of deoxyribonucleic acid (sometimes referred to as the Watson strand 
and the Crick strand) are held together by the bonding interactions between 


Table 10.1 S \ à 5 
Adenine Adenine Guanine Purines 

to to to to to 
Source Guanine Cytosine Thymine Cytosine Pyrimidines 
Human 1.56 1.75 1.00 1.00 1.0 
Hen 1.45 1.29 1.06 0.91 0.99 
Salmon 1.43 1.43 1.02 1.02 1.02 
Wheat 122 1.18 1.00 0.97 0.99 
Yeast 1.67 1:92 1.03 1.20 1.0 
Haemophilus influenzae 1.74 1.54 1.07 0.91 1.0 
E. coli K-12 1.05 0.95 1.09 0.99 1.0 
Serratia marcescens 0.7 0.7 0.95 0.86 0.9 


Source: After Chargaff, E., 1951. Structure and function of nucleic acids as cell constituents. Federation Proceedings 
10:654-659. 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


328 Chapter 10 Nucleotides and Nucleic Acids 


Old Old 


Old New New Old 
Emerging progeny DNA 


Figure 10.17 The replication of DNA gives identical 
progeny molecules because base pairing is the 
mechanism that determines the nucleotide sequence 
of each newly synthesized strand. 


Figure 10.16 The Watson—Crick base pairs A:T and G:C. 


unique base pairs, always consisting of a purine in one strand and a pyrimidine in 
the other. Base pairing is very specific: If the purine is adenine, the pyrimidine 
must be thymine. Similarly, guanine pairs only with cytosine (Figure 10.16). Thus, 
if an A occurs in one strand of the helix, T must occupy the complementary 
position in the opposing strand. Likewise, a G in one dictates a C in the other. 
Because of this exclusive pairing of A only with T and G only with C, these pairs 
are taken as the standard or accepted law, and the A:T and G:C base pairs are 
often referred to as canonical. As Watson recognized from testing various 
combinations of bases using structurally accurate models, the A:T pair and the 
G:C pair form spatially equivalent units (Figure 10.16). The backbone-to-backbone 
distance of an A:T pair is 1.11 nm, virtually identical to the 1.08-nm chain 
separation in G:C base pairs. 

Base pairing in the DNA molecule not only conforms to Chargaff results and 
Watson and Crick’s rules, but also has a profound property relating to heredity: The 
sequence of bases in one strand has a complementary relationship to the sequence of 
bases in the other strand. That is, the information contained in the sequence of one 
strand is conserved in the sequence of the other. Therefore, separation of the two 
strands and faithful replication of each, through a process in which base pairing 
specifies the nucleotide sequence in the newly synthesized strand, leads to two progeny 
molecules identical in every respect to the parental double helix (Figure 10.17). 
Elucidation of the double-helical structure of DNA represented one of the most 
significant events in the history of science. This discovery, more than any other, marked 
the beginning of molecular biology. Indeed, upon solving the structure of DNA, Crick 
proclaimed in The Eagle, a pub just across from the Cavendish lab, “We have discovered 
the secret of life!” 


The Information in DNA Is Encoded in Digital Form In this digital age, we are accus- 
tomed to electronic information encoded in the form of extremely long arrays of 
just two digits: ones (ls) and zeros (0s). DNA uses four digits to encode biological 
information: A, C, G, and T. A significant feature of the DNA double helix is that 
virtually any base sequence (encoded information) is possible: Other than the base- 
pairing rules, no structural constraints operate to limit the potential sequence of 
bases in DNA. 
DNA contains two kinds of information: 


1. The nucleotide sequences of genes that encode the amino acid sequences of pro- 
teins and the nucleotide sequences of functional RNA molecules such as rRNA 
and tRNA (see the following discussion). 

2. The gene regulatory networks that control the expression of protein-encoding (and 
functional RNA-encoding) genes (see Chapter 29). 
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DNA Is in the Form of Enormously Long, Threadlike Molecules Because of the double 
helical nature of DNA molecules, their size can be represented in terms of the num- 
bers of paired nucleotides (or base pairs) they contain. For example, the E. coli chro- 
mosome consists of 4.64 X 10° base pairs (abbreviated bp) or 4.64 X 10° kilobase 
pairs (kbp). DNA is a threadlike molecule. The diameter of the DNA double helix is 
only 2 nm, but the length of the DNA molecule forming the E. coli chromosome is 
over 1.6 X 10° nm (1.6 mm). The long dimension of an E. coli cell is only 2000 nm 
(0.002 mm), so its chromosome must be highly folded. Because of their long, thread- 
like nature, DNA molecules are easily sheared into shorter fragments during isola- 
tion procedures, and it is technically challenging to obtain intact chromosomes even 
from the simple cells of prokaryotes. 


DNA in Cells Occurs in the Form of Chromosomes DNA occurs in various forms in 
different cells. The single chromosome of prokaryotic cells is typically a circular 
DNA molecule. Proteins are associated with prokaryotic DNA, but unlike eukary- 
otic chromosomes, prokaryotic chromosomes are not uniformly organized into 
ordered nucleoprotein arrays. The DNA molecules of eukaryotic cells, each of 
which defines a chromosome, are linear and richly adorned with proteins (Figure 
10.18). A class of arginine- and lysine-rich basic proteins called histones interact 
electrostatically with the anionic phosphate groups in the DNA backbone to form 
nucleosomes, structures in which the DNA double helix is wound around a protein 
core composed of pairs of four different histone polypeptides. Chromosomes also 
contain a varying mixture of other proteins, so-called nonhistone chromosomal 
proteins, many of which are involved in regulating which genes in DNA are tran- 
scribed at any given moment. The amount of DNA in a diploid mammalian cell is 
typically more than 1000 times that found in an E. coli cell. Some higher plant cells 
contain more than 50,000 times as much as an E. coli cell. 
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Figure 10.18 A diagrammatic representation of a typical eukaryotic chromosome. 

The double-stranded DNA molecule is wound around nucleosomes composed of histone proteins, 
which are condensed into higher-order structural arrangements and decorated with nonhistone 
chromosomal proteins. 
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10.5b Various Forms of RNA Serve Different Roles in Cells 


Unlike DNA, cellular RNA molecules are almost always single stranded. However, 
all of them typically contain double-stranded regions formed when stretches of 
nucleotides with complementary base sequences align in an antiparallel fashion and 
form canonical A:U and G:C base pairs. (Compare Figures 10.3 and 10.16 to convince 
yourself that U would pair with A in the same manner T does.) Such base pairing 
creates secondary structure. 


Messenger RNA Carries the Sequence Information for Synthesis of a Protein Messenger 
RNA (mRNA) serves to carry the information, or message, encoded in genes to the 
sites of protein synthesis in the cell where this information is translated into a 
polypeptide sequence. That is, mRNA molecules are transcribed copies of the 
protein-coding genetic units of DNA. Prokaryotic mRNAs have from 75 to 3000 
nucleotides; mRNA constitutes about 2% of total prokaryotic RNA. 

Messenger RNA is synthesized during transcription, an enzymatic process in which 
an RNA copy is made of the sequence of bases along one strand of DNA. This mRNA 
then directs the synthesis of a polypeptide chain as the information that is contained 
within its nucleotide sequence is translated into an amino acid sequence by the 
protein-synthesizing machinery of the ribosomes. Ribosomal RNA and tRNA mole- 
cules are also synthesized by the transcription of DNA sequences, but unlike mRNA 
molecules, these RNAs are not subsequently translated to form proteins. In prokary- 
otes, a single mRNA may contain information for the synthesis of several polypeptide 
chains within its nucleotide sequence (Figure 10.19). In contrast, eukaryotic mRNAs 
encode only one polypeptide but are more complex—they are synthesized in the 
nucleus in the form of much larger precursor molecules called heterogeneous nuclear 
RNA, or hnRNA. hnRNA molecules contain stretches of nucleotide sequence that 
have no protein-coding capacity. These noncoding regions are called intervening 
sequences or introns because they intervene between coding regions, which are called 
exons. Introns interrupt the continuity of the information specifying the amino acid 
sequence of a protein and must be spliced out before the message can be translated. In 
addition, eukaryotic hnRNA and mRNA molecules have a run of 100 to 200 adenylic 
acid residues attached at their 3’-ends, so-called poly(A) tails. This polyadenylation 
occurs after transcription has been completed and is essential for efficient translation 
and stability of the mRNA. The properties of mRNA molecules as they move through 
transcription and translation in prokaryotic versus eukaryotic cells are summarized in 
Figure 10.19. 


Ribosomal RNA Provides the Structural and Functional Foundation for Ribosomes 
Ribosomes, the ribonucleoprotein complexes where protein synthesis occurs, are 
about 65% RNA of the ribosomal RNA type. Ribosomal RNA (rRNA) molecules fold 
into characteristic secondary structures as a consequence of intramolecular base- 
pairing interactions (Figure 10.20). The different species of rRNA are generally 
referred to according to their sedimentation coefficients!, which are a rough measure 
of their relative size (Figure 10.21). 

Ribosomes are composed of two subunits of different sizes that dissociate from each 
other if the Mg?* concentration is below 107? M. Each subunit is a supramolecular 
assembly of proteins and RNA and has a total mass of 10° Da or more. E. coli ribosomal 
subunits have sedimentation coefficients of 30S (the small subunit) and 50S (the large 
subunit). Eukaryotic ribosomes are somewhat larger than prokaryotic ribosomes, 
consisting of 40S and 60S subunits. More than 80% of total cellular RNA is represented 
by the various forms of rRNA. 


'Sedimentation coefficients are a measure of the velocity with which a particle sediments in a centrifugal 
force field. Sedimentation coefficients are expressed in Svedbergs (symbolized S), named to honor The 
Svedberg, developer of the ultracentrifuge. One S=107! s. 
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Figure 10.19 Transcription and translation of mRNA molecules in prokaryotic versus 
eukaryotic cells. 


Ribosomal RNAs characteristically contain a number of specially modified nucle- 
otides, including pseudouridine residues, ribothymidylic acid, and methylated bases 
(Figure 10.22). The central role of ribosomes is protein biosynthesis: The genetic infor- 
mation in the nucleotide sequence of an mRNA is translated into the amino acid se- 
quence of a polypeptide chain by ribosomes. 


Transfer RNAs Carry Amino Acids to Ribosomes for Use in Protein Synthesis Transfer 
RNAs (tRNAs) serve as the carrier of amino acids for protein synthesis. tRNA molecules 
also fold into a characteristic secondary structure (Figure 10.23). tRNAs are small 
RNA molecules containing 73 to 94 residues, a substantial number of which are meth- 
ylated or otherwise unusually modified. Each of the 20 amino acids in proteins has at 
least one unique tRNA species dedicated to chauffeuring its delivery to ribosomes for 
insertion into growing polypeptide chains, and some amino acids are served by several 
tRNAs. In eukaryotes, there are discrete sets of tRNA molecules for each site of pro- 
tein synthesis—the cytoplasm, the mitochondrion, and in plant cells, the chloroplast. 
All tRNA molecules possess a 3’-terminal nucleotide sequence that reads -CCA, and 
the amino acid is carried to the ribosome attached as an acyl ester to the free 3’-OH 


~~ 


Figure 10.20 Ribosomal RNA has a complex 
secondary structure due to many intrastrand 
hydrogen bonds. The gray line in this figure traces 
a polynucleotide chain consisting of more than 
1000 nucleotides. Aligned regions represent 
H-bonded complementary base sequences. 
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PROKARYOTIC RIBOSOMES EUKARYOTIC RIBOSOMES 
(E. coli) (Rat) 
Ribosome Ribosome 
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21 proteins 
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RNA 16S RNA 23S RNA 18S RNA 28S + 5.85 RNA 
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5S RNA 5S RNA 
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Figure 10.23 Transfer RNA has a complex 
secondary structure due to many intrastrand 
hydrogen bonds. The black lines represent base-paired 
nucleotides in the sequence. 


Figure 10.21 The organization and composition of prokaryotic and eukaryotic ribosomes. 
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Figure 10.22 Unusual bases in RNA. 


of the terminal A residue. These aminoacyl-tRNAs are the substrates of protein 
synthesis, the amino acid being transferred to the carboxyl end of a growing polypep- 
tide. The peptide bond—forming reaction is catalyzed not by a protein but by the large 
(23S or 28S) rRNA within the large ribosomal subunit. 


Small Nuclear RNAs Mediate the Splicing of Eukaryotic Gene Transcripts (hnRNA) into 
mRNA Small nuclear RNAs, or snRNAs, are a class of RNA molecules found in 
eukaryotic cells, principally in the nucleus. They contain from 100 to about 200 
nucleotides, some of which, like tRNA and rRNA, are methylated or otherwise 
modified. No snRNA exists as naked RNA. Instead, snRNA is found in stable 
complexes with specific proteins forming small nuclear ribonucleoprotein particles, or 
snRNPs, which are about 10S in size. Their occurrence in eukaryotes, their location 
in the nucleus, and their relative abundance (1% to 10% of the number of ribosomes) 
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are significant clues to their biological purpose: snRNPs are important in the 
processing of eukaryotic gene transcripts (hnRNA) into mature messenger RNA for 
export from the nucleus to the cytoplasm (Figure 10.19). 


A Variety of RNAs Serve Regulatory Roles The twenty-first century has witnessed a 
significant revision of the dogma of molecular biology as summarized by 
DNA->RNA->protein, which confined RNA to information transfer (mRNA), 
processing (snRNPs), and protein synthesis (RNA and tRNA). This revision has 
been driven by discoveries showing that RNA molecules act in a multitude of ways 
to regulate biological processes as diverse as gene expression, chromatin organization, 
and programmed cell death. The next two sections summarize these newfound roles 
for RNA and the classes of RNA molecules that carry them out. 


Small RNAs Serve a Number of Roles, Including Gene Silencing A class of RNA 
molecules even smaller than tRNAs is the small RNAs, so-called because they are 
only 21 to 28 nucleotides long. Some refer to this class as the noncoding RNAs (or 
ncRNAs). Others refer to small RNAs as regulatory RNAs, because virtually every 
step along the pathway of gene expression can be regulated by one or another small 
RNA. Small RNAs are involved in a number of novel biological functions. These 
small RNAs can target DNA or RNA through complementary base pairing. Base 
pairing of the small RNA with particular nucleotide sequences in the target is 
called direct readout. 

Small RNAs are classified into a number of subclasses on the basis of their func- 
tion. RNA interference (RNAi) is mediated by one subclass, the small interfering 
RNAs (siRNAs). siRNAs disrupt gene expression by blocking specific protein pro- 
duction, even though the mRNA encoding the protein has been synthesized. The 
21- to 23-nucleotide-long siR NAs act by base pairing with complementary sequences 
within a particular mRNA to form regions of double-stranded RNA (dsRNA). 
These dsRNA regions are then specifically degraded, eliminating the mRNA from 
the cell. Thus, RNAi is a mechanism to silence the expression of specific genes, even 
after they have been transcribed, a phenomenon referred to as gene silencing. RNAi 
is also implicated in modifying the structure of chromatin and causing large-scale 
influences in gene expression. 

Another subclass, the micro RNAs (miRNAs) control developmental timing by 
base pairing with and preventing the translation of certain mRNAs, thus blocking 
the synthesis of specific proteins. Thus, miRNAs also act in gene silencing. However, 
unlike siRNAs, miRNAs (22 nucleotides long) do not cause mRNA degradation. 
Hundreds of different miR NAs have been described from human cells and have been 
shown to target most human mRNAs. As such, miRNAs are believed to influence 
essentially all aspects of human development and human disease. 

A third subclass is the small nucleolar RNAs (snoRNAs). snoRNAs (60 to 300 
nucleotides long) are catalysts that, in conjunction with specific snoRNA-binding 
proteins, accomplish some of the chemical modifications found in tRNA, rRNA, 
and even DNA (see Figure 10.22, for example). Small RNAs in bacteria (known by 
the acronym sRNAs) play an important role altering gene expression in response to 
stressful environmental situations. 


Long Noncoding RNAs Are Another Class of Regulatory RNAs Long noncoding RNAs, 
or lineRNAs, are RNA transcripts longer than 200 nucleotides. They are transcribed 
from DNA regions lying between genes and thus are sometimes referred to as long 
intergenic noncoding RNAs or long intervening noncoding RNAs. Those lincRNAs 
that remain in the nucleus play roles in the regulation of chromatin structure and 
gene repression in the nucleus. On the other hand, despite being classified as non- 
coding, lincRNAs that gain access to the cytoplasm may be translated into 
micropeptides (40 or so amino acid residues), with diverse functions. Another class 
of long noncoding RNAs, the IncRNAs, bind proteins or nucleic acids in a sequence- 
specific manner and are implicated in several chronic diseases. 
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A Deeper Look 


The RNA World and Early Evolution 


Proteins are encoded by nucleotide sequences in DNA. DNA 
replication depends on the activity of protein enzymes. These two 
statements form a chicken and egg paradox: Which came first in 
evolution—DNA or protein? Neither, it seems. The 1989 Nobel 
Prize in Chemistry was awarded to Thomas Cech and Sidney Alt- 
man for their discovery that RNA molecules are not only informa- 
tional, but also may be catalytic. Three basic assumptions about 
the prebiotic RNA world are (1) RNA replication maintained 
information-carrying RNAs, (2) Watson—Crick base pairing was 
essential to RNA replication, and (3) genetically encoded proteins 
were unavailable as catalysts. The challenge shifts to explaining the 
origin of nucleotides and their polymerization to form RNA. 

Adenine exists in outer space and is found in comets and mete- 
orites. A likely source is conversion of aminoimidazolecarbonitrile 
to adenine. (Aminoimidazolecarbonitrile is a tetramer of HCN; 
adenine is a pentamer.) 


NH, 

N a N 

\ ot \ 
I>? LR 
H H 


Aminoimidazolecarbonitrile 


N 
Sc 
HoN 


Adenine 


Structures of aminoimidazolecarbonitrile and adenine. Adapted 
from Glaser, R., et al., 2007. Adenine synthesis in interstellar space: 
Mechanisms of prebiotic pyrimidine-ring formation of monocyclic 
HCN-pentamers. Astrobiology 7:455—470. 


Glycolaldehyde can combine with other simple compounds to form 
ribose (and glucose). Glycolaldehyde has been detected in a gas 
cloud at the center of the Milky Way, our galaxy. (Acetic acid and 
methyl formate have the same eight atoms as glycolaldehyde; these 
two useful precursor molecules have also been detected in inter- 
galactic clouds.) Inorganic phosphate, the remaining ingredient 


in nucleotides, is a common component in naturally occur- 
ring aqueous solutions. Its negative charge allows it to interact 
readily with positively charged mineral surfaces, upon which 
the first nucleotides may have spontaneously assembled. Cyclic 
nucleotides such as cAMP (3’, 5’-cyclic AMP) and cGMP (3’, 5’- 
cyclic GMP) are formed spontaneously and accumulate under 
mild conditions in the presence of formamide. Recent studies 
show that the chemical formation of short RNA polynucleotides 
from such cyclic nucleotides is thermodynamically favorable in the 
absence of enzymes. These tantalizing facts are bright spots along 
the dim thread that connects us to our distant past. The RNA 
world is an attractive hypothesis. 
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10.5c The Chemical Differences between DNA and RNA 
Have Biological Significance 


Two fundamental chemical differences distinguish DNA from RNA: 


1. DNA contains 2-deoxyribose instead of ribose. 
2. DNA contains thymine instead of uracil. 
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Figure 10.24 The deamination of cytosine forms 
uracil. 


What are the consequences of these differences, and do they hold any significance in 
common? An argument can be made that, because of these differences, DNA is 
chemically more stable than RNA. The greater stability of DNA over RNA is consistent 
with the respective roles these macromolecules have assumed in heredity and information 
transfer. 

Consider first why DNA contains thymine instead of uracil. The key observation is 
that cytosine deaminates to form uracil at a finite rate in vivo (Figure 10.24). Because 
C in one DNA strand pairs with G in the other strand, whereas U would pair with A, 
conversion of a C to a U could potentially result in a heritable change of a C:G pair 
to a U:A pair. Such a change in nucleotide sequence would constitute a mutation in 
the DNA. To prevent this C deamination from leading to permanent changes in 
nucleotide sequence, a cellular repair mechanism proofreads DNA, and when a U 
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arising from C deamination is encountered, it is treated as inappropriate and is 
replaced by a C. If DNA normally contained U rather than T, this repair system could 
not readily distinguish U formed by C deamination from U correctly paired with A. 
However, the U in DNA is 5-methyl-U or, as it is conventionally known, thymine 
(Figure 10.25). That is, the 5-methyl group on T labels it as if to say “this U belongs; 
do not replace it.” 

The other chemical difference between RNA and DNA is that the ribose 2'-OH 
group on each nucleotide in RNA is absent in DNA. Consequently, the ubiquitous 3’-O 
of polynucleotide backbones lacks a hydroxyl neighbor in DNA. This difference leads 
to a greater resistance of DNA to alkaline hydrolysis, examined in detail in the following 
section. To view it another way, RNA is less stable than DNA because its 2'-OH group 
makes the 3’-phosphodiester bond susceptible to nucleophilic cleavage (Figure 10.26). 
For just this reason, it is selectively advantageous for the heritable form of genetic 
information to be DNA rather than RNA. 


10.6 | Are Nucleic Acids Susceptible to Hydrolysis? 


Most reactions of nucleic acid hydrolysis break phosphodiester bonds in the polynu- 
cleotide backbone even though such bonds are among the most stable chemical bonds 
found in biological molecules. In the laboratory, hydrolysis of polynucleotides is used 
to generate smaller fragments that are easier to manipulate and study. 


10.6a RNA Is Susceptible to Hydrolysis by Base, but DNA Is Not 


RNA is relatively resistant to the effects of dilute acid, but gentle treatment of DNA with 
1 mM HCI leads to hydrolysis of purine glycosidic bonds and the loss of purine bases 
from the DNA. The glycosidic bonds between pyrimidine bases and 2’-deoxyribose are 
not affected, and in this case, the polynucleotide’s sugar-phosphate backbone remains 
intact. The purine-free polynucleotide product is called apurinic acid. 

DNA is not susceptible to alkaline hydrolysis. On the other hand, RNA is alkali 
labile and is readily hydrolyzed by hydroxide ions (Figure 10.26). DNA has no 2'-OH; 
therefore, DNA is alkali stable. 


10.6b The Enzymes That Hydrolyze Nucleic Acids 
Are Phosphodiesterases 


Enzymes that hydrolyze nucleic acids are called nucleases. Virtually all cells con- 
tain various nucleases that serve important housekeeping roles in the normal course 
of nucleic acid metabolism. Organs that provide digestive fluids, such as the pan- 
creas, are rich in nucleases and secrete substantial amounts to hydrolyze ingested 
nucleic acids. Fungi and snake venom are often good sources of nucleases. As a 
class, nucleases are phosphodiesterases because they catalyze the cleavage of phos- 
phodiester bonds by H-O. Because each internal phosphate in a polynucleotide 
backbone is involved in two phosphoester linkages, cleavage can potentially occur 
on either side of the phosphorus (Figure 10.27). Convention labels the 3’-side as a 
and the 5’-side as b. Enzymes or reactions that hydrolyze nucleic acids are charac- 
terized as acting at either a or b. A second convention denotes whether the nucleic 
acid chain was cleaved at some internal location, endo, or whether a terminal nucle- 
otide residue was hydrolytically removed, exo. Note that exo a cleavage occurs at 
the 3’-end of the polymer, whereas exo b cleavage involves attack at the 5’-terminus 
(Figure 10.27). 


10.6c Nucleases Differ in Their Specificity for Different Forms 
of Nucleic Acid 
Nucleases play an indispensable role in the cellular breakdown of nucleic acids and 


salvage of their constituent parts. Nucleases also participate in many other cellular 
functions, including (1) aspects of DNA metabolism, such as replication and repair; 


Figure 10.25 The 5-methyl group, highlighted in 
blue, on thymine labels it as a special kind of uracil. 
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A nucleophile such as OH™ can abstract the H of the 2'-OH, 
generating 2'—O7, which attacks the 6*P of the phosphodiester bridge: 
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Complete hydrolysis of RNA by alkali yields a random 
mixture of 2'-NMPs and 3'-NMPs. 


Figure 10.26 Alkaline hydrolysis of RNA. The defining 2'-OH of RNA is highlighted in yellow. 
The vertical lines represent ribose; the diagonals the phosphodiester linkages joining successive 
nucleotides. Nucleophilic attack by OH- on the P atom leads to 5'-phosphoester cleavage and 
random hydrolysis of the cyclic 2’,3'-phosphodiester intermediate to give a mixture of 2'- and 
3’-nucleoside monophosphate products. 
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Convention: The 3'-side of each phosphodiester 
is termed @; the 5'-side is termed ®. 
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Hydrolysis of the @bond yields 5'-PO, products: 
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Figure 10.27 Cleavage in polynucleotide chains. (a) Diagram showing the definition of the a and b sides 
of the phosphodiester bridge. (b) Cleavage on the a side leaves the phosphate attached to the 5’-position 
of the adjacent nucleotide, while (c) b-side hydrolysis yields 3’-phosphate products. 


(2) aspects of RNA metabolism, such as splicing the primary gene transcript, 
processing of mRNA, and RNAi; (3) rearrangements of genetic material, such as 
recombination and transposition; (4) host defense mechanisms against foreign nucleic 
acid molecules; and (5) the immune response, through assembly of immunoglobulin 
genes. Some nucleases are not proteins but instead are catalytic RNA molecules. 

Like most enzymes, nucleases exhibit selectivity or specificity for the nature of the 
substance on which they act. That is, some nucleases act only on DNA (DNases), 
whereas others are specific for RNA (the RNases). Still others are nonspecific and are 
referred to simply as nucleases. Nucleases may also show specificity for only single- 
stranded nucleic acids or may act only on double helices. Some display a decided 
preference for acting only at certain bases in a polynucleotide, or as we shall see for 
restriction endonucleases, act only at a particular nucleotide sequence four to eight 
nucleotides (or more) in length. To the molecular biologist, nucleases are the surgical 
tools for the dissection and manipulation of nucleic acids in the laboratory. 


10.6d Restriction Enzymes Are Nucleases That Cleave 
Double-Stranded DNA Molecules 


Restriction endonucleases are enzymes, isolated chiefly from bacteria, that have the 
ability to cleave double-stranded DNA. The term restriction comes from the capacity 
of prokaryotes to defend against or restrict the possibility of takeover by foreign DNA 
that might gain entry into their cells. Prokaryotes degrade foreign DNA by using their 
unique restriction enzymes to chop it into relatively large but noninfective fragments. 
Restriction enzymes are classified into three types: I, I, or III. Types I and 
III require ATP to hydrolyze DNA and can also catalyze the chemical modification 
of DNA through the addition of methyl groups to specific bases. Type I restriction 
endonucleases cleave DNA randomly, whereas type III recognize specific nucleotide 
sequences within dsDNA and cut the DNA at or near these sites. 


10.6e Type II Restriction Endonucleases Are Useful 
for Manipulating DNA in the Lab 
Type II restriction enzymes have received widespread application in the cloning and 


sequencing of DNA molecules. Their hydrolytic activity is not ATP dependent, and 
they do not modify DNA by methylation or other means. Most important, they cut 
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5! 


DNA within or near particular nucleotide sequences that they specifically recognize. 
These recognition sequences are typically four or six nucleotides in length and have a 
twofold axis of symmetry. For example, E. coli has a restriction enzyme, EcoRI, that 
recognizes the hexanucleotide sequence GAATTC: 


5'—N—N—N—N—G—A—A—T—T—C—N—N—N—-N—3' 
3’ —_N-N—-N-N—-G—T-T-A—A_ 6 -N-N—N-N—5' 


Note the twofold symmetry: The sequence read 5'—3' is the same in both strands. 
When EcoRI encounters this sequence in dsDNA, it causes a staggered, double- 
stranded break by hydrolyzing each chain between the G and A residues: 


5’ N-N-N-N-G A-A-T-T-C-N-N-N N35! 


3'—_N—N—N—N—C—T—T—A—A G—N—N—N—N—'’ 


Staggered cleavage results in fragments with protruding single-stranded 5’-ends: 


ai 


N—N—N—G 5' A—A—T—T—C—N—N—N—N—3 
N—N—N—C —T—T—A—A 5’ G—N—N—N—N—5' 


Because the protruding termini of EcoRI fragments have complementary base 
sequences, they can form base pairs with one another. 


5'—N—N—N—N—G A~A—T—T—C—N—N—N—-N— 3) 
3’ —_N-NON-N-G—T-T-A-A G-—N-NCNON——5' 


Therefore, DNA restriction fragments having such sticky ends can be joined together 
to create new combinations of DNA sequence. If fragments derived from DNA mole- 
cules of different origins are combined, novel recombinant forms of DNA are created. 

EcoRI leaves staggered 5'-termini. Other restriction enzymes, such as PstI, which 
recognizes the sequence 5’-CTGCAG-3’ and cleaves between A and G, produce 
cohesive staggered 3'-ends. Still others, such as Ball, act at the center of the twofold 
symmetry axis of their recognition site and generate blunt ends that are noncohesive 
Ball recognizes 5'-TGGCCA-3’ and cuts between G and C. 

Table 10.2 lists some of the commonly used restriction endonucleases and their 
recognition sites. Different restriction enzymes sometimes recognize and cleave within 
identical target sequences. Such enzymes are called isoschizomers, meaning that they cut 
at the same site; for example, MboI and Sau3A are isoschizomers. 


Restriction Fragment Size Assuming random distribution and equimolar proportions 
for the four nucleotides in DNA, a particular tetranucleotide sequence should occur 
every 4* nucleotides, or every 256 bases. Therefore, the fragments generated by a 
restriction enzyme that acts at a four-nucleotide sequence should average about 250 bp 
in length. Six-cutters, enzymes such as EcoRI or BamHI, will find their unique hexa- 
nucleotide sequences, on average, once in every 4096 (4°) bp of length. Because the 
genetic code is a triplet code with three successive bases in a DNA strand specifying 
one amino acid in a polypeptide sequence, and because polypeptides typically con- 
tain at most 1000 amino acid residues, the fragments generated by six-cutters are 
approximately the size of prokaryotic genes. This property makes these enzymes 
useful in the construction and cloning of genetically useful recombinant DNA mol- 
ecules. For the isolation of even larger nucleotide sequences, such as those of genes 
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Table 10.2 


mon Restriction Endonucleases and Their Restriction Sites 


Over 3000 restriction enzymes have been characterized. They are named by italicized three-letter codes; the first is a capital letter denoting the 
genus of the organism of origin, and the next two letters are an abbreviation of the particular species. Because prokaryotes often contain more 
than one restriction enzyme, the various representatives are assigned letter and number codes as they are identified. Thus, EcoRI is the initial 
restriction endonuclease isolated from Escherichia coli, strain R. With one exception (Ncil), all known type II restriction endonucleases generate 


fragments with 5'-PO, and 3’-OH ends. 


Enzyme Common Recognition Compatible 
Isoschizomers Sequence Cohesive Ends 
Alul AG|CT Blunt 
Apyl Atul, EcoRII CCl G(4)GG 
Aval G|PyCGPuG Sall, Xhol, Xmal 
BamHI G|JGATCC Bell, Bgl, Mbol, Sau3A, XhoI 
Bell T)GATCA BamHI, BglII, Mbol, Sau3A, Xholl 
BstX1 CCANNNNN|NTGG 
Ddel C|TNAG 
EcoRI G|AATTC 
Hael #)GG|CC(A) Blunt 
Hindi AJAGCTT 
Hpall CCGG Accl, Acyl, Asull, Clal, Tag! 
Kpni GGTAC|C 
Mbol Sau3A {GATC BamHI, Bci, Bell, Xholl 
NotI GC|GGCCGC 
Pstl CTGCA|G 
Sacl Sstl GAGCT|C 
Sall G|TCGAC Aval, Xhol 
Sau3A {GATC BamHI, Bcll, Bgl, Mbol, Xholl 
Sfil GGCCNNNN|NGGCC 
Smal Xmal CCCGGG Blunt 
Xhol C|TCGAG Aval, Sall 
Xholl GATCE) BamHI, Bcil, Bgill, Mbol, Sau3A 


encoding large polypeptides (or those of eukaryotic genes that are disrupted by large 
introns), partial or limited digestion of DNA by restriction enzymes can be employed. 
However, restriction endonucleases that cut only at specific nucleotide sequences 
8 or even 13 nucleotides in length are also available, such as NotI and Sfil. 


10.6f Restriction Endonucleases Can Be Used to Map 
the Structure of a DNA Fragment 


The application of these sequence-specific nucleases to problems in molecular biol- 
ogy is considered in detail in Chapter 12, but one prominent application is described 
here. Because restriction endonucleases cut dsDNA at unique sites to generate large 
fragments, they provide a means for mapping DNA molecules that are many 
kilobase pairs in length. Restriction digestion of a DNA molecule is in many ways 
analogous to proteolytic digestion of a protein by an enzyme such as trypsin: The 
restriction endonuclease acts only at its specific sites so that a discrete set of nucleic 
acid fragments is generated. This action is analogous to trypsin cleavage only at 
Arg and Lys residues to yield a particular set of tryptic peptides from a given pro- 
tein. The restriction fragments represent a unique collection of different-sized 
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A B A+B 
kbp Longer DNA A Treatment with restriction endonuclease A gave two fragments: one 7 kbp in size and 
fragments one 3 kbp in size, as judged by gel electrophoresis. 
= 
B Treatment of another sample of the 10 kbp DNA with restriction endonuclease B 
7] gave three fragments: 8.5 kbp, 1.0 kbp, and 0.5 kbp. 
————= 
A+B Treatment of a third sample with both restriction endonucleases A and B yielded 
5-4 fragments 6.5 kbp, 2 kbp, 1 kbp, and 0.5 kbp. 
3, ——=—= 
S=> 
14 ~] Shorter DNA 
SS em fragments 
The observed electrophoretic 
pattern (1) (2) 
3 7 7 3 
Enzyme A = = 


8 (4) (5 
g5 051 8.5 10.5 1 85 99 


0.51 8.5 0.5 8.5 1 10.5 85 
(6) O © 
| Which arrangements are correct? The only combinations giving the observed A + B digests are @+ O and@+@ 


Enzyme B 


B B 
1 8.5 0.5 


e= Digests @ + O 


3 T 


B B 
0.5 8.5 1 


p Digests O +O 


7 A? 


Figure 10.28 Restriction mapping of a DNA molecule as determined by an analysis of the 
electrophoretic pattern obtained for different restriction endonuclease digests. (Keep in mind that a 
dsDNA molecule has a unique nucleotide sequence and therefore a definite polarity; thus, fragments 
from one end are distinctly different from fragments derived from the other end.) 


DNA pieces. Fortunately, this complex mixture can be resolved by electrophoresis 
(see the Chapter 5 A Deeper Look box “Techniques Used in Protein Purification”). 
Electrophoresis of DNA molecules on gels of restricted pore size (as formed in aga- 
rose or polyacrylamide media) separates them according to size, the largest being 
slowed in their migration through the gel pores, whereas the smallest move rela- 
tively unhindered. Figure 10.28 shows a hypothetical electrophoretogram obtained 
for a DNA molecule treated with two different restriction nucleases, alone and in 
combination. Just as cleavage of a protein with different proteases to generate over- 
lapping fragments allows an ordering of the peptides, restriction fragments can be 
ordered or mapped according to their sizes, as deduced from the patterns depicted 
in Figure 10.28. 
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Foundational Biochemistry Things You Should Know after Reading Chapter 10. 


Nucleotides are composed of either a pyrimidine or a purine base, a 
ribose, and phosphate. 


Nucleotides participate in virtually all aspects of metabolism. 
Nucleic acids are polynucleotides. 

The two kinds of nucleic acid are DNA and RNA. 

DNA is the repository of genetic information in cells. 


RNA is involved in the expression of genetic information through the 
processes of transcription and translation. 


Pyrimidines and purines are heterocyclic aromatic substances con- 
taining two (pyrimidines) or four (purines) nitrogen atoms in their 
ring structures. 


The three principal pyrimidines are cytosine, uracil, and thymine. 
The two principal purines are adenine and guanine. 
Purines and pyrimidines undergo keto—enol tautomeric shifts. 


The particular keto or enol form of a nitrogenous base affects its 
H-bonding possibilities. 


The aromaticity of pyrimidines and purines endows them with UV 
absorbance properties. 


Nucleosides are formed when ribose (or deoxyribose) is attached to a 
nitrogenous base via a B—N-glycosidic bond. 


Nucleotides are formed when phosphoric acid is esterified to an 
—OH group of the ribose of a nucleoside. 


Cyclic nucleotides result when the phosphate group forms ester link- 
ages with two ribose —OH groups. 


Nucleoside diphosphates (NDPs) and nucleoside triphosphates (NTPs) 
are nucleotides with two or three phosphoryl groups. 


NDPs and NTPs are polyprotic acids. 
NTPs are the carriers of chemical energy. 


Nucleic acids are polymers of nucleotides linked 3’ to 5' by phospho- 
diester bridges to form a sugar—phosphate backbone from which the 
bases project. 


The sequence of bases along the sugar—phosphate backbone is the 
defining characteristic of a nucleic acid. 


DNA is found in chromosomes as the genetic material. 


RNA has as its central role the transfer of information from DNA 
to protein. 


The fundamental structure of DNA is a double helix that is main- 
tained by base pairing between a purine base in one strand and a 
pyrimidine base in the other. 


The canonical base pairs are A: T and G:C. 
The information of DNA is encoded in digital form. 


Messenger RNA (mRNA) carries the sequence information for syn- 
thesis of a protein. 


Ribosomal RNA (rRNA) provides both the structural framework 
and the protein-synthesizing function of ribosomes. 


Transfer RNAs (tRNAs) deliver amino acids in the form of aminoacyl- 
tRNAs for use in protein synthesis. 


Small nuclear ribonucleoproteins (snaRNPs) mediate the splicing of 
enkaryotic gene transcripts to form mature mRNAs. 


Small RNAs and long noncoding RNAs serve a variety of regulatory 
roles in cells. 


The chemical differences between DNA and RNA—thymine vs. uracil, 
presence or absence of 2’-OH group—have important biological 
consequences. 


DNA is acid labile; RNA is not. 


RNA is susceptible to hydrolysis by base because of its 2'-OH group; 
DNA lacks this —OH and is not susceptible to alkaline hydrolysis. 


Enzymes that hydrolyze the phosphoester bonds of nucleic acids are 
called nucleases. 


Nucleases differ in their specificity for the different forms of nucleic 
acid, such as DNA versus RNA or single stranded versus double 
stranded. 


Type II restriction endonucleases are nucleases that cleave double- 
stranded DNA at specific base sequences. These enzymes are useful 
for in vitro manipulation of DNA molecules. 


Restriction endonucleases can be used to map the structure of DNA 
molecules. 


Answers to all problems are at the end of this book. Detailed solutions are 
available in the Student Solutions Manual. 


1. The Structure and lonization Properties of Nucleotides Draw the 


principal ionic species of 5'-GMP occurring at pH 2. 


. Oligonucleotide Structure Draw the chemical structure of pACG. 


3. Chemistry of Nucleosides For each part below, select the term that 


applies when describing the chemistry of nucleosides: 
a. pentose, hexose, or heptose 

b DorL 

c. ketose or aldose 

d. aorg 

e. pyranose or furanose 


. Tautomerism Draw all the tautomeric forms of uric acid (see 


Figure 10.5). 


5. 


Chargaff’s Rules for the Base Composition of DNA Chargaff’s results 
(Table 10.1) yielded a molar ratio of 1.56 for A to Gin human DNA, 
1.75 for T to C, 1.00 for A to T, and 1.00 for G to C. Given these values, 
what are the approximate mole fractions of A, C, G, and T in human 
DNA? 


Abundance of the Different Bases in the Human Genome Results on 
the human genome published in Science (Science 291:1304—1350 
[2001]) indicate that the haploid human genome consists of 

2.91 gigabase pairs (2.91 X 10° base pairs) and that 27% of the 
bases in human DNA are A. Calculate the number of A, T, G, 

and C residues in a typical human cell. 


. The Base Sequence in the Two Polynucleotide Chains of a DNA 


Double Helix Is Complementary Adhering to the convention of 
writing nucleotide sequences in the 5'—3' direction, what is the 
nucleotide sequence of the DNA strand that is complementary 
to dd ATCGCAACTGTCACTA? 
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8. Structure of Nucleic Acids A student put together a structure of a 


short fragment of DNA. Identify the serious mistake that the student 
made in this structure. 


9. The Relationship between the Nucleotide Sequence of an mRNA and 


the DNA Strand from Which It Is Transcribed Messenger RNAs are 
synthesized by RNA polymerases that read along a DNA template 
strand in the 3’->5’ direction, polymerizing ribonucleotides in the 
5'—3' direction (see Figure 10.19). Give the nucleotide sequence 
(5'—3') of the DNA template strand from which the following 
mRNA segment was transcribed: 5'-UAGUGACAGUUGCGAU-3’. 


10. An Antisense RNA Is an RNA That Base-Pairs with an mRNA and 


Prevents Its Translation One strategy to thwart the deleterious effects of 
genes activated in disease states (such as cancer) is to generate antisense 
RNAs in affected cells. Suppose the transcription of a cancer-activated 
gene yielded an mRNA whose sequence included the segment 
5'-UACGGUCUAAGCUGA. Which of the following is the correct 
nucleotide sequence (5’—>3’) for the template strand in a DNA duplex 
that would code for an antisense RNA? (Hint: Recall that the DNA 
strand that is not copied by RNA polymerase during transcription has a 
nucleotide sequence identical to that of the RNA being synthesized 
[except T residues are found in the DNA strand at sites where U residues 
occur in the RNA)). 


a. 5'- ATGCCAGATTCGACT 
b. 5'- TACGGTCTAAGCTGA 
c. 5'- TAGCGGTCTAAGCTGA 
d. 5'- TCAGCTTAGACCGCTA 


11. Restriction Endonuclease Mapping of a DNA Fragment A 10 kbp 


DNA fragment digested with the restriction endonuclease EcoRI 
yielded fragments 4 kbp and 6 kbp in size. When digested with 
BamHI, fragments 1, 3.5, and 5.5 kbp were generated. Concomitant 
digestion with both EcoRI and BamHI yielded fragments 0.5, 1, 3, 
and 5.5 kbp in size. Give a possible restriction map for the original 
fragment. 


12. Design of DNA Sequences with Overlapping Restriction Endonuclease 


Sites Based on the information in Table 10.2, describe two different 
20 base-pair nucleotide sequences that have restriction sites for 
BamH 1, Pstl, Sall, and Smal. Give the sequences of the Smal 
cleavage products of each. 


13. Calculate the Free Energy Change for Synthesis of a Polynucleotide 


(Integrates with Chapter 3.) The synthesis of RNA can be summa- 
rized by the reaction: 


n NTP —> (NMP), + n PP; 


14. 


15. 


16. 


17. 


18. 


19. 


20. 


21. 


22. 


What is the AG?’ oyera for synthesis of an RNA molecule 100 nucleotides 
in length, assuming that the AG” for transfer of an NMP from an NTP 
to the 3’-O of polynucleotide chain is the same as the AG” for transfer 
of an NMP from an NTP to H20? (Use data given in Table 3.1.) 


Protein-DNA Interactions Gene expression is controlled through the 

interaction of proteins with specific nucleotide sequences in double- 

stranded DNA. 

a. List the kinds of noncovalent interactions that might take place 
between a protein and DNA. 

b. How do you suppose a particular protein might specifically interact 
with a particular nucleotide sequence in DNA? That is, how might 
proteins recognize specific base sequences within the double helix? 


The Properties Restriction Endonucleases Must Have Restriction 
endonucleases also recognize specific base sequences and then act to 
cleave the double-stranded DNA at a defined site. Speculate on the 
mechanisms by which this sequence recognition and cleavage reaction 
might occur by listing a set of requirements for the process to take place. 


The Properties That Carbohydrates Contribute to Nucleosides, 
Nucleotides, and Nucleic Acids A carbohydrate group is an integral 
part of a nucleoside. 


a. What advantage does the carbohydrate provide? 

b. Why might ribose be preferable for this backbone instead of 
glucose? 

c. Why might 2-deoxyribose be preferable to ribose in some 
situations? 


Chemical Analysis of DNA A DNA sample was analyzed and found to 
contain 0.1 pg phosphorus. What is the number of base pairs present 
in this DNA sample? 


How Does the Presence of Phosphate Affect the Properties of 

Nucleotides? Phosphate groups are also integral parts of nucleotides, 

with the second and third phosphates of a nucleotide linked through 

phosphoric anhydride bonds, an important distinction in terms of the 

metabolic role of nucleotides. 

a. What property does a phosphate group have that a nucleoside 
lacks? 


b. How are phosphoric anhydride bonds useful in metabolism? 


c. How are phosphate anhydride bonds an advantage to the 
energetics of polynucleotide synthesis? 


Calculate the Frequency of Occurrence of an RNAi Target Sequence 
The RNAs acting in RNAi are about 21 nucleotides long. To judge 
whether it is possible to uniquely target a particular gene with a RNA 
of this size, consider the following calculation: What is the expected 
frequency of occurrence of a specific 21-nucleotide sequence? 


Calculate the Length of a Nucleotide Sequence Whose Expected 
Frequency Is Once per Haploid Human Genome The haploid human 
genome consists of 3 X 10° base pairs. Using the logic in problem 19, 
one can calculate the minimum length of a unique DNA sequence 
expected to occur by chance just once in the human genome. That is, 
what is the length of an oligonucleotide whose expected frequency of 
occurrence is once every 3 X 10° bp? 


Design a Sequencing Strategy for Nucleic Acids Snake venom 
phosphodiesterase is an a-specific exonuclease (Figure 10.27) that acts 
equally well on single-stranded RNA or DNA. Design a protocol 
based on snake venom phosphodiesterase that would allow you to 
determine the base sequence of an oligonucleotide. Hint: Adapt the 
strategy for protein sequencing by Edman degradation, as described 
in Section 5.3 and Figure 4.8a. 


Calculate the Mass of DNA in a Human Cell From the answer to 
problem 6 and the molecular weights of dAMP (331 Da), (CMP 
(307 Da), (GMP (347 Da), and dTMP (322 Da), calculate the mass 
(in daltons) of the DNA in a typical human cell. 
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23. The Bases of Nucleotides Serve as Information Symbols The bases of 
nucleotides and polynucleotides are information symbols. Their 
central role in providing information content to DNA and RNA is 
clear. What advantages might bases as information symbols bring to 
the roles of nucleotides in metabolism? 


24. Structural Complementarity in the DNA Double Helix Structural 
complementarity is the key to molecular recognition, a lesson learned 
in Chapter 1. The quintessential example of structural complemen- 
tarity in all of biology is the DNA double helix. What features of the 
DNA double helix exemplify structural complementarity? 


Think-Pair-Share Question 


Binary Code in dsDNA DNA molecules carry information dictating the 
order in which amino acids need to be linked to make a functional 
protein, the so-called genetic code. Comparisons between this genetic 
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code and computer codes are often made. However, computer codes 
are binary, consisting of a sequence of zeros and ones (0’s and 1’s), 
while the code in DNA, built from the four different nucleotides, is a 
quaternary code. 


Consider the piece of dsDNA shown. 


5’-TCAACTGGTACCTAATCCCGGGATCCGGTAATTCT- 
GAGCCCAGTAACT-3’ 


3’-AGTTGACCATGGATTAGGGCCCTAGGCCATTAAGACTC- 
GGGTCATTGA-S’ 


Devise a system by which it could serve as a source of binary code 
and write down the 0/1 sequence encoded in this dsDNA segment. 
Note that several solutions are possible. (Hint: remember Chargaff’s 
observations.) 
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Reginald H. Garrett 


Structure of Nucleic Acids 


helix adorning the cathedral in Orvieto, 
Italy, some 500 years ago. In 1953, a 
double helix was shown to be the 


Essential Question 


The nucleotide sequence—the primary structure—of DNA not only determines its higher- 
order structure, but it is also the physical representation of genetic information in organ- 
isms. RNA sequences, as copies of specific DNA segments, direct both the higher-order 
structure and the function of RNA molecules in information transfer processes. What are 
the higher-order structures of DNA and RNA, and what methodologies have allowed 
scientists to probe these structures and the functions that derive from them? 


hapter 10 presented the structure and chemistry of nucleotides and how these units 

are joined via phosphodiester bonds to form nucleic acids, the biological polymers 
for information storage and transmission. In this chapter, we investigate biochemical 
methods that reveal the primary, secondary, and tertiary structure of nucleic acids. The 
primary structure—the sequential order of nucleotides in a polynucleotide chain—is the 
embodiment of genetic information. Then, we turn to the higher orders of structure in 
the nucleic acids—the secondary and tertiary levels—that describe how these structures 
exist in nature. Although the focus here is primarily on the structural and chemical prop- 
erties of these macromolecules, it is fruitful to keep in mind the biological roles of these 
remarkable substances. We can anticipate that the cellular mechanisms for accessing 
and expressing genetic information, as well as reproducing it with high fidelity, will be 
illuminated by knowledge of the chemical and structural qualities of these polymers. 


T1.1 | How Do Scientists Determine the Primary Structure 


of Nucleic Acids? 


Determining the primary structure of nucleic acids—the nucleotide sequence—would 
seem to be a more formidable problem than amino acid sequencing of proteins, simply 
because nucleic acids contain only four unique monomeric units (A, C, G, and T), 


< Renaissance masons created this double 


structure that lies at the heart of heredity. 


Ti 


The Structure of DNA: “A melody for the eye of the 
intellect, with not a note wasted.” 


Horace Freeland Judson 
The Eighth Day of Creation 


Key Questions 


111 


11.2 


11.3 


11.6 


11.7 


How Do Scientists Determine the 
Primary Structure of Nucleic Acids? 


What Sorts of Secondary Structures 
Can Double-Stranded DNA 
Molecules Adopt? 


Can the Secondary Structure of 
DNA Be Denatured and Renatured? 
How Does DNA Adopt Structures of 
Higher Complexity? 

What Is the Structure of Eukaryotic 
Chromosomes? 

How Can Nucleic Acids Be 
Synthesized Chemically? 


What Are the Secondary and 
Tertiary Structures of RNA? 
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Old New Old New 
Figure 11.1 DNA replication yields two new DNA 


duplexes identical to the original DNA double helix. 


Single-  5' 
stranded 
DNA 


DNA polymerase 


whereas proteins have 20. With only four, there are apparently fewer specific sites for 
selective cleavage, distinctive sequences are more difficult to recognize, and the likeli- 
hood of ambiguity is greater. The much greater number of monomeric units in most 
polynucleotides compared with polypeptides is a further difficulty. However, two sim- 
ple tools rendered the determination of nucleotide sequence in nucleic acids far easier 
than amino acid sequencing. One of these tools was the set of type II restriction endo- 
nucleases that cleave DNA at specific oligonucleotide sites, generating unique fragments 
of manageable size (see Chapter 10). The second was gel electrophoresis, a method capa- 
ble of separating nucleic acid fragments that differ from one another in length by just a 
single nucleotide. As we shall see, the early successes based on these tools spawned more 
efficient and effective technologies for nucleic acid sequencing, so much so that now, 
such sequencing is extremely easy, and a new problem has emerged: the “big data” prob- 
lem; that is how to interpret the information held within the more than 10" base pairs’ 
worth of novel sequence data. This challenge is addressed by the new field of bioinfor- 
matics, the analysis of these data sets to reveal biologically relevant information. 


Tl.la The Nucleotide Sequence of DNA Can Be Determined 
from the Electrophoretic Migration of a Defined Set 
of Polynucleotide Fragments 


The traditional protocol for nucleic acid sequencing has been the chain termination or 
dideoxy method of Frederick Sanger, which relies on enzymatic replication of the 
DNA to be sequenced. Very sensitive analytical techniques that can detect the newly 
synthesized DNA chains following electrophoretic separation are available, so Sanger 
sequencing could be carried out on as little as 1 attomole (amol, 10~'® mol) of DNA 
contained in less than 0.1 uL volume. (107!8 moles of DNA are roughly equivalent to 
10- grams (pg) of 1-kb sized DNA molecules.) These analytical techniques typically 
rely on fluorescent detection of the DNA products. 


T1.1b Sanger’s Chain Termination or Dideoxy Method Uses DNA 
Replication to Generate a Defined Set of Polynucleotide Fragments 


The rationale of the chain termination or dideoxy method is found in the biochemistry 
of DNA replication. DNA is a double helical molecule. In the course of its replication, 
the sequence of nucleotides in one strand is copied in a complementary fashion to 
form a new second strand by the enzyme DNA polymerase. Each original strand of the 
double helix serves as a template for the biosynthesis that yields two daughter DNA 
duplexes from the parental double helix (Figure 11.1). DNA polymerase will carry out 
this reaction in vitro in the presence of the four deoxynucleotide monomers and copy 
single-stranded DNA, provided a double-stranded region of DNA 1s artificially 
generated by adding a primer. This primer is an oligonucleotide capable of forming a 
short stretch of dsDNA by base pairing with the ssDNA (Figure 11.2). The primer 
must have a free 3’-OH end from which the new polynucleotide chain can grow as the 
first residue is added in the initial step of the polymerization process. DNA 
polymerases synthesize new strands by adding successive nucleotides in the 5’—3’ 
direction. 


3' (dNMP),,-3/OH + dNTP > (dNMP),,+4-3'OH + PP; + H* 


The DNA polymerase reaction: dNMP addition to the 3'-OH 
end of a DNA strand that is base-paired with a template strand 


5! 


Annealing of primer Figure 11.2 Primed synthesis of a DNA template by DNA 
creates a short stretch polymerase, using the four deoxynucleoside triphosphates as the 
of double-stranded DNA substrates. 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


11.1 How Do Scientists Determine the Primary Structure of Nucleic Acids? 


The Chain Termination Protocol In the chain termination method of DNA sequencing, 
a DNA fragment of unknown sequence serves as a template in a polymerization 
reaction using DNA polymerase, usually a genetically engineered version that lacks 
all traces of exonuclease activity that might otherwise degrade the DNA. (DNA 
polymerases usually have an intrinsic exonuclease activity that allows proofreading 
and correction of the DNA strand being synthesized; see Chapter 28.) An appropriate 
oligonucleotide is used as a primer (this method is also known as the primed synthesis 
method for this reason). The reaction is run in the presence of all four deoxynucleoside 
triphosphates dATP, dGTP, dCTP, and dTTP, which are the substrates for DNA 
polymerase (Figure 11.3). In addition, the reaction mixture contains the four 
corresponding 2’,3’-dideoxynucleotides (ddATP, ddGTP, ddCTP, and ddTTP); it is 
these dideoxynucleotides that give the method its name. 

Because dideoxynucleotides lack 3'-OH groups, they cannot serve as acceptors for 
5'-nucleotide addition in the polymerization reaction; thus, the chain is terminated 
where they become incorporated. The concentrations of the deoxynucleotides in each 
reaction mixture are significantly greater than the concentrations of the dideoxynucleo- 
tides, so incorporation of a dideoxynucleotide is infrequent. Therefore, premature chain 
termination is only a random, occasional event, and a population of new strands of 
varying length is synthesized. Nevertheless, termination, although random, occurs ev- 
erywhere in the sequence. Thus, the population of newly synthesized DNAs forms a 
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nested set of molecules that differ in length by just one nucleotide. Each newly synthe- 
sized strand has a dideoxynucleotide at its 3’-end, and each of the four dideoxynucleo- 
tides used in Sanger sequencing is distinctive because each bears a fluorescent tag of a 
different color. (These fluorescent tags are attached to the 5-position of pyrimidine di- 
deoxynucleotides or the 7-position of purine dideoxynucleotides, where these tags do 
not impair the ability of DNA polymerase to add them to a growing polynucleotide 
chain.) The color of a particular fluorescence (as in orange for ddA, blue for ddC, green 
for ddG, and red for ddT) reveals which base was specified by the template and incor- 
porated by DNA polymerase at that spot. 


Reading Dideoxy Sequencing Gels The sequencing products are visualized by 
fluorescence spectroscopy following their separation according to size by capillary 
electrophoresis (Figure 11.3). Because the smallest fragments migrate fastest upon 
electrophoresis and because fragments differing by only a single nucleotide in length 
are readily resolved, the sequence of nucleotides in the set of newly synthesized DNA 
fragments is given by the order of the fluorescent colors emerging from the capillary. 
Thus, the gel in Figure 11.3 is read TTGTCGAAGTCAG (5’-3’). Because of the way 
DNA polymerase acts, this observed sequence is complementary to the corresponding 
unknown template sequence. Knowing this, the template sequence now can be 
written CTGACTTCGACAA (5’3’). 

Sanger sequencing has been fully automated. Automation is achieved through the 
use of robotics for preparing the samples, running the DNA sequencing reactions, 
loading the chain-terminated DNA fragments onto capillary electrophoresis tubes, 
performing the electrophoresis, and imaging the results for computer analysis. These 
advances made it feasible to sequence the entire genomes of organisms. 


Careers in Chemistry 


Sabrina Farais 
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a lab director in the medical marijuana industry, let that scare you away from your passion. Experi- 
working in tandem with the Oklahoma Medical ence is strength and challenges become easier as you 
Marijuana Authority (OMMA). become stronger.” 


11.1c Next-Generation Sequencing by Synthesis 


The enormous significance of DNA sequence information to fundamental questions 
in biology, medicine, and personal health has been the driving force for the develop- 
ment of ever more rapid and efficient DNA sequencing technologies, the so-called 
next-generation (“next-gen”) or ultra-high-throughput (UHT) sequencing. None of 
this would have been possible without extraordinary advances in nanotechnology. 
Nanobiotechnology has developed materials and methods allowing manipulation 
and analysis of extremely small amounts of biomolecules, such as DNA. Indeed, in 
some instances, the ultimate goal—studies on single molecules—has been met. 

Most commercially available sequencers have a common attribute: As in the Sanger 
or capillary electrophoresis protocols, they perform sequencing by synthesis (SBS), 
where DNA polymerase catalyzes the synthesis of a complementary strand. The 
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next-gen sequencers have the advantage in that they can directly detect the addition of 
each base and record that information in digital form for computer analysis. The digi- 
tal record also contains information that enables quantification of the amount of each 
DNA molecule within the library. Hundreds of thousands to hundreds of millions of 
sequencing reactions can be run at the same time on these instruments, yielding thou- 
sands of gigabases (>10!* bases) of sequence information per run. This ability is 
referred to as massively parallel sequencing. Unlike capillary sequencers, which can 
read about 600 bases per DNA template, these next-gen sequencers have shorter read 
lengths, on the order of 150 to 250 bases per template. Their capacity for massively 
parallel sequencing means that read length is not a limitation because so many differ- 
ent overlapping DNA strands are sequenced at once. As in amino acid sequencing, the 
complete sequence of a DNA molecule is reconstructed by comparing the sequences 
of overlapping fragments. 

The massively parallel DNA sequencing platforms developed by Illumina, Inc., rely 
on sequencing by synthesis. The Illumina protocol uses fluorescent-tagged deoxynucle- 
otide substrates whose 3’OH terminus is blocked by an easily removed chemical group 
(Figure 11.4a). By using such reversible terminator substrates, this process allows only 
one base to be added at a time to a growing chain. Once the DNA polymerase selects 
the appropriate dNTP through base pairing with the template, it is added it to the grow- 
ing chain. Whether an A, C, G, or T was added is revealed upon laser excitation of the 
fluorescent tag it carries (Figure 11.4b). The fluorescent flash is recorded by a digital 
camera, and the information is sent to a computer. Then, both the 3’-blocking group 
and the bulky fluorescent tag are chemically removed, exposing a free 3’-O to the DNA 
polymerase, which can now select and add the next appropriate dNTP in line. 

In Illumina SBS sequencing, the DNA of interest is first broken into manageable 200 
to 600 bp fragments. Individual fragments are processed to generate hundreds of mil- 
lions of identical single-stranded DNA templates (called “clusters”). Individual DNA 
molecules within the cluster are immobilized by base pairing with primers attached to 
the surface of a glass sheet. The sequencing process begins when reagents for sequenc- 
ing by synthesis are flowed across the sheet. In the reaction cycle, a nucleotide substrate 
(A, C, G, or T) will be added to a growing primer, as chosen by base pairing with the 
template. Let’s suppose the first unpaired base in the primer-immobilized template is G. 
The DNA polymerase will select the dCTP representative among the cleavable 3’-O- 
blocked-dNTP reversible terminator substrate choices and extend the chain. The 
fluorescence of the specific dCTP tag reports its presence at this position, and this 
information is digitally recorded. In the next step, both the 3'-O-blocking group and the 
fluorescent tag are removed. The 3-O-terminal C on the growing “sequence by synthe- 
sis” strand now has a free 3'-OH group that can accept the next incoming nucleotide, as 
determined by base pairing with the template at the next base in line. The process con- 
tinues in this manner until completed. Since each cycle provides a unique fluorescent 
signal identifying the base just incorporated (and thus, the template base that was avail- 
able to the DNA polymerase), information on the template sequence is accumulated. 
Illumina sequencing is currently the most widely used sequencing protocol. The latest 
Illumina sequencing instruments take less than two days and produce in excess of 3000 
Gb of sequence data per run. 


T1.1d High-Throughput DNA Sequencing Based on Proton Detection 


As shown in Figure 11.2, the DNA polymerase reaction produces pyrophosphate and 
an H* with each dNTP addition to the growing polynucleotide strand (see Box, 
Figure 11.2 for the DNA polymerase reaction). Detection of this proton release is the 
basis for a sequencing by synthesis strategy developed by Ion Torrent. Semiconductor 
chips (such as those used in the electronics industry) are the platform for this technol- 
ogy. These chips consist of an array of microwells, placed over dedicated ion sensors. 
Primed DNA strands to be sequenced are captured in individual microwells. Since 
each microwell contains a unique population of identical DNA strands, many differ- 
ent sequences can be determined in parallel on the same chip. Next, either dATP, 
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Figure 11.4 The Illumina SBS sequencing protocol. (a) Generic structure of the fluorescent dye-tagged, 
3'-O-blocked reversible deoxynucleotide substrates used in Illumina sequencing protocols. (Adapted 
from Figure 1 in Chen, F., Dong, M., Ge, M., Zhu, L., et al., 2013. The history and advances of 
reversible terminators used in new generations of sequencing technology. Genomics, Proteomics, and 
Bioinformatics 11:34-40). (b) The Illumina SBS sequencing cycle. Primer extension through DNA 
polymerase selection of a deoxynucleotide substrate through base paring with the template strand adds 
the appropriate 3'-O-blocked fluorescence-tagged deoxynucleotide. The deoxynucleotide added is 
identified by its unique fluorescence emission following laser excitation (Step 2, “read color”), followed 
by cleavage and removal of the fluorescence tag and the 3’-O-blocking group (Step 3, remove color 
and 3’-blocking group”). Wash and repeat. (Adapted from Figure 3 in Chen, F., Dong, M., Ge, M., 
Zhu, L., et al., 2013. The history and advances of reversible terminators used in new generations of 
sequencing technology. Genomics, Proteomics, and Bioinformatics 11:34-40). 
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dCTP, dGTP, or dTTP is introduced into the microwells in the presence of DNA 
polymerase. If the introduced dNTP is complementary to the unpaired base on the 
primed DNA strand being sequenced, DNA polymerase incorporates it, and an H* is 
released, which causes a pH change. This pH change is detected by the semiconductor 
ion sensor, which converts this chemical information into a digital signal in the form 
of a voltage change. As dNTPs are flowed one after the other through the microwells, 
voltage changes are recorded in the various microwells each time DNA polymerase 
adds a nucleotide to the growing chain. The record of voltage changes within each 
microwell reveals the sequence of the DNA strand in that well. The advantages of this 
methodology are rapid sequencing at a lower cost. 


Tl.le Single-Molecule Real-Time Sequencing 


A disadvantage of Illumina and Ion Torrent sequencing systems is the short-read 
length they generate—on the order of only 150 to 200 nucleotides per DNA template. 
This disadvantage can usually be offset by running large numbers of sequence experi- 
ments concurrently with overlapping DNA sets. However, eukaryotic genomes con- 
tain large regions of repetitive DNA where the same sequence units are repeated over 
and over again. Indeed, 30% of the human genomes consists of thousands of copies of 
such tandemly repeated sequences, some about 300 bp, others, thousands of bp in 
length. It is very difficult to pinpoint the beginning and ends of tandemly repeated 
sequences using short-read sequencing technologies. 

Single-molecule real-time (SMRT) sequencing, developed by Pacific BioSciences 
(PacBio), offers an alternative approach to overcome many of these limitations. SMRT 
sequencing yields read lengths that exceed 10 kbp on average (Figure 11.5). It is real- 
time sequencing because it doesn’t require a pause between read steps. SMRT sequenc- 
ing can resolve the precise location and sequence of long repetitive regions, a real ad- 
vantage in genome analysis. 
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Figure 11.5 Single-molecule real-time (SMRT) sequencing. (a) Hairpin loops (green) of DNA are ligated to the double-stranded DNA (yellow/blue) to be 
sequenced. Note the distinct single-stranded regions and the double-stranded ‘primer’ region (orange) of the left-hand loop. A sample of this DNA construct, 
called an SMRTbell, is loaded into a chip called the SMRT cell. A single DNA polymerase molecule (gray) binds to the primer end in the DNA loop. Each 
SMRT cell has from 35,000 to 75,000 wells, each of which contains a different DNA SMRTbell sample with its single DNA polymerase. The four 
deoxynucleotide substrates consist of the usual dNTP structure, each with a distinctive fluorophore triphosphate fluorescent tag attached to the terminal 
phosphate group of the dNTP. The four different phosphoflurophore-dNTP substrates are flooded into the wells containing the various SMRTbell samples. 
The appropriate deoxynucleotide substrate, as determined by base-pairing with the SMRT template, is captured by the DNA polymerase and can be identified 
by its characteristic light pulse upon laser excitation. DNA polymerase action adds this chosen substrate as a 3'-O terminal-dNMP, releasing the 
phosphofluorophore tag, which is washed away. The sequence of the DNA sample is determined by recording the distinctive light pulses over time, as the 
DNA polymerase moves along its template, selecting the right deoxynucleotide base from among the four phosphofluorophore-dNTP substrates. (b) The 
dATP version of the phosphofluorophore-dNTP substrates). (a) Adapted from Figure 1 in Rhoads, A., and Au, F. K., 2015, PacBio sequencing and Its 
applications. Genomics, Proteomics, Bioinformatics 3:278-289. (b) Adapted from Figure S4 in Eid, J., Fehr, A., Fray, J., Luong, K., et al., 2009. Real-time DNA 
sequencing from single polymerase molecules. Science 323:133-138. 
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Human Biochemistry 


The Human Genome Project 


Completed in 2003, the Human Genome Project was a 13-year 
collaborative international, government-, and private-sponsored ef- 
fort to map and sequence the entire human genome, some 3 billion 
base pairs distributed among the two sex chromosomes (X and Y) 
and 22 autosomes (chromosomes that are not sex chromosomes). 
A primary goal was to identify and map at least 3000 genetic 
markers (genes or other recognizable loci on the DNA), which were 
evenly distributed throughout the chromosomes at roughly 100-kb 
intervals. At the same time, determination of the entire nucleotide 
sequence of the human genome was undertaken. J. Craig Venter 
and colleagues working at Celera, a private corporation, took an 
alternative approach based on computer alignment of sequenced 
human DNA fragments. A working draft of the human genome 
was completed in June 2000 and published in February 2001. In 
reality, the human genome sequences reported in 2011 were in- 
complete. About 8% of human DNA is found in heterochromatin 
(regions of highly compacted DNA located principally near the 
centromeres and telomeres of chromosomes). With the advent of 
PacBio SMRT and Oxford Nanopore Technology (ONT) long-read 
sequencing technology (Sections 11.3e and f), accurate nucleotide 
sequence information about these regions was finally obtained and 
the first truly complete human genome was published in 2022. An 
ancillary part of the project focused on sequencing the genomes 
of other species (such as yeast, Drosophila melanogaster [the fruit 
fly], mice, and Arabidopsis thaliana [a plant]) to reveal comparative 
aspects of genetic and sequence organization (Table 11.1). 

The Human Genome Project cost $3 billion, or about $1/base. 
Today, thanks to technological advances, any person’s genome can 
be sequenced for less than $1,000. This success ushers in the excit- 
ing new era of personalized medicine, where genomic information 
can be used to make customized treatment decisions about patients. 
As of 2020, the Human Gene Mutation Database (HGMD) listed 
more than 300,000 mutations in almost 12,000 genes associated with 
human disease. Related developments include the Cancer Genome 
Atlas (information on the genomes of over 11,000 tumors from 
33 of the most common cancers as of 2018), the British 100,000 
Genomes Project (completed in 2018, a compilation of 100,000 ge- 
nomes from around 85,000 patients ill with a rare disease or cancer), 
and the 1,000 Genomes Project of BGI Group (initiated in 2008 
and completed in 2015, a detailed catalog of global human genetic 
variation). In 2018, the Earth BioGenome Project was launched, 


with the goal of sequencing the DNA of the 1.5 million known spe- 
cies of eukaryotes. The 24 September 2021 issue of Science (volume 
373, issue 6562) marks the 20th anniversary of the publication of a 
first-draft human genome sequence and reviews the legacy of this 
accomplishment. 


| Table 11.1 Completed Genome Nucleotide Sequences! 


Genome Year 
Genome Size? Completed 
Bacteriophage 6X174 0.0054 1977 
Bacteriophage A 0.048 1982 
Marchantia? chloroplast genome 0.187 1986 
Vaccinia virus 0.192 1990 
Marchantia mitochondrial genome 0.187 1992 
Variola (smallpox) virus 0.186 1993 
Haemophilus influenzae* 1.830 1995 
(Gram-negative bacterium) 
Mycobacterium genitalium (mycobacterium) 0.58 1995 
Escherichia coli (Gram-negative bacterium) 4.64 1996 
Saccharomyces cerevisiae (yeast) 12.1 1996 
Methanococcus jannaschii (archaeon) 1.66 1998 
Arabidopsis thaliana (green plant) 115 2000 
Caenorhabditis elegans (simple animal: 88 1998 
nematode worm) 
Drosophila melanogaster (fruit fly) 117 2000 
Homo sapiens (human) 3038 2001 
Pan troglodytes (chimpanzee) 3109 2005 
Homo sapiens neanderthalensis (Neanderthal) 3200 2010 


'Data available from the National Center for Biotechnology Information at the National 
Library of Medicine. Website: http://www.ncbi.nlm.nih.gov/ 

2Genome size is given as millions of base pairs (mb). 

3Marchantia is a bryophyte (a nonvascular green plant). 

‘The first complete sequence for the genome of a free-living organism. 


11.1f Oxford Nanopore Technologies (ONT) Real-Time Sequencing 


A record of a DNA molecule’s sequence is carried in each of its strands. It would be 
advantageous to have a platform that could read single-stranded DNA sequences 
rapidly, accurately, and reliably. One new approach is to measure the change in the 
electrical properties of a synthetic membrane as single strands of DNA (or RNA) are 
pulled through nanopores in the membrane. As a polynucleotide chain is pulled 
through such a nanopore, the electrical conductance of the pore is altered in subtly 
different ways, depending on nature of the chain’s nucleotide sequence (Figure 11.6). 
Readout of these electrical changes reveals the sequence of the DNA directly, with- 
out the need for expensive reagents or chemical or enzymatic reactions, as in SBS 
sequencing. ONT technology achieves long reads of DNA sequence, from one to 
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several hundred kb and even up to several Mb in length. Further advantages include 
lower cost for ONT sequencing instruments and their compact size, making them 
useful for sequencing studies in the field, away from sophisticated laboratories. 
ONT technology was instrumental in constructing the first entire nucleotide 
sequence of a human chromosome from end-to-end, the 155-megabasepair 
X chromosome. And combined PacBio SMRT-sequencing and ONT sequencing 
allowed the assembly of the entire 146,259,671 bp-long, telomere-to-telomere 
sequence of human chromosome 8. 


11.1g Bioinformatics—Functional Genomics, Proteomics, and More 


The growing body of information about whole genome sequences of organisms has 
created a new branch of science called bioinformatics: the study of the nature and 
organization of biological information. Bioinformatics includes such approaches as 
functional genomics and proteomics. 
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Figure 11.6 ONT sequencing. (a) In ONT 
sequencing, long double-stranded DNA fragments 
are tagged with sequencing adapters (/ight blue) that 
carry a preloaded motor protein on one or both 
ends (yellow). (b) This DNA construct is combined 
with proteins (not shown) that will tether the DNA 
to the nanopore and then introduced onto the 
sequencing flow cell. The flow cell contains a 
synthetic membrane studded with thousands of 
protein nanopores. Each tethering protein: DNA 
complex attaches to a nanopore. (c) The sequencing 
adapter inserts into the nanopore, and the motor 
protein unwinds the double-stranded DNA, 
threading a single-DNA strand through the 
nanopore. An applied electric current in concert 
with the motor protein drives the negatively charged 
DNA strand through the pore at a rate of about 450 
bases per second. As the DNA moves through the 
pore, it causes disruptions to the current. These 
changes relate to the nature of the specific 
deoxynucleotide sequences passing through the pore 
and can be used to identify the DNA sequence. 
Adapted from Figure 2 in Logsdon, G. A., Vollger, 
M. R., and Eichler, E. E., 2020. Long-read human 
genome sequencing and its applications. Nature 
Reviews Genetics 21:597-614. 
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Figure 11.7 Conformational possibilities along the sugar-phosphate backbone and around the N— 
glycolsidic bond. (a) The six degrees of freedom in the deoxyribose—PO, units of the polynucleotide 
chain. (b) Four puckered conformations of the furanose rings. (c) Free rotation about the Cl’-N 


glycosidic bond. 


Structure of Nucleic Acids 


Functional genomics addresses global issues of gene expression, such as looking at 
all the genes that are activated during major metabolic shifts (as from growth under 
aerobic to growth under anaerobic conditions) or during embryogenesis and develop- 
ment of organisms. Transcriptome is the word used in functional genomics to define the 
entire set of genes expressed (as mRNAs transcribed from DNA) in a particular cell or 
tissue under defined conditions. Functional genomics also provides new insights into 
evolutionary relationships between organisms. 

Proteomics is the study of all the proteins expressed by a certain cell or tissue under 
specified conditions. Typically, this set of proteins, the proteome, is revealed by running 
two-dimensional polyacrylamide gel electrophoresis on a cellular extract or by coupling 
protein separation techniques to mass spectrometric analysis. 


11.2 | What Sorts of Secondary Structures Can 
Double-Stranded DNA Molecules Adopt? 
Single strands of DNA or RNA are conformationally dynamic. Unlike the constraint 


imposed on the conformation of polypeptides by the amide plane, no analogous con- 
straint exists along the sugar-PO, backbone of polynucleotides. 


11.2a Conformational Variation in Polynucleotide Strands 


Polynucleotide strands are inherently flexible. Each deoxyribose—phosphate segment 
of the backbone has six degrees of freedom (Figure 11.7a) as a consequence of the six 
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successive single bonds per segment along the chain. Furanose rings of pentoses are 
not planar but instead adopt puckered conformations, four of which are shown in 
Figure 11.7b. A seventh degree of freedom per nucleotide unit arises because of free 
rotation about the Cl’-N glycosidic bond. This freedom allows the plane of the base 
to rotate relative to the path of the polynucleotide strand (Figure 11.7c). 


11.2b DNA Usually Occurs in the Form of 
Double-Stranded Helices 


Fundamentally, double-stranded DNA is a regular two-chain structure with hydrogen 
bonds formed between opposing bases on the two chains (see Chapter 10). Such H 
bonding is possible only when the two chains are antiparallel. The polar sugar- 
phosphate backbones of the two chains are on the outside. The bases are stacked on 
the inside of the structure; these heterocyclic bases, as a consequence of their 
a-electron clouds, are hydrophobic on their flat sides. One purely hypothetical 
conformational possibility for a two-stranded arrangement would be a ladderlike 
structure (Figure 11.8a) in which the base pairs are fixed at 0.6 nm apart because this 
is the distance between adjacent sugars along a polynucleotide strand. Because H,O 
molecules could fit into the spaces between the hydrophobic surfaces of the bases, this 
conformation is energetically unfavorable. This ladderlike structure converts to a 
double helix when given a simple right-handed twist (Figure 11.8b). Helical twisting 
brings the base-pair rungs of the ladder closer together, stacking them 0.34 nm apart, 
without affecting the sugar—sugar distance of 0.6 nm. Because this helix repeats itself 
approximately every 10 bp, its pitch is 3.4 nm. This is the major conformation of DNA 
in solution, and it is called B-DNA. 


11.2c Watson-Crick Base Pairs Have Virtually 
Identical Dimensions 


As indicated in Chapter 10, the base pairing in DNA is size complementary: Large 
bases (purines) pair with small bases (pyrimidines). Hydrogen bond formation 
between purines and pyrimidines dictates that the purine adenine pairs with the 
pyrimidine thymine; the purine guanine pairs with the pyrimidine cytosine. Size 
complementarity means that the A:T pair and G:C pair have virtually identical 
dimensions (Figure 11.9). Watson and Crick realized that units of such structural 
equivalence could serve as spatially invariant substructures to build a polymer whose 
exterior dimensions would be uniform along its length, regardless of the sequence of 
bases. That is, the pairing of smaller pyrimidines with larger purines everywhere 
across the double-stranded molecule allows the two polynucleotide strands to assume 
essentially identical helical conformations. 


11.2d The DNA Double Helix Is a Stable Structure 


Several factors account for the stability of the double helical structure of DNA. 


H Bonds Although it has long been emphasized that the two strands of DNA are held 
together by H bonds formed between the complementary purines and pyrimidines, 
two in an A:T pair and three in a G:C pair (Figure 11.9), the H bonds between base 
pairs impart little net stability to the double-stranded structure compared with the 
separated strands in solution. When the two strands of the double helix are sepa- 
rated, the H bonds between base pairs are replaced by H bonds between individual 
bases and surrounding water molecules. Polar atoms in the sugar—phosphate back- 
bone do form external H bonds with surrounding water molecules, but these form 
with separated strands as well. 


Electrostatic Interactions A prominent feature of the backbone of a DNA strand is 
the repeating array of negatively charged phosphate groups. These arrays of nega- 
tive charge along the strands repel each other so that their sugar—phosphate 


(a) Ladder 
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Figure 11.8 (a) Double-stranded DNA as an 
imaginary ladderlike structure. (b) A simple right- 
handed twist converts the ladder to a helix. 
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Major groove Major groove 


Minor groove Minor groove 


Figure 11.9 Watson—Crick A:T and G:C base pairs. All H bonds in both base pairs are straight. 


backbones are kept apart and the two strands come together through Watson—Crick 
base pairing. As a consequence, the negative charges are situated on the exterior 
surface of the double helix, such that repulsive effects are minimized. Further these 
charges become electrostatically shielded from one another because divalent cat- 
ions, particularly Mg**, bind strongly to the anionic phosphates. Mg?* ions are 
abundant in cells. 


Van der Waals and Hydrophobic Interactions The core of the helix consists of the 
base pairs that stack together through m, z-electronic interactions (a form of van 
der Waals interaction) and hydrophobic forces. These base-pair stacking interac- 
tions range from —16 to —51 kJ/mol (expressed as the energy of interaction 
between adjacent base pairs), contributing significantly to the overall stabilizing 
energy. 

A consequence of the way A:T and G:C base pairs form is that the sugars of the 
respective nucleotides have opposite orientations. This is why the sugar—phosphate back- 
bones of the two chains run in opposite or “antiparallel” directions. Furthermore, the 
two glycosidic bonds holding the bases in each base pair are not directly across the helix 
from each other (Figure 11.10). Consequently, the sugar-phosphate backbones of the 
helix are not equally spaced along the helix axis and the grooves between them are not 
the same size. Instead, the intertwined chains create a major groove and a minor groove 
(Figure 11.10). The edges of the base pairs have a specific relationship to these grooves. 
The “top” edges of the base pairs (“top” as defined by placing the glycosidic bond at the 
bottom, as shown in Figure 11.10) are exposed along the interior surface or “floor” of 
the major groove; the base-pair edges nearest to the glycosidic bond form the interior 
surface of the minor groove. Some proteins that bind to DNA can actually recognize 
specific nucleotide sequences by “reading” the pattern of H-bonding possibilities 
presented by the edges of the bases in these grooves. Such DNA-protein interactions 
provide one step toward understanding how cells regulate the expression of genetic 
information encoded in DNA. 
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(a) Top view (b) B-DNA 


Radius of 


sugar—phosphate b Figure 11.10 The major and minor grooves 
backbone of B-DNA. 


11.2e Double Helical Structures Can Adopt a Number 
of Stable Conformations 


In solution, DNA ordinarily assumes the familiar structure we have been discussing: 
B-DNA. However, nucleic acids also occur naturally in other double-helical forms. 
The base-pairing arrangement remains the same, but the inherently flexible sugar— 
phosphate backbone adopts different conformations. Base-pair rotations are one 
form of conformational variation. Helical twist is the rotation (around the axis of the 
double helix) of one base pair relative to the next (Figure 11.1la). Successive base pairs 
in B-DNA show a mean rotation of 36° with respect to each other. Propellor twist 
involves rotation around a different axis, namely, an axis perpendicular to the helix 
axis (Figure 11.11b). Propellor twist allows greater overlap between successive bases 
along a strand of DNA and diminishes the area of contact between bases and 
solvent water. 


11.2f A-Form DNA Is an Alternative Form 
of Right-Handed DNA 


An alternative form of the right-handed double helix is A-DNA. A-DNA molecules 
differ from B-DNA molecules in a number of ways. The pitch, or distance required 
to complete one helical turn, is different. In B-DNA, it is 3.4 nm, whereas in 
A-DNA, it is 2.46 nm. One turn in A-DNA requires 11 bp to complete. Depending 
on local sequence, 10 to 10.6 bp define one helical turn in B-form DNA. In A-DNA, 
the base pairs are no longer nearly perpendicular to the helix axis but instead are 
tilted 19° with respect to this axis. Successive base pairs occur every 0.23 nm along 
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Two base pairs with 32° of right-handed 
helical twist: the minor-groove edges are drawn with 
heavy shading. 


(2) 


Propellor twist, as in (2), allows greater overlap _ Propellor-twisted base pairs. Note how the 

of successive bases along the same strand and hydrogen bonds between bases are distorted by this 
reduces the area of contact between the bases motion, yet remain intact. The minor-groove edges 
and water. of the bases are shaded gray. 


Figure 11.11 Helical twist and propellor twist in DNA. (a) Successive base pairs in B-DNA show a 
rotation with respect to each other. (b) Rotation in a different dimension—propellor twist—allows the 
hydrophobic surfaces of bases to overlap better. Dots represent axes perpendicular to the helix axis. 
The view is from the sugar—P backbone. (c) Each of the bases in a base pair shows positive propellor 
twist (a clockwise rotation from the horizontal, as viewed along the N-glycosidic bond, from the 
pentose Cl’ to the base). (Adapted from Figure 3.4 in Callandine, C. R., and Drew, H. R., 1992. 
Understanding DNA: The Molecule and How It Works. London: Academic Press.) 


the axis, as opposed to 0.332 nm in B-DNA. The B-form of DNA is thus longer and 
thinner than the short, squat A-form, which has its base pairs displaced around, 
rather than centered on, the helix axis. Figure 11.12 and Table 11.2 show the relevant 
structural characteristics of the A- and B-forms of DNA. (Z-DNA, another form of 
DNA to be discussed shortly, is also depicted in Figure 11.12 and Table 11.2.) A 
comparison of the structural properties of A-, B-, and Z-DNA is summarized in 
Table 11.2. 

Relatively dehydrated DNA fibers can adopt the A-conformation, and DNA may be 
in the A-form in dehydrated structures, such as bacterial and fungal spores. The pentose 
conformation in A-DNA is 3’-endo, as opposed to 2’-endo in B-DNA. Double helical 
DNA:RNA hybrids, which readily form between sequences that are complementary, 
have an A-like conformation. The 2’-OH in RNA sterically prevents double helical 
regions of RNA chains from adopting the B-form helical arrangement. Importantly, 
double-stranded regions in RNA chains often assume an A-like conformation, with 
their bases strongly tilted with respect to the helix axis. 


11.2g Z-DNA Is a Conformational Variation in the Form 
of a Left-Handed Double Helix 


Z-DNA was first discovered when X-ray analysis of crystals of the synthetic deoxy- 
nucleotide dCpGpCpGpCpG revealed an antiparallel double helix of unexpected 
conformation. The alternating pyrimidine—purine (Py—Pu) sequence of this oligo- 
nucleotide is the key to its unusual properties. The N-glycosyl bonds of G residues 
in this alternating copolymer are rotated 180° with respect to their conformation in 
B-DNA, so that the purine ring is in the syn rather than the anti conformation 
(Figure 11.13). The C residues remain in the anti form. Because the G ring is 
“flipped,” the C ring must also flip to maintain normal Watson—Crick base pairing. 
However, pyrimidine nucleosides do not readily adopt the syn conformation 
because it creates steric interference between the pyrimidine C-2 oxy substituent 
and atoms of the pentose. Because the cytosine ring does not rotate relative to the 
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Z-DNA 


B-DNA 


Figure 11.12 Comparison of the A-, B-, and Z-forms of the DNA double helix. The A- and 
B-structures show 12 bp of DNA; the Z-structures, 6 bp. The middle Z-structure shows just one 
strand of a Z-DNA double helix to illustrate better the left-handed zigzag path of the polynucleotide 
backbones in Z-DNA. (The light blue line was added to show the imaginary zigzag path.) A-DNA: 
PDB id = 2D47, B-DNA: PDB id = 355D, Z-DNA: PDB id = 1DCG. 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


359 


360 Chapter 11 Structure of Nucleic Acids 


| Table 11.2 | Comparison of the Structural Properties of A-, B-, and Z-DNA 


Double Helix Type 


A B Z 

Overall proportions Short and broad Longer and thinner Elongated and slim 

Rise per base pair 2.3 A. 3.32A+0.19A 3.8 Å 

Helix packing diameter 25.5 A 23.7 Å 18.4 A 
Helix rotation sense Right-handed Right-handed Left-handed 

Base pairs per helix repeat 1 1 2 

Base pairs per turn of helix =f ~10 12 

Mean rotation per base pair 33.6° 35.9° + 4.2° —60°/2 

Pitch per turn of helix 24.6 A 33.2 Å 45.6 A 
Base-pair tilt from the perpendicular + 19° =1.2° 241° =9° 
Base-pair mean propeller twist + 18° +16° + 7° ~0° 


Helix axis location Major groove Through base pairs 


Extremely narrow but very Wide and with Flattened out on helix 
deep intermediate depth surface 


Minor groove 


Major groove proportions 


Narrow and with 
intermediate depth 


Minor groove proportions Very broad but shallow Extremely narrow but very deep 


Glycosyl bond conformation anti anti anti at C, syn at G 


Adapted from Dickerson, R. L., et al., 1983. Helix geometry and hydration in A-DNA, B-DNA, and Z-DNA. Cold Spring Harbor Symposium on Quantitative Biology 47:13-24. 


Deoxyguanosine in B-DNA (anti position) 


Deoxyguanosine in Z-DNA (syn position) 


Figure 11.13 Comparison of the deoxyguanosine conformation in B- and Z-DNA. 


pentose, the whole C nucleoside (base and sugar) must flip 180° (Figure 11.14). It is 
topologically possible for the G to go syn and the C nucleoside to undergo rotation 
by 180° without breaking and re-forming the G:C hydrogen bonds. In other words, 
the B-to-Z structural transition can take place without disrupting the bonding rela- 
tionships among the atoms involved. 

Because alternate nucleotides assume different conformations, the repeating unit on 
a given strand in the Z-helix is the dinucleotide. That is, for any number of bases, n, 
along one strand, n — 1 dinucleotides must be considered. For example, a GpCpGpC 
subset of sequence along one strand is composed of three successive dinucleotide units: 
GpC, CpG, and GpC. (In A- and B-DNA, the nucleotide conformations are essentially 
uniform and the repeating unit is the mononucleotide.) It follows that the CpG sequence 
is distinct conformationally from the GpC sequence along the alternating copolymer 
chains in the Z-double helix. The conformational alterations going from B to Z realign 
the sugar—phosphate backbone along a zigzag course that has a left-handed orientation 
(Figure 11.12), thus the designation Z-DNA. Note that in any GpCpGp subset, the 
sugar—phosphates of GpC form the horizontal “zig,” while the CpG backbone segment 
forms the vertical “zag.” The mean rotation angle circumscribed around the helix axis is 
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F > 


> B-DNA 


TN 


ANa 


> B-DNA 


—15° for a CpG step and —45° for a GpC step (giving — 60° for the dinucleotide repeat). 
The minus sign denotes a left-handed or counterclockwise rotation about the helix axis. 
Z-DNA is more elongated and slimmer than B-DNA. 


Cytosine Methylation and Z-DNA The Z-form can arise in sequences that are not 
strictly alternating Py—Pu. For example, the hexanucleotide "CGAT™CG, a Py-Pu- 
Pu-Py-Py-Pu sequence containing two 5-methylcytosines (™C), crystallizes as 
Z-DNA. Indeed, the in vivo methylation of C at the 5-position is believed to favor a 
B-to-Z switch because, in B-DNA, these hydrophobic methyl groups would protrude 
into the aqueous environment of the major groove, a destabilizing influence. In 
Z-DNA, the same methyl groups can form a stabilizing hydrophobic patch. It is 
likely that the Z-conformation naturally occurs in specific regions of cellular DNA, 
which otherwise is predominantly in the B-form. Furthermore, because methylation 
is implicated in gene regulation, the occurrence of Z-DNA may affect the expression 
of genetic information. 


11.2h The Double Helix Is Flexible 


The long-range structure of B-DNA in solution is not that of a rigid, linear rod. Instead, 
DNA behaves as a dynamic, flexible molecule. Localized thermal fluctuations tempo- 
rarily distort and deform DNA structure over short regions. Base and backbone ensem- 
bles of atoms undergo elastic motions on a time scale of nanoseconds. To some extent, 
these effects represent changes in rotational angles of the bonds comprising the poly- 
nucleotide backbone. These changes are also influenced by sequence-dependent varia- 
tions in base-pair stacking. The consequence is that the helix bends gently. When these 
variations are summed over the great length of a DNA molecule, these bending influ- 
ences give the double helix a roughly spherical shape, as might be expected for a long, 
semirigid rod undergoing apparently random coiling. It is also worth noting that, on 
close scrutiny, the surface of the double helix is not that of a totally featureless, smooth, 
regular “barber pole” structure. Different base sequences impart their own special sig- 
natures to the molecule by subtle influences on such factors as the groove width, the 
angle between the helix axis and base planes, and the mechanical rigidity. Certain regu- 
latory proteins bind to specific DNA sequences and participate in activating or 
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Figure 11.14 The change in topological relationships 
of base pairs from B- to Z-DNA. A six-base-pair 
GCGCGC segment of B-DNA (1) is converted to 
Z-DNA (2) through rotation of the base pairs, as 
indicated by the curved arrows. The purine rings 
(green) of the deoxyguanosine nucleosides rotate via 
an anti to syn change in the conformation of the 
guanine—deoxyribose glycosidic bond; the 
pyrimidine rings (blue) are rotated by flipping the 
entire deoxycytosine nucleoside (base and 
deoxyribose). 
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Human Biochemistry 


DNA Methylation, CpG Islands, and Epigenetics 


In Chapter 10, we saw that rRNA and tRNA often contain chemi- 
cally modified bases (Figure 10.22). A common form of modifica- 
tion is methylation. Methylation of eukaryotic DNA occurs only at 
cytosine residues, forming 5-methylcytosine. 


NH» 


A | 
I 


5-Methylcytosine 


In mammals, cytosine methylation is essential for normal embry- 
onic development. A general rule of thumb is that cytosine meth- 
ylation switches genes to an “off’ state. Then, the information they 
encode is not expressed. In somatic cells (cells other than gametes), 
most cytosine methylation occurs at cytosine residues next to Gs, 
that is, within CpG dinucleotides. (Note that the complementary 
strand would have a 5'-CpG-3’ as well.) CpG dinucleotides are rare 
in mammalian genomes, appearing at just 20% of their statistically 
expected numbers. 

However, some regions of the genome contain CpG dinucleo- 
tides at expected frequencies. Small regions of DNA (less than 
500 bp or so) that contain expected amounts of CpG dinucleo- 


tides are called CpG islands (CGIs). CGIs are G-C rich (about 
65% G-C content, compared with the average 40% G-C content 
of mammalian genomes). The Cs in CGIs are usually unmethyl- 
ated. Those that are methylated tend to lie in gene regulatory 
regions and shut down gene expression. 

Epigenetics is the study of heritable changes in the genome that 
occur without a change in nucleotide sequence. Epigenetic changes 
are reversible and include modifications that occur in DNA (such as 
methylation) and in histones, the principal proteins found in chro- 
matin, the DNA-protein complex in the eukaryotic nucleus. Epige- 
netic changes can influence expression of the information encoded 
by the genome. 

Hypermethylation of CGIs in tumor suppressor genes can shut 
down the expression of these genes, leading to cancer. 
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suppressing expression of the information encoded therein. These proteins bind at 
unique sites due to their ability to recognize novel structural characteristics imposed on 
the DNA by the local nucleotide sequence. 


Intercalating Agents Distort the Double Helix Aromatic macrocycles, flat hydrophobic 
molecules composed of fused, heterocyclic rings, such as ethidium bromide, acridine 
orange, and actinomycin D (Figure 11.15), can slip between the stacked base pairs of 
DNA. The base pairs move apart to accommodate these so-called intercalating agents, 
causing an unwinding of the helix to a more ladderlike structure. The deoxyribose— 
phosphate backbone is almost fully extended as successive base pairs are displaced 0.7 
nm from one another, and the rotational angle about the helix axis between adjacent 
base pairs is reduced from 36° to 10°. 


Dynamic Nature of the DNA Double Helix in Solution Intercalating substances insert 
with ease into the double helix, indicating that the van der Waals stacking interactions 
that they share with the bases sandwiching them are more favorable than similar 
interactions between the bases themselves. Furthermore, the fact that these agents slip 
in suggests that the double helix must momentarily unwind and present gaps for these 
agents to occupy. That is, the DNA double helix in solution must be represented by a 
set of metastable alternatives to the standard B-conformation. These alternatives 
constitute a flickering repertoire of dynamic structures. 


11.2i Alternative Hydrogen-Bonding Interactions Give Rise to Novel 
DNA Structures: Cruciforms, Triplexes, and Quadruplexes 
Cruciform Structures Arise from Inverted Repeats Inverted repeats (Figure 11.16) are 


duplex DNA sequences showing twofold symmetry (the 5’->3’ sequence is identical in 
both strands). Palindromes are words, phrases, or sentences that read the same 
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Figure 11.15 The structures of ethidium bromide, acridine orange, and actinomycin D, three 
intercalating agents, and their effects on DNA structure. 
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Figure 11.16 Self-complementary inverted repeats can rearrange to form hydrogen-bonded cruciform 
stem-loop structures. 
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backward or forward, such as “radar,” “sex at noon taxes,” “Madam, I’m Adam,” and 
“a man, a plan, a canal, Panama.” Inverted repeats are sometimes referred to as 
palindromes (despite the inaccuracy of this description). Inverted repeats have the 
potential to adopt cruciform (meaning “cross-shaped”) structures if the normal 
interstrand base pairing is replaced by intrastrand pairing. In effect, each strand 
forms a hairpin structure through alignment and pairing of the self-complementary 
sequences along the strand. Cruciforms are never as stable as normal DNA duplexes 
because an unpaired segment must exist in the loop region. Cruciforms potentially 
create novel structures that can serve as distinctive recognition sites for specific 
DNA-binding proteins. 


Hoogsteen Base Pairs and DNA Multiplexes The A:T and G:C base pairs first seen by 
Watson (Figure 11.9) are the canonical building blocks for DNA structures. However, 
Karst Hoogsteen found that adenine and thymine do not pair in this way when 
crystallized from aqueous solution. Instead, they form two H bonds in a different 
arrangement (Figure 11.17). Further, Hoogsteen observed that in mildly acidic solutions, 
guanine and cytosine form base pairs different from Watson—Crick G:C base pairs. 
These Hoogsteen base pairs depend upon protonation of cytosine N-3 (Figure 11.17) and 
have only two H bonds, not three. In both A:T and G:C Hoogsteen base pairs, the 
purine N-7 atom is an H-bond acceptor. The functional groups of adenine and guanine 
that participate in Watson—Crick H bonds remain accessible in Hoogsteen base pairs. 
Thus, base triplets can form, as shown in Figure 11.18, giving rise to TAT and CtGC 
triplets, where each purine interacts with one of its pyrimidine partners through 
Hoogsteen base pairing and the other through Watson—Crick base pairing. 


T:A ct:G 
Figure 11.17 Hoogsteen base pairs: A:T (/eft) and C*:G (right). 
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Figure 11.18 Base triplets formed when a purine interacts with one pyrimidine by Hoogsteen 
base-pairing and another by Watson—Crick base pairing. 
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H-DNA Is Triplex DNA Under certain conditions, triple-stranded DNA structures can 
form. In H-DNA, two of the strands are pyrimidine rich and the third is purine rich. 
One pyrimidine-rich strand is hydrogen bonded to the purine-rich strand via 
Watson—Crick base pairing, and the other pyrimidine-rich strand is hydrogen bonded 
to the purine-rich strand by Hoogsteen base pairing. Such structures were originally 
referred to as H-DNA because protonation of the cytosine N-3 atom was necessary, 
but the name also fits because a hinge is present between double- and triple-stranded 
DNA regions when H-DNA forms. Consider, for example, a long stretch of alternating 
C:T sequence in one strand of a DNA duplex (Figure 11.19). If the C:T bases in half of 
this stretch becomes separated from their G:A partners and the unpaired C:T segment 
is folded back on the C:T half still paired in the C:T/G:A duplex, triplex DNA can 
form through Hoogsteen base pairing. Triple-stranded DNA is implicated in the 
regulation of some eukaryotic genes. 


DNA Quadruplex Structures Four-stranded DNA structures can form between poly- 
nucleotide strands rich in guanine. At the heart of such G-quadruplexes are cyclic 
arrays of four G residues united through Hoogsteen base pairing (Figure 11.20a). The 
presence of metal cations (K*, Nat, Ca?*) favors their assembly. Free-electron pairs 
contributed by the closely spaced O6 carbonyl oxygens of the G-quartet coordinate 
the centrally located cation. A variety of different G-quadruplex structures have been 
reported, with different G-rich sequences leading to variations on a common quadru- 
plex plan. Quadruplexes constructed from dG, strands usually form with all four 
strands in parallel orientation and all bases in the anti conformation (Figure 11.20b). 
Polynucleotides with varying sequence repeats, such as (G;N), or (G)N;),,, form 
G-quadruplexes with variations on the dG, structural theme, such as the (dG4T,Gy), 
structure in which two such strands pair in antiparallel fashion to form the 
G-quadruplex (Figure 11.20c and 11.20d). G-quadruplex structures have biological 
significance because they have been found in telomeres (structures that define the ends 
of chromosomes), in regulatory regions of genes, in immunoglobulin gene regions 
responsible for antibody diversity, and in sequences associated with human diseases. 
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Figure 11.19 H-DNA. (a) The pyrimidine-rich strands of the duplex regions are blue, and the 
purine-rich strands are green. The Hoogsteen base-paired pyrimidine-rich strand in the triplex 
(H-DNA) structure is yellow. (b) Nucleotide sequence representation of H-DNA formation. T:A 
Hoogsteen base pairing leading to triplex formation is shown by dots; C*-G Hoogsteen base pairing 
leading to triplex formation is shown by + signs. (Adapted from Htun, H., and Dahlberg, J. E., 1989. 
Topology and formation of triple-stranded H-DNA. Science 243:1571-1576.) 
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(a) (b) 


Figure 11.20 DNA quadruplex structures. (a) G-quadruplex showing the cyclic array of guanines 
linked by Hoogsteen hydrogen bonding. (b) Four G-rich polynucleotide strands in parallel alignment 
with all bases in anti conformation. (c) Antiparallel dimeric hairpin quadruplex formed from 
d(G4T4G,)s. (d) Structure of d(G4T4G4),K* solved by X-ray crystallography. Two d(G4T4G,) strands 
come together as hairpins to form a G-quadruplex. The backbones of the two strands are traced in 
violet. (Adapted from Keniry, M. A., 2001. Quadruplex structures in nucleic acids. Biopolymers 
56:123-146.) 


11.3 | Can the Secondary Structure of DNA 
Be Denatured and Renatured? 


When duplex DNA molecules are subjected to conditions of pH, temperature, or 
ionic strength that disrupt base-pairing interactions, the strands are no longer held 
together. That is, the double helix is denatured, and the strands separate as individ- 
ual random coils. If temperature is the denaturing agent, the double helix is said to 
melt. 
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11.3 Can the Secondary Structure of DNA Be Denatured and Renatured? 


11.3a Thermal Denaturation of DNA Can Be Observed 
by Changes in UV Absorbance 


DNA denaturation can be followed spectrophotometrically because the relative 
absorbance of the DNA solution at 260 nm increases as much as 40% as the bases 
unstack. This absorbance increase, or hyperchromic shift, is due to the fact that the 
aromatic bases in DNA interact via their 7-electron clouds when stacked together in 
the double helix. Because the UV absorbance of the bases is a consequence of 
a-electron transitions, and because the potential for these transitions is diminished 
when the bases stack, the bases in duplex DNA absorb less 260-nm radiation than 
expected for their numbers. Unstacking alleviates this suppression of UV absorbance. 
The rise in absorbance coincides with strand separation, and the midpoint of the 
absorbance increase is termed the melting temperature, Tm (Figure 11.21). DNAs differ 
in their T, values because they differ in relative G + C content. The higher the G + C 
content of a DNA, the higher its melting temperature because G:C pairs have higher 
base-stacking energies than A:T pairs. Also, Tm is dependent on the ionic strength of 
the solution; the lower the ionic strength, the lower the melting temperature. Because 
cations suppress the electrostatic repulsion between the negatively charged phosphate 
groups in the complementary strands of the double helix, the double-stranded form of 
DNA is more stable in dilute salt solutions. DNA in pure water melts even at room 
temperature. 


11.3b pH Extremes or Strong H-Bonding Solutes 
also Denature DNA Duplexes 


At pH values greater than 10, the bases of DNA lose protons, which destroys their 
base-pairing potential, thus denaturing the DNA duplex. Extensive addition of protons 
to the bases below pH 2.3 also disrupts base pairing. Alkali is the preferred denaturant 
because, unlike acid, it does not hydrolyze the glycosidic bonds linking purine bases to 
the sugar—phosphate backbone. Small solutes that readily form H bonds can also 
denature duplex DNA at temperatures below Tm. If present in sufficiently high 
concentrations, such small solutes will form H bonds with the bases, thereby disrupting 
H-bonding interactions between the base pairs. Examples include formamide and urea. 


11.3¢ Single-Stranded DNA Can Renature to Form DNA Duplexes 


Denatured DNA will renature to re-form the duplex structure if the denaturing condi- 
tions are removed (that is, if the solution is cooled, the pH is returned to neutrality, or 
the denaturants are diluted out). Renaturation requires reassociation of the DNA 


1.2 


Relative absorbance (260 nm) 


1.0 
70 80 90 100 
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Figure 11.21 Heat denaturation of DNA from various sources, so-called melting curves. (Adapted 
from Marmur, J., 1959. Heterogenity in deoxyribonucleic acids. Nature 183:1427-1429.) 
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A Deeper Look 


The Buoyant Density of DNA 


Density gradient ultracentrifugation is a variant of the basic tech- 
nique of ultracentrifugation (discussed in Chapter 5). The densities 
of DNAs are about the same as those of concentrated solutions of 
cesium chloride, CsCl (1.6 to 1.8 g/mL). Centrifugation of CsCl 
solutions at very high rotational speeds, where the centrifugal force 
becomes 10° times stronger than the force of gravity, causes the 
formation of a density gradient within the solution. This gradient is 
the result of a balance that is established between the sedimentation 
of the salt ions toward the bottom of the tube and their diffusion 
upward toward regions of lower concentration. If DNA is present 
in the centrifuged CsCl solution, it moves to a position of equilib- 
rium in the gradient equivalent to its buoyant density (as shown in 
the figure). For this reason, this technique is also called isopycnic 
centrifugation. 

Cesium chloride centrifugation is an excellent means of remov- 
ing RNA and proteins in the purification of DNA. The density 
of dsDNA is typically slightly greater than 1.7 g/cm?, whereas the 
density of RNA is more than 1.8 g/cm. Proteins have densities less 
than 1.3 g/cm?. In CsCl solutions of appropriate density, the DNA 
bands near the center of the tube, RNA pellets to the bottom, and 
the proteins float near the top. Single-stranded DNA is denser than 
double helical DNA. The irregular structure of randomly coiled 
ssDNA allows the atoms to pack together through van der Waals 


The net movement of solute particles in an ultracentrifuge is the 
result of two processes: diffusion (from regions of higher concentra- 
tion to regions of lower concentration) and sedimentation due to 
centrifugal force (in the direction away from the axis of rotation). In 
general, diffusion rates for molecules are inversely proportional to 
their molecular weight—larger molecules diffuse more slowly than 
smaller ones. On the other hand, sedimentation rates increase with 
increasing molecular weight. A macromolecular species that has 
reached its position of density equilibrium in isopycnic centrifuga- 
tion has formed a concentrated band of material. 

Essentially three effects are influencing the movement of the 
molecules in creating this concentration zone: (1) diffusion away 
to regions of lower concentration, (2) sedimentation of molecules 
situated at positions of slightly lower solution density in the density 
gradient, and (3) flotation (buoyancy or “reverse sedimentation”) 
of molecules that have reached positions of slightly greater solution 
density in the gradient. The consequence of the physics of these 
effects is that, at equilibrium, the width of the concentration band 
established by the macromolecular species is inversely proportional to 
the square root of its molecular weight. That is, a population of large 
molecules will form a concentration band that is narrower than 
the band formed by a population of small molecules. For example, 


interactions. These interactions compact the molecule into a smaller 


i; : the bandwidth formed by dsDNA will be less than the bandwidth 
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strands into a double helix, a process termed reannealing. For this to occur, the 
strands must realign themselves so that their complementary bases are once again in 
register and the helix can be zippered up (Figure 11.22). Renaturation is dependent on 
both DNA concentration and time. Many of the realignments are imperfect, and thus 
the strands must dissociate again to allow for proper pairings to be formed. The pro- 
cess occurs more quickly if the temperature is warm enough to promote diffusion of 
the large DNA molecules but not so warm as to cause melting. 


11.3d The Rate of DNA Renaturation Is an Index 
of DNA Sequence Complexity 


The renaturation rate of DNA is an excellent indicator of the sequence complexity 
of DNA. For example, the DNA of bacteriophage T4 contains 2 X 10° base pairs; an 
Escherichia coli chromosome contains more than 10 times as much (4.64 X 10° base 
pairs). E. coli DNA is considerably more complex in that it encodes more information. 
Expressed in another way, for any fixed amount of single-stranded DNA (in grams), 
the nucleotide sequences represented in an E. coli sample will show greater sequence 
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Native DNA Denatured DNA Renatured DNA 
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Figure 11.22 Steps in the thermal denaturation and renaturation of DNA. The nucleation phase of 
the reaction is a second-order process depending on sequence alignment of the two strands (1). This 
process takes place slowly because it takes time for complementary sequences to encounter one 
another in solution and then align themselves in register. Once the sequences are aligned, the strands 
zipper up quickly (2). 


variation than those in an equal weight of phage T4 DNA. Thus, it will take longer 
for the E. coli DNA strands to find their complementary partners and reanneal. 
Because the rate of DNA duplex formation depends on complementary DNA 
sequences encountering one another and beginning the process of sequence 
alignment and reannealing, the time necessary for reconstituting double-stranded 
DNA molecules is an excellent index of the degree of sequence complementarity in 
a DNA sample. 


11.3e Nucleic Acid Hybridization: Different DNA Strands 
of Similar Sequence Can Form Hybrid Duplexes 


If DNA from two different species are mixed, denatured, and allowed to cool slowly 
so that reannealing can occur, hybrid duplexes may form, provided the DNA from one 
species is similar in nucleotide sequence to the DNA of the other. The degree of 
hybridization is a measure of the sequence similarity, or relatedness, between the two 
species. Depending on the conditions of the experiment, about 25% of the DNA from 
a human forms hybrids with mouse DNA, implying that some of the nucleotide 
sequences (genes) in humans are very similar to those in mice (Figure 11.23). Mixed 
RNA:DNA hybrids can be created in vitro if single-stranded DNA is allowed to 
anneal with RNA copies of itself, such as those formed when genes are transcribed 
into mRNA molecules. 

Nucleic acid hybridization is a commonly employed procedure in molecular biology. 
First, it can reveal evolutionary relationships. Second, it gives researchers the power to 
identify specific genes selectively against a vast background of irrelevant genetic mate- 
rial: An appropriately labeled oligonucleotide or polynucleotide, referred to as a probe, 
is constructed so that its sequence is complementary to a target gene. The probe specifi- 
cally base pairs with the target gene, allowing identification and subsequent isolation of 
the gene. Also, the quantitative expression of genes (in terms of the amount of mRNA 
synthesized) can be assayed by hybridization experiments. 


Figure 11.23 Solutions of human DNA (red) and 
mouse DNA (blue) are mixed and denatured, and 
the single strands are allowed to reanneal. About 
25% of the human DNA strands form hybrid 
duplexes (one red and one blue strand) with 
mouse DNA. 
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(a) 
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11.4 | How Does DNA Adopt Structures of 
Higher Complexity? 


Not all DNA molecules are linear; many DNA molecules are circular. Most, but not 
all, bacterial chromosomes are covalently closed, circular DNA duplexes, as are most 
plasmid DNAs. Plasmids are naturally occurring, self-replicating, extrachromosomal 
DNA molecules found in bacteria; plasmids carry genes specifying novel metabolic 
capacities advantageous to the host bacterium. Various animal virus DNAs are circu- 
lar as well. DNA can adopt regular structures of higher complexity in several ways. 


11.4a Supercoils Are One Kind of Structural Complexity in DNA 


In duplex DNA, the two strands are wound about each other once every 10 bp or so, 
that is, once every turn of the helix. Double-stranded circular DNA (or linear DNA 
duplexes whose ends are not free to rotate) form supercoils if the strands are under- 
wound (negatively supercoiled) or overwound (positively supercoiled) (Figure 11.24). 
Underwound duplex DNA has fewer than the normal number of turns, whereas over- 
wound DNA has more. DNA supercoiling is analogous to twisting or untwisting a 
two-stranded rope so that it is torsionally stressed. Negative supercoiling introduces 
a torsional stress that favors unwinding of the right-handed B-DNA double helix, 
whereas positive supercoiling overwinds such a helix. Both forms of supercoiling 
compact the DNA so that it sediments faster upon ultracentrifugation or migrates 
more rapidly in an electrophoretic gel in comparison to relaxed DNA (DNA that is not 
supercoiled). Cellular DNA is almost always negatively supercoiled (underwound). 


11.4b Supercoiled DNA Is Defined by Its Linking Number 


The basic parameter characterizing supercoiled DNA is the linking number (L). This 
is the number of times the two strands are intertwined, and provided both strands 
remain covalently intact, L cannot change. In a relaxed circular DNA duplex of 400 
bp, L is 40 (assuming 10 bp per turn in B-DNA). The linking number for relaxed DNA 
is usually taken as the reference parameter and is written as Lo. L can be equated to 
the twist (7) and writhe (W) of the duplex, where twist is the number of helical turns 
and writhe is the number of supercoils: 


L=T+Ww 
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Figure 11.24 Toroidal and interwound varieties of supercoiling. (a) The DNA is coiled in a spiral 
fashion about an imaginary toroid (yellow circle). (b) The DNA interwinds and wraps about itself. 

(c) Supercoils in long, linear DNA arranged into loops whose ends are restrained—a model for 
chromosomal DNA. (Adapted from Figures 6.2 and 6.3 in Callandine, C. R., and Drew, H. R., 2004. 
Understanding DNA: The Molecule and How It Works, 3rd ed. London: Academic Press.) 
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(a) Positive supercoiling 


T=0 T=+3 T=+2 T=+1 T=0 
W=0 W=0 W=+1 W=+2 W=+3 
L=0 L=+3 

(1) (2) (3) (4) (5) 

(b) Negative supercoiling 

T=0 T=-3 T=-2 T=-1 T=0 
W=0 W=0 W=1 W=-2 W=-3 
L=0 L=-3 


a) (2) (3) (4) (5) 


Figure 11.25 Supercoil topology for a simple circular tube with a single stripe along it. (Adapted from 
Figures 6.4 and 6.5 in Callandine, C. R., and Drew, H. R., 2004. Understanding DNA: The Molecule 
and How It Works, 3rd ed. London: Academic Press.) 


Figure 11.25 shows the values of T and W for a simple striped circular tube in various 
supercoiled forms. In any closed, circular DNA duplex that is relaxed, W=0. A 
relaxed circular DNA of 400 bp has 40 helical turns, T = L = 40. This linking number 
can be changed only by breaking one or both strands of the DNA, winding them 
tighter or looser, and rejoining the ends. Enzymes capable of carrying out such 
reactions are called topoisomerases because they change the topological state of DNA. 
Topoisomerases fall into two basic classes: I and II. Topoisomerases of the I type cut 
one strand of a DNA double helix, pass the other strand through, and then rejoin the 
cut ends. Topoisomerase II enzymes cut both strands of a dsDNA, pass a region of the 
DNA duplex between the cut ends, and then rejoin the ends (Figure 11.26). 
Topoisomerases are important players in DNA replication (see Chapter 28). 


11.4c Enzymes Can Add or Remove Supercoils 


The bacterial enzyme DNA gyrase is a topoisomerase that introduces negative supercoils 
into DNA in the manner shown in Figure 11.26. Suppose DNA gyrase puts four negative 
supercoils into the 400-bp circular duplex, then W = —4, T remains the same, and L = 
36 (Figure 11.27). In actuality, the negative supercoils cause a torsional stress on the 
molecule, so T tends to decrease; that is, the helix becomes a bit unwound, so base pairs 
are separated. The extreme would be that T would decrease by 4 and the supercoiling 
would be removed (T = 36, L = 36, and W = 0). That is, negative supercoiling has the 
potential to cause localized unwinding of the DNA double helix so that single-stranded 


DNA loop 


DNA is cut and a conformational change 
allows the DNA to pass through. Gyrase 
religates the DNA and then releases it. 


Figure 11.26 A model for the action of bacterial 
DNA gyrase (topoisomerase I). The A-subunits 
cut the DNA duplex (1) and then hold onto the cut 
ends (2). Conformational changes in the enzyme 
allow an intact region of the DNA duplex to pass 
between the cut ends. The cut ends are religated (3), 
and the covalently complete DNA duplex is released 
from the enzyme. The circular DNA now contains 
two negative supercoils (4). 
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Figure 11.27 A 400-bp circular DNA molecule in 
different topological states: (a) relaxed, (b) negative 
supercoils distributed over the entire length, and 
(c) negative supercoils creating a localized single- 
stranded region. 


regions (or bubbles) are created (Figure 11.27). Usually, the natural situation is a compro- 
mise in which the negative value of Wis reduced, T decreases slightly, and these changes 
are distributed over the length of the circular duplex so that no localized unwinding of 
the helix ensues. Nevertheless, negative supercoiling makes it easier to separate the DNA 
strands and access the information encoded by the nucleotide sequence. 


Superhelix Density The difference between the linking number of a DNA and the link- 
ing number of its relaxed form is AL = (L — Lo). In our example with four negative su- 
percoils, AL = —4. The superhelix density or specific linking difference is defined as 
AL/Lo and is sometimes termed sigma, o. For our example, o = —4/40, or —0.1. As a 
ratio, ø is a measure of supercoiling that is independent of length. Its sign reflects 
whether the supercoiling tends to unwind (negative a) or overwind (positive o) the helix. 
In other words, the superhelix density states the number of supercoils per 10 bp, which 
also is the same as the number of supercoils per B-DNA repeat. Circular DNA isolated 
from natural sources is always found in the underwound, negatively supercoiled state. 


Toroidal Supercoiled DNA Negatively supercoiled DNA can arrange into a toroidal 
state (Figure 11.28). The toroidal state of negatively supercoiled DNA is stabilized by 
wrapping around proteins that serve as spools for the DNA “ribbon.” This toroidal 
conformation of DNA is found in protein-DNA interactions that are the basis of phe- 
nomena as diverse as chromosome structure (see Figure 11.31) and gene expression. 


11.5 | What Is the Structure of Eukaryotic Chromosomes? 


A typical human cell is 20 um in diameter. Its genetic material consists of 23 pairs of 
dsDNA molecules in the form of chromosomes, the average length of which is 3 x 10° 
bp/23 or 1.3 X 108 nucleotide pairs. At 0.34 nm/bp in B-DNA, this represents a DNA 
molecule 5 cm long. Together, these 46 dsDNA molecules amount to more than 2 m of 
DNA that must be packaged into a nucleus perhaps 10 um in diameter! Clearly, the 
DNA must be condensed by a factor of more than 10°. The mechanisms by which this 
condensation is achieved are not completely understood at the present time, but it is 
clear that the first stage of this condensation is accomplished by neatly wrapping the 
DNA around protein spools called nucleosomes. 


(a) 


Protein spool 


(b) T=0, W=-2 (©) T=0, W=-2 


Figure 11.28 Supercoiled DNA in a toroidal form wraps readily around protein spools. A twisted 
segment of linear DNA with two negative supercoils (a) can collapse into a toroidal conformation if 
its ends are brought closer together (b). Wrapping the DNA toroid around a protein “spool” stabilizes 
this conformation (c). (Adapted from Figure 6.6 in Callandine, C. R., and Drew, H. R., 2004. 
Understanding DNA: The Molecule and How It Works, 3rd ed. London: Academic Press.) 
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11.5 What Is the Structure of Eukaryotic Chromosomes? 


11.5a Nucleosomes Are the Fundamental Structural Unit in Chromatin 


The DNA in a eukaryotic cell nucleus during the interphase between cell divisions 
exists as a nucleoprotein complex called chromatin. The proteins of chromatin fall into 
two classes: histones and nonhistone chromosomal proteins. Histones are abundant and 
play an important role in chromatin structure. In contrast, the nonhistone class is 
defined by a great variety of different proteins, all of which are involved in genetic regu- 
lation; typically, there are only a few molecules of each per cell. Five distinct histones 
are known: H1, H2A, H2B, H3, and H4 (Table 11.3). All five are relatively small, posi- 
tively charged, arginine- or lysine-rich proteins that interact via ionic bonds with the 
negatively charged phosphate groups on the polynucleotide backbone. Pairs of histones 
H2A, H2B, H3, and H4 aggregate to form octameric core structures, and the DNA 
helix is wound about these core octamers, creating nucleosomes. 

If chromatin is treated with mild detergent and prepared for viewing in the electron 
microscope, the nucleosomes are evident as “beads on a string,” dsDNA being the string. 
The structure of the histone octamer core wrapped with DNA has been solved by T. J. 
Richmond and collaborators (Figure 11.29). The core octamer has surface landmarks that 
guide the course of the DNA; 147 bp of B-DNA in a flat, left-handed superhelical confor- 
mation make 1.6 turns around the histone core (Figure 11.29), which itself is a protein su- 
perhelix consisting of a spiral array of the four histone dimers. Histone H1, a three-domain 
protein, organizes an additional 29 to 43 bp of DNA and links consecutive nucleosomes. 
Each complete nucleosome unit contains 176 to 190 bp of DNA. The N-terminal tails of 
histones H3 and H4 are accessible on the surface of the nucleosome. Lysine and serine resi- 
dues in these tails can be covalently modified in myriad ways (lysines may be acetylated, 
methylated, or ubiquitinated; serines may be phosphorylated). These modifications play an 
important role in chromatin dynamics and gene expression. A single nucleosome and its 
associated H1 linker is referred to as a chromatosome. 


| Table 11.3 e| Properties of Histones 


Ratio of 


Histone Lysine to Arginine Size (kD) Copies per Nucleosome 
H1 59/3 21.2 1 (not in core) 
H2A 13/13 14.1 2 

H2B 20/8 13.1 2 

H3 13/17 15.1 2 

H4 11/14 11.4 2 
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Figure 11.29 The nucleosome core particle wrapped 
with 1.65 turns of DNA (147 bp). The DNA is 
shown as a blue and orange double helix. The four 
types of core histones are shown as different colors. 
(a) View down the axis of the nucleosome; (b) View 
perpendicular to the axis (PDB id = 1AOI). 
(Adapted from Luger, K., et al., 1997. Crystal 
structure of the nucleosome core particle at 2.8 A 
resolution. Nature 389:251—260. Photos courtesy of 
T. J. Richmond, ETH-Honggerberg, Zurich, 
Switzerland.) 
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11.5b Higher-Order Structural Organization of Chromatin 
Gives Rise to Chromosomes 


The DNA double helix has a diameter of about 2 nm. And, as indicated earlier, the 
total length of the 46 DNA molecules in a human cell is roughly 2 meters. And all this 
DNA must be contained within the nucleus, which is around 10 mm in diameter. 
Thus, the total length of human DNA is over 5 orders of magnitude greater than the 
diameter of the organelle which contains it. And within this small volume, the DNA 
must remain accessible so that the processes of gene expression and DNA replication 
are possible. The first stage in DNA compaction is wrapping it around nucleosomes. 
The mechanisms involved in compacting it further and the structures that result, 
including the overall structure of chromosomes, are under intensive study. 

It is important to keep in mind that chromosomes are structurally dynamic. In cells 
undergoing mitosis (cell division), the chromatin of each chromosome has been dupli- 
cated and highly condensed in preparation for distribution to the two daughter cells. In 
the nuclei of interphase cells (cells that are not undergoing mitosis), the chromatin of 
each chromosome is dispersed, and despite its flexibility and limited compaction, it is 
not entangled with the chromatin of other chromosomes. Instead, individual chromo- 
somes occupy distinct chromosome territories within the nucleus (Figure 11.30a). At a 
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Figure 11.30 Chromosome structural dynamics. Each chromosome occupies its own discrete territory 
within the nucleus. Individual chromosomes are shown in different colors. (Adapted from Figure 1 
Kempfer, R., and Pombo, A., 2020. Methods for mapping 3D chromosome architecture. Nature 
Reviews Genetics 21:207—226.) (b) SMC protein architecture. SMC proteins range from 115 to 165 kD 
in size. (b) SMC protein function. SMC proteins are reminiscent of motor proteins. Ilustrated in 

(c) is a condensation of DNA into a coiled arrangement through SMC2/SMC4-mediated interactions. 
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smaller scale, chromosomal chromatin in interphase nuclei is organized into topologi- 
cally associated domains (TADs) that have a median size of about 200 kb. Human cells 
have about 10,000 such TADs. TADs seem to define replication units as well as units of 
coordinate gene transcription. 


11.5c SMC Proteins Establish Chromosome Organization and 
Mediate Chromosome Dynamics 


Two different multi-subunit complexes, condensin and cohesion, mediate the changes 
in chromosome compaction that take place over the course of the cell cycle. SMC 
proteins provide two of the five subunits found in both condensin and cohesin. SMC 
stands for structural maintenance of chromosomes. SMC proteins are members of 
the nonhistone chromosomal protein class. SMC proteins form a large superfamily 
of ATPases involved in higher-order chromosome organization and dynamics. SMC 
protein representatives are found in all forms of life—archaea, bacteria, and 
eukaryotes. Chromosomal dynamics includes chromosome condensation, sister 
chromatid cohesion, genetic recombination, and DNA repair, as well as other phe- 
nomena. SMC proteins have a characteristic five-domain organization, consisting 
of an N-terminal globular ATP-binding domain, a rodlike dimerization domain 
involved in coiled coil formation, a flexible hinge region, another rodlike and coiled 
coil-forming region, and finally a C-terminal globular domain termed DA for its 
DNA-binding and ATPase abilities (Figure 11.30b). Five subgroups of SMC pro- 
teins are found in eukaryotes, and functional SMC proteins are heterodimers. 
SMC2/SMC4 heterodimers are essential for chromatin condensation as part of con- 
densin complexes; SMC1/SMC3 heterodimers act in sister chromatid cohesion as 
part of cohesin complexes. SMC proteins form V-shaped heterodimers that bind to 
DNA through their DA domains and mediate chromosomal dynamics in an ATP- 
dependent manner. The flexible hinge region of each SMC subunit is located at the 
point of the V, and hinge-bending motions allow the DNA-binding parts of the two 
globular heads to move closer together, compacting the DNA into a higher-order 
structure (Figure 11.30c). 

The degree of chromosome folding is modulated as cells progress through the cell 
cycle. During mitosis, condensin translocates along the DNA double helix and uses 
ATP energy to extrude loops of DNA. By this action, condensins fold chromosomes 
into the compact helical loop arrays that characterize mitotic chromosomes. In inter- 
phase, the cohesin complex generates loops and the topologically associating domains 
discussed above. A separate cohensin-mediated process drives segregation of transcrip- 
tionally active chromatin from inactive chromatin. 


11.6 | How Can Nucleic Acids Be Synthesized Chemically? 


Laboratory synthesis of nucleic acids frees researchers from a dependence on natu- 
rally occurring sequences for structural and functional studies. Laboratory synthesis 
of oligonucleotide chains of defined sequence presents some of the same problems 
encountered in chemical synthesis of polypeptides (see Chapter 5). First, functional 
groups on the monomeric units (in this case, bases) are reactive under conditions of 
polymerization and therefore must be protected by blocking agents. Second, to gener- 
ate the desired sequence, a phosphodiester bridge must be formed between the 3’-O of 
one nucleotide (B) and the 5'-O of the preceding one (A) in a way that precludes the 
unwanted bridging of the 3’-O of A with the 5’-O of B. Finally, recoveries at each step 
must be high so that overall yields in the multistep process are acceptable. As in pep- 
tide synthesis (see Chapter 5), orthogonal solid-phase methods are used to overcome 
some of these problems. Commercially available automated instruments, called DNA 
synthesizers or “gene machines,” are capable of carrying out the synthesis of oligonu- 
cleotides of 150 bases or more. 
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11.6a Phosphoramidite Chemistry Is Used to Form 
Oligonucleotides from Nucleotides 


Phosphoramidite chemistry is currently the accepted method of oligonucleotide syn- 
thesis. The general strategy involves the sequential addition of nucleotide units as 
nucleoside phosphoramidite derivatives to a nucleoside covalently attached to the 
insoluble resin. Excess reagents, starting materials, and side products are removed 
after each step by filtration. After the desired oligonucleotide has been formed, it is 
freed of all blocking groups, hydrolyzed from the resin, and purified by gel electro- 
phoresis. The four-step cycle is shown in Figure 11.31. Chemical synthesis takes place 
in the 3'—5' direction (the reverse of the biological polymerization direction). 


11.6b Genes and Even Chromosomes Can Be Synthesized Chemically 


Individual single-stranded DNA oligonucleotides (100-200 bp in size) synthesized 
through phosphoramidite chemistry can be assembled to form completely synthetic 
double-stranded DNA. Typically, successive oligonucleotides are designed to contain 
overlapping sequences that encode opposing strands of the desired DNA duplex. 
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Figure 11.31 Solid-phase oligonucleotide synthesis. The four-step cycle starts with the first base in 
nucleoside form attached by its 3’-OH group to an insoluble support. Its 5'-OH is blocked with a 
dimethoxytrityl (DMTr) group (a). If the base has reactive —NH; functions, as in A, G, or C, then 
blocking groups such as N-benzoyl or N-isobutyryl chloride are used to prevent their reaction (b). In 
step 1, the DMTr protecting group is removed by trichloroacetic acid treatment. Step 2 is the coupling 
step: The second base is added in the form of a nucleoside phosphoramidite derivative whose 5'-OH 
bears a DMTr blocking group so it cannot polymerize with itself (c). 
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Figure 11.31 continued In step 2, the presence of a weak acid, such as tetrazole, activates the phos- 
phoramidite, and it rapidly reacts with the free 5'-OH of N-1, forming a dinucleotide linked by a 
phosphite group. Unreacted free 5'-OHs of N-1 are blocked from further participation in the 
polymerization process by acetylation with acetic anhydride in step 3, referred to as capping. In step 4, 
the phosphite linkage between N-1 and N-2 is oxidized by aqueous iodine (I) to form the desired 
more stable phosphate group. Subsequent cycles add successive residues to the resin-immobilized 
chain. When the chain is complete, it is cleaved from the support with NH4OH, which also removes 
the N-benzoyl- and N-isobutyryl-protecting groups from the amino functions on the A, G, and C 
residues. Phosphoramidite chemistry for oligonucleotide synthesis has been miniaturized through the 
development of photo-labile or acid-sensitive blocking agents. Oligonucleotides up to 200 bases in 
length can be synthesized as microarrays on small chips, using inkjet printing to deliver microdroplets 
of phosphoramidite substrates and lasers (or electrogenerated pH changes) to trigger removal of 
blocking groups during cycles of nucleotide addition (see Figure 12.12). 


When assembled together during the gene synthesis process, double-stranded DNA 
molecules up to 2 kbp in length are formed. At this point, in order to make gene- and 
chromosome-sized synthetic DNA constructs capable of encoding metabolic path- 
ways or genomes requires some in vivo system, like yeast. 

To date, such methods have achieved the complete synthesis of an entire yeast chro- 
mosome, entire bacterial genomes, and even synthetic human chromosomes (though 
strictly speaking, it was only the DNA in these structures that was synthesized, not the 
complete DNA:protein complex that a chromosome is). Such technology is referred to 
as DNA-write, in counterpoint to the results of DNA sequencing studies, which can 
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aptly be termed DNA-read. The power of DNA-write is found in the realization that any 
desired nucleotide sequence can be constructed. Thus, creating new DNA molecules in 
which new mutations have been introduced, deleterious or unwanted sequences have 
been omitted, novel gene products such as proteins or RNAs are encoded or recoded, 
are all easily performed. 

DNA is analogous to software. The information encoded in a genome is a program 
for the operation of an organism; indeed, it defines an organism. DNA as code in con- 
junction with DNA-write has opened the door to engineering new biological parts, de- 
vices, systems, and even organisms with features that heretofore never existed in nature. 

DNA-write also opens to door to exploiting the remarkable stability and longevity 
of DNA as a data storage medium. The raw storage capacity of the four-base DNA in- 
formation system is staggering compared with even the most advanced electronic or 
magnetic storage systems. It is theoretically possible to store an exabyte (10!% bytes; 1000 
gigabytes) of information, if it were coded into DNA, in the volume of a grain of sand. 
(An exabyte is roughly equivalent to 200 million DVDs). Equally important, this infor- 
mation can be subsequently retrieved from the DNA archive and used to recreate its 
contents in more conventional binary forms appropriate for computer use. A disadvan- 
tage of conventional digital data-archiving systems is that they degrade and must be 
renewed every decade or so. However, under appropriate conditions, DNA will last for 
thousands of years, preserving the information encoded in it for posterity. 


11.7 | What Are the Secondary and Tertiary 
Structures of RNA? 


RNA molecules (see Chapter 10) are typically single stranded. The course of a single- 
stranded RNA in three-dimensional space conceivably would have six degrees of freedom 
per nucleotide, represented by rotation about each of the six single bonds along the sugar— 
phosphate backbone per nucleotide unit. (Rotation about the B-glycosidic bond creates a 
seventh degree of freedom in terms of the total conformational possibilities at each nucle- 
otide.) Compare this situation with DNA, whose separated strands would obviously enjoy 
the same degrees of freedom. However, the double-stranded nature of DNA imposes 
great constraint on its conformational possibilities. Compared with dsDNA, an RNA 
molecule has a much greater number of conformational possibilities. Intramolecular 
interactions and other stabilizing influences limit these possibilities, but the higher-order 
structure of RNA is both specific and dynamic, bringing with it the kinds of functional 
possibilities once thought possible only in proteins. It turns out that cellular RNA mole- 
cules are far more versatile than DNA. In analogy with the protein structures dictated by 
amino acid sequence, RNA structures are determined by nucleotide sequence. The intri- 
cate three-dimensional architectures they adopt and the conformational dynamics 
displayed by such architectures endow RNAs with functional abilities, such as ligand 
binding and even catalysis. The functional complexity of RNA is the reason why RNA is 
so admirably suited for its role in a wide variety of biological processes. 

RNA molecules are rich in double-stranded regions that form when complementary 
sequences within the chain come together and join via intrastrand base-pairing. These 
interactions create hairpin stem-loop structures, in which the base-paired regions form 
the stem and the unpaired regions between base pairs are the loop, as in Figures 11.32 
and 11.33. Paired regions of RNA cannot form B-DNA-—type double helices because 
the RNA 2’-OH groups are a steric hindrance to this conformation. Instead, these 
paired regions adopt a conformation similar to the A-form of DNA, having about 11 
bp per turn, with the bases strongly tilted from the plane perpendicular to the helix axis 
(see Figure 11.12). A-form double helices are the most prominent secondary structural 
elements in RNA. Both tRNA and rRNA have large amounts of A-form double helix. 
In addition, a number of defined structural motifs recur within the loops of stem-loop 
structures, such as U-turns (a loop motif of consensus sequence UNRN, where N is any 
nucleotide and R is a purine) and tetraloops (another class of four-nucleotide loops 
found at the termini of stem-loop structures). Stems of stem-loop structures may also 
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have bulges (or internal loops) where the RNA strand is forced into a short single- 
stranded loop because one or more bases along one strand in an RNA double helix 
finds no base-pairing partners (Figure 11.32). Regions where several stem-loop 
structures meet are termed junctions (Figure 11.33). Stems, loops, bulges, and junctions 
are the four basic secondary structural elements in RNA. 

The single-stranded loops in RNA stem-loops create base-pairing opportunities be- 
tween distant, complementary, single-stranded loop regions. These interactions, mostly 
based on Watson—Crick base pairing, lead to tertiary structure in RNA. Other tertiary 
structural motifs arise from coaxial stacking, pseudoknot formation, and ribose zippers. 
In coaxial stacking, the blunt, nonloop ends of stem-loops situated next to one another 
in the RNA sequence stack upon each other to create an uninterrupted stack of base 
pairs (Figure 11.33). A good example of coaxial stacking is found in the tertiary struc- 
ture of tRNAs, where the acceptor end of the L-shaped tRNA is formed by coaxial 
stacking of the acceptor stem on the TWC stem-loop and the anticodon end is formed 
by coaxial stacking of the dihydrouracil stem-loop on the anticodon stem-loop 


Figure 11.32 Bulges and loops formed in RNA 
when aligned sequences are not fully complementary. 
(Adapted from Appendix Figure | in Gesteland, 

R. F., Cech, T. R., and Atkins, J. F., eds. The RNA 
World, 2nd ed. New York: Cold Spring Harbor 
Press.) 


Figure 11.33 Junctions and coaxial stacking in 
RNA. Stem junctions (or multibranched loops) 
are another type of RNA secondary structure. 
Coaxial stacking of stems or stem-loops (as in 
stacking of stem | on stem-loop 4) is a tertiary 
structural feature found in many RNAs. (Adapted 
from Figure | in Tyagi, R., and Matthews, D. H., 
2007. Predicting coaxial stacking in multibranch 
loops. RNA 13:1-13.) 
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(Figures 11.35 and 11.37). Pseudoknots occur when bases in the loops of stem-loop 
structures form a short double helix by base pairing with nearby single-stranded regions 
in the RNA (Figure 11.34). Ribose zippers are found when two antiparallel, single- 
stranded regions of RNA align as an H-bonded network forms between the 2'-OH 
groups of the respective strands, the O at the 2'-OH position of one strand serving as 
the H-bond acceptor, while the H on the 2’-OH of the other strand is the H-bond donor. 
A full catalog of RNA secondary and tertiary structural motifs, including the ribose 
zippers and other RNA structures mentioned here, can be found in the NDB (Nucleic 
Acid Database) at http://ndbserver.rutgers.edu/. Predicting the three-dimensional 
structure of an RNA molecule is more challenging than predicting the structure of a 
protein. Machine-learning computer algorithms are having some success in meeting this 
challenge. 


11.7a Transfer RNA Adopts Higher-Order Structure 
Through Intrastrand Base Pairing 


IntRNA molecules, which contain 73 to 94 nucleotides in a single chain, a majority of 
the bases are hydrogen bonded to one another. Figure 11.35 shows the structure that 
typifies tRNAs. Hairpin turns bring complementary stretches of bases in the chain 
into contact so that double helical regions form, creating stem-loop secondary struc- 
tures. Because of the arrangement of the complementary stretches along the chain, 
the overall pattern of base pairing can be represented as a cloverleaf. Each cloverleaf 
consists of four base-paired segments—three loops and the stem where the 3’- and 
5'-ends of the molecule meet. These four segments are designated the acceptor stem, 
the D loop, the anticodon loop, and the TYC loop (the latter two are U-turn motifs). 


tRNA Secondary Structure The acceptor stem is where the amino acid is linked to form 
the aminoacyl-tRNA derivative, which serves as the amino acid—donating species in 
protein synthesis; this is the physiological role of tRNA. The carboxyl group of an 


hTR 
_C 
uU y 
A A 
/ Cc—G 
U cmc 
| A—U 
U C—G 
| u U—A s2 
C 
S A—U 
+ BU 
A—U 
\ 
U 
D ir- AS 
U—A N 
C—G ‘A 
sill fy 12 \ 
G—C C 
G—C | 
' A 
5 d j 


Figure 11.34 RNA pseudoknots are formed when a single-stranded region of RNA folds to base-pair 
with a hairpin loop. Loops L1 and L2, as shown on the sequence representation of human telomerase 
RNA (hTR) on the left, form a pseudoknot. The three-dimensional structure of an hTR pseudoknot 
is shown on the right (PDB id = 1YMO). (Adapted from Figure 2 in Staple, D. W., and Butcher, S. E., 
2005. Pseudoknots: RNA structures with diverse functions. PLoS Biology 3:e213.) 
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amino acid is linked to the 3'-OH of the 3’-terminal A nucleotide, thus forming an 
aminoacyl ester (Figure 11.35). The 3’-end of tRNA is invariantly CCA-3’-OH. 
The D loop is so named because this tRNA loop often contains dihydrouridine, or D, 
residues. In addition to dihydrouridine, tRNAs characteristically contain a number of 
unusual bases, including inosine, thiouridine, pseudouridine, and hypermethylated 
purines (see Figure 10.23). The anticodon stem-loop consists of a double helical seg- 
ment and seven unpaired bases, three of which are the anticodon—a three-nucleotide 
unit that recognizes and base pairs with a particular mRNA codon, a complementary 
three-base unit in mRNA providing the genetic information that specifies an amino 
acid. In the 5'—3' direction beyond the anticodon stem-loop lies a loop that varies 
from tRNA to tRNA in the number of residues that it has, the so-called extra or vari- 
able loop. The last loop in the tRNA, reading 5'—3’, is within the TYC stem-loop. It 
contains seven unpaired bases, including the sequence TYC, where w is the symbol for 
pseudouridine. Most of the invariant residues common to tRNAs lie within the non— 
hydrogen-bonded regions of the cloverleaf structure. 


tRNA Tertiary Structure Tertiary structure in tRNA arises from base-pairing 
interactions between bases in the D loop with bases in the variable and TYC loops, as 
shown for yeast phenylalanine tRNA in Figure 11.36. Note that these base-pairing 
interactions involve the invariant nucleotides of tRNAs. These interactions fold the D 
and TWC arms together and bend the cloverleaf into the stable L-shaped tertiary form 
(Figure 11.37). Many of these base-pairing interactions involve base pairs that are not 
canonical A:T or G:C pairings, as illustrated around the central ribbon diagram of the 
tRNA in Figure 11.37. Note that three of the interactions involve three bases. The 
amino acid acceptor stem (highlighted in green) is at one end of the inverted, backward 
L shape, separated by 7 nm or so from the anticodon at the opposite end of the L. 
The D and TWC loops form the corner of the L. Hydrophobic stacking interactions 
between the flat faces of the bases contributes significantly to L-form stabilization. 


11.7b Messenger RNA Adopts Higher-Order Structure 
Through Intrastrand Base Pairing 
Messenger RNA (mRNA) directs the synthesis of proteins. The defining feature of an 


mRNA is its primary structure—its specific nucleotide sequence. This sequence is 
translated by ribosomes into a unique amino acid sequence. Nevertheless, mRNA 
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Figure 11.35 A general diagram for the structure of 
tRNA. The positions of invariant bases as well as 
bases that seldom vary are shown in color. R = 
purine; Y = pyrimidine. Dotted lines denote sites in 
the D loop and variable loop regions where varying 
numbers of nucleotides are found in different 
tRNAs. Inset: An aminoacyl group can add to the 
3’-OH to create an aminoacyl-tRNA (shown at 
lower right highlighted in pink). 
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Figure 11.36 Tertiary interactions in yeast phenylalanine tRNA. The molecule is presented in the 
conventional cloverleaf secondary structure generated by intrastrand hydrogen bonding. Solid red lines 
connect bases that are hydrogen bonded when this cloverleaf pattern is folded into the characteristic 
tRNA tertiary structure. 


has regions of secondary structure that arise when base pairing takes place between 
intrastrand complementary regions. That is, like other RNAs, mRNA folds back on 
itself, forming stem-loop secondary structures, as well as more complex tertiary 
structures. The stability of these structures depends on the degree of base pairing. 
Since each mRNA has a unique nucleotide sequence, the secondary and tertiary 
structures of each are unique. The 5’- and 3’-untranslated (and intron) regions, whose 
sequences are not constrained by the necessity of encoding a protein, often show 
higher orders of structure. However, the coding region may also have structured ele- 
ments. The stability of these various structures in a particular mRNA can affect its 
translation efficiency, since the mRNA must be unfolded to be read by the ribosome. 
Thus, higher orders of structure in mRNAs can influence gene expression. 


T1.7c Ribosomal RNA Also Adopts Higher-Order Structure 
Through Intrastrand Base Pairing 


Ribosomal rRNA provides the structural framework of the ribosome, 16S rRNA for 
the small ribosomal subunit and 23S and 5S rRNAs for the large ribosomal subunit. 
Further, the 23S rRNA is a ribozyme and it catalyzes synthesis of the peptide bonds 
that link amino acids together to form proteins. 
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Figure 11.37 The three-dimensional structure of yeast phenylalanine tRNA. (a) The tertiary folding is (b) 
illustrated in the center of the diagram with the ribose—phosphate backbone presented as a 
continuous ribbon; H bonds are indicated by crossbars. Unpaired bases are shown as short, 
unconnected rods. The anticodon loop is at the bottom and the -CCA 3’-OH acceptor end is at the 
top right. (b) A space-filling model of the molecule (PDB id = 6TNA). 


rRNA Secondary Structure A large degree of intrastrand sequence complementarity is 
found in all ribosomal RNA strands, and all assume a highly folded pattern that al- 
lows base pairing between these complementary segments, giving rise to multiple 
stem-loop structures. Furthermore, the loop regions of stem-loops contain the char- 
acteristic structural motifs, such as U-turns, tetraloops, and bulges. Figure 11.38 
shows the secondary structure of several 16S rR NAs. These structures were not deter- 
mined by direct analysis; instead they were generated through computer alignment of 
each nucleotide sequence into optimal H-bonding segments. The reliability of these 
alignments is then tested through a comparative analysis of whether very similar sec- 
ondary structures are observed. If so, then such structures are apparently conserved. 
The approach is based on the thesis that because ribosomal RNA species (regardless 
of source) serve common roles in protein synthesis, it may be anticipated that they 
share structural features. These secondary structures resemble one another, even 
though the nucleotide sequences of these 16S rR NAs exhibit little sequence similarity. 
Apparently, evolution is acting at the level of rRNA secondary structure, not rRNA 
nucleotide sequence. Similar conserved folding patterns are seen for the 5S-like 
rRNAs and 23S-like rRNAs that reside in the large ribosomal subunits of various 
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(a) E. coli (a eubacterium) 


(b) H. volcanii (an archaebacterium) 


species. An insightful conclusion may be drawn regarding the persistence of such 
strong secondary structure conservation despite the millennia that have passed since 
these organisms diverged: All ribosomes are similar, and all function in a similar man- 
ner. As usual with RNAs, the single-stranded regions of rRNA create the possibility 
of base-pairing opportunities with distant, complementary, single-stranded regions. 
Such interactions are the driving force for tertiary structure formation in RNAs. 


rRNA Tertiary Structure The detailed structure of ribosomes has been revealed through 
X-ray crystallography and cryoelectron microscopy of ribosomes. These detailed images 
not only disclose the tertiary structure of the rRNAs but also the quaternary interactions 
that must occur when ribosomal proteins combine with rRNAs and when the ensuing 
ribonucleoprotein complexes, the small and large subunits, come together to form the 
complete ribosome. Only the rRNAs of the 50S ribosomal subunit are shown in 
Figure 11.39; no ribosomal proteins are shown. Note that the overall anatomy of the 50S 
ribosomal subunit (shown diagrammatically in Figure 10.22) is essentially the same as 
that of the rRNA molecules within this subunit, even though these rRNAs account for 
only 65% of the mass of this particle. An assortment of tertiary structural features are 
found in the rRNAs, including coaxial stacks, pseudoknots, and ribose zippers. Peptide 
bond formation, the key reaction in protein synthesis, is catalyzed by the large rRNA in 
the large ribosomal subunit (the 23S rRNA in the case of prokaryotic ribosomes and the 
28S rRNA of eukaryotic ribosomes). That is, the ribosome is an RNA-based enzyme. 


11.7d Aptamers Are Oligonucleotides Specifically Selected 
for Their Ligand-Binding Ability 


Aptamers are oligonucleotides, usually RNA, which fold into very specific three- 
dimensional structures that selectively bind ligands with high affinity. Ligand bind- 
ing by aptamers is based on the fundamental principle of structural complementarity. 
The rich array of interactive possibilities presented by the four bases and the sugar— 
phosphate backbone, coupled with the inherent flexibility of polynucleotide chains, 
make nucleic acids very good ligand-binding candidates. The bases project polar 
amino and carbonyl functionalities, and their z-electron density gives them nonpolar 
properties. The sugar—phosphate backbone presents polar —OH groups and regu- 
larly spaced, negatively charged phosphate groups. These phosphate groups can 


(c) S. cerevisiae (yeast, a lower eukaryote) 


Figure 11.38 Comparison of secondary structures of 16S-like rRNAs from (a) a bacterium (E. coli), 
(b) an archaeon (H. volcanii), and (c) a eukaryote (S. cerevisiae, a yeast). 
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(b) 


23S rRNA 5’ end 3' end 


Figure 11.40 Structure of the thiamine 
Figure 11.39 The secondary and tertiary structures of rRNAs in the 50S ribosomal subunit from the pyrophosphate (TPP) riboswitch, a conserved region 
archaeon Haloarcula marismortui (PDB id = 1FFk). (a) Secondary structure of the 23S rRNA, with within the mRNA that encodes enzymes for 
various domains color-coded. (b) Secondary structure of 5S rRNA. (c) Tertiary structure of the synthesis of this coenzyme (PDB id = 2CKY). TTP, 
5S and 23S rRNAs within the 50S ribosomal subunit. The 5S rRNA (red) lies atop the 23S rRNA. a pyrimidine-containing compound, is shown in 
(Adapted from Figure 4 in Ban, N., et al., 2000. The complete atomic structure of the large ribosomal orange. (From Figure 1b in Thore, S., Leibundgdut, 
subunit at 2.4 A resolution. Science 289:905-920.) Although the 5S rRNA of eukaryotes is the same M., and Ban, N., 2006. Structure of the eukaryotic 
size as this archaeon 5S rRNA, the large rRNA of the large ribosomal subunit in eukaryotes has thiamine pyrophosphate riboswitch with its 
2,000 more nucleotides than this 23S rRNA (over 5000 vs. fewer than 3000). regulatory ligand. Science 312:1208-1211.) 
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coordinate cations and thus provide foci of positive charge. Synthetic aptamers 
designed to target a selected protein can be potent inhibitors of protein function; they 
are of interest in drug development. 

Riboswitches, a naturally occurring class of aptamers, are conserved regions of mRNAs 
that reversibly bind specific metabolites and coenzymes and usually act as regulators of 
gene expression. Riboswitches are usually buried within the 5’- or 3'-untranslated regions 
of the mRNAs whose expression they regulate. Binding of the metabolite to the ribo- 
switch typically blocks expression of the mRNA. Figure 11.40 shows the structure of 
the thiamine pyrophosphate riboswitch. 


Foundational Biochemistry Things You Should Know after Reading Chapter 11. 


DNA sequencing: Sanger’s chain termination protocol based on 
dideoxynucleotides uses DNA replication to generate a defined set of 
polynucleotide fragments. 


The nucleotide sequence of DNA can be determined from the elec- 
trophoretic migration of a defined set of polynucleotide fragments. 


Many ultra-high-throughput methodologies for DNA sequencing 
also rely on DNA polymerase-driven primer extension to copy un- 
known DNA sequences. 


Ultra-high-throughput sequencing technologies use light emission, 
fluorescence detection, or pH change to follow DNA polymerase- 
catalyzed nucleotide addition and note which base was added. These 
reactions can be carried out on millions of different DNA strands 
at the same time, all of which are immobilized on beads or glass 
surfaces. 


Single-molecule real-time DNA sequencing methods, based either on 
DNA polymerase action as revealed by fluorescent light pulses or by 
changes in electrical potential as DNA strands are drawn through 
nanopores in a synthetic membrane, yield much longer read lengths. 


Long reads have been used to determine the complete sequence of 
entire human chromosomes. 


DNA structure: The sugar—phosphate backbone of nucleic acids has 
six degrees of freedom. Rotation about the C;'-N glycosidic bond cre- 
ates a seventh degree of freedom. 


DNA usually occurs in the form of double-stranded molecules. 


The Watson—Crick base pairs (A-T; G-C) have virtually identical 
dimensions. 


The DNA double helix is stabilized by H bonds, electrostatic interac- 
tions, and the van der Waals and hydrophobic interactions arising 
from base-pair stacking. 


The DNA double helix has a major groove and a minor groove. 


DNA double helices can occur as A-, B-, or Z-DNA conformations; 
the B-DNA conformation is the canonical form for double-helical 
DNA. 


Z-DNA is a left-handed double helix. 


RNA double helices and hybrid DNA:RNA helices adopt the 
A-conformation due to steric hindrance caused by the RNA 3’-OH 
groups. 


Alternative H-bonding interactions give rise to novel DNA structures, 
such as cruciforms, triplexes, and quadruplexes. 


Hoogsteen base pairs, an alternative mode of base-pair formation, 
arise in triplex DNA structures known as H-DNA. 


Thermal denaturation of DNA can be followed by observing 
increases in UV absorbance. 


Single-stranded DNA can renature (re-anneal) to form DNA duplexes. 


The rate of DNA renaturation is a measure of DNA sequence 
complexity. 

Supercoils are one kind of structural complexity in DNA. Supercoils 
alter the topological state of DNA. 


Supercoils can be positive (overwound DNA) or negative (under- 
wound DNA), relative to B-DNA. 


The linking number, L, of double-helical DNA is given by L = T + W, 
where T = twist (the number of helical turns ina DNA molecule) and 
W = writhe (the number of supercoils). 


Topoisomerases are enzymes that change the topological state of DNA. 


Eukaryotic DNA: Nucleosomes are the fundamental structural unit 
of chromatin. 


Chemical synthesis of DNA has led to the creation of synthetic 
chromosomes. 


The ability to chemically synthesize DNA is termed “DNA-write.” 


DNA as code: The information encoded in DNA is analogous to 
computer code. 


DNA is an excellent data storage system. 


Nucleic acids can be created synthetically, using an orthogonal proce- 
dure and phosphoramidite chemistry. 


RNA molecules are rich in secondary structures formed through 
intrastrand H-bonding. 


RNAs contain a variety of structural motifs, such as stem-loops, 
U-turns, tetraloops, bulges, junctions, and coaxial stacks. 


Transfer RNA adopts higher-order structure through base- 
pairing interactions between the single-stranded regions of stem-loop 
structures. 


Ribosomal RNA also adopts higher-order structure through base- 
pairing interactions between the single-stranded regions of stem-loop 
structures. 


Ribosomal RNAs adopt an evolutionarily conserved secondary 
structure. 


Aptamers are oligonucleotides with highly selective ligand-binding 
ability. 


Riboswitches are naturally occurring aptamers involved in regulating 
the expression of genetic information. 
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Problems 


Answers to all problems are at the end of this book. Detailed solutions are 
available in the Student Solutions Manual. 


1. 


10. 


Predicting a Sanger Sequencing Pattern The oligonucleotide d- 
AGATGCCTGACT was subjected to sequencing by Sanger’s dideoxy 
method, using fluorescent-tagged dideoxynucleotides and capillary 
electrophoresis, essentially as shown in Figure 11.3. Draw a diagram 
of the gel-banding pattern within the capillary. 


. Sanger’s Method of DNA Sequencing Consider the DNA fragment 


shown. Using TAGCT as the primer, show all the fragments that 
would be generated by Sanger sequencing. Refer to Figure 11.3 and 
use the following color coding: green for ddG, blue for ddC, orange 
for ddA and red for ddT. Alternatively the following shape coding 
could be used: ll for ddG, A for ddC, @ for ddA, and @ for ddT. 


5'-CCTAGGCATGAACTCTGTAATCGACTATGC-3’ 


. Deducing DNA Sequence from Sanger Sequencing Results The output 


of an automated DNA sequence determination by the Sanger dideoxy 
chain termination method, performed as illustrated in Figure 11.4, was 
recorded as a series of light flashes in the following order (where B = 
blue, G = green, R = red, and Y = yellow): YYGGRRGYR- 
BRRRGGGBRRRB. What is the sequence of the original oligonucle- 
otide? 


. Helical Parameters for a Double-Stranded DNA X-ray diffraction 


studies indicate the existence of a double-stranded DNA helical 
conformation in which AZ (the rise per base pair) = 0.32 nm and 

P (the pitch) = 3.36 nm. What are the other parameters of this novel 
helix: (a) the number of base pairs per turn, (b) A¢ (the mean rotation 
per base pair), and (c) c (the true repeat)? 


. B- and Z-DNA Helical Parameters | A 41.5 nm-long duplex DNA 


molecule in the B-conformation adopts the A-conformation upon 
dehydration. How long is it now? What is its approximate number of 
base pairs? 


. B- and Z-DNA Helical Parameters II If 80% of the base pairs in a 


duplex DNA molecule (12.5 kbp) are in the B-conformation and 20% 
are in the Z-conformation, what is the length of the molecule? 


. DNA Supercoiling Parameters A “relaxed,” circular, double-stranded 


DNA molecule (1600 bp) is in a solution where conditions favor 10 bp 
per turn. What is the value of Lo for this DNA molecule? Suppose 
DNA gyrase introduces 12 negative supercoils into this molecule. What 
are the values of L, W, and T now? What is the superhelical density, o? 


. B- and Z-DNA Supercoiling Parameters Suppose one double helical 


turn of a superhelical DNA molecule changes conformation from B- 
to Z-form. What are the changes in L, W, and T? Why do you 
suppose the transition of DNA from B- to Z-form is favored by 
negative supercoiling? 


. Calculate the Number of Nucleosomes in a Human Diploid Cell 


Assume that there is one nucleosome for every 200 bp of eukaryotic 
DNA. How many nucleosomes are there in a diploid human cell? 
Nucleosomes can be approximated as disks 11 nm in diameter and 

6 nm long. If all the DNA molecules in a diploid human cell are in 
the B-conformation, what is the sum of their lengths? If this DNA is 
now arrayed on nucleosomes in the beads-on-a-string motif, what 
would be the approximate total height of the nucleosome column if 
these disks were stacked atop one another? 


The Linear Arrangement of Complementary Sequences along the tRNA 
Strand The characteristic secondary structures of tRNA and rRNA 
molecules are achieved through intrastrand hydrogen bonding. Even for 
the small tRNAs, remote regions of the nucleotide sequence interact via 
H bonding when the molecule adopts the cloverleaf pattern. Using 
Figure 11.35 as a guide, draw the primary structure of a tRNA and 
label the positions of its various self-complementary regions. 


11. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


19. 


Using ChargafPs Results to Order DNAs According to Their Tms Using 
the data in Table 10.1, arrange the DNAs from the following sources in 
order of increasing Tm: human, salmon, wheat, yeast, E. coli. 


Calculating Tms and Separating DNA Molecules That Differ in G:C 
Content At 0.2 M Na‘, the melting temperature of double-stranded 
DNA is given by the formula, Tm = 69.3 + 0.41 (% G + C). The 
DNAs from mice and rats have (G + C) contents of 44% and 40%, 
respectively. Calculate the Tms for these DNAs in 0.2 M NaCl. If 
samples of these DNAs were inadvertently mixed, how might they be 
separated from one another? 


Calculating the Density of DNAs That Differ in G:C Content The 
buoyant density of DNA is proportional to its (G + C) content. 
(G:C base pairs have more atoms per volume than A:T base pairs.) 
Calculate the density (p) of Serratia marcescens DNA from the data 
presented in Table 10.1 and the equation p = 1.660 + 0.098(GC), 
where (GC) is the mole fraction of (G + C) in DNA. 


Draw a wW:G Base Pair (Integrates with Chapter 10.) Pseudouridine (y) is 
an invariant base in the TYC loop of tRNA; wis also found in strategic 
places in rRNA. (Figure 10.22 shows the structure of pseudouridine.) 
Draw the structure of the base pair that y might form with G. 


The Dimensions of Closed Circular DNA Molecules The plasmid 
pBR322 is a closed circular dsDNA containing 4363 base pairs. What 
is the length in nm of this DNA (that is, what is its circumference if it 
were laid out as a perfect circle)? The E. coli K12 chromosome is a 
closed circular dsDNA of about 4,639,000 base pairs. What would be 
the circumference of a perfect circle formed from this chromosome? 
What is the diameter of a ds DNA molecule? Calculate the ratio of the 
length of the circular plasmid pBR322 to the diameter of the DNA 
of which it’s made. Do the same for the E. coli chromosome. 


Identifying DNA Structural and Functional Elements from Nucleotide 
Sequence Information Listed below are four DNA sequences. Which 
one contains a type-II restriction endonuclease (“‘six-cutter”) 
hexanucleotide site? Which one is likely to form a cruciform structure? 
Which one is likely to be found in Z-DNA? Which one represents the 
5'-end of a tRNA gene? Which one is most likely to be found in a 
triplex DNA structure? 


a. CGCGCGCCGCGCACGCGCTCGCGCGCCGC 

b. GAACGTCGTATTCCCGTACGACGTTC 

c. CAGGTCTCTCTCTCTCTCTCTC 

d. TGGTGCGAATTCTGTGGAT 

e. ATCGGAATTCATCG 

Interactions between amino acids and DNA. (Integrated with 


Chapter 4) Refer to Figure 4.3. Which amino acids may interact with 
DNA through 


a. hydrogen bonding, 

b. ionic interactions, or 

c. intercalating interactions? Explain your answer 

Draw the Cloverleaf Structure of a tRNA from RNA Sequence Information 
The nucleotide sequence of E. coli tRNAS" is as follows: 


UGGGGUAUCG ;pCCAAGC—GGU »AAGGCACCGG3)AU- 
UCUGAYWWCyCGGCAUUCCG;,AGGT WCGAAU 69 CCUCGUA- 
CCCpCAGCCA7¢6 


From this primary structure information, draw the secondary 
structure (cloverleaf) of this RNA and identify its anticodon. 


Cruciform DNA Consider the dsDNA shown and draw a possible 
cruciform structure for this DNA segment. 


5'-CGATAGTCCATGCCTAGCATGGACTTAATGC-3’ 
3'-GCTATCAGGTACGGATCGTACCTGAATTACG-S' 
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20. 


21. 


22. 


23. 


24. 
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Use the Tools of the Protein Data Bank to Explore Ribosome Structure 
The Protein Data Bank (PDB) is also a repository for nucleic acid 
structures. Go to the PDB at www.rcsb.org and enter pdb id = 1Y12. 
1Y12 is the PDB ID for the structure of the H. marismortui SOS 
ribosomal subunit with erythromycin bound. Erythromycin is an 
antibiotic that acts by inhibiting bacterial protein synthesis. In the list 
of the display options under the image of the 50S subunit, click on 
one of the viewing options to view the structure. Using the tools of 
the viewer, zoom in and locate erythromycin within this structure. If 
the 50S ribosomal subunit can be compared to a mitten, where in the 
mitten is erythromycin? 


Use the Human Genome Database to Explore the Relationship Between 
Genes, Proteins, and Diseases Online resources provide ready access 
to detailed information about the human genome. Go the National 
Center for Biotechnology Information (NCBI) genome database at 
http://www.ncbi.nlm.nih.gov/Genomes/index.html. Under the 
“Custom Resources” page that appears, click on “Human Genome.” 
Click on “Chromosome 6” in the array of displayed chromosomes. In 
the Genome Data Viewer page that comes up, locate the “Search 
assembly” box, type in “TNF,” and click on the arrow to start the 
search. Note the TNF chromosome 6 location information for TNF 
displayed in the box below the “Search assembly” box. This informa- 
tion is given in base pairs. Scroll over the NM_000594.4 transcript 
displayed to the right to reveal further information about TNF. 

a. What does TNF stand for? 

b. What is the chromosomal location of TNF in base pairs? 

c. How many exons does the TNF gene have? 

d. How many amino acid residues does the TNF protein have? 


G:C Content in Bacteria (Integrates with Chapter 10.) Erwin Chargaff 
did not have any DNA samples from thermoacidophilic bacteria such 
as those that thrive in the geothermal springs of Yellowstone National 
Park. (Such bacteria had not been isolated by 1951 when Chargaff 
reported his results.) If he had obtained such a sample, what do you 
think its relative G:C content might have been? Why? 


Structural Features of B-form DNA Think about the structure of DNA 
in its most common B-form double helical conformation and then list 
its most important structural features (deciding what is “important” 
from the biological role of DNA as the material of heredity). Arrange 
your answer with the most significant features first. 
The Dimensions of DNA An average base pair in DNA contributes 
107?! g to the weight and 0.32 nm to the length of a DNA molecule. 
a. What is the total weight and total length of the DNA contained 
in a human cell? 
b. The circumference of the earth at the equator is about 
40,000 km. How many base pairs would a double-stranded DNA 
molecule that circled the earth have? 


c. How much would it weigh? 


The distance from the earth to the sun = 150 million km. 


d. How many base pairs would a double-stranded DNA that 
stretched from the earth to the sun have? 


e. How much would such a DNA molecule weigh? 


25. How Structural Differences Between DNA and RNA Affect Its Properties 
Cellular DNA molecules are double stranded. RNAs are single- 
stranded molecules with double-stranded regions. For each of the 
following properties, describe how DNA and RNA may differ and 
why. 

Hyperchromic shift 

Colligative properties of DNA vs. RNA 
Inverted repeats 

Tm (melting points) 

Buoyant density 

Chemical and physical stability 


26. Modified Nucleotides in DNA or RNA The bases found in DNA or 
RNA are often chemically modified. Methylation (addition of -CH;) 
of Cin DNA (see Box Human Biochemistry DNA methylation, CpG 
Islands, and Epigenetics) or of other bases in RNA are very common 
modifications. When noting methylation, both the base and the exact 
position that is methylated are described. As an example, the presence 
of “m>C” in a sequence would indicate that cytosine is methylated in 
position 5. Revisit Figures 10.2, 10.3, and 10.4 and draw the struc- 
tures of m3C, mC, m’G, m!A, and m°A. 


27. DNA Bending As evident from Problem 24, double-stranded DNA 
molecules are extremely long and extremely thin. Over short 
distances, such molecules behave in solution like a rigid rod, but over 
long distances, they act more like flexible noodles (see Figure 11.30). 
DNA flexibility is essential to its compaction in the nucleus. 
Persistence length is the physical property that defines the bending 
stiffness of polymers. Sections of the polymer shorter than the 
persistence length behave like rigid rods. The path of sections much 
longer than the persistence length is random and can only be 
described statistically. Double-helical DNA has a persistence length 
of about 40 nm. How many helical turns of B-form DNA occur in a 
distance of 40 nm? How many base pairs? Human chromosome 8 has 
a DNA molecule that is 146,259,671 bp-long, What is the ratio of the 
overall length of this DNA to its persistence length? What effect 
might the organization of DNA into chromatin have on its 
persistence length? 


Think-Pair-Share Question 


Gene Sequencing with Procedural Mistakes A segment of DNA was se- 
quenced using the Sanger method (see Section 11.1b and Figure 11.3) but 
mistakes were made during the process: 

The segment was sequenced in the absence of: 


a. ddATP and the following sequence was obtained: 
TCCGTTGGCTTGCTCC 


b. ddCTP and the following sequence was obtained: 
TAGTTAGGTTGAT 


c. ddTTP and the following sequence was obtained: 
ACCGAGGCGCACC 


d. ddGTP and the following sequence was obtained: 
TACCTTACTTCATCC 


What was the sequence of the original segment of DNA? 
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Recombinant DNA, Cloning, 12 
Gene Editing, and Synthetic 
Biology—An Introduction 


...-how many vain chimeras have you created?... 
Go and take your place with the seekers after gold. 


Leonardo da Vinci 
The Notebooks (1508-1518), Volume II, 
Chapter 25 


< The Chimera of Arezzo, of Etruscan origin 
and probably from the fifth century 8.c., was 
found near Arezzo, Italy, in 1553. Chimeric 
animals existed only in the imagination of 


Scala/Art Resource, NY 


the ancients. But the ability to create 
chimeric DNA molecules is a very real 
technology that has opened up many new 
avenues of scientific investigation. 


Essential Questions 


Using techniques for the manipulation of nucleic acids in the laboratory, scientists can 
join together DNA segments from different sources. Such manmade products are called 
recombinant DNA molecules, and the use of such molecules to alter the genetics of 
organisms is termed genetic engineering. What are the methods that scientists use to create 
recombinant DNA molecules; can scientists create genes from recombinant DNA molecules; 
and can scientists modify the heredity of an organism using recombinant DNA? 


echnologies for the laboratory manipulation of nucleic acids emerged in the early 

1970s. In turn, these technologies led to the construction of DNA molecules composed 
of nucleotide sequences taken from different sources. The products of these innovations, 
recombinant DNA molecules,! opened exciting new opportunities in molecular biology and 
genetics, and a new field was born—recombinant DNA technology. Genetic engineering 
is the application of this technology to the manipulation of genes. These advances were 
made possible by methods for amplification of any DNA segment of interest, regardless of 
source, within bacterial host cells and later on, eukaryotic cells, including those of mam- 
mals. In the language of recombinant DNA technology, the cloning of virtually any DNA 
sequence became feasible. Cloning has led to methodologies to identify specific DNA and 
RNA sequences among an overwhelming amount of extraneous nucleic acid, to express 
cloned genes and produce desired proteins in cell cultures, to inactivate gene expression, to 
edit DNA nucleotide sequences, and to alter the metabolism by introducing new enzymes, 
to list just a few of the achievements. Parallel advances in nanotechnology and microfluid- 
ics render these methodologies even more powerful. What follows is an introduction and 
overview of what is possible today. Tomorrow will bring even more dramatic technology 
for DNA, RNA, and protein analysis. 


'The advent of molecular biology, like that of most scientific disciplines, generated a jargon all its own. 
Learning new fields often requires gaining familiarity with a new vocabulary. We will soon see that many 
words—vector, amplification, and insert are but a few examples—have new meanings that describe the 
marvels of molecular biology. 


| l Key Questions 


12.1 What Is DNA Cloning? 
12.2 What Is a DNA Library? 


12.3 Can the Genes in Libraries Be 
Transcribed and Translated? 


12.4 How Is RNA Interference Used to 
Reveal the Function of Genes? 

12.5 Gene Therapy: Is It Possible for 
Scientists to Alter or Rewrite the 
Genome of an Organism? 

12.6 How Does High-Throughput 
Technology Allow Global Study of 
Millions of Genes or Molecules 
at Once? 


12.7 What Is the Field of Synthetic Biology? 
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12.1 | What Is DNA Cloning? 


In classical biology, a clone is a population of identical organisms derived from a single 
parental organism. For example, the members of a colony of bacterial cells that arise 
from a single cell on a petri plate are a clone. Molecular biology has borrowed the term 
to mean a collection of molecules or cells all identical to an original molecule or cell. 
So, if a single bacterial cell harboring a recombinant DNA molecule in the form of a 
plasmid grows and multiplies on a petri plate to form a colony, the plasmids within the 
millions of cells in the bacterial colony represent a clone of the original DNA mole- 
cule, and these molecules can be isolated and studied. Furthermore, if the cloned DNA 
molecule contains a gene (or part of a gene)—that is, it encodes a functional product— 
a new avenue to isolating and studying this product has opened. Desired proteins can 
be produced in abundance, which greatly expands the possibilities for their biochemi- 
cal analysis and manipulation. Using recombinant DNA molecules that carry genes 
encoding proteins of interest, new functionalities could be introduced into host cells. 
Recombinant DNA methodology offers exciting new vistas in biochemistry. 


12.1a Plasmids Are Very Useful in Cloning Genes 


Plasmids are naturally occurring, circular, extrachromosomal DNA molecules (see 
Chapter 11). Natural strains of the common colon bacterium Escherichia coli isolated 
from various sources contain diverse plasmids. Often these plasmids carry genes spec- 
ifying novel metabolic activities that are advantageous to the host bacterium. These 
activities range from catabolism of unusual organic substances to metabolic functions 
that endow the host cells with resistance to antibiotics, heavy metals, or bacteriophages. 
Plasmids that are able to perpetuate themselves in E. coli, the bacterium favored by 
bacterial geneticists and molecular biologists, are the workhorses of recombinant DNA 
technology. Because restriction endonuclease digestion of plasmids can generate frag- 
ments with overlapping or sticky ends, artificial plasmids can be constructed by ligating 
different fragments together. Such artificial plasmids were among the earliest recombi- 
nant DNA molecules. These recombinant molecules can be autonomously replicated, 
and hence propagated, in suitable bacterial host cells, provided they still possess a site 
signaling where DNA replication can begin (an origin of replication or ori sequence). 


Plasmids as Cloning Vectors This technique relies on the idea that foreign DNA se- 
quences could be inserted into artificial plasmids and that these foreign sequences 
would be carried into E. coli and propagated as part of the plasmid. That is, these 
plasmids could serve as cloning vectors to carry genes. (The word vector is used here in 
the sense of “a vehicle or carrier.”) Plasmids useful as cloning vectors possess three 


Chemistry in History 


Tu Youyou 


Tu Youyou (1930-present) is a Nobel Prize 
winning pharmaceutical chemist who discovered 
both artemisinin, the active component of 
sweet wormwood, and its active metabolite, 
dihydroartemisinin, which are used in the treatment 
of malaria. She was born in 1930 and after having her 
first year of high school derailed by a tuberculosis 
infection, she resolved that her goal was to pursue 
medical science. She graduated from Beijing Medical 
University School of Pharmacy in 1955. 


During the Vietnam War, she was appointed head 
of a secret government research group whose goal 
was to create a treatment for chloroquine-resistant 
malaria. Youyou’s study of classical Chinese herbal 
medicine led her to sweet wormwood (Artemisia 
annua), an ancient treatment for intermittent fever, 
a well-known symptom of malaria. Tu Youyou was 
the first Chinese woman to ever receive a Nobel 
Prize and her work is credited for saving millions of 
lives across the world. 
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common features: a replicator, a selectable marker, and a cloning site (Figure 12.1). A 
replicator is an origin of replication, or ori. The selectable marker is typically a gene 
conferring resistance to an antibiotic. Only cells containing the cloning vector will 
grow in the presence of the antibiotic. Therefore, growth on antibiotic-containing 
media selects for plasmid-containing cells. Typically, the cloning site is a sequence of 
nucleotides representing one or more restriction endonuclease cleavage sites. Cloning 
sites are located where the insertion of foreign DNA neither disrupts the plasmid’s 
ability to replicate nor inactivates essential markers. 


Virtually Any DNA Sequence Can Be Cloned Nuclease cleavage at a restriction site opens, 
or /inearizes, the circular plasmid so that a foreign DNA fragment can be 
inserted. The ends of this linearized plasmid are joined to the ends of the fragment 
so that the circle is closed again, creating a recombinant plasmid (Figure 12.2). 
Recombinant plasmids are hybrid DNA molecules consisting of plasmid DNA 
sequences plus inserted DNA elements (called inserts). Such hybrid molecules are also 
called chimeric constructs or chimeric plasmids. (The term chimera is borrowed from 
mythology and refers to a beast composed of the body and head of a lion, the heads of 
a goat and a snake, and the wings of a bat.) The presence of foreign DNA sequences 
does not adversely affect replication of the plasmid, so chimeric plasmids can be propa- 
gated in bacteria just like the original plasmid. Bacteria often harbor several hundred 
copies of common cloning vectors per cell. Hence, large amounts of a cloned DNA 
sequence can be recovered from bacterial cultures. The enormous power of recombinant 
DNA technology stems in part from the fact that virtually any DNA sequence can be 
selectively cloned and amplified in this manner. DNA sequences that are difficult to 
clone include inverted repeats, origins of replication, centromeres, and telomeres. The 
only practical limitation is the size of the foreign DNA segment: Most plasmids with 
inserts larger than about 10 kbp are not replicated efficiently. However, bacteriophages 
such as bacteriophage A can be manipulated so that DNA sequences as large as 40 kbp 
can be inserted into the bacteriophage genome. Such recombinant phage DNA mole- 
cules lack essential A genes and replicate in E. coli as plasmids. 


Construction of Chimeric Plasmids The creation of chimeric plasmids requires joining 
the ends of the foreign DNA insert to the ends of a linearized plasmid. This ligation is 
facilitated if the ends of the plasmid and the insert have complementary, single-stranded 
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Figure 12.1 One of the first widely used cloning 
vectors, the plasmid pBR322. This 4363-bp plasmid 
contains an ori and genes for resistance to the 
drugs ampicillin (amp") and tetracycline (tet"). 

The locations of restriction endonuclease cleavage 
sites are indicated. 
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Figure 12.2 An EcoRI restriction fragment of for- 
eign DNA can be inserted into a plasmid having an 
EcoRI cloning site by (1) cutting the plasmid at this 
site with EcoRI, (2) annealing the linearized plasmid 
with the EcoRI foreign DNA fragment, and (3) seal- 
ing the nicks with DNA ligase. 


|e Cut with EcoRI Cut with EcoRI 


y 


© Anneal ends of vector 
L and foreign DNA 
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Seal gaps in 
chimeric plasmid 
with DNA ligase 


(3) 


— > 


DNA ligase 


overhangs. Then these ends can base-pair with one another, annealing the two molecules 
together. One way to generate such ends is to cleave the DNA with restriction enzymes 
that make staggered cuts; many such restriction endonucleases are available (see Table 
10.2). For example, if the sequence to be inserted was generated by EcoRI digestion 
and the plasmid is cut with EcoRI, the single-stranded sticky ends of the two DNAs 
can anneal (Figure 12.2). The interruptions in the sugar-phosphate backbone of DNA 
can then be sealed with DNA ligase to yield a covalently closed, circular chimeric plas- 
mid. DNA ligase is an enzyme that covalently links adjacent 3’-OH and 5’-PO, groups. 
An inconvenience of this strategy is that any pair of EcoRI sticky ends can anneal with 
each other. So, plasmid molecules can reanneal with themselves, as can the foreign DNA 
restriction fragments. These DNAs can be eliminated by selection schemes designed to 
identify only those bacteria containing chimeric plasmids. 

Blunt-end ligation is an alternative method for joining different DNAs. The most 
widely used DNA ligase, bacteriophage T4 DNA ligase, is an ATP-dependent enzyme that 
can even ligate two DNA fragments whose ends lack overhangs (blunt-ended DNAs). 
Many restriction endonucleases cut double-stranded DNA so that blunt ends are formed. 

An astounding number of variations on these basic themes has emerged. For example, 
short synthetic DNA duplexes whose nucleotide sequence consists of little more than a 
restriction site can be blunt-end ligated onto any DNA. These short DNAs are known as 
linkers. Cleavage of the ligated DNA with the restriction enzyme then leaves tailor-made 
sticky ends useful in cloning reactions (Figure 12.3). Similarly, many vectors contain a 
polylinker cloning site, a short region of DNA sequence bearing numerous restriction sites. 


Promoters and Directional Cloning Note that the strategies discussed thus far create hybrids 
in which the orientation of the DNA insert within the chimera is random. Sometimes it is 
desirable to insert the DNA in a particular orientation. For example, an experimenter might 
wish to insert a particular DNA segment (a gene) in a vector so that its gene product is 
synthesized. To do this, the DNA must be placed downstream from a promoter. A promoter 
is a nucleotide sequence lying upstream of a gene. The promoter controls expression of the 
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(b) A vector cloning site containing multiple restriction sites, 
a so-called polylinker. 
1 2 3 4 5 1 2 3 4 5 6 7 8 9 10 11 12 13 14 6 
Ser Pro Asp Pro Ser Thr Cys Arg Ser Thr Asp Pro Gly Asn = 
TCC CCG GAT CCG TCG ACC TGC AGG TCG ACG GAT CCG GGG AAT 
I RY HY A *{U0 ~~ — 


BamHI Sall Psi Sall BamHI EcoRI 
Acd Acad 
HindI Hind 


gene. RNA polymerase molecules bind specifically at promoters and initiate transcription 
of adjacent genes, copying template DNA into RNA products. One way to insert DNA so 
that it will be properly oriented with respect to the promoter is to create DNA molecules 
whose ends have different overhangs. Ligation of such molecules into the plasmid vector 


can only take place in one orientation to give directional cloning (Figure 12.4). 
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Figure 12.3 The use of linkers to create tailor-made 
ends on cloning fragments. (a) The ligation reaction 
can add multiple linkers on each end of the blunt- 
ended DNA. EcoRI digestion removes all but the ter- 
minal one, leaving the desired 5'-overhangs. 

(b) Cloning vectors often have polylinkers consisting 
of a multiple array of restriction sites at their cloning 
sites, so restriction fragments generated by a variety 
of endonucleases can be incorporated into the vector. 
Note that the polylinker is engineered not only to 
have multiple restriction sites, but also to have an 
uninterrupted sequence of codons, so this region of 
the vector has the potential for translation into pro- 
tein (see Figure 12.15). (Adapted from Figure 1.14.2 
in Greenwich, D., and Brent, R., UNIT 1.14 
Introduction to Vectors Derived from Filamentous 
Phages, in Current Protocols in Molecular Biology, 
Ausubel, F. M., et al., eds. New York: John Wiley and 
Sons, 2003.) 
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Figure 12.4 Directional cloning. DNA molecules whose ends have different overhangs can be used to form chimeric constructs in 
which the foreign DNA can enter the plasmid in only one orientation. The foreign DNA and the plasmid are digested with the 


same two enzymes. pUC stands for universal cloning plasmid. 
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(4363 bases) 
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@ A plasmid with ge 


ampicillin resistance (amp") and 
tetracycline resistance ((et"). 

A BamHI restriction site is 
located within the tet" gene. 


Tetra 


medium 
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Biologically Functional Chimeric Plasmids The first biologically functional chimeric 
DNA molecules constructed in vitro were assembled from parts of different plasmids 
in 1973 by Stanley Cohen, Annie Chang, Herbert Boyer, and Robert Helling. These 
plasmids were used to transform recipient E. coli cells (transformation means the uptake 
and replication of exogenous DNA by a recipient cell). To facilitate transformation, the 
bacterial cells were rendered somewhat permeable to DNA by Ca? treatment and a 
brief 42°C heat shock. Although less than 0.1% of the Ca?*-treated bacteria became 
competent for transformation, transformed bacteria could be selected by their resis- 
tance to certain antibiotics (Figure 12.5). Consequently, the chimeric plasmids must 
have been biologically functional in at least two aspects: They replicated stably within 
their hosts, and they expressed the drug resistance markers they carried. 


© tet" gene is inactivated by the 
insertion of DNA fragment. 
amp" gene remains intact. 


amp" gene 
remains intact. 


ee 


© BamHI restriction fragment of 
DNA to be cloned is inserted 
into the BamHI site of tet’. 


Chimeric plasmid 


@ Suspend 20 ng plasmid DNA + 107 
E.coli cells in CaCl solution. 


nes for 


42°C, 2 min 


@ Plate bacteria on ampicillin media. 


37°C, overnight 


cycline-containing 


Ampicillin- 
containing 
medium 
re < 
37°C, overnight 
®© Only tet" colonies appear; @ Using velvet-covered disc, bacterial @ Only ampicillin-resistant 
tet’ colonies can be colonies are lifted from surface of agar (amp") bacterial colonies 
recovered from amp" plate amp’ plate and pressed briefly to surface grow. 


by comparing two plates. 


of plate containing tetracycline media. 


Figure 12.5 A typical bacterial transformation experiment. Here the plasmid pBR322 is the cloning 
vector. (1) Cleavage of pBR322 with BamHI, followed by (2) annealing and ligation of inserts gener- 
ated by BamHI cleavage of some foreign DNA, (3) creates a chimeric plasmid. (4) The chimeric plas- 
mid is then used to transform Ca?*-treated heat-shocked E. coli cells, and the bacterial sample is 
plated on a petri plate. (5) Following incubation of the petri plate overnight at 37°C. (6) Colonies of 
amp" bacteria are evident. (7) Replica plating of these bacteria on plates of tetracycline-containing 
media (8) reveals which colonies are tet’ and which are tetracycline sensitive (tet). Only the ters 
colonies possess plasmids with foreign DNA inserts. 
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In general, plasmids used as cloning vectors are engineered to be small (2.5 kbp 
to about 10 kbp in size) so that the size of the insert DNA can be maximized. These 
plasmids have only a single origin of replication, so the time necessary for complete rep- 
lication depends on the size of the plasmid. Under selective pressure in a growing culture 
of bacteria, overly large plasmids are prone to delete any nonessential genes, such as any 
foreign inserts. Such deletion would thwart the purpose of most cloning experiments. The 
useful upper limit on cloned inserts in typical plasmids is about 10 kbp. Many eukaryotic 
genes exceed this size. F’, an atypical plasmid, can carry foreign DNA inserts of 350 kbp 
or more. Recombinant F’ constructs, called BACs, for bacterial artificial chromosomes, 
have proven to be very useful in cloning eukaryotic DNA in bacterial hosts. 


12.1b Shuttle Vectors Are Plasmids That Can Propagate 
in Two Different Organisms 


Shuttle vectors are plasmids capable of propagating and transferring (shuttling) genes 
between two different organisms, one of which is typically a prokaryote (E. coli) and 
the other a eukaryote (for example, yeast). Shuttle vectors must have unique origins 
of replication for each cell type and different markers for the selection of transformed 
host cells harboring the vector (Figure 12.6). Shuttle vectors have the advantage that 
eukaryotic genes can be cloned in bacterial hosts, yet the expression of these genes can 
be analyzed in appropriate eukaryotic cells. 


12.1c Artificial Chromosomes Can Be Created from Recombinant DNA 


DNA molecules two megabase pairs in length have been successfully propagated in yeast by 
creating yeast artificial chromosomes, or YACs. Furthermore, such YACs have been trans- 
ferred into transgenic mice for the analysis of large genes or multigenic DNA sequences 
in vivo, that is, within the living animal. For these large DNAs to be replicated in the yeast 
cell, YAC constructs must include not only an origin of replication (known in yeast terminol- 
ogy as an autonomously replicating sequence or ARS) but also a centromere and telomeres. 
Recall that centromeres provide the site for attachment of the chromosome to the spindle 
during mitosis and meiosis, and telomeres are nucleotide sequences defining the ends of 
chromosomes. Telomeres are essential for proper replication of the chromosome. 
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Figure 12.6 A typical shuttle vector. LEU21 is a gene in the yeast pathway for leucine biosynthesis. The 
recipient yeast cells are LEU2~ (defective in this gene) and thus require leucine for growth. LEU2~ yeast 
cells transformed with this shuttle vector can be selected on medium lacking any leucine supplement. 
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Human artificial chromosomes, or HACs, have also been created. HACs typically carry 
5 to 10 x 10? kbp of DNA, including features required for chromosome replication, such 
as centromeres, telomeres, and replication origins (normal human chromosomes have 
50 to 250 X 103 kbp of DNA). HACs are constructed by removing most of the DNA 
from a selected human chromosome and then introducing one or more genes of inter- 
est. HACs offer the advantage of introducing large amounts of recombinant DNA into 
human cells, ideally for the treatment of disease. 


12.2 | What Is a DNA Library? 


A DNA library is a set of cloned fragments that collectively represent the genes of a spe- 
cific organism. Particular genes can be isolated from DNA libraries, much as books can 
be obtained from conventional libraries. The secret is knowing where and how to look. 


12.2a Genomic Libraries Are Prepared from the 
Total DNA in an Organism 


Any particular gene constitutes only a small part of an organism’s genome. For exam- 
ple, if the organism is a mammal whose entire genome exceeds 10° kbp and the gene 
is 10 kbp, then the gene represents less than 0.001% of the total nuclear DNA. It is 
impractical to attempt to recover such rare sequences directly from isolated nuclear 
DNA because of the overwhelming amount of extraneous DNA sequences. Instead, 
a genomic library is prepared by isolating total DNA from the organism, digesting it 


Critical Developments in Biochemistry 


Combinatorial Libraries 


Specific recognition and binding of other molecules is a defining 
characteristic of any protein or nucleic acid. Often, target ligands 
of a particular protein are unknown, or in other instances, a unique 
ligand for a known protein may be sought in the hope of blocking 
the activity of the protein or otherwise perturbing its function. 
Or, the hybridization of nucleic acids with each other according to 
base-pairing rules, as an act of specific recognition, can be exploited 
to isolate or identify pairing partners. Combinatorial libraries 
are the products of strategies to facilitate the identification and 
characterization of macromolecules (proteins, DNA, RNA) that 
interact with small-molecule ligands or with other macromolecules. 
Unlike genomic libraries, combinatorial libraries consist of synthetic 
oligomers. Arrays of synthetic oligonucleotides printed as tiny dots 
on miniature solid supports are known as DNA chips. (See Section 
12.2f DNA Microarrays (Gene Chips) Are Arrays of Different 
Oligonucleotides Immobilized on a Chip.) 

Specifically, combinatorial libraries contain very large numbers 
of chemically synthesized molecules (such as peptides or oligonucle- 
otides) with randomized sequences or structures. Such libraries are 
designed and constructed with the hope that one molecule among 
a vast number will be recognized as a ligand by the protein (or 
nucleic acid) of interest. If so, perhaps that molecule will be useful 
in a pharmaceutical application. For instance, the synthetic oligomer 
may serve as a drug to treat a disease involving the protein to which 
it binds. 

An example of this strategy is the preparation of a synthetic 
combinatorial library of hexapeptides. The maximum number of 
sequence combinations for hexapeptides is 20°, or 64,000,000. One 
approach to simplify preparation and screening possibilities for such 


a library is to specify the first two amino acids in the hexapeptide, and 
the next four are randomly chosen. In this approach, 400 libraries (207) 
are synthesized, each of which is unique in terms of the amino 
acids at positions | and 2 but random at the other four positions 
(as in AAXXXX, ACXXXX, ADXXXX, etc.), so each of the 400 
libraries contains 204, or 160,000, different sequence combinations. 
Screening these libraries with the protein of interest reveals which 
of the 400 libraries contains a ligand with high affinity. Then, this 
library is expanded systematically by specifying the first three amino 
acids (knowing from the chosen 1-of-400 libraries which amino 
acids are best as the first two); only 20 synthetic libraries (each 
containing 20°, or 8000, hexapeptides) are made here (one for each 
third-position possibility, with the remaining three positions being 
randomized). Selection for ligand binding, again with the protein 
of interest, reveals the best of these 20, and this particular library 
is then varied systematically at the fourth position, creating 20 more 
libraries (each containing 20’, or 400, hexapeptides). This cycle of 
synthesis, screening, and selection is repeated until all six positions 
in the hexapeptide are optimized to create the best ligand for the 
protein. A variation on this basic strategy using synthetic oligonucle- 
otides rather than peptides identified a unique 15-mer (sequence 
GGTTGGTGTGGTTGG) with high affinity (Kp = 2.7 nM) toward 
thrombin, a serine protease in the blood coagulation pathway. 
Thrombin is a major target for the pharmacological prevention of 
clot formation in coronary thrombosis. 


Reference 
Cortese, R., 1996. Combinatorial Libraries: Synthesis, Screening and Appli- 
cation Potential. Berlin: Walter de Gruyter. 
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into fragments of suitable size, and cloning the fragments into an appropriate vector. Master plaie ot 
This approach is called shotgun cloning because the strategy has no way of targeting bacterial colonies 
a particular gene but instead seeks to clone all the genes of the organism at one time. 
The intent is that at least one recombinant clone will contain at least part of the gene 
of interest. Usually, the isolated DNA is only partially digested by the chosen restric- 
tion endonuclease so that not every restriction site is cleaved in every DNA molecule. 
Then, even if the gene of interest contains a susceptible restriction site, some intact 
genes might still be found in the digest. Genomic libraries have been prepared from © Replicate onto 
thousands of different species. absorbent disc. 
Many clones must be created to be confident that the genomic library contains the 


gene of interest. The probability, P, that some number of clones, N, contains a particu- 
lar fragment representing a fraction, f, of the genome is 
P=1-(1- f)" 


Thus, 
@ Treat with NaOH; 
In(1 — P) neutralize, dry. 
N= ——_y 
In(1 — f) 
Denatured 

For example, if the library consists of 10 kbp fragments of the E. coli genome (4640 kbp DNA bound 
total), more than 2000 individual clones must be screened to have a 99% probability oe 
(P = 0.99) of finding a particular fragment. Since f = 10/4640 = 0.0022 and P = 0.99, 
N = 2093. For a 99% probability of finding a particular sequence within the 3 X 10° kbp 
human genome, N would equal almost 1.4 million if the cloned fragments averaged © Place disc in| 
10 kbp in size. The need for cloning vectors capable of carrying very large DNA inserts ae A 
becomes obvious from these numbers. i era DNA 


probe. 


12.2b Libraries Can Be Screened for the Presence of Specific Genes 


Any given DNA fragment is unique solely because of its specific nucleotide sequence. The 
only practical way to find one particular gene or DNA segment among a vast population 
of different DNA fragments (such as you might find in genomic DNA preparations) is to 
exploit its unique sequence to identify it. A simple way to screen genomic libraries is to 
carry out a colony hybridization experiment. In a typical experiment, host bacteria contain- 
ing a plasmid-based library are plated out on a petri dish and allowed to grow overnight to 
form colonies (Figure 12.7). A replica of the bacterial colonies is then obtained by overlay- 
ing the plate with a flexible, absorbent disc. The disc is removed, treated with alkali to dis- 
sociate bound DNA duplexes into single-stranded DNA, dried, and placed in a sealed bag 
with a labeled probe. A probe is usually a single-stranded DNA of defined sequence that 
is distinctively labeled with some easily detectable marker, such as a fluorescent tag. The @ Wash disc, visualize 
nucleotide sequence of the probe is designed to be complementary to the sought-for or tar- Ree 
get DNA fragment. To detect a colony that contains the desired DNA, the prehybridized with master 
plate. 


Labeled probe 
will hybridize with 
its complementary DNA. 
Figure 12.7 Screening a genomic library by colony hybridization. Host bacteria transformed with a 

plasmid-based genomic library are plated on a petri plate and incubated overnight to allow bacterial 
colonies to form. (1) A replica of the colonies is obtained by overlaying the plate with a flexible disc 
composed of absorbent material (such as nitrocellulose or nylon). (2) Nitrocellulose strongly binds 


nucleic acids; single-stranded nucleic acids are bound more tightly. Once the disc has taken up an © Labeled probe 


impression of the bacterial colonies, it is removed and the petri plate is set aside and saved. The disc is identifies 

treated with 2 M NaOH, neutralized, and dried. NaOH both lyses any bacteria (or phage particles) colonies 

and dissociates the DNA strands. When the disc is dried, the DNA strands become immobilized on the eres - 
esired. 


filter. (3) The dried disc is placed in a sealable plastic bag, and a solution containing heat-denatured 
(single-stranded), labeled probe is added. (Probes can be labeled in a variety of ways: fluorescent or 
colored dyes, radioactive isotopes, and more.) The bag is incubated to allow annealing of the probe 
DNA to any target DNA sequences that might be present on the disc. (4) The filter is then washed, 
dried, and prepared for visualization of the presence of labeled probe. (5) The position of any spots 
reveals where the labeled probe has hybridized with target DNA. The location of these spots can be Labeling reveals position (s) of 
used to recover the genomic clone from the bacteria on the original petri plate. colonies bearing the desired clone. 
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Known amino acid sequence: 
Phe Met Glu Trp His Lys Asn 


Possible mRNA sequence: 


UUU AUG GAA UGG CAU AGG AAU 
UUC GAG CAC AAA AAC 
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Figure 12.8 Cloning genes using oligonucleotide 
probes designed from a known amino acid sequence. 
A labeled set of DNA (degenerate) oligonucleotides 
representing all possible mRNA coding sequences is 
synthesized and is used to probe the genomic library 
by colony hybridization (see Figure 12.7). (Adapted 
from Figure 19-18, in Watson, J. D., et al., 
Molecular Biology of the Gene 4/e. Menlo Park, 
Calif.: Benjamin/Cummings, 1987.) 


sheet is incubated in a sealed plastic bag with a solution of specific probe molecules (step 
5 in figure). The single-stranded probe DNA anneals with the single-stranded target DNA 
bound to the sheet through specific base pairing to form a DNA duplex. This annealing, 
or hybridization as it is usually called, labels the target DNA, revealing its location. If the 
probe DNA is duplex DNA, it must be denatured by heating at 70°C. The probe and 
target DNA complementary sequences must be in a single-stranded form if they are to 
hybridize with one another. Any DNA sequences complementary to probe DNA will be 
revealed by the location of the hybridized labeled probe on the absorbent disc. Bacterial 
colonies containing clones bearing target DNA can be recovered from the master plate. 


12.2c Probes for Screening Libraries Can Be Prepared 
in a Variety of Ways 


Clearly, specific probes are essential reagents if the goal is to identify a particular gene or 
gene fragment against a background of innumerable DNA sequences. Usually, the probes 
used to screen libraries are nucleotide sequences that are complementary to some part of 
the target gene. Making useful probes requires some information about the gene’s nucleo- 
tide sequence. Sometimes, such information is available. Alternatively, if the amino acid 
sequence of the protein encoded by the gene is known, it is possible to work backward 
through the genetic code to the DNA sequence (Figure 12.8). Because the genetic code 
is degenerate (that is, several codons may specify the same amino acid; see Chapter 30), 
probes designed by this approach are usually degenerate oligonucleotides about 17 to 50 
residues long (such oligonucleotides are 17- to 50-mers). The oligonucleotides are syn- 
thesized so that different bases are incorporated at sites where degeneracies occur in the 
codons. The final preparation thus consists of a mixture of equal-length oligonucleotides 
whose sequences vary to accommodate the degeneracies. Presumably, one oligonucleotide 
sequence in the mixture will hybridize with the target gene. These oligonucleotide probes 
are at least 17-mers because shorter degenerate oligonucleotides might hybridize with 
sequences unrelated to the target sequence. 

A piece of DNA from the corresponding gene in a related organism can also be 
used as a probe in screening a library for a particular gene. Such probes are termed 
heterologous probes because they are not derived from the homologous (same) organ- 
ism. Problems arise if a complete eukaryotic gene is the cloning target; eukaryotic 
genes can be tens or even hundreds of kilobase pairs in size. Genes of this size are 
fragmented in most cloning procedures. Thus, the DNA identified by the probe may 
represent a clone that carries only part of the desired gene. However, most cloning 
strategies are based on a partial digestion of the gnomic DNA, a technique that gener- 
ates an overlapping set of genomic fragments. This being so, DNA segments from the 
ends of the identified clone can now be used to probe the library for clones carrying 
DNA sequences that flanked the original isolate in the genome. Repeating this process 
ultimately yields the complete gene among a subset of overlapping clones. 


12.2d PCR Is Used to Clone and Amplify Specific Genes 


Provided sequence information is available, polymerase chain reaction, or PCR, is a tech- 
nique for identifying and dramatically amplifying the amount of a specific DNA segment, 
and PCR amplification is an effective, indeed the preferred, cloning strategy. PCR relies on 
DNA polymerase to carry out multiple rounds of DNA synthesis. A preparation of dena- 
tured DNA containing the desired segment (such as total genomic DNA) serves as a tem- 
plate for DNA polymerase, and two specific oligonucleotides serve as primers (as shown in 
Figure 12.9). The design of appropriate primers for the DNA polymerase is key to PCR, 
because these primers must base-pair with the template adjacent to the DNA sequence 
of interest (see Figure 11.2). These primers are designed to be complementary to the two 
3’-ends of the specific DNA segment to be amplified; primers are added in excess amounts 
of 1000 times or greater (Figure 12.9). They prime the DNA polymerase-catalyzed syn- 
thesis of the two complementary strands of the desired segment, effectively doubling its 
concentration in the solution. Then the DNA is heated to dissociate the DNA duplexes, 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


12.2 What Is a DNA Library? 
35! 


Figure 12.9 Polymerase chain reaction (PCR). 
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followed by cooling so that more primers can bind to both the newly formed and the old 
strands. Another cycle of DNA synthesis follows. The protocol has been automated by car- 
rying out the reaction in thermal cyclers that alternately heat the reaction mixture to 95°C to 
dissociate the DNA, followed by cooling, annealing of primers, and another round of DNA 
synthesis. The use of heat-stable DNA polymerases from thermophilic bacteria (such as the 

Taq DNA polymerase from Thermus aquaticus) makes it unnecessary to add fresh enzyme 
for each round of synthesis. Because the amount of target DNA theoretically doubles each 
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Identifying Specific DNA Sequences by Southern Blotting (Southern Hybridization) 


As discussed, any given DNA fragment is unique solely by virtue 
of its specific nucleotide sequence. The only practical way to find a 
particular DNA segment among a vast population of different DNA 
fragments (such as you might find in genomic DNA preparations) 
is to exploit its sequence specificity to identify it. In 1975, E. M. 
Southern invented a technique capable of doing just that. 


Electrophoresis 


Southern first fractionated a population of DNA fragments 
according to size by gel electrophoresis (see step 2 in figure). The 
electrophoretic mobility of a nucleic acid is inversely proportional to 
its molecular mass. Polyacrylamide gels are suitable for the separation 
of nucleic acids of 25 to 2000 bp. Agarose gels are better if the 
DNA fragments range up to 10 times this size. Most preparations of 
genomic DNA show a broad spectrum of sizes, from less than 1 kbp 
to more than 20 kbp. Typically, no discrete-size fragments are evident 
following electrophoresis, just a smear of DNA throughout the gel. 


Blotting 


Once the fragments have been separated by electrophoresis 
(step 3), the gel is soaked in alkali to convert the duplex DNA into 
single-stranded DNA. After adjusting the pH of the gel to neutral, a 
sheet of absorbent material is placed over the gel, and salt solution is 
drawn through the gel by electrophoresis in a direction perpendicular 
to the direction of the first electrophoresis (step 4). The salt 


DNA 


(5) DNA fragments are bound 
to the sheet in positions 
identical to those on the gel 


Hybridize sheet with 
labeled probe 


The Southern blotting technique involves 

the transfer of electrophoretically separated 
DNA fragments to an absorbent sheet and 
subsequent detection of specific DNA 
sequences, using labeled, single-stranded probe 
DNA that hybridizes with target DNA sequences 
complementary to it. The location of these target 
sequences is revealed by the labeled probe. 


Labeled probe 
solution 


solution carries the DNA to the absorbent sheet, which binds the 
ssDNA molecules very tightly, effectively immobilizing them on 
the sheet. The distribution pattern of separated DNA fragments 
is maintained when the ssDNA molecules bind to the absorbent 
sheet (step 5 in figure). The sheet is dried and then incubated with 
a solution containing protein (serum albumin, for example) and/or 
a detergent such as sodium dodecylsulfate to saturate any remaining 
binding sites for DNA on the sheet, so no more DNA can bind 
nonspecifically. 


Hybridization 

To detect a particular DNA within the electrophoretic smear of 
countless DNA fragments, the sheet is exposed to a specific probe 
(step 6 in figure). A probe is usually a ssDNA of defined sequence 
that carries a fluorescent tag or some other easily detectable marker. 
The sequence of the probe is designed to be complementary to 
the target DNA fragment. The probe DNA base pairs with the 
target DNA, revealing its position on the sheet (step 7 in figure). 
Southern’s procedure has been extended to the identification of 
specific RNA and protein molecules: In a play on Southern’s name, 
the identification of particular RNAs following separation by gel 
electrophoresis, blotting, and probe hybridization is called Northern 
blotting. The analogous technique for identifying protein molecules is 
termed Western blotting. In Western blotting, the probe of choice is 
usually an antibody specific for the target protein. 
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round, 25 rounds would increase its concentration about 33 million times. In practice, the 
increase is actually more like a million times, which is more than ample for gene isolation. 
Thus, starting with a tiny amount of total genomic DNA, a particular sequence can be 
produced in quantity in a few hours. Because DNA from a single cell can be used as a tem- 
plate, the technique has enormous potential for the clinical diagnosis of infectious diseases 
and genetic abnormalities. With PCR techniques, DNA from a single hair or sperm can be 
analyzed to identify particular individuals in criminal cases without ambiguity. 


12.2e cDNA Libraries Are DNA Libraries Prepared from mRNA 


Complementary DNAs (cDNAs) are DNA molecules copied from mRNA templates. 
cDNA libraries are constructed by synthesizing cDNA from purified cellular mRNA. 
These libraries present an alternative strategy for gene isolation, especially eukaryotic 
genes. Because most eukaryotic mRNAs carry 3’-poly(A) tails, mRNA can be selectively 
isolated from preparations of total cellular RNA by oligo(dT)-cellulose chromatography 
(Figure 12.10). DNA copies of the purified mRNAs are synthesized by first annealing 
short oligo (dT) chains to the poly(A) tails. These oligo(dT) chains serve as primers for 
reverse transcriptase-driven synthesis of DNA (Figure 12.11). Reverse transcriptase is an 
enzyme that synthesizes a DNA strand, copying RNA as the template. DNA polymerase 
is then used to copy the DNA strand and form a double-stranded (duplex DNA) mole- 
cule. Linkers are then added to the DNA duplexes rendered from the mRNA templates, 
and the cDNAs are cloned into a suitable vector to create a cDNA library. 

RT-PCR (reverse transcriptase-PCR), a variation on the basic PCR method, is useful 
for cloning the gene that encodes a specific mRNA. Reverse transcriptase (RT) is used 
to synthesize a cDNA strand complementary with the RNA, and this cDNA serves 
as the template for further cycles of PCR. This amplified DNA is then used to probe 
genomic libraries to isolate the gene of interest. 

Because different cell types in eukaryotic organisms express selected subsets of genes, 
RNA preparations from cells or tissues in which genes of interest are selectively transcribed 
are enriched for the desired mRNAs. cDNA libraries prepared from such mRNA are rep- 
resentative of the pattern and extent of gene expression that uniquely define particular 
kinds of differentiated cells. cDNA libraries of many normal and abnormal human cell 
types are commercially available, including cDNA libraries of tumor cells. Comparison of 
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Figure 12.10 Isolation of eukaryotic mRNA via 
oligo(dT)-cellulose chromatography. (a) In the pres- 
ence of 0.5 M NaCl, the poly(A) tails of eukaryotic 
mRNA anneal with short oligo(dT) chains cova- 
lently attached to an insoluble chromatographic 
matrix such as cellulose. Other RNAs, such as rRNA 
(green), pass right through the chromatography col- 
umn. (b) The column is washed with more 0.5 M 
NaCl to remove residual contaminants. (c) Then the 
poly(A) mRNA (red) is recovered by washing the 
column with water because the base pairs formed 
between the poly(A) tails of the mRNA and the 
oligo(dT) chains are unstable in solutions of low 
ionic strength. 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


406 Chapter 12 Recombinant DNA, Cloning, Gene Editing, and Synthetic Biology—An Introduction 


mRNA 5 AAARAA 3' 
Anneal oligo(dT) 191g primers 
y 
mRNA 5 AAAAAA 3' 
TITTTT 
(a) First-strand Add reverse transcriptase and substrates 
cDNA synthesis dATP, dTTP, dGTP, dCTP 
Heteroduplex y 


mRNA 5' AAAAAA 3' 
cDNA 3 5 
Add RNase H, DNA polymerase, and dATP, 

dTTP, dGTP, dCTP; mRNA degraded by RNase H 
(b) y 


5' ] AA 3' 
3! TITT 5' 


DNA polymerase copies first-strand 
cDNA using RNA segments as primer 


DNA 
(c) polymerase \ y 
5' > > AARAAR 3' 
3' 5' 
(d) DNA fragments joined by DNA ligase 
cDNA duplex y 
cDNA Žž 5 AAAARA 3' 
cDNA 3 MEE 5 
(e) EcoRI linkers, 
T4 DNA ligase 
A 
RATITCGGCACGAGI 
GCCETECTCI 


EcoRI-ended cDNA duplexes for cloning 


Figure 12.11 Reverse transcriptase-driven synthesis of cDNA from oligo(dT) primers annealed 

to the poly(A) tails of purified eukaryotic mRNA. (a) Oligo(dT) chains serve as primers for synthesis 
of a DNA copy of the mRNA by reverse transcriptase. Following completion of first-strand cDNA 
synthesis by reverse transcriptase, RNase H and DNA polymerase are added (b). RNase H specifically 
digests RNA strands in DNA;RNA hybrid duplexes. DNA polymerase copies the first-strand cDNA, 
using as primers the residual RNA segments after RNase H has created nicks and gaps (c). DNA poly- 
merase has a 5'—3' exonuclease activity that removes the residual RNA as it fills in with DNA. The 
nicks remaining in the second-strand DNA are sealed by DNA ligase (d), yielding duplex cDNA. 
EcoRI adapters with 5'-overhangs are then ligated onto the cDNA duplexes (e) using phage 

T4 DNA ligase to create EcoRI-ended cDNA for insertion into a cloning vector. 


cDNA libraries, from normal and abnormal cells in conjunction with two-dimensional gel 
electrophoretic analysis (see A Deeper Look: Techniques Used in Protein Purification in 
Chapter 5) of the proteins produced is a useful strategy in clinical medicine to understand 
disease mechanisms. 


Expressed Sequence Tags When a cDNA library is prepared from the mRNAs syn- 
thesized in a particular cell type under certain conditions, these cDNAs represent 
the nucleotide sequences (genes) that have been expressed in this cell type under these 
conditions. Expressed sequence tags (ESTs) are relatively short (~200 nucleotides or 
so) sequences obtained by determining a portion of the nucleotide sequence for each 
insert in randomly selected cDNAs. An EST represents part of a gene that is being 
expressed. Probes derived from ESTs can be labeled and used in hybridization 
experiments to identify which genes in a genomic library are being expressed in the 
cell. For example, labeled ESTs can be hybridized to a gene chip (see following 
discussion). 
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12.2f DNA Microarrays (Gene Chips) Are Arrays of Different 
Oligonucleotides Immobilized on a Chip 


Robotic methods can be used to synthesize combinatorial libraries of DNA 
oligonucleotides directly on a solid support, such that the completed library is a two- 
dimensional array of different oligonucleotides (as discussed in the Critical Develop- 
ments in Biochemistry box on combinatorial libraries earlier in this section). Synthesis is 
performed by phosphoramidite chemistry (Figure 11.31) adapted into a photochemical 
process that can be controlled by light. Computer-controlled masking of the light allows 
the photochemical reaction to take place at some spots in the two-dimensional array of 
growing oligonucleotides and not at others, so each spot on the array is a population 
of identical oligonucleotides of unique sequence. The final products of such proce- 
dures are referred to as gene chips because the oligonucleotide sequences synthesized 
upon the chip represent the sequences of chosen genes. Typically, the oligonucleotides 
are up to 25 nucleotides long (there are more than 10! possible sequence arrangements 
for 25-mers made from four bases), and as many as 1.3 million different oligonucle- 
otides can be arrayed on a chip 1 cm square. The oligonucleotides on such gene chips 
are used as the probes in a hybridization experiment to reveal gene expression patterns. 
Figure 12.12 shows one design for gene chip analysis of gene expression. 
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Figure 12.12 Gene chips (DNA microarrays) in the analysis of gene expression. Here 


a small glass surface (chip), employing phosphoramidite chemistry (Figure 11.31), 
using laserjet printing to deliver microdrops of deoxynucleotides with photo-labile or 


is one of many analytical possibilities based on DNA microarray technology: 
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electrochemically sensitive blocking groups that can be removed after each cycle of 
nucleotide addition. Alternatively, gene segments (for example, ESTs) are isolated and 
amplified by PCR (see Figure 12.9), and the PCR products are robotically printed 


onto coated glass microscope slides to create a gene chip. The gene chip usually is con- 
sidered the probe in a target: probe screening experiment. (b) Target preparation: Total 
RNA from two sets of cell treatments (control and test treatment) are isolated, and 
cDNA is produced from the two batches of RNA via reverse transcriptase. During 


cDNA production, the control is labeled with a specific fluorescent marker (green, for 00 © © 00 © 
example) and the test treatment is labeled with a different fluorescent marker (red, for eos er es P 
example), so the wavelength of fluorescence allows discrimination between the two s ere weer : 


different sets of cDNAs. (c) The two batches of labeled cDNA are pooled and hybrid- 
ized to the gene chip. (d) Laser excitation of the hybridized gene chip with light of 
appropriate wavelength allows collection of data indicating the intensities of fluores- 
cence, and hence the degree of hybridization of the two different probes with the gene 
chips. Because the location of genes on the gene chip is known, which genes are 
expressed (or not) and the degree to which they are expressed is revealed by the fluo- 
rescent patterns. (Adapted from Figure 1 in Duggan, D. J., et al., 1999. Expression 
profiling using cDNA microarrays. Nature Genetics 21 supplement:10-14.) 


Computer analysis 
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Figure 12.13 Expression vectors carrying the 
promoter recognized by the RNA polymerase of 
bacteriophage SP6 are useful for the production of 
multiple RNA copies of any DNA inserted at the 
polylinker. Before transcription is initiated, the 
circular expression vector is linearized by a single 
cleavage at or near the end of the insert so that 
transcription terminates at a fixed point. 


12.3 | Can the Genes in Libraries Be Transcribed 
and Translated? 


Libraries of cloned genes are excellent resources for the investigation of gene regula- 
tion and analysis of the proteins encoded by the genes. To carry out such studies, these 
genes must be expressed via the processes of transcription and translation. 


12.3a Expression Vectors Are Engineered So That the RNA or Protein 
Products of Cloned Genes Can Be Synthesized 


Expression vectors are engineered so that any cloned insert can be transcribed into 
RNA, and, in many instances, even translated into protein. cDNA expression librar- 
ies can be constructed in specially designed vectors. Proteins encoded by the various 
cDNA clones within such expression libraries can be synthesized in the host cells, and 
if suitable assays are available to identify a particular protein, its corresponding cDNA 
clone can be identified and isolated. Expression vectors designed for RNA expression 
or protein expression, or both, are available. 


RNA Expression A vector for in vitro expression of DNA inserts as RNA transcripts 
can be constructed by putting a highly efficient promoter adjacent to a versatile 
cloningsite. Figure 12.13 depicts a typical expression vector. Linearized recombinant 
vector DNA is transcribed in vitro using SP6 RNA polymerase. Large amounts of 
RNA product can be obtained in this manner; if radioactive or fluorescent-labeled 
ribonucleotides are used as substrates, labeled RNA molecules useful as probes 
are made. 


Measuring the Abundance of an mRNA by RT-qPCR Real-time quantitative polymerase 
chain reaction (RT-qPCR) is a technique for directly determining mRNA expression 
without cloning its gene.! RT-qPCR measures the abundance of a particular mRNA 
within the total RNA isolated from a particular cell type. Reverse transcriptase is used 
to synthesize cDNA copies of the mRNAs within the sample, and then specific primers 
are used to amplify the amount of the target mRNA by PCR, using DNA polymerase. 
The amount of dsDNA that accumulates during each PCR cycle is determined as it is 
formed (that is, in real time) by fluorescence, either through intercalation of fluorescent 
dyes (see Figure 11.15) or release of fluorescent tags from mRNA-specific probes as 
DNA polymerase copies the templates. Plots of the fluorescence intensity versus PCR 
cycle provide information about the amount of the target mRNA in the original total 
RNA sample (Figure 12.14). 


Protein Expression Because cDNAs are DNA copies of mRNAs, cDNAs are uninter- 
rupted copies of the exons of expressed genes. Because cDNAs lack introns, it is 
feasible to express these cDNA versions of eukaryotic genes in prokaryotic hosts that 
cannot process the complex primary transcripts of eukaryotic genes. To express a 
eukaryotic protein in E. coli, the eukaryotic cDNA must be cloned into an expression 
vector that contains regulatory signals for both transcription and translation. 
Accordingly, a promoter where RNA polymerase initiates transcription as well as a 
ribosome-binding site to facilitate translation are engineered into the vector just 
upstream from the restriction site for inserting foreign DNA. The AUG initiation 
codon that specifies the first amino acid in the protein (the translation start site) is 
contributed by the insert (Figure 12.15). 

Some expression vectors have strong promoters that drive the synthesis of foreign 
proteins to levels equal to 30% or more of total E. coli cellular protein. An example 
is the hybrid promoter, Pac, which was created by fusing part of the promoter for the 
E. coli genes encoding the enzymes of lactose metabolism (the /ac promoter) with 


'RT is a bit ambiguous in reference to PCR. It can connote “reverse transcriptase” or “real-time,” 
depending on context. 
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Relative fluorescence 
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Figure 12.14 RT-qPCR amplification plots of target mRNA, as relative fluorescence versus number of 
PCR cycles. The different curves equate to differing mRNA abundance in the sample, with the number 
of mRNA copies (103-10’) indicated. (Adapted from Figure 1 in Gibson, U. E., Heid, C. A., and 

Williams, P. M., 1996. A novel method for real time quantitative PCR. Genome Research 6:995-1001.) 
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Figure 12.15 A typical expression-cloning vector. (Adapted from Fig. 19-19, in Watson, J. D., et al., (3) Recovery of eukaryotic 
Molecular Biology of the Gene 4/e. Menlo Park, Calif.: Benjamin/Cummings, 1987.) protein product 


part of the promoter for the genes encoding the enzymes of tryptophan biosynthesis 

(the trp promoter) (Figure 12.16). In cells carrying Piac expression vectors, the Prac Polylinker 

promoter is not induced to drive transcription of the foreign insert until the cells are cloning site N 

exposed to inducers that lead to its activation. Analogs of lactose (a B-galactoside) y 

such as isopropyl-B-thiogalactoside, or IPTG, are excellent inducers of Piac. Thus, 

expression of the foreign protein is easily controlled. “copy 
A widely used protein expression system is based on the pET plasmid. Transcription 

of the cloned gene insert is under the control of the bacteriophage T7 RNA polymerase 

promoter in pET. This promoter is not recognized by the E. coli RNA polymerase, so 

transcription can only occur if the T7 RNA polymerase is present in host cells. Host pst! 

E. coli cells are engineered so that the T7 RNA polymerase gene is inserted in the 

host chromosome under the control of the /ac promoter. IPTG induction triggers T7 

RNA polymerase production and subsequent transcription and translation of the pET 30 

insert. The bacteriophage T7 RNA polymerase is so active that most of the host cell’s 

resources are directed into protein expression and levels of expressed protein approach Figure 12:16 A pa. proteinexpression vector 

50% of total cellular protein. contains the hybrid promoter Piac derived from 
The bacterial production of valuable eukaryotic proteins represents one of the most fusion of the /ac and trp promoters. Isopropyl-B- 

important uses of recombinant DNA technology. For example, human insulin for the D-thiogalactoside, or IPTG, induces expression 

clinical treatment of diabetes is now produced in bacteria. from Prac- 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


410 


Figure 12.17 A typical expression vector for the 
synthesis of a hybrid protein. The cloning site is 
located at the end of the coding region for the 
protein B-galactosidase. Insertion of foreign DNAs 
at this site fuses the foreign sequence to the 
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Analogous systems for expression of foreign genes in eukaryotic cells include vec- 
tors carrying promoter elements derived from mammalian viruses, such as simian virus 
40 (SV40), the Epstein—Barr virus, and the human cytomegalovirus (CMV). A system 
for high-level expression of foreign genes uses insect cells infected with the baculovirus 
expression vector. Baculoviruses infect lepidopteran insects (butterflies and moths). In 
engineered baculovirus vectors, the foreign gene is cloned downstream of the promoter 
for polyhedrin, a major viral-encoded structural protein, and the recombinant vector is 
incorporated into insect cells grown in culture. Expression from the polyhedrin promoter 
can lead to the accumulation of the foreign gene product to levels as high as 500 mg/L. 

Technologies for the expression of recombinant proteins in mammalian cell cultures 
are also commercially available. These technologies have the advantage that the unique 
post-translational modifications of proteins (such as glycosylation; see Chapters 5, 7, and 
31) seen in mammalian cells take place in vivo so that the expressed protein is produced in 
its naturally occurring form. 


Fusion Protein Expression Some expression vectors carry cDNA inserts cloned 
directly into the coding sequence of a vector-borne protein-coding gene (Figure 
12.17). Translation of the recombinant sequence leads to synthesis of a hybrid protein 
or fusion protein. The N-terminal region of the fused protein represents amino acid 
sequences encoded in the vector, whereas the remainder of the protein is encoded by 
the foreign insert. Keep in mind that the triplet codon sequence within the cloned 
insert must be in the same reading frame with codons contributed by the vector 
sequences to make the right protein. Furthermore, adding an N-terminal signal 
sequence that targets the hybrid protein for secretion from the cell simplifies recovery 
of the fusion protein. A variety of gene fusion systems have been developed to facili- 
tate isolation of a specific protein encoded by a cloned insert. The isolation proce- 
dures are based on affinity chromatography purification of the fusion protein 
through exploitation of the unique ligand-binding properties of the vector-encoded 
protein (Table 12.1). 


Screening cDNA Expression Libraries with Antibodies Antibodies that specifically 
cross-react with a particular protein of interest are often available. If so, these anti- 
bodies can be used to screen a cDNA expression library to identify and isolate cDNA 
clones encoding the protein. The cDNA library is introduced into host bacteria, which 
are plated out and grown overnight, as in the colony hybridization scheme previously 
described. DNA-binding nylon membranes are placed on the plates to obtain a replica 
of the bacterial colonies. The nylon membrane is then incubated under conditions that 
induce protein synthesis from the cloned cDNA inserts, and the cells are treated to 
release the synthesized protein. The synthesized protein binds tightly to the nylon 
membrane, which can then be incubated with the specific antibody. Binding of the 


Cloning 
site 


pUR278 
5.2 kbp 


B-galactosidase coding region (the lacZ gene). IPTG Prac 

induces the transcription of the /acZ gene from its Codon: Cys Gln Lys Gly Asp Pro Ser Thr Leu Glu Ser Leu Ser Met 
promoter Piac, Causing expression of the fusion Cloning site: TGT CAA AAA GGG GAT CCG TCG ACT CTA GAA AGC TTA TCG ATG 
protein. (Adapted from Figure 2, Riither, U., and l l l Í I j 
Müller-Hill, B., 1983. EMBO Journal 2:1791-1794.) BamHI Sall Xbal HindIII Clal 
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| Table 12.1 | Gene Fusion Systems for Isolation of Cloned Fusion Proteins 


Fusion Protein Secreted?* Affinity Ligand 

B-Galactosidase No a-Aminophenyl-6-D-thiogalactoside 
(APTG) 

Protein A Yes Immunoglobulin G (IgG) 

FLAG peptide (DYKDDDDK) Yes FLAG-specific monoclonal antibody 

Streptavidin Yes Biotin 

Glutathione-S-transferase (GST) No Glutathione 

Maltose-binding protein (MBP) Yes Starch 

Hexahistidine tag No Nickel or cobalt 

Hemagglutinin (HA) peptide No HA-peptide antibody 


n 
*This indicates whether combined secretion-fusion gene systems have led to secretion of the protein product from the 
cells, which simplifies its isolation and purification. 


antibody to its target protein product reveals the position of any cDNA clones 
expressing the protein, and these clones can be recovered from the original plate. Like 
other libraries, expression libraries can be screened with oligonucleotide probes, too. 


12.3b Reporter Gene Constructs Are Chimeric DNA Molecules 
Composed of Gene Regulatory Sequences Positioned Next 
to an Easily Expressible Gene Product 


Potential regulatory regions of genes (such as promoters) can be investigated by plac- 
ing these regulatory sequences into plasmids upstream of a gene, called a reporter 
gene, whose expression is easy to measure. Such chimeric plasmids are then introduced 
into cells of choice (including eukaryotic cells) to assess the potential function of the 
regulatory sequences because expression of the reporter gene serves as a report on the 
effectiveness of the regulatory element. A number of different genes have been used as 
reporter genes. A reporter gene with many inherent advantages is that encoding the green 
fluorescent protein (or GFP), described in Chapter 4. Unlike the protein expressed by 
other reporter gene systems, GFP does not require any substrate to measure its activity, 
nor is it dependent on any cofactor or prosthetic group. Detection of GFP requires only 
irradiation with near-UV or blue light (400 nm light is optimal), and the green fluores- 
cence (light of 500 nm) that results is easily observed with the naked eye, although it can 
also be measured precisely with a fluorometer. Figure 12.18 demonstrates the use of the 
genes encoding GFP and mCherry, a similar fluorescent protein, as reporter genes. 


Courtesy of Sarah Kucenas 
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Figure 12.18 Green fluorescent protein (GFP) and 
Cherry fluorescent protein (mCherry) as reporter 
genes. In this experiment, GFP expression is under 
the control of the promoter for the Danio rerio 
(zebrafish) hb9 gene. Hb9 encodes a homeobox pro- 
tein (Hb9) expressed specifically in motor neurons. 
mCherry expression depends on the promoter for the 
zebrafish nkx2.2a gene. Nx2.2a is another homeobox 
gene (encoding Nkx2.2a protein) expressed specifi- 
cally in glia. Glia are non-neuronal cells that support 
and maintain neurons. (Bottom) Green fluorescence 
labels motor neuron cell bodies and projections, spe- 
cifically in motor nerves. mCherry expression labels 
glia in both the brain and spinal cord. mCherry is a 
red fluorescent protein encoded by a gene found in 
Discosoma species, which are soft corals, not jelly- 
fish. (Image courtesy of Yunlu Zhu and Sarah 
Kucenas, the University of Virginia.) 
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12.3c Specific Protein-Protein Interactions Can Be Identified 
Using the Yeast Two-Hybrid System 


Specific interactions between proteins (protein-protein interactions) lie at the heart of 
many essential biological processes. One method to identify specific protein—protein 
interactions in vivo is through the expression of a reporter gene whose transcription 
is dependent on a functional transcriptional activator, the GAL4 protein. The GAL4 
protein consists of two domains: a DNA-binding (or DB) domain and a transcriptional 
activation (or TA) domain. Even if expressed as separate proteins, these two domains 
will still work, provided they can be brought together. The method depends on two sepa- 
rate plasmids encoding two hybrid proteins, one consisting of the GAL4 DB domain 
fused to protein X and the other consisting of the GAL4 TA domain fused to protein 
Y (Figure 12.19). If proteins X and Y interact in a specific protein-protein interaction, 
the GAL4 DB and TA domains are brought together so that transcription of a reporter 
gene driven by the GAL4 promoter can take place (Figure 12.19). Protein X, fused to 
the GAL4-DNA-binding domain (DB), serves as the bait to fish for the protein Y target 
and its fused GAL4 TA domain. This method can be used to screen cells for protein tar- 
gets that interact specifically with a particular bait protein. To do so, cDNAs encoding 
proteins from the cells of interest are inserted into the TA-containing plasmid to create 
fusions of the cDNA coding sequences with the GAL4 TA domain coding sequences, 
so a fusion protein library is expressed. Identification of a target of the bait protein 
by this method also directly yields a cDNA version of the gene encoding the target 
protein. Two-hybrid systems to identify protein-protein interactions in bacteria and in 
mammalian cells are available. 


Identifying Protein-Protein Interactions Through Immunoprecipitation If antibodies 
against one protein of a multiprotein complex are available, the entire complex can 
be immunoprecipitated and its composition analyzed. Attachment of such antibodies 
to glass or agarose beads, which easily sediment in a centrifuge, makes recovery of the 
complex very simple. Because antibodies against it are commercially available, the 
hemagglutinin (HA) peptide sequence YPYDVPDYA is a useful protein fusion tag, 
not only for fusion protein purification (Table 12.1), but also for analysis of protein— 
protein interactions. Expressing an HA-tagged protein in vivo, followed by 
immunoprecipitation, allows the isolation of protein complexes of which the HA- 
tagged protein is a member. The other members of the complex can then be identified 
to establish the various interacting partners within the multiprotein complex. 


ChIP-Seq—lsolating Specific DNA Sequences Through Their Interactions with Proteins 
ChIP-Seq is an acronym for chromatin immunoprecipitation-DNA sequencing, 
a technique in which an antibody against a specific DNA-binding protein is used to 
immunoprecipitate complexes from fragmented chromatin preparations that contain 
the specific protein and any DNA sequence associated with it. Recovery of the DNA 
from these immunoprecipitated complexes followed by massively parallel DNA 
sequencing (see Section 11.1) reveals a collection of target DNA sites for the protein of 


Figure 12.19 The yeast two-hybrid system for identifying protein-protein interactions. If proteins X and 
Y interact, the lacZ reporter gene is expressed. Cells expressing lacZ exhibit B-galactosidase activity. 
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interest. Thus, the sequences bound by any DNA-binding protein can be precisely 
identified and mapped globally according to their location within the genome. The 
results of these studies can be used to determine how specific DNA-binding proteins 
affect the phenotype of the organism. 


12.3d In Vitro Mutagenesis—Introducing Changes into Expressed Proteins 


The advent of recombinant DNA technology has made it possible to clone genes, 
manipulate them in vitro, and express them in a variety of cell types under various con- 
ditions. The function of any protein is ultimately dependent on its amino acid sequence, 
which in turn can be traced to the nucleotide sequence of its gene. The introduction of 
purposeful changes in the nucleotide sequence of a cloned gene represents an ideal way 
to make specific structural changes in a protein. The effects of these changes on the 
protein’s function can then be studied. Such changes constitute mutations introduced 
in vitro into the gene. In vitro mutagenesis makes it possible to alter the nucleotide 
sequence of a cloned gene systematically, as opposed to the chance occurrence of muta- 
tions in natural genes. 

One technique for in vitro mutagenesis is PCR-based mutagenesis. Mutant primers 
are added to a PCR reaction in which the gene (or segment of a gene) is undergoing 
amplification. The mutant primers are primers whose sequence has been specifically 
altered to introduce a directed change at a particular place in the nucleotide sequence 
of the gene being amplified (Figure 12.20). Mutant versions of the gene can then be 
cloned and expressed to determine any effects of the mutation on the function of the 
gene product. Alternatively, precise manipulation and revision of any gene’s nucleo- 
tide sequence can be achieved using the CRISPR/Cas9 editing system described in 
Section 12.5. 


12.4 | How Is RNA Interference Used to Reveal 
| the Function of Genes? 


RNA interference (RNAi) has emerged as a useful method in eukaryotic gene inac- 
tivation. RNAi leads to targeted destruction of a selected gene’s RNA transcript. 
The consequences that follow the loss of gene function reveal the role of the gene 
product in cell metabolism. Inactivation of gene expression by RNAi is sometimes 
referred to as gene knockdown. Gene knockdown is a term that contrasts the method 
with gene knockout, a procedure that inactivates a gene by disrupting its nucleotide 
sequence. 

Procedures for silencing gene expression via RNAi depend on the introduction of 
double-stranded RNA (dsRNA) molecules into target cells by transfection, viral infec- 
tion, or other means. One strand of the dsRNA is designed to be an antisense RNA, 
in that its nucleotide sequence is complementary to the RNA transcript of the gene 
selected for silencing. An ATP-dependent endogenous nucleoprotein complex known 
as Dicer processes the dsRNA. Dicer is an RNase III family member that catalyzes 
endonucleolytic cleavage of both strands of dsRNA molecules to produce a double- 
stranded small interfering RNA (siRNA) 21 to 23 nucleotides long with 2-nucleotide-long 
3'-overhangs on each strand (Figure 12.21). The siRNA is then passed to another pro- 
tein complex known as RNA-induced silencing complex (RISC). In an ATP-dependent 
process, RISC unwinds the double-stranded siR NA and selects the antisense strand, 
which is referred to as the guide strand. The other strand, referred to as the passenger 
strand, is discarded. RISC pairs the single-stranded guide strand with a complemen- 
tary region on the targeted gene transcript. RISC then carries out its slicer function 
by cleaving the RNA transcript between nucleotides 10 and 11 of the mRNA region 
that is base-paired with the guide strand. Such cleavage prevents expression of the 
product encoded by the mRNA. The guide strand remains associated with RISC, and 
RISC can use it for multiple cycles of mRNA cleavage and post-transcriptional gene 
silencing. 
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Figure 12.20 One method of PCR-based site- 
directed mutagenesis. (1) Template DNA strands are 
separated and amplified by PCR using mutagenic 
primers (represented as bent arrows) whose 
sequences introduce a single base substitution at site 
X (and its complementary base X’; thus creating the 
desired amino acid change in the protein encoded by 
the gene). Ideally, the mutagenic primers also 
introduce a unique restriction site into the plasmid 
that was not present before. (2) Following many 
cycles of PCR, the DNA product can be used to 
transform E. coli cells. (3) The plasmid DNA can be 
isolated and screened for the presence of the 
mutation by screening for the presence of the unique 
restriction site by restriction endonuclease cleavage. 
For example, the nucleotide sequence GGATCT 
within a gene codes for amino acid residues Gly-Ser. 
Using mutagenic primers of nucleotide sequence 
AGATCT (and its complement AGATCT) changes 
the amino acid sequence from Gly-Ser to Arg-Ser 
and creates a Bg/II restriction site (see Table 10.2). 
Gene expression of the isolated mutant plasmid in 
E. coli allows recovery and analysis of the mutant 
protein. 
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Figure 12.21 Gene knockdown by RNAi. Viral 

The dsRNA is processed by Dicer, which cleaves Artificial infection 

both strands of the dsRNA to form an siRNA, expression Transfection 
a ~20-nucleotide dsRNA with 3’-overhangs. A 

helicase activity associated with Dicer unwinds the 
siRNA, and the guide strand is delivered to the 
RISC protein complex. An Argonaute protein fam- 
ily member (Ago) is the catalytic subunit of RISC. 
Ago has a dsRNA-binding domain that brings 
together the guide strand and a complementary 
nucleotide sequence on the targeted gene transcript. 
Ago also has a RNase H-type catalytic domain that 
cleaves the gene transcript, rendering it incapable of 
translation by ribosomes. This RNase H activity of 
Ago is whimsically referred to as the slicer function 
in RNAi. 


7mG 


Transcript 


A clever innovation in RNAi is to introduce the silencing RNA in the form of 
a short hairpin, or shRNA. A 47-bp DNA sequence is created in which the first 
19 nucleotides are complementary to the target mRNA. Within the larger shRNA, 
this 19 base segment is separated by a short, nine nucleotide spacer from the reverse 
complement of the same 19-bp sequence, so that the DNA encodes a transcript that 
will fold back on itself to form a 19-bp double-stranded stem-loop RNA structure. 
DNA encoding such 47-bp sequences can be introduced into plasmids and expressed 
in target cells. Or, lipid nanoparticles, like the ones used in the mRNA viruses 
developed to innoculate against SARS-CoV-2, can deliver shRNAs to recipient cells. 
Such shRNA transcripts are recognized by Dicer and processed to generate a gene- 


specific siRNA. 


12.5 | Gene Therapy: Is It Possible for Scientists to Alter 


Guide strand 


Guide strand: 
transcript duplex 


or Rewrite the Genome of an Organism? 


Recombinant DNA technology is a powerful tool for the genetic modification of 
organisms. The strategies and methodologies described in this chapter are only an 
overview of the repertoire of experimental approaches that have been devised by 
molecular biologists to manipulate DNA and the information encoded in it. The enor- 
mous success of recombinant DNA technology means that the molecular biologist’s 
task in searching genomes for genes is now akin to that of a lexicographer compiling 
a dictionary, a dictionary in which the letters (the nucleotide sequences) spell out not 
words but rather genes and what they mean. Molecular biologists have no index or 
alphabetic arrangement to serve as a guide through the vast volume of information 
in a genome; nevertheless, this information and its organization are rapidly being dis- 
closed by the imaginative efforts and diligence of these scientists and their growing 


arsenal of analytical schemes. 
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Using recombinant DNA technology, it is now possible to engineer at will the hered- 
ity (or genetic makeup) of organisms for desired ends. The commercial production of 
therapeutic biomolecules in microbial cultures is already established (for example, the 
production of human insulin in quantity in E. coli cells). Agricultural crops with desired 
attributes, such as enhanced resistance to herbicides or elevated vitamin levels, are in 
cultivation. Transgenic mice are widely used as experimental animals to investigate 
models of human disease and physiology. Already, transgenic versions of domestic 
animals such as pigs, sheep, and even fish have been developed for human benefit. 
Perhaps most important, in an increasing number of instances, gene replacement 
therapy (or, more simply, gene therapy) and gene editing are being used to correct 
certain human genetic disorders. 


12.5a Human Gene Therapy Can Repair Genetic Deficiencies 


Human gene therapy seeks to repair the damage caused by a genetic deficiency 
through the introduction of a functional version of the defective gene. To achieve this 
end, a cloned variant of the gene must be incorporated into the organism in such a 
manner that it is expressed only at the proper time and only in appropriate cell types. 
Alternatively, the nucleotide sequence of an inactive or dysfunctional resident gene 
is edited to change it into one with normal function. Although such requirements 
impose serious technical and clinical challenges, successful human gene therapy is 
now a reality. Hundreds of gene therapies have received approval for use in human 
patients, including the introduction of gene constructs into patients. Among these are 
constructs designed to cure ADA- SCID (severe combined immunodeficiency due to 
adenosine deaminase [ADA] deficiency), neuroblastoma, cystic fibrosis, or to treat 
cancer through the expression of the E1A and p53 tumor suppressor genes. Insertion 
of the CD19 surface antigen gene into a patient’s T lymphocytes has been used to 
create the Kymriah therapy for treatment of lymphoma and leukemia. Figure 12.22 
shows some of the possible routes for introduction of functional genes into human 
cells. A reliable methodology to deliver a functional gene into any desired target cell 
type is not yet available. 
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Figure 12.22 Various methods to carry out gene therapy on human cells. Viruses gain entry by inter- 
action with membrane receptors. Liposomes are spherical vesicles containing therapeutic genes. Their 
membranes are composed of phospholipid bilayers designed to fuse with the plasma membrane of 
target cells. Cationic polymers are a non-viral way to deliver DNA to cells. Cationic polymers form 
nanoscale complexes with nucleic acids and deliver them across cell membranes. They act by neutral- 
izing the negative charge on DNA or RNA. (Adapted from Figure 1 in Goswami, R., et al., 2019. 
Gene therapy leaves a vicious cycle. Frontiers in Oncology 9:297, DOI:10.3389/fonc.2019.00297.) 
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12.5b Viruses as Vectors in Human Gene Therapy 


A basic strategy in human gene therapy involves the incorporation of a functional gene 
into target cells in one of two ways. The gene is typically in the form of an expres- 
sion cassette consisting of a cDNA version of the gene downstream from a promoter 
that will drive expression of the gene. One way to incorporate the cassette, the ex vivo 
route, is to introduce a vector carrying the expression cassette into cells that have been 
isolated from a patient and cultured in the laboratory. The modified cells are then rein- 
troduced into the patient. The other way involves the direct incorporation of the gene 
by treating the patient with a viral vector carrying the expression cassette. Retroviruses 
are commonly used in gene therapy. Retroviruses are RNA viruses that replicate their 
RNA genome by first making a DNA intermediate. Because retroviruses can transfer 
their genetic information directly into the genome of host cells, they can be used for 
permanent modification of host cells ex vivo. 

In 2000, scientists at the Pasteur Institute in Paris used a retrovirus-based ex vivo 
approach to successfully treat infants with X-linked severe combined immunodeficiency 
(X-linked SCID). X-linked SCID occurs almost exclusively in males. Boys with X-linked 
SCID lack an effective yc cytokine receptor. This receptor is necessary to produce 
functional lymphocytes, key cells in the immune response. Without treatment, males 
with X-linked SCID seldom survive beyond infancy. Gene therapy was used to deliver a 
functional yc cytokine receptor subunit gene to stem cells harvested from the infants. 
Transformed stem cells were reintroduced into the patients, who were then able to 
produce functional lymphocytes and lead normal lives. This achievement represented 
the first successful outcome in human gene therapy. Unfortunately, several patients 
developed leukemia, apparently due to the chance insertion of the gene into the genome 
at places that altered the expression of nearby cancer-causing genes. Alternative, safer 
strategies for X-linked SCID are in clinical trials. 

Successful human gene therapy has also been achieved using the ex vivo approach 
with recombinant lentiviral vectors (lentiviruses are also retroviruses and include HIV, 
the virus that causes AIDS). Two successes include: (1) the treatment of X-linked adre- 
noleukodystrophy (ALD, a fatal central nervous system disease) by the infection of 
hematopoietic stem cells (HSCs) with an HIV-1-derived lentiviral vector carrying a 
functional ALD gene; and (2) the treatment of B-thalassemia patients via the ex vivo 
transduction of their HSCs with a recombinant lentiviral vector carrying the gene for 
B-globin. (G-Thalassemics cannot produce functional B-globin chains, lack an effective 
population of red blood cells, and suffer life-threatening anemia.) 

Adenovirus vectors, which can carry expression cassettes over 30 kbp in size, are an 
in vivo approach to human gene therapy. Adenoviruses are DNA viruses. Deletion of 
most of the 36-kbp adenovirus genome renders the adenovirus incapable of replication, 
and thus, viral infection. The recombinant adenoviruses enter the patient’s cells via spe- 
cific receptors on the target cell surface (Figure 12.23). 

The transferred genetic information is expressed directly from the adenovirus recom- 
binant DNA and is never incorporated into the host cell genome. An adenovirus vector 
formed the basis of the first commercially approved human gene therapy: Glybera. 
Direct intramuscular injection of the Glybera recombinant adenovirus was used to 
introduce a functional lipoprotein lipase gene into patients that lack one (such patients 
suffer from dramatically elevated levels of serum triglycerides, which can cause fatal 
pancreatitis). Adenovirus-based gene therapies have been developed for a variety of 
specific cancer types. Dozens of viral-based human gene therapy drugs are currently 
available in the global marketplace. 


12.5c Genome Engineering 


Instead of introducing foreign genes from other organisms into host cells to achieve 
some purpose, another approach is to directly alter an organism’s genome, that is, 
change its nucleotide sequence at one or more specific sites. At one end of the scale, 
even a single nucleotide substitution could be made, for example, to reverse the muta- 
tion that causes sickle-cell anemia. Sickle-cell anemia is due to a single amino acid 
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change in the primary structure of the B-globin subunit of hemoglobin. This mutation 
occurs when a T is substituted for A as the middle base in the B-globin gene’s sixth 
codon. This change transforms the sixth codon nucleotide sequence from GAG, which 
codes for glutamate, to GTG, which codes for valine. Changing that T back to A would 
restore full functionality to the B-globin gene. The ability to do such genome editing 
has the potential to correct deleterious mutations, which would be of great benefit in 
human gene therapy. CRISPR-Cas9 is a system that can be used to carry out such 
genome editing (see below). 

At the other end of the genome engineering scale is the creation of entirely new 
genomes—the design and production of synthetic genomes that would direct novel 
organisms endowed with attributes desired by the creator. Although the complete 
Mycobacterium mycoides genome was chemically synthesized in 2010 (see the Critical 
Developments in Biochemistry box Synthetic Life in Chapter | and Figure 12.30), com- 
pletely synthetic genomes that code for new life forms are confined to the future. 


12.5d Genome Editing with CRISPR-Cas9 


The challenge in genome editing is to make a direct change in the nucleotide sequence 
at a specific site within a genome, which may consist of billions of base pairs. One 
way to do this is to take advantage of the information presented by the nucleotide 
sequence around the selected site to target an editing system to the site. Once targeting 
is achieved, editing entails the replacement of the undesired nucleotide (or sequence) 
with the desired one. 

Several targeting strategies have emerged, the most effective of which is CRISPR-Cas9, 
a simple two-component system. CRISPR-Cas9 can target and edit any nucleotide 
sequence in any genome. CRISPR is an acronym for clustered regularly interspaced 
short palindromic repeats. CRISPRs are a family of DNA repeats that are widespread 
among prokaryotic genomes. Cas9 (CRISPR-associated protein 9) is an RNA-guided 
endonuclease. That is, Cas9 is an RNA-binding enzyme that catalyzes the cleavage of 
dsDNA at a specific site determined by the nucleotide sequence of its bound RNA. 
The bound RNA acts as a guide; through complementary base-pairing, this guide 
RNA (gRNA, occasionally written as sgRNA for single guide RNA) can deliver the 
Cas9 endonuclease to any desired nucleotide sequence within a target DNA. 


Figure 12.23 Adenovirus-mediated delivery of ther- 
apeutic DNA. Infection of the target cell allows 
encapsulated therapeutic DNA to gain access to the 
interior through interactions with membrane protein 
receptors such as CAR and integrin. Once inside, 
the virus is released into the cytoplasm and enters 
the nucleus, where transcription of the gene, fol- 
lowed by translation of the mRNA leads to produc- 
tion of the therapeutic protein (which may or may 
not be delivered elsewhere through the blood.) 
(Adapted from Figure 2 in Goswami, R., et al., 
2019. Gene therapy leaves a vicious cycle. Frontiers in 
Oncology 9:297, DOI:10.3389/fonc.2019.00297.) 
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A Deeper Look 


The Biological Role of CRISPR-Cas Systems: 


Adaptive immunity in Bacteria 


The biological function of CRISPR-Cas systems is to serve as 
prokaryotic defense mechanisms against viruses and plasmids. 
Prior exposure of many bacterial or archaeal strains to foreign 
DNA endows these prokaryotes with the ability to fend off subse- 
quent encounters from such intruders. Since the response depends 
on an earlier challenge, it is termed adaptive, and since the prokary- 
ote is now immune to these agents, CRISPR/Cas systems provide 
adaptive immunity. How do they work? 

The cas genes lie adjacent to the CRISPR loci and encode the 
Cas proteins, which are essential to adaptive immunity. cas is an 
acronym for CRISPR-associated. Each CRISPR locus contains a 
series of short DNA repeats (20-50 bp long) that are separated from 
one another by similarly-sized unique sequences to give a repeat- 
spacer-repeat pattern of organization. The spacers are derived from 
viral and plasmid DNA sequences formed when prokaryotic cells 
use Cas endonucleases to chop the foreign DNA of these infectious 
agents into short segments. Other Cas proteins then mediate the 
incorporation of these short segments into the prokaryotic genome 
as the spacers in CRISPRs. 

Therefore, the spacer sequences are homologous to, and thus 
can base pair with, DNA sequences found in bacteriophages and 
plasmids which might infect the prokaryote. So, when the bacteria 
are exposed to a bacteriophage or virus they have previously seen, 
an adaptive immune response is triggered, such that the DNA of the 
invading pathogen is cleaved (and thus inactivated). For this to hap- 
pen, an RNA is transcribed from the CRISPR locus and processed 
to yield a crRNA containing a repeat element and a spacer region 
that is homologous to, and thus can target, the invading foreign 
DNA. A second RNA, called the trans-activating CRISPR RNA, 
or tracrRNA, is required to bind the Cas endonuclease and deliver 
it to the target. The tracrRNA has a region of homology with the 
repeat element within the crRNA, and base-pairing between these 
two RNAs positions the Cas endonuclease at the proper site on 
the target DNA. The Cas endonuclease cleaves both strands of the 
foreign DNA, introducing a double-stranded break (DSB), which 
aborts its ability to infect the bacterial cell (figure). See Section 28.8 
for a discussion of DSBs. 

There are three types of CRISPR/Cas systems. The one used 
for genome editing, CRISPR/Cas9, is a type II system. CRISPR/ 
Cas9 is a three-part system, consisting of a crRNA, a tracrRNA, 
and Cas9, the cas9 gene-encoded endonuclease that catalyzes DSB 


formation, as guided by crRNA base-pairing with foreign DNA. 
To simplify their use in genome editing, the crRNA and tracrRNA 
gene sequences have been fused to form a sgRNA (or single guide 
RNA) sequence that binds the Cas9 protein and positions it pre- 
cisely on the target DNA, as determined by base-pairing between 
the spacer region of the gR NA and the target DNA sequence, which 
is usually referred to as the protospacer sequence because of its 
homology to the spacer (figure). 


PAM 


Cas9 Protospacer 


CRISPR-Cas9 action depends on a short nucleotide sequence 
known as the protospacer adjacent motif (PAM) about two to six 
nucleotides downstream (reading 5’ to 3’) of the DNA sequence tar- 
geted by the gRNA. The Cas endonuclease functions cut the DNA 
three to four nucleotides upstream of the PAM. Because different 
bacteria have different PAM specificities, CRISPR-Cas9 systems 
can be designed to cut any desired target sequence. For example, the 
Streptococcus pyogenes Cas9 recognizes a 5'-NGG-3' PAM (where 
“N” can be any nucleotide base). The Staphylococcus aureus Cas9 
depends on an NGRRN PAM sequence (where R is any purine). 
Pairing the appropriate Cas9 with a gRNA directed at the desired 
DNA sequence allows virtually any DNA sequence to be edited. 
Cas9 will only act to cleave DNA when it finds the correct PAM 
sequence and the gRNA base-pairs with its target sequence. 

Although the site-specific introduction of a double-stranded 
break (DSB) in DNA would not seem to offer any particular ad- 
vantage in editing the DNA to change its nucleotide sequence to a 
more desirable one, it is actually very effective because cells are very 
adept at repairing DSBs. 


At the site determined by the gRNA, Cas9 endonuclease acts to cleave both strands 
of the dsDNA. Since a gRNA of any desired nucleotide sequence is easily made, any 
DNA sequence is a legitimate target for Cas9 endonucleolytic cleavage (see 
Figure 12.24). The gRNA positions Cas9 precisely on the target DNA, as determined 
by base-pairing between a 20-nucleotide region of the gRNA (Figure 12.24). The 
Cas9 HNH nuclease domain cleaves the DNA strand complementary to the gRNA; 
the Cas9 RuvC nuclease domain cleaves the DNA strand that is not complementary 
to the gRNA (the DNA strand whose sequence matches the sequence of the gR NA). 

There are two prominent DNA repair systems: nonhomologous end joining (NHEJ) 
and homology-directed repair (HDR) (Figure 12.24). In NHEJ, the two ends of a DSB 
are rejoined, and in the process, small insertions, deletions, and base substitutions occur. 
Assuming the somewhat random products of CRISPR-Cas9 cleavage followed by 
NHEJ can be screened for those repairs giving the hoped-for changes in nucleotide 
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Figure 12.24 CRISPR-Cas9 structural organization and catalytic function. Cas9 is shown in blue; the 
guide RNA in orange, with its 20-nucleotide DNA-targeting region in bold. Cas9 creates a double-stranded 
break (DSB) in the target DNA when its HNH endonuclease cleaves the DNA strand complementary to 
the gRNA, while its RuvC endonuclease cleaves the other DNA strand. The target DNA’s PAM region is 
highlighted in yellow. Mammalian cells repair the DSB by either of two cellular pathways — the error-prone 
nonhomologous end-joining (NHEJ) pathway or by homology-directed repair, using an exogenous DNA 
as template. The molecular graphic shown here is Streptococcus pyogenes CRISPR-Cas9 in complex with 
sgRNA and double-stranded DNA primed for target DNA cleavage (the protein is blue, the RNA is 
orange, and the DNA is green); PDB id = 5F9R. (Adapted from Jiang, F, and Doudna, J. A., 2017. 
CRISPR-Cas9 Structures and Mechanisms. Annual Review of Biophysics 46:505-529.) 


sequence, then selected genome editing is possible. A more certain way to edit a DNA 
sequence is to use HDR to repair the DSB. Here, short DNA templates that have the 
desired sequence can be introduced to direct the repair process. Since these short tem- 
plates, or donor DNAs, precisely replace the DNA around the DSB, the repaired DNA 
will have been edited to have the desired sequence, as determined by the sequence of the 
donor (Figure 12.24). That is, any DNA sequence can be edited using CRISPR-Cas9, 
just by expressing a gRNA having the same sequence and the proper PAM, then allow- 
ing endogenous systems to repair the DNA, as directed by donor DNA. 

Thus, genomes can be edited by introducing vectors bearing the CRISPR-Cas9 sys- 
tem into target cells and expressing the vector so that its transcription produces the 
desired gRNA, and transcription plus translation yields Cas9 endonuclease. The cells 
will naturally repair the gR NA-targeted, Cas9-catalyzed DSB, as directed by a donor 
DNA, which is also provided. Because it is so easy and straightforward to make a gRNA 
of any desired sequence, CRISPR-Cas9 has revolutionized genome editing. CRISPR/ 
Cas9 genome editing has been performed in dozens of species, including yeast, fruit 
flies, plants, fish, and mammals, including human cell lines. 
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Figure 12.25 CRISPR-Cas9 system engineering for 
base editing and prime editing. (a) Domain organiza- 
tion of the Cas9 protein. Cas9 protein consists of 
two prominent structural domains, the recognition 
(REC) lobe which is responsible for gRNA binding 
and the nuclease (NUC) lobe, the site of Cas9’s two 
endonuclease activities—the HNH nuclease that 
cleaves the DNA strand that is complementary to the 
gRNA and the RuvC nuclease that cuts the noncom- 
plementary DNA strand. The Rev lobe is also the 
location of the PAM-interacting (PI) domain of 
Cas9. The Rec and Nuc lobes are linked by the 
bridge helix (BH). Mutations can be introduced into 
Cas9 to inactivate its endonuclease functions or alter 
its PAM recognition specificity. (b) Engineering the 
Cas9 protein to create a base editor: Cas9 engineered 
to act as a base editor rather than an endonuclease. 
A cytidine deaminase (CBE) domain is fused to the 
N-terminus of Cas9 and its two endonuclease 
domains are inactivated by mutating them. These 
inactivated endonuclease functions are symbolized 
by superscript minus signs, as in HNH” and RuvC’. 
The N-terminal CBE fusion catalyzes base-editing of 
a specific C:G base pair to an T:A base pair, as 
selected by the appropriate gR NA. The gRNA is 
shown in green. Replacing the N-terminal cytidine 
deaminase fusion with a deoxyadenosine deaminase 
(ABE) fusion creates a Cas9 protein engineered to 
edit an A:T base by replacing it with a G:C base pair 
(not shown). (c) Engineering the Cas9 protein to cre- 
ate a prime editor. This Cas9 protein has been engi- 
neered to carry an N-terminal reverse transcriptase 
(RT) domain. The HNH endonuclease in this Cas9 
version has been inactivated by mutation. The RuvC 
endonuclease remains functional, so that it can cut 
the nontemplate DNA strand to create a nick. Its 
guide RNA (green, shown bound to Cas9 and in 
greater detail below) is called a prime-editing guide 
RNA (pegRNA) to denote its new functions. 
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If CRISPR edits are carried out in cells that ultimately could give rise to gametes 
(eggs and sperm) with edited genomes, the edited DNA and the trait it encodes will be 
inherited by future generations. Such experiments have been done in a wide variety of 
plants and animals, but ethical considerations discourage its application in human gam- 
etes. This ethical restraint was violated in 2018, when He Jiankui, an American-trained 
Chinese scientist introduced CRISPR-Cas9 into human eggs that had been fertilized 
with sperm from an HIV-positive father. The eggs were then implanted into the uterus 
of a woman who gave birth to two daughters. These fraternal twins were the first 
humans born with genomes edited by human germline gene therapy. Dr. He’s objective 
was to use CRISPR-Cas9 to inactivate the CCRS5 gene in the fertilized eggs that gave 
rise to these girls. CCR5 encodes a coreceptor for HIV (the virus that causes AIDS), and 
He’s intention was to create children free of the risk of contracting AIDS. Worldwide 
condemnation was swift and He was imprisoned in China for his actions. 


12.5e Base Editing with CRISPR-Cas9 


Even though DSBs are repairable by NHEJ or HDR, it would be preferable in many 
cases if Cas9 could change the nucleotide sequence directly instead of cleaving the 
DNA. That is, could Cas9 be engineered to become a base editor rather than an endo- 
nuclease? This goal has been achieved. Inactivation of both (HNH and RuvC) endo- 
nuclease by mutations introduced into Cas9, coupled with the fusion of a base editor 
domain to the N-terminus of Cas9 creates a base-editing Cas9 (Figure 12.25). Various 


Cytidine deaminase 
(CBE) domain 


DNA binding region 


— eee 


RT PBS 
template 


It consists of the target DNA-binding domain linked to an RT template region (RT template, blue) that 
encodes the editing changes desired in the gene, followed by a primer-binding sequence (PBS, orange). 
The PBS will base-pair with the nicked nontemplate strand of the target DNA and prime RT-catalyzed 
synthesis of new DNA. The nucleotide sequence in this new DNA is directed by the RT template of 
the pegRNA. (Adapted from Figures 3, 6, and 7 in Meaker, G. A., Hzir, E. J., and Gotochowski, T. E., 
2020. Advances in engineering CRISPR-Cas9 as a molecular Swiss Army knife. Synthetic Biology 
5:ysaa021.) 
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12.6 How Does High-Throughput Technology Allow Global Study of Millions of Genes or Molecules at Once? 


repurposed Cas9 proteins have been assembled in this way. Two examples are a Cas9 
with an N-terminal cytosine deaminase base-editing (CBE) domain and a Cas9 with 
N-terminal deoxyadenosine deaminase (ABE) base-editing domain. The CBE domain 
converts a C in the targeted DNA sequence to a U, which then base-pairs with A. The 
result is base-editing of a C:G base pair in the target DNA to a T:A base pair. The 
ABE domain converts adenosine to inosine, which leads to a G:C base pair where an 
A:T base pair was. 


12.5f Prime Editing with CRISPR-Cas9 


A further innovation of CRISPR-Cas9 is to fuse a reverse transcriptase (RT) domain 
onto the N-terminus of Cas9 in which the HNH nuclease (but not the RuvC nuclease) 
has been inactivated. The gRNA is also redesigned by linking a desired RNA tem- 
plate (RT) sequence to the gRNA downstream from the PAM, followed by a primer 
binding site (PBS); this gRNA is termed a pegRNA (for primer editor guide RNA) 
(Figure 12.25). This CRISPR-Cas9 construct will nick the noncomplementary DNA 
strand via its RuvC endonuclease and then the engineered Cas9 will use its RT func- 
tion to copy the RNA template region of the gRNA, as primed by the nicked region 
of the noncomplementary strand. In this manner, the nucleotide sequence of the 
target DNA can be rewritten as desired. Base-editing and prime-editing variants of 
CRISPR-Cas9 are being used to edit the genomes of plants and animals, including 
human cell lines. 


12.5g Gene silencing with CRISPR-Cas9 


Another addition to the CRISPR-Cas9 toolbox is CRISPRoff, which can be used 
to silence genes through DNA methylation (see the Human Biochemistry box in 
Chapter 11, DNA Methylation, CpG Islands, and Epigenetics). Catalytically inac- 
tive Cas9 (the HNH and RuvC endonuclease domains were inactivated by mutations) 
was fused with DNA-methyltransferase domains to create CRISPRoff. CRISPROff 
is a programmable epigenetic memory writer that carries out gene silencing through 
DNA methylation and repressive histone modifications. It is paired with gRNAs 
selected from a gRNA library targeting 20,000 protein-encoding genes. Gene silencing 
using CRISPRoff has been shown to be durable, persisting for over 450 cell divisions 
and permanently silencing the genes it targets. Multiple genes can be turned off at 
the same time using a combination of gRNAs. CRISPRoff gene silencing could be 
reversed by a counterpart construct, CRISPRon, which erases the methylation modifi- 
cations created by CRISPRoff through a Cas9-fused, DNA-demethylase domain and 
CRISPR inactivation of the host cell DNA methylation maintenance pathways. The 
beauty of this on/off CRISPR switch is that the nucleotide sequence of the genes is 
never altered. Instead, the genes are turned off and on at will through reversible DNA 
methylation. 


12.6 | How Does High-Throughput Technology Allow Global 
Study of Millions of Genes or Molecules at Once? 


A central scientific maxim is “To understand a system, you must perturb it.” By scru- 
tinizing the consequences of perturbations, we gain some insight into how things 
work. Mutations and drugs are two obvious examples of this strategy. The beauty 
and challenge of biological systems—cells, organisms—is their extraordinary com- 
plexity. Traditional biochemical and molecular biological research has been limited 
by its inability to dissect and interpret more than one molecule at a time. The emer- 
gence of high-throughput technology (HTT) has dramatically altered the landscape; 
now researchers can look at millions of genes or molecules at once. Such studies are 
appropriately called global, as in global gene screening or global gene expression 
studies. 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


421 


422 Chapter 12 Recombinant DNA, Cloning, Gene Editing, and Synthetic Biology—An Introduction 


12.6a High-Throughput Screening 


HTT relies on nanotechnological automation of experimental analysis, so that large num- 
bers of molecules—DNA, RNA, proteins, or small molecules—can be screened for their 
functional role. Many of the relevant scientific questions today are based on screening 
genomes to reveal which genes are involved in a process, or screening total RNA to moni- 
tor those transcripts that are important, or finding which proteins among the thousands 
in a particular cell might be involved. The basic protocols of molecular biology described 
in this chapter underlie this technical revolution, but automation allows these protocols 
to be employed on a million-fold or more greater scale. The instrumentation used in HTT 
includes microtiter plates or chips bearing arrays of the molecules of interest (as shown 
in Figure 12.12), inkjet printers capable of delivering nanodrops of reagents, robotics 
for fluid handling and sample manipulation, software to manage the robotics, sensitive 
detectors that often rely on fluorescent markers to follow the progress of reactions, and 
computer interfaces for data processing and storage. The advent of HTT makes it easy 
to screen millions of DNA sequences (genes), transcripts, proteins, or small molecules in 
a single experiment. Such experiments are the realm of high-throughput screening (HTS). 
An explosion of scientific discovery is being driven by HTS, and a dazzling number of 
applications of this technology is available. A review of these many applications is beyond 
the scope of this chapter, but the few examples which follow convey the scale of these new 
technologies. 


12.6b High-Throughput RNAi Screening of Mammalian Genomes 


In a pioneering study, shR NA-encoding DNA oligonucleotides 96 bases long were 
designed to correspond to all known genes in humans and in mice. A total of almost 
200,000 oligonucleotides for each species was synthesized as a series of microarrays 
using inkjet DNA synthesis technology. These oligonucleotides could be recovered 
and cloned into DNA vectors capable of infecting mammalian cells. Introduction 
of these shRNA libraries into recipient mammalian cells (so that each cell receives a 
unique but different shRNA) leads to knockdown of the gene whose coding sequence 
is matched by the introduced shRNA. (“Knockdown of a gene” is a phrase describing 
a technique that prevents expression of a gene by targeting the mRNA product of that 
gene for destruction.) The challenge at this step is to design high-throughput screens 
to select for cells that have lost a particular function. Two examples would be: (1) 
Coexpress a GFP-labeled protein along with the shRNA to screen for cells deficient in 
protein degradation. Any cells with impaired capacity to degrade protein as a conse- 
quence of shRNA-mediated gene knockdown will display persistent fluorescence and 
can be easily selected. (2) Express the shRNA library in a mammalian cell line whose 
progression through mitosis can be monitored. shR NA knockdown of genes required 
for mitosis can be detected and identified. 


12.6c High-Throughput Protein Screening 


HTS screening of proteins allows efficient examination of a large number of protein 
variants or optimization of a wide range of experimental variables that might influence 
protein expression or purification. An example is the directed evolution of an enzyme, 
in this case, horseradish peroxidase. Directed evolution involves mutation of the 
enzyme-encoding gene and selection for variants displaying greater catalytic efficiency. 
Yeast cells were engineered to express horseradish peroxidase (HRP) from mutagenized 
libraries of HRP cDNA. A fluorogenic substrate allowed direct measurement of 
HRP enzymatic activity. A robotic microfluidic system created aqueous 6 pL drop- 
lets of yeast cells plus fluorogenic substrate that were 23 um in diameter within an 
oil—water emulsion. (Fluorogenic substrates are molecules that, when acted on by 
an enzyme, yield a fluorescent product.) These droplets were passed through a drop 
sorter, which could sort 2000 drops per second over a 3 hour period. The drop sorter 
selected those droplets showing high fluorescence, discarding the rest. This HTS 
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selection scheme identified yeast cells expressing HRP variants that were more than 
twice as efficient as the wild type enzyme, with the best mutant being almost seven 
times more efficient. The amino acid sequence encoded by the cDNA in such a yeast 
cell provides insight regarding HRP structure and activity. 


12.7 | What Is the Field of Synthetic Biology? 


DNA is analogous to software. The information encoded in a genome is a program 
for the operation of an organism; indeed, it defines an organism. This realization 
has spawned the new science of synthetic biology. Synthetic biology (or SynBio) 
brings together genetics, cell biology, biochemistry, and molecular biology with the 
goal of redesigning organisms to have new abilities that serve useful purposes. DNA 
as code has opened the door to engineering new parts, devices, systems, and even 
organisms with features that before now have never existed in nature (Figure 12.26). 
Synthetic biology allows the rational design of new molecules at any level in the bio- 
logical hierarchy: metabolites, protein, RNA, DNA. Engineering new systems with 
off-the-shelf, interchangeable parts easily introduced into well-studied cells, such as 
E. coli, yeast, or plants such as tobacco, presents exciting new possibilities for novel 
organisms to fit almost any imaginable purpose—environmental remediation, sustain- 
ability, biosensors, biofuel production, data storage, pharmaceutical synthesis and 
other therapeutic applications, and much more. Synthetic biology, with its reliance 
on mathematical modeling, engineering principles, and molecular biology, is a cross- 
discipline, integrated endeavor. 


12.7a iGEM and BioBricks (Registry of Standard Biological Parts) 


The International Genetically Engineered Machine (iGEM) competition is a excel- 
lent opportunity for students to gain hands-on experience in SynBio. iGEM is open to 
undergraduate students around the world (see http://igem.org/About). Originating in 
2003, iGEM is dedicated to the advancement of synthetic biology through openness, 
education, and competition. The competition relies on kits of standardized and tested 
biological parts (BioBricks) available from the Registry of Standard Biological Parts 
(see http://parts.igem.org/Help:Contents). A basic part is defined as a DNA sequence 
that encodes for a specific biological function, such as a protein. An essential feature is 
that any part can be readily combined with any other part using standardized cloning 
procedures. A device is a composite of several parts, such as a ribosome-binding site, 
a protein-coding gene, and a terminator. A device has a cellular function, such as the 
promoter-driven expression of a protein-coding gene. At the beginning of each annual 
iGEM competition, kits of biological parts are given to competing collegiate teams. 
Teams can also design and build their own parts. The aim is to build a biological system 
and then express it in an appropriate chassis. A good example of an iGEM competition 
winner is shown under Sustainability-Recycling Mixed Waste below. 

Mixed waste, a costly by-product of trash disposal, consists of non-recyclable plas- 
tics such as polyurethane as well as cellulose-based material (paper, wood) and other 
fibers. The 2013 iGEM team at Imperial College, London, designed synthetic biology 
systems to recover valuable components from mixed-waste trash for subsequent indus- 
trial use. Æ. coli cells were engineered to express and secrete polyurethane esterases 
that degrade polyurethane to form polyethylene glycol and poly(isothiocyanate), both 
of which can be recovered and sold back to the chemical industry. Further, engineered 
E. coli cells also received an operon encoding the three enzymes required for synthesis 
of poly-B-hydroxybutyrate (PHB) from acetyl-CoA, using the sugars released from 
cellulose breakdown as substrates for acetyl-CoA formation (Figure 12.27). PHB is 
a commercially valuable, environment-friendly polymer used in the manufacture of 
biodegradable plastic items, such as containers, eating utensils, food containers, and 
more. 


12.7 What Is the Field of Synthetic Biology? 
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Figure 12.26 The bottom-up organization of 


synthetic biology. 
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Figure 12.28 Structures of artemisinic acid and 
artemisinin. Yeast have been engineered to synthesize 
artemisinic acid, which can be recovered, chemically 
reduced, then photo-oxidized to yield artemisinin, a 
valuable anti-malarial drug. Artemisinin’s pharmaco- 
logical mechanism of action proceeds as follows: 
Plasmodium falciparum parasites infect red blood 
cells and use the hemoglobin they find there for food. 
The iron released from hemoglobin degradation acti- 
vates the endoperoxide (-O-O-) group of artemisinin; 
in turn, activated artemisinin binds to and inactivates 
the plasmodium’s calcium pump, causing death of 
the parasite. 
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Figure 12.27 Synthetic biology applied to sustainability. Students on the 2013 iGEM competition 
team at Imperial College, London, designed new BioBricks so that they could engineer E. coli to pro- 
cess mixed waste and recover valuable commodities from trash. Two new BioBricks were created: one 
that expressed polyurethane esterases (PUR esterases) and another with a three-enzyme cassette for 
poly-6—hydroxybutyrate (PHB) synthesis. (Image courtesy of Imperial College, London). 


12.7b Metabolic Engineering 


Metabolic engineering is a subdiscipline of synthetic biology that aims to optimize pro- 
duction of a desired substance through genetic and regulatory manipulation of living cells. 
Metabolic engineering takes advantage of mathematical modeling, engineering principles, 
molecular biology, recombinant DNA technology, cell biology, and detailed biochemical 
knowledge about metabolic pathways. Cells are redesigned in ways that alter their funda- 
mental metabolic capabilities in order to enhance biosynthesis of chemical compounds 
that have industrial, medical, or other important purposes. The synthesis of artemesinic 
acid provides an illustration of metabolic engineering. 


Synthesis of Artemisinin, an Anti-Malaria Drug, in Yeast Artemisinin is part of the 
standard therapy for treatment of Plasmodium falciparum infection, the most dangerous 
form of malaria. More than 1 million people die each year from malaria. Artemisinin 
can be isolated from Artemisia annua (also known as sweet wormwood), a plant used in 
Chinese medicine for 2000 years. Chemical synthesis of artemisinin is difficult and costly, 
and sweet wormwood has limited availability and inconsistent quantities. These chal- 
lenges prompted the Bill and Melinda Gates Foundation to give Professor Jay D. Keasling 
at the University of California, Berkeley, a $43,000,000 grant to engineer artemisinin 
biosynthesis into yeast cells. The strategy he and his colleagues devised was to redirect 
farnesyl pyrophosphate (see Figure 9.23) production from sterol synthesis to artemisinic 
acid formation. Subsequently, artemisinic acid is used as a precursor for industrial-scale 
artemisinin synthesis (Figure 12.28). Keasling engineered yeast so that its genes encod- 
ing enzymes for farnesyl pyrophosphate (FPP) synthesis were up-regulated and one 
gene for conversion of FPP to sterols was down-regulated. Genes encoding enzymes for 
artemisinic acid from FPP were identified in A. annua, isolated, and cloned into Saccha- 
romyces cerevisiae to create a metabolically engineered strain of yeast that produced more 
pharmacologically useful product than A. annua in less than one-tenth the time. 


Golden Rice—A Metabolic Engineering Success Story Vitamin A deficiency is a serious 
nutritional inadequacy in low-income countries, particularly in Africa and Southeast 
Asia. As many as 500,000 children suffer permanent blindness and half of these 
blinded children die within a year. Vitamin A-deficient children also suffer greater 
likelihood of death from common childhood infections, such as measles. Adequate 
intake of B-carotene, a precursor to vitamin A biosynthesis prevents development of 
deficiency (Figure 12.29). 
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The structure of B—carotene. 


Rice is the principal source of calories in the geographic regions where vitamin A defi- 
ciency is rampant. Genes encoding enzymes in the biosynthetic pathway for B-carotene 
that leads from isoprene (Figure 8.12) have been introduced into rice—a phytoene syn- 
thase from daffodils (or maize) and a phytoene desaturase from Erwinia uredorova, a com- 
mon soil bacterium. These enzymes convert isoprene in the rice endosperm (the part that 
we eat) into lycopene (Figure 8.13) whose rings are then cyclized to give B-carotene. The 
B-carotene-rich rice grains turned a deep golden color, giving the rice its name. Further 
improvements led to a strain containing enough g-carotene that a single serving of this 
Golden Rice could deliver more than half the recommended daily intake for vitamin A. 


Biochemistry of a Burger The Impossible Burger was created by Impossible Food, Inc., 
from a mixture of soy protein, potatoes, coconut fat, sunflower oil, and various bind- 
ers such as corn starch to give it a proper meat texture. A patented process generates 
a meat flavor profile when a mixture of amino acids, sugars, and metabolites is heated 
with leghemoglobin. Leghemoglobin is a 143-residue protein produced by soybeans. 
Soybeans are legumes and leghemoglobin is named for its source. Leghemoglobin shows 
a striking structural similarity to both the a- and B-chains of mammalian hemoglo- 
bin (Figure 5.5) and the protein myoglobin of mammalian muscle. Like these proteins, 
leghemoglobin is intensely red in color, making it an ideal addition to a plant-derived 
burger substitute. The leghemoglobin gene was isolated from soybeans and cloned in 
yeast cells engineered to overexpress the synthesis of leghemoglobin. Yeast-synthesized 
plant leghemoglobin is a key ingredient in Impossible Burgers, now available at fast-food 
chains and grocery stores. 


Chemoenzymatic Synthesis of Starch from CO, and H, Starch supplies the majority of 
calories in the diet of humans and livestock. Researchers have created a cell-free sys- 
tem for starch synthesis from carbon dioxide and hydrogen gas. The system is a | 1-step 
hybrid pathway of enzymatically catalyzed and chemical reactions. The pathway begins 
with the chemical reduction of CO, by H, to form methanol, which is then enzymati- 
cally converted to formaldehyde. The next step, the union of three formaldehyde mol- 
ecules to form dihydroxyacetone (DHA), was achieved by formolase, an enzyme created 
through a combination of computational protein design and directed evolution. DHA 
is a 3-carbon carbohydrate (Figure 7.3). Enzyme-catalyzed phosphorylation of DHA 
yielded a metabolic intermediate, dihydroxyacetone phosphate (DHAP), from which 
glucose-1-P and then starch could be synthesized by enzymes of carbohydrate metabo- 
lism common to plant cells. This chemoenzymatic system synthesized the conversion of 
CO; to starch over eight times faster than corn plants. 


12.7¢ Synthetic Genomes 


The creation of a fully synthetic genome was achieved in 2010 by J. Craig Venter’s 
laboratory. These scientists assembled the complete 1.08-Mbp genome of Mycoplasma 
mycoides genome from 1078 DNA segments, each 1080 bp long, in a three-step process 
(Figure 12.30; see also the Critical Developments box Synthetic Life in Section 1.5). 
Each of these segments was constructed from chemically synthesized oligonucleotides. 
The assembly process was performed in yeast cells, and the completed M. mycoides 
genome was then introduced into M. capricolum recipient cells. Cells that could 
only grow if they contained a functional M. mycoides chromosome were obtained 
and analyzed for the presence of unique watermarks introduced during synthetic 


12.7 What Is the Field of Synthetic Biology? 


Figure 12.29 The structure of B-carotene. 
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Figure 12.30 Assembly of the fully synthetic Mycoplasma mycoides genome in three steps. The entire 
genome was represented on 1078 overlapping DNA molecules, each 1080 bp long, obtained from Blue 
Heron, a company that performs custom DNA synthesis. These DNA cassettes were joined in sets of 

10 to create 109, 10.08 kbp assemblies, then again in sets of 10 to make 11 100 kbp fragments, and finally, 
in the third step, the 11 100 kbp fragments were linked to yield the 1.08 Mbp complete M. mycoides 
genome. Each of these three steps relied on the intrinsic ability of yeast to join DNA fragments by 
homologous recombination. (Adapted from Figure | in Gibson, D. G., et al., 2010. Creation of a 
bacterial cell controlled by a chemically synthesized genome. Science 329:52—S6.) 


chromosome assembly. The presence of these watermarks proved that the fully syn- 
thetic genome was directing the growth of the cells. 

The creation of synthetic genomes faces two formidable challenges—the high cost of 
synthesizing long DNA sequences by chemical means and the difficulties in assembling 
such long sequences to form synthetic chromosomes. To date, the assembly of DNA 
segments within yeast (see Figure 12.30) remains the best alternative. Despite these 
difficulties, efforts are underway to create the first-ever fully synthetic genome of a 
eukaryotic cell. The Sc2.0 Project is an international collaboration to make the entire 
12 megabase pair yeast (S. cerevisiae) genome from scratch. Launched in 2004, Sc2.0 
synthesized seven of the 16 chromosomes as of 2021. The synthesis of giga bp-sized 
genomes remains a considerable challenge. 

One advantage of synthetic genome construction is the opportunity it offers in 
protein design. It is feasible to enhance an organism’s genome by re-purposing some 
of its codons to specify unnatural amino acids. As an example, an E. coli strain has 
been engineered that uses only 57 of the 64 3-base codons found in all organisms, which 
frees seven for re-coding purposes. Such re-coding allows the creation of genomes with 
functions never-before-seen in nature. 
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Foundational Biochemistry 


Foundational Biochemistry Things You Should Know after Reading Chapter 12. 


The laboratory manipulation of nucleic acid molecules has led to the 
fields of recombinant DNA technology and genetic engineering. 


A clone in molecular biology refers to a population of molecules or 
cells identical to an original molecule or cell. 


Plasmids are very useful as vectors for cloning genes. 
Virtually any DNA sequence can be cloned. 


Chimeric plasmids can be constructed by inserting foreign DNA into 
a plasmid that has been cleaved by a restriction endonuclease. 


Directional cloning allows the insertion of foreign DNA into a plas- 
mid or other vector in the desired orientation with respect to adjacent 
DNA elements such as promoters. 


Biologically functional chimeric plasmids have been used to transform 
cells, and the gene(s) they carry have been expressed in the recipient cells. 


Shuttle vectors are plasmids that can be propagated in two different 
organisms. 


Artificial chromosomes can be created from recombinant DNA. 
Genomic libraries are prepared from the total DNA in an organism. 
Genomic libraries can be screened for the presence of specific genes. 
PCR can be used to clone and amplify specific genes. 


Southern hybridization is a method for identifying specific DNA 
sequences in a large population of different DNA fragments. 


Probes for Southern hybridization can be prepared in a variety of ways. 


Northern hybridization is a technique for finding specific RNA 
sequences among a large population of RNA molecules. 


Western blotting is used to a identify specific protein on an electro- 
phoretic gel. 


RT-PCR (reverse transcriptase-PCR) is useful for cloning the gene 
that encodes a specific mRNA. 


cDNA libraries are libraries prepared from mRNA using reverse 
transcriptase. 


DNA microarrays, or gene chips, are arrays of different oligonucle- 
otides immobilized on a chip. 


Expression vectors are designed so that the RNA transcripts or pro- 
tein products encoded by specific genes can be expressed in host cells. 


RT-qPCR (real-time quantitative PCR) is used to determine levels of 
mRNA expression. 


cDNA libraries can be screened for the presence of specific protein- 
encoding cDNAs by using antibodies to the protein. 


Expression of proteins from cDNA libraries where the coding 
sequences of the cDNAs have been fused to easily isolated proteins or 
peptides allows rapid purification of the protein encoded by the cDNA. 


Reporter gene constructs are chimeric DNA molecules composed 
of gene regulatory sequences positioned next to an easily expressible 
gene product, such as GFP. 


Specific protein—protein interactions can be identified by the yeast 
two-hybrid system. 


Protein—protein interactions can also be revealed by immunoprecipi- 
tation using antibodies against one of the proteins. 


In vitro mutagenesis makes it possible to alter the nucleotide sequence 
of a cloned gene in a systematic way. 


RNAi (RNA interference) can be used to reveal the function of genes. 
Synthetic short-hairpin RNAs (shRNAs) target genes of choice. 
Human gene therapy can repair genetic deficiencies. 


Retroviruses and adenoviruses are useful vectors for human gene 
therapy. 


Successful human gene therapy has been achieved using the ex vivo 
approach with recombinant lentiviral vectors. 


Inherited genetic diseases successfully treated by human gene ther- 
apy include X-linked severe combined immunodeficiency (X-linked 
SCID), X-linked adrenoleukodystrophy (ALD), sickle-cell anemia, 
and 6-thalassemia. 


A recombinant adenovirus was used to introduce a functional lipopro- 
tein lipase gene in vivo in the first commercially approved human gene 
therapy. 


Genome engineering seeks to directly alter an organism’s genome by 
changing its nucleotide sequence at one or more specific sites. 


DNA as code: DNA is analogous to software. 


CRISPR-Cas9 is a revolutionary new tool for genome engineering 
and gene editing. 


Base editing: CRISPR-Cas9 can be modified so that it acts as a base 
editor. 


Prime editing: CRISPR-Cas9 can be modified so that a fused reverse 
transcriptase domain will rewrite the genome. 


High-throughput technology (HTT) allows global study of millions 
of genes or molecules at the same time. 


shRNA-encoding DNA oligonucleotides corresponding to all 
known genes in humans and in mice can be used to knock down 
any gene. 


High-throughput protein screening using robotic microfluidic sample 
handling and a mutagenized peroxidase cDNA was used to demon- 
strate the directed evolution of an enzyme. 


Synthetic biology allows the rational design of new molecules at any 
level in the biological hierarchy. 


Examples of synthetic biology include the creation of biosensors, 
engineering organisms for recovery of industrial chemicals from 
waste, and metabolic engineering of organisms to produce pharma- 
ceuticals or enhance their nutritional value. 


Synthetic genomes are chemically synthesized, fully functional 
genomes. 
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Answers to all problems are at the end of this book. Detailed solutions are 
available in the Student Solutions Manual. 


aE 


10. 


Method Terminology For each item listed, indicate whether it 
functions in the process of DNA cloning, PCR, RNA interference, or 
CRISPR-Cas9: DNA polymerase, gR NA, primer, passenger strand, 
ligase, PAM, vector, dicer, endonuclease, siRNA, guide strand, linkers, 
and RISC. 


. Protein, RNA, or DNA? For each item listed, indicate whether it is 


composed of protein, RNA, or DNA: passenger strand, primer, Cas9, 
YAC, expressed sequence tags, RNA, RISC, ligase, BAC, PAM, 
endonuclease, reverse transcriptase, HAC, vector, guide strand, 
linkers, DNA polymerase, dicer, and siRNA. 


. Construction of a Recombinant Plasmid A DNA fragment isolated 


from an EcoRI digest of genomic DNA was combined with a plasmid 
vector linearized by EcoRI digestion so that the sticky ends could 
anneal. Phage T4 DNA ligase was then added to the mixture. List all 
possible products of the ligation reaction. 


. Overlapping Restriction Sites in a Plasmid Vector Polylinker The 


nucleotide sequence of a polylinker in a particular plasmid vector is 


-GAATTCCCGGGGATCCTCTAGAGTCGACCTGCAGGCATGC- 


This polylinker contains restriction sites for BamHI, EcoRI, PstI, 
Sall, Smal, Sphl, and Xbal. Indicate the location of each restriction 
site in this sequence. (See Table 10.2 for the cleavage sites of restric- 
tion enzymes.) 


. Design a Vector Polylinker for Directional Cloning A vector has a 


polylinker containing restriction sites in the following order: HindIII, 
Sacl, Xhol, Bglll, Xbal, and Clal. 


a. Give a possible nucleotide sequence for the polylinker. 


b. The vector is digested with HindIII and Clal. A DNA segment 
contains a HindIII restriction site fragment 650 bases upstream 
from a Clal site. This DNA fragment is digested with HindIII and 
Clal, and the resulting HindIII—Clal fragment is directionally 
cloned into the HindI—Clal-digested vector. Give the nucleotide 
sequence at each end of the vector and the insert and show that 
the insert can be cloned into the vector in only one orientation. 


. The Number of Clones Needed to Screen a Yeast Genomic Library at 


99% Confidence Yeast (S. cerevisiae) has a genome size of 1.21 X 107 bp. 
If a genomic library of yeast DNA was constructed in a vector 
capable of carrying 16-kbp inserts, how many individual clones would 
have to be screened to have a 99% probability of finding a particular 
fragment? 


. The Number of Clones Needed to Screen a Very Large Genomic Library 


at 99% Confidence The South American lungfish has a genome size 
of 1.02 x 10!! bp. If a genomic library of lungfish DNA was 
constructed in a vector capable of carrying inserts averaging 45 kbp in 
size, how many individual clones would have to be screened to have a 
99% probability of finding a particular DNA fragment? 


. Calculating Number of Clones The octopus’ genome has been 


estimated to contain 2.7 billion bp. A library is generated using a 
vector capable of carrying 22 kbp inserts. How many individual 
clones would have to be screened to have a 99% probability of finding 
a particular DNA fragment? 


. Designing Primers for PCR Amplification of a DNA Sequence Given 


the following short DNA duplex of sequence (5’—>3’) 
ATGCCGTAGTCGATCATTACGATAGCATAGCACAGGGATCCA- 
CATGCACACACATGACATAGGACAGATAGCAT 
what oligonucleotide primers (17-mers) would be required for PCR 
amplification of this duplex? 


Amplification of DNA Using PCR A sample contains 10 copies of the 
DNA of interest. PCR is used to amplify the DNA of interest using 


Tl. 


12. 


13. 


excess amounts of primer. Three PCR kits are available with an 
average copying (doubling) efficiency of 75%, 85%, or 95%. 
Calculate and graph the number of DNA copies as a function of the 
number of PCR cycles, up to 20 cycles. (Assume that the number of 
copies would double with every cycle if the process were 100% 
efficient.) 


Designing a Primer Consider the segment of DNA shown 


S-AGATACCGCTGTGTATAACGGA-? 


Design a primer that binds anywhere within this sequence with the 
following requirements: it is six nucleotides long and has a minimal 
number of C or G. 


A Polylinker for Expression of a B—Galactosidase Fusion Protein 
Figure 12.3 shows a polylinker that falls within the B-galactosidase 
coding region of the /acZ gene. This polylinker serves as a cloning 
site in a fusion protein expression vector where the closed insert is 
expressed as a B-galactosidase fusion protein. Assume the vector 
polylinker was cleaved with BamHI and then ligated with an insert 
whose sequence reads 


GATCCATTTATCCACCGGAGAGCTGGTATCCCCAAAAGACG- 
GCC... 


What is the amino acid sequence of the fusion protein? Where is the 
junction between -galactosidase and the sequence encoded by the 
insert? (Consult the genetic code table on the inside front cover to 
decipher the amino acid sequence.) 


Using PCR for the In Vitro Mutagenesis of a Protein Coding Sequence 
The amino acid sequence across a region of interest in a protein is 


Asn-Ser-Gly-Met-His-Pro-Gly-Lys-Leu-Ala-Ser-Trp-Phe-Val-Gly-Asn-Ser 


14. 


15. 


16. 


The nucleotide sequence encoding this region begins and ends 

with an EcoRI site, making it easy to clone out the sequence and 
amplify it by PCR. Give the nucleotide sequence of this region. 
Suppose you wished to change the middle Ser residue to a Cys to 
study the effects of this change on the protein’s activity. What is the 
sequence of the mutant oligonucleotide you would use for PCR 
amplification? 


Combinatorial Libraries: Calculating the Number of Sequence Possibili- 

ties for Oligonucleotides and Peptides Combinatorial chemistry can 

be used to synthesize polymers such as oligopeptides or oligonucle- 

otides. The number of sequence possibilities for a polymer is given by 

x’, where x is the number of different monomer types (for example, 

20 different amino acids in a protein or 4 different nucleotides in a 

nucleic acid) and y is the number of monomers in the oligomers. 

a. Calculate the number of sequence possibilities for RNA 
oligomers 15 nucleotides long. 

b. Calculate the number of amino acid sequence possibilities for 
pentapeptides. 


Combinatorial Libraries Revisit Critical Developments in Biochemis- 
try — Combinatorial Libraries. Consider the task of making oligo- 
nucleotides built from the four different nucleotides A, G, C and T. 
a. Calculate the number of possible oligonucleotides that are 2 bp 
long. 
b. Calculate the number of possible oligonucleotides that are 
3 bp long. 
c. Give a general formula for calculating the number of possible 
oligonucleotides of any length. Graph the number of possible 
oligonucleotides versus the length of the oligonucleotide. 


Using the Yeast Two-Hybrid System to Discover Protein-Protein 
Interactions Imagine that you are interested in a protein that interacts 
with proteins of the cytoskeleton in human epithelial cells. Describe 
an experimental protocol based on the yeast two-hybrid system that 
would allow you to identify proteins that might interact with your 
protein of interest. 
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17. Studying Protein-Protein Interactions Purified protein A was reported 
to bind purified protein B in vitro only in the presence of calcium 
ions. How would you confirm this finding in vivo? Use any figure from 
Chapter 12 to model your answer. 


18. Preparing cDNA Libraries from Different Cells Describe an experimen- 
tal protocol for the preparation of two cDNA libraries, one from 
anaerobically grown yeast cells and the second from aerobically 
grown yeast cells. 


19. Using Gene Chips to Study Differential Gene Expression Describe an 
experimental protocol based on DNA microarrays (gene chips) that 
would allow you to compare gene expression in anaerobically grown 
yeast versus aerobically grown yeast. 


20. Using Antibodies to Screen cDNA Libraries You have an antibody 
against yeast hexokinase A (hexokinase is the first enzyme in the 
glycolytic pathway). Describe an experimental protocol using the 
cDNA libraries prepared in problem 18 that would allow you to 
identify and isolate the cDNA for hexokinase. Consulting Chapter 5 
for protein analysis protocols, describe an experimental protocol to 
verify that the protein you have identified is hexokinase A. 


21. Design of a Reporter Gene Construct to Study Promoter Function In 
your experiment in problem 19, you discover a gene that is strongly 
expressed in anaerobically grown yeast but turned off in aerobically 
grown yeast. You name this gene nox (for no oxygen). You have the 
idea that you can engineer a yeast strain that senses O, levels if you can 
isolate the nox promoter. Describe how you might make a reporter gene 
construct using the nox promoter and how the yeast strain bearing this 
reporter gene construct might be an effective oxygen sensor. 


22. CRISPR/Cas9: Design of a gRNA to Target the Human PVALB Gene 
The human PVALB gene, which encodes the Ca**-binding protein 
parvalbumin, can be targeted by CRISPR/Cas9, at the protospacer 
sequence -ATGCAGGAGGGTGGCGAGAGGGGCCGAGAT- 
followed by a -TGG-PAM trinucleotide. Give the sequence of the 
spacer region of a gRNA that will target the complementary 
DNA strand at this site. Include at the 3’-end of your gRNA 
sequence a region that will form a stem-loop structure with a 
5'-AGCAUAGCUGUAAAAC- sequence downstream in the 
gRNA to create the dsRNA-binding site for Cas9. 


23. Design of gRNA for CRISPR-Cas9 Consider the dsDNA shown, with 
the PAM sequence shown in bold and underlined. Design a 
17-nucleotide-long segment of the guide RNA that would target this 
DNA segment. 
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S-CTACGTACAATCGCATTGCAGCTTCAGCAG 
CTAGCATCA-3’ 


3-GATGCATGTTAGCGTAACGTCGAAGTCGTCG 
ATCGTAGT-S 


24. (Integrates with Chapter 10.) In base editing with CRISPR-Cas9, 
specific enzymes are used to convert a C in DNA into a U, or an A 
into an I, through a deamination reaction. 

a. Revisiting the structures shown in Chapter 10, give the chemical 
reaction of the deamination of C into U. 

b. Why is the C to U deamination not an instance of changing a 
DNA nucleotide into an RNA nucleotide? 

c. Using the conversion of C into U as your model, give the 
chemical reactions for the deamination of A and G. 


Biochemistry on the Web 


25. Search the National Center for Biotechnology Information (NCBI) 
website at /ttp-//www.ncbi.nlm.nih. gov/siteslentrez?db=Genome to 
discover the number of organisms whose genome sequences have been 
completed. Explore the rich depository of sequence information 
available here by selecting one organism from the list and browsing 
through the contents available. 


26. Figure 12.1 shows restriction endonuclease sites for the plasmid 
pBR322. You want to clone a DNA fragment and select for it in 
transformed bacteria by using resistance to tetracycline and sensitivity 
to ampicillin as a way of identifying the recombinant plasmid. What 
restriction endonucleases might be useful for this purpose? 


27. Suppose in the amino acid sequence in Figure 12.8, tryptophan was 
replaced by cysteine. How would that affect the possible mRNA 
sequence? (Consult the inside front cover of this textbook for amino 
acid codons.) How many nucleotide changes are necessary in replacing 
Trp with Cys in this coding sequence? What is the total number of 
possible oligonucleotide sequences for the mRNA if Cys replaces Trp? 


Think-Pair-Share Question 


Gene Therapy A single-point mutation in its gene renders an enzyme 
hyperactive in the cytosol of cells. This phenomenon is observed when just 
one copy of the gene carries the mutation. Using the techniques discussed 
in this chapter (or other methods you may know of), propose therapeutic 
approaches to minimize the impact of or reverse this mutation. You can 
target the protein (enzyme), RNA, or DNA. 
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Enzymes—Kinetics 
and Specificity 


At any moment, thousands of chemical reactions are taking place in any living cell. 
Enzymes are essential for these reactions to proceed at rates fast enough to sustain life. 
What are enzymes, and what do they do? 


Ls organisms teem with metabolic activity. Thousands of chemical reactions 
are proceeding very rapidly at any given instant within all living cells. Virtually 
all of these transformations are mediated by enzymes—proteins (and occasionally 
RNA) specialized to catalyze metabolic reactions. The substances transformed in 
these reactions are often organic compounds that show little tendency for reaction 
outside the cell. An excellent example is glucose, a sugar that can be stored indefinitely 
on the shelf with no deterioration. Most cells quickly oxidize glucose, producing 
carbon dioxide and water, and releasing lots of energy: 


C6H20¢ + 6 O —> 6 CO, + 6 H2O + 2870 kJ of energy 


(—2870 kJ/mol is the standard-state free energy change [AG°’] for the oxidation of 
glucose.) In chemical terms, 2870 kJ is a large amount of energy, and glucose can be 
viewed as an energy-rich compound even though at ambient temperature it is not 
readily reactive with oxygen outside of cells. Stated another way, glucose represents 
thermodynamic potentiality: Its reaction with oxygen is strongly exergonic, but it 
doesn’t occur under normal conditions. Enzymes can catalyze such thermodynami- 
cally favorable reactions, causing them to proceed at extraordinarily rapid rates. 
Enzymes accomplish this feat by lowering the energy of activation for the reaction 
(Figure 13.1). In glucose oxidation and countless other instances, enzymes provide 
cells with the ability to exert kinetic control over thermodynamic potentiality. That is, 
living systems use enzymes to accelerate and control the rates of vitally important 
biochemical reactions. 


13 


Part Il 
Protein 
Dynamics 


There is more to life than increasing its speed. 


Mahatma Gandhi (1869-1948) 


The Space Shuttle must accelerate from zero velocity to 
a velocity of more than 18,000 miles per hour in order 
to enter earth orbit. Its three main engines are powered 
by energy released in the reaction: 2H, + O; > 2 H20. 
The rate of Oz consumption per engine is 13,000 moles 
per second. 


13.1 What Characteristic Features 
Define Enzymes? 


13.2 How Can the Rate of an Enzyme- 
Catalyzed Reaction Be Defined in a 
Mathematical Way? 


13.3 What Equations Define the Kinetics 
of Enzyme-Catalyzed Reactions? 

13.4 What Can Be Learned from the 
Inhibition of Enzyme Activity? 


13.5 What Is the Kinetic Behavior of 
Enzymes Catalyzing Bimolecular 
Reactions? 


13.6 How Can Enzymes Be So Specific? 
13.7 Are All Enzymes Proteins? 


13.8 Is It Possible to Design an Enzyme 
to Catalyze Any Desired Reaction? 
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Figure 13.2 The breakdown of glucose by glycolysis 
provides a prime example of a metabolic pathway. 


Free energy, G 


6 CO; + 6 H,O 


Progress of reaction 


Figure 13.1 Reaction profile showing the large AG? for glucose oxidation. Enzymes lower AG}, 
thereby accelerating rate. 


Enzymes Are the Agents of Metabolic Function 


Acting in sequence, enzymes form metabolic pathways by which nutrient molecules 
are degraded, energy is released and converted into metabolically useful forms, and 
precursors are generated and transformed to create the literally thousands of 
distinctive biomolecules found in any living cell (Figure 13.2). Situated at key 
junctions of metabolic pathways are specialized regulatory enzymes capable of sens- 
ing the momentary metabolic needs of the cell and adjusting their catalytic rates 
accordingly. The responses of these enzymes ensure the harmonious integration of 
the diverse and often divergent metabolic activities of cells so that the living state is 
promoted and preserved. 


13.1 | What Characteristic Features Define Enzymes? 


Enzymes are remarkably versatile biochemical catalysts that have three distinctive 
features in common: catalytic power, specificity, and regulation. 


Dr. Maud Menten 


Maud Leonora Menten (1879-1960) was one of 
the first women to receive an M.D. degree from the 
University of Toronto. In 1912, she moved to the 
laboratory of Leonor Michaelis in Berlin because 
women were not permitted to do research in Canada 
at that time. With Michaelis, she studied the kinetics 
of invertase, an enzyme that hydrolyzes sucrose to 
glucose and fructose. Michaelis and Menten found 
evidence for the formation of an enzyme-substrate 
complex, leading to their formulation of the 
Michaelis-Menten equation, which relates the rate 
of an enzyme-catalyzed reaction to the enzyme’s 
affinity for its substrate. 


To establish herself as an independent 
investigator, Menten got a Ph.D. in biochemistry 
from the University of Chicago in 1916 and joined 
the faculty at the University of Pittsburgh School 
of Medicine, where her research ranged from blood 
chemistry and enzymology to immunology. She 
was awarded a full professorship in 1949, the year 
she retired. She drove a Model T Ford around the 
University of Pittsburgh for 32 years, played the 
clarinet, spoke multiple languages, and went on an 
Arctic expedition. 
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13.la Catalytic Power Is Defined as the Ratio of the Enzyme-Catalyzed 
Rate of a Reaction to the Uncatalyzed Rate 


Enzymes display enormous catalytic power, accelerating reaction rates as much as 
10%% over uncatalyzed levels, which is far greater than any synthetic catalysts can 
achieve. Enzymes accomplish these astounding feats in dilute aqueous solutions 
under mild conditions of temperature and pH. For example, the enzyme jack bean 
urease catalyzes the hydrolysis of urea: 


O 


| 
HoN—C—NHp + 2 HO + Ht —> 2 NH4* + HCO3- 


At 20°C, the rate constant for the enzyme-catalyzed reaction is 3 X 104/s; the rate 
constant for the uncatalyzed hydrolysis of urea is 3 X 107!°/s. Thus, 10" is the ratio of 
the catalyzed rate to the uncatalyzed rate of reaction. Such a ratio is defined as the 
relative catalytic power of an enzyme, so the catalytic power of urease is 10, 


13.1b Specificity Is the Term Used to Define the Selectivity 
of Enzymes for Their Substrates 


Any given enzyme is very selective, both in the substances with which it interacts and 
in the reaction that it catalyzes. The substances upon which an enzyme acts are called 
substrates. In an enzyme-catalyzed reaction, none of the substrate is diverted into 
nonproductive side reactions, so no wasteful by-products are produced. It follows 
then that the products formed by a given enzyme are also very specific. This situation 
can be contrasted with your own experiences in the organic chemistry laboratory, 
where yields of 50% or even 30% are viewed as substantial accomplishments 
(Figure 13.3). The selective qualities of an enzyme are collectively recognized as its 
specificity. Intimate interaction between an enzyme and its substrates occurs through 
molecular recognition based on structural complementarity; such mutual recognition 
is the basis of specificity. The specific site on the enzyme where substrate binds and 
catalysis occurs is called the active site. 


13.1c Regulation of Enzyme Activity Ensures That the Rate 
of Metabolic Reactions Is Appropriate 
to Cellular Requirements 


Regulation of enzyme activity is essential to the integration and regulation of metabo- 
lism. Enzyme regulation is achieved in a variety of ways, ranging from controls over 
the amount of enzyme protein produced by the cell to more rapid, reversible interac- 
tions of the enzyme with metabolic inhibitors and activators. Chapter 15 is devoted to 
discussions of this topic. Because most enzymes are proteins, we can anticipate that 
the functional attributes of enzymes are due to the remarkable versatility found in 
protein structures. 


13.1d Enzyme Nomenclature Provides a Systematic 
Way of Naming Metabolic Reactions 


Traditionally, enzymes were named by adding the suffix -ase to the name of the 
substrate upon which they acted, as in urease for the urea-hydrolyzing enzyme or 
phosphatase for enzymes hydrolyzing phosphoryl groups from organic phosphate 
compounds. Other enzymes acquired names bearing little resemblance to their 
activity, such as the peroxide-decomposing enzyme catalase or the proteolytic 
enzymes (proteases) of the digestive tract, trypsin and pepsin. Because of the confusion 
that arose from these historical designations, an International Commission on 
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Figure 13.3 A 90% yield over 10 steps gives an 
overall yield of 35%. Yields in biological reactions 
must be substantially greater, otherwise, unwanted 
by-products would accumulate to unacceptable 
levels. 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


436 Chapter 13 Enzymes—Kinetics and Specificity 


Enzymes was established to create a systematic basis for enzyme nomenclature. 
Although common names for many enzymes remain in use, all enzymes now are 
classified and formally named according to the reaction they catalyze. Six classes of 
reactions are recognized (Table 13.1). Within each class are subclasses, and under 
each subclass are sub-subclasses within which individual enzymes are listed. Classes, 
subclasses, sub-subclasses, and individual entries are each numbered so that a series 
of four numbers serves to specify a particular enzyme. A systematic name, descriptive 
of the reaction, is also assigned to each entry. To illustrate, consider the enzyme that 
catalyzes this reaction: 


ATP + D-glucose —> ADP + b-glucose-6-phosphate 


A phosphate group is transferred from ATP to the C-6-OH group of glucose, so the 
enzyme is a transferase (class 2, Table 13.1). Subclass 7 of transferases is enzymes 
transferring phosphorus-containing groups, and sub-subclass 1 includes those phos- 


IE JCA Systematic Classification of Enzymes According to the Enzyme Commission 
E.C. Number Systematic Name and Subclasses E.C. Number Systematic Name and Subclasses 
1 Oxidoreductases (oxidation—reduction reactions) 4.1.2 Aldehyde lyases 
1.1 Acting on CH—OH group of donors 4.2 C—O lyases 
1.1.1 With NAD or NADP as acceptor 4.2.1 Hydrolases 
1.1.3 With O, as acceptor 4.3 C—N lyases 
1.2 Acting on the Sc=o group of donors 4.3.1 Ammonia lyases 
1.2.3 With O; as acceptor 5 Isomerases (isomerization reactions) 
1.3 Acting on the CH—CH group of donors 5.1 Racemases and epimerases 
1.3.1 With NAD or NADP as acceptor 5.1.3 Acting on carbohydrates 
2 Transferases (transfer of functional groups) 5.2 Cis-trans isomerases 
2.1 Transferring C-1 groups 6 Ligases (formation of bonds with ATP cleavage) 
2.1.1 Methyltransferases 6.1 Forming C—O bonds 
2.1.2 Hydroxymethyltransferases and 6.1.1 Amino acid—RNA ligases 
formyltransferases 
2.1.3 Carboxyltransferases and carbamoyltransferases 6.2 Forming C—S bonds 
2.2 Transferring aldehydic or ketonic residues 6.3 Forming C—N bonds 
2.3 Acyltransferases 6.4 Forming C—C bonds 
2.4 Glycosyltransferases 6.4.1 Carboxylases 
2.6 Transferring N-containing groups 7 Translocases (the movement of a molecule or ion 
across a membrane) 
2.6.1 Aminotransferases Tal Translocation of protons 
2.7 Transferring P-containing groups TA Linked to oxidoreductases 
2.7.1 With an alcohol group as acceptor 7.2 Translocation of inorganic cations 
3 Hydrolases (hydrolysis reactions) 7.2.1 Linked to the hydrolysis of a nucleoside triphosphate 
3.1 Cleaving ester linkage 7.3 Translocation of inorganic anions and their chelates 
3.1.1 Carboxylic ester hydrolases 731 Linked to the hydrolysis of a nucleoside triphosphate 
3.1.3 Phosphoric monoester hydrolases 7.4 Translocation of amino acids and peptides 
3.1.4 Phosphoric diester hydrolases 7.4.1 Linked to the hydrolysis of a nucleoside triphosphate 
4 Lyases (bond cleavage by means other than 7.5 Translocation of carbohydrates and their derivatives 
hydrolysis or oxidation) 
4.1 C—C lyases 7.5.1 Linked to the hydrolysis of a nucleoside triphosphate 
4.1.1 Carboxy-lyases 7.6 Translocation of other compounds 


For details, see https://enzyme.expasy.org. 
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photransferases with an alcohol group as an acceptor. Entry 2 in this sub-subclass is 
ATP:p-glucose-6-phosphotransferase, and its classification number is 2.7.1.2. In use, 
this number is preceded by the letters E.C., denoting the Enzyme Commission. 
For example, entry | in the same sub-subclass is E.C.2.7.1.1, ATP:D-hexose-6- 
phosphotransferase, an ATP-dependent enzyme that transfers a phosphate to the 
6-OH of hexoses (that is, it is nonspecific regarding its hexose acceptor). 
These designations can be cumbersome, so in everyday usage, traditional names are 
commonly used. The glucose-specific enzyme E.C.2.7.1.2 is called glucokinase, and 
the nonspecific E.C.2.7.1.1 is known as hexokinase. Kinase is a term for enzymes that 
are ATP-dependent phosphotransferases. 


13.le Coenzymes and Cofactors Are Nonprotein 
Components Essential to Enzyme Activity 


Many enzymes carry out their catalytic function relying solely on their protein 
structure. Many others require nonprotein components, called cofactors (Table 13.2). 
Cofactors may be metal ions or organic molecules referred to as coenzymes. 
Coenzymes and cofactors provide proteins with chemically versatile functions not 
found in amino acid side chains. Many coenzymes are vitamins or contain vitamins 
as part of their structure. Usually coenzymes are actively involved in the catalytic 
reaction of the enzyme, often serving as intermediate carriers of functional groups in 
the conversion of substrates to products. In most cases, a coenzyme is firmly 
associated with its enzyme, perhaps even by covalent bonds, and it is difficult to 
separate the two. Such tightly bound coenzymes are referred to as prosthetic groups of 
the enzyme. The catalytically active complex of protein and prosthetic group is called 
the holoenzyme. The protein without the prosthetic group is called the apoenzyme; it 
is catalytically inactive. 


Enzyme Cofactors: A Few Examples of Metal lons and Coenzymes and the Enzymes with Which They Are Associated 


Metal lons and Some 
Enzymes That Require Them 


Coenzymes Serving as Transient Carriers 
of Specific Atoms or Functional Groups 


Metal lon Enzyme Coenzyme Entity Transferred 
Fe?* or Cytochrome oxidase Thiamine pyrophosphate (TPP) Aldehydes 
Fe Catalase Flavin adenine dinucleotide (FAD) Hydrogen atoms 
Peroxidase Nicotinamide adenine dinucleotide Hydride ion (:H~) 
(NAD) 
Cuw* Cytochrome oxidase 
Zn?* DNA polymerase Coenzyme A (CoA) Acyl groups 
Carbonic anhydrase Pyridoxal phosphate (PLP) Amino groups 
Alcohol dehydrogenase 
Mg?* Hexokinase 5'-Deoxyadenosylcobalamin H atoms and alkyl 
(vitamin Bj) groups 
Glucose-6-phosphatase 
Mn?* Arginase Biotin (biocytin) CO, 
Kt Pyruvate kinase Tetrahydrofolate (THF) Other one-carbon 
(also requires Mg?*) groups, such as formyl 
Ni2+ Uréase and methyl groups 
Mo Nitrate reductase 
Se Glutathione peroxidase 


Representative Enzyme 
Pyruvate dehydrogenase 


Succinate 
dehydrogenase 


Alcohol dehydrogenase 


Acetyl-CoA 
carboxylase 


Aspartate 
aminotransferase 


Methylmalonyl-CoA 
mutase 


Propionyl-CoA 
carboxylase 


Thymidylate synthase 
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13.2 | How Can the Rate of an Enzyme-Catalyzed Reaction 
| Be Defined in a Mathematical Way? 


Kinetics is the branch of science concerned with the rates of reactions. The study of 
enzyme kinetics addresses the biological roles of enzymatic catalysts and how they 
accomplish their remarkable feats. Enzyme kineticists seek to determine the 
maximum reaction velocity that the enzyme can attain and its binding affinities for 
substrates and inhibitors. Coupled with studies on the structure and chemistry of the 
enzyme, analysis of the enzymatic rate under different reaction conditions yields 
insights regarding the enzyme’s mechanism of catalytic action. Such information is 
essential to an overall understanding of metabolism. 

Significantly, this information can be exploited to control and manipulate the 
course of metabolic events. The science of pharmacology relies on such a strategy. 
Pharmaceuticals, or drugs, are often special inhibitors specifically targeted at a particular 
enzyme in order to overcome infection or to alleviate illness. A detailed knowledge of the 
enzyme’s kinetics is indispensable to rational drug design and successful pharmacological 
intervention. 


13.2a Chemical Kinetics Provides a Foundation 
for Exploring Enzyme Kinetics 


Before beginning a discussion of enzyme kinetics, it will be fruitful to review some 
basic principles of chemical kinetics. Chemical kinetics is the study of the rates of 
chemical reactions. Consider a reaction of overall stoichiometry: 


A— >P 


Although we treat this reaction as a simple, one-step conversion of A to P, it more 
likely occurs through a sequence of elementary reactions, each of which is a simple 
molecular process, as in 


A —> I — J —P 


where I and J represent intermediates in the reaction. Precise description of all of the 
elementary reactions in a process is necessary to define the overall reaction mecha- 
nism for A—>P. 

Let us assume that A—>P is an elementary reaction and that it is spontaneous and 
essentially irreversible. Irreversibility is easily assumed if the rate of P conversion to A 
is very slow or the concentration of P (expressed as [P]) is negligible under the condi- 
tions chosen. The velocity, v, or rate, of the reaction A—> is the amount of P formed 
or the amount of A consumed per unit time, t. That is, 


y= AP! or y= Al (13.1) 
The mathematical relationship between reaction rate and concentration of reactant(s) 
is the rate law. For this simple case, the rate law is 
—d[A] 
aa k{A] (13.2) 
From this expression, it is obvious that the rate is proportional to the concentration 
of A, and k is the proportionality constant, or rate constant. The units of k are (time)“!, 
usually s~!. v is a function of [A] to the first power, or in the terminology of kinetics, 
vis first-order with respect to A. For an elementary reaction, the order for any reactant 
is given by its exponent in the rate equation. The number of molecules that must 
simultaneously interact is defined as the molecularity of the reaction. Thus, the simple 
elementary reaction of A—> is a first-order reaction. Figure 13.4 portrays the course 
of a first-order reaction as a function of time. The rate of decay of a radioactive 
isotope, like !4C or ¥P, is a first-order reaction, as is an intramolecular rearrangement, 
such as A—>P. Both are unimolecular reactions (the molecularity equals 1). 
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100 


Slope of tangent to the 


B line at any point = d[A]/dt 


% A remaining 


13.2b Bimolecular Reactions Are Reactions 
Involving Two Reactant Molecules 


Consider the more complex reaction, where two molecules must react to yield products: 
A + B—>P+Q 


Assuming this reaction is an elementary reaction, its molecularity is 2; that is, it is a 
bimolecular reaction. The velocity of this reaction can be determined from the rate of 
disappearance of either A or B, or the rate of appearance of P or Q: 


—d[A] _—d[B] _a[P] _ d[Q] 
d dt d dt gaa) 


y = 


The rate law is 
v = k[A][B] (13.4) 


Since A and B must collide in order to react, the rate of their reaction will be 
proportional to the concentrations of both A and B. Because it is proportional to the 
product of two concentration terms, the reaction is second-order overall, first- 
order with respect to A and first-order with respect to B. Were the elementary 
reaction 2A—>P + Q, the rate law would be v = k[AP, second-order overall and 
second-order with respect to A. Second-order rate constants have the units of 
(concentration) !(time)~!, asin M~! s7!. 

Molecularities greater than 2 are rarely found and those greater than 3, never. The 
likelihood of simultaneous collision of three molecules is very, very small. When the 
overall stoichiometry of a reaction is greater than two (for example, as in A + B + 
C—> or 2A + B—>), the reaction almost always proceeds via unimolecular or bimo- 
lecular elementary steps, and the overall rate obeys a simple first- or second-order 
rate law. 

At this point, it may be useful to remind ourselves of an important caveat that is the 
first principle of kinetics: Kinetics cannot prove a hypothetical mechanism. Kinetic 
experiments can only rule out various alternative hypotheses because they do not fit the 
data. However, through thoughtful kinetic studies, a process of elimination of alternative 
hypotheses leads ever closer to the reality. 


13.2c Catalysts Lower the Free Energy of Activation for a Reaction 


In a first-order chemical reaction, the conversion of A to P occurs because, at any 
given instant, a fraction of the A molecules has the energy necessary to achieve a 
reactive condition known as the transition state. In this state, the probability is very 
high that the particular rearrangement accompanying the A—>P transition will 
occur. This transition state sits at the apex of the energy profile in the energy diagram 


Figure 13.4 Plot of the course of a first-order 
reaction. The half-time, żin, is the time for one-half 
of the starting amount of A to disappear. 
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Figure 13.5 Energy diagram for a chemical reaction (A—>P) and the effects of (a) raising the 
temperature from T; to T» or (b) adding a catalyst. 


describing the energetic relationship between A and P (Figure 13.5). The average free 
energy of A molecules defines the initial state, and the average free energy of P 
molecules is the final state along the reaction coordinate. The rate of any chemical 
reaction is proportional to the concentration of reactant molecules (A in this case) 
having this transition-state energy. The higher this energy is above the average energy, 
the smaller the fraction of molecules that will have this energy and the slower the 
reaction will proceed. The height of this energy barrier is called the free energy of 
activation, AG". Specifically, AG? is the energy required to raise the average energy of 
1 mol of reactant (at a given temperature) to the transition-state energy. The 
relationship between activation energy and the rate constant of the reaction, k, is 
given by the Arrhenius equation: 


k = AeAG RT (13.5) 


where A is a constant for a particular reaction (not to be confused with the reactant 
species, A, that we’re discussing). Another way of writing this is 1/k = (1/A)e4@/87, 
That is, k is inversely proportional to eAGIRT, Therefore, if the energy of activation 
decreases, the reaction rate increases. 


13.2d Decreasing AG# Increases Reaction Rate 


We are familiar with two general ways that rates of chemical reactions may be ac- 
celerated. First, the temperature can be raised. This will increase the kinetic energy 
of reactant molecules, and more reactant molecules will possess the energy to reach 
the transition state (Figure 13.5a). In effect, increasing the average energy of reactant 
molecules makes the energy difference between the average energy and the 
transition-state energy smaller. (Also note that the equation k = Ae~4@'RT demon- 
strates that k increases as T increases.) The rates of many chemical reactions are 
doubled by a 10° C rise in temperature. Second, the rates of chemical reactions can 
also be accelerated by catalysts. Catalysts work by lowering the energy of activation 
rather than by raising the average energy of the reactants (Figure 13.5b). Catalysts 
accomplish this remarkable feat by combining transiently with the reactants in a way 
that promotes their entry into the reactive, transition-state condition. Two aspects of 
catalysts are worth noting: (1) They are regenerated after each reaction cycle (A—~P), 
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and therefore can be used over and over again; and (2) catalysts have no effect on the 
overall free energy change in the reaction, the free energy difference between A 
and P (Figure 13.5b). 


13.3 | What Equations Define the Kinetics 
of Enzyme-Catalyzed Reactions? 


Examination of the change in reaction velocity as the reactant concentration is varied 
is one of the primary measurements in kinetic analysis. Returning to A—»P, a plot of 
the reaction rate as a function of the concentration of A yields a straight line whose 
slope is k (Figure 13.6). The more A that is available, the greater the rate of 
the reaction, v. Similar analyses of enzyme-catalyzed reactions involving only a single 
substrate yield remarkably different results (Figure 13.7). At low concentrations of the 
substrate S, v is proportional to [S], as expected for a first-order reaction. However, v 
does not increase proportionally as [S] increases, but instead begins to level off. At 
high [S], v becomes virtually independent of [S] and approaches a maximal limit. The 
value of v at this limit is written Vmax- Because rate is no longer dependent on [S] at 
these high concentrations, the enzyme-catalyzed reaction is now obeying zero-order 
kinetics; that is, the rate is independent of the reactant (substrate) concentration. This 
behavior is a saturation effect: When y shows no increase even though [S] is increased, 
the system is saturated with substrate. Such plots are called substrate saturation 
curves. The physical interpretation is that every enzyme molecule in the reaction 
mixture has its substrate-binding site occupied by S. Indeed, such curves were the 
initial clue that an enzyme interacts directly with its substrate by binding it. 


13.3a The Substrate Binds at the Active Site of an Enzyme 


An enzyme molecule is often (but not always) orders of magnitude larger than its 
substrate. In any case, its active site, the place on the enzyme where S binds, comprises 
only a portion of the overall enzyme structure. The conformation of the active site 
is structured to form a special pocket or cleft whose three-dimensional architecture is 
complementary to the structure of the substrate. The enzyme and the substrate 
molecules recognize each other through this structural complementarity. The 
substrate binds to the enzyme through relatively weak forces—H bonds, ionic bonds 
(salt bridges), and van der Waals interactions between sterically complementary 
clusters of atoms. 


v= Vinax = 


Substrate 
molecule 


a _-Active site 


Enzyme 


WO molecule 


Substrate concentration, [S] 


Slope = k 


Velocity, v 


Reactant concentration, [A] 


Figure 13.6 A plot of v versus [A] for the 
unimolecular chemical reaction, A—>P, yields a 
straight line having a slope equal to k. 


Figure 13.7 Substrate saturation curve for an 
enzyme-catalyzed reaction. The amount of enzyme 
is constant, and the velocity of the reaction is 
determined at various substrate concentrations. The 
reaction rate, v, as a function of [S] is described 
mathematically by a rectangular hyperbola. The 
H20 molecule provides a rough guide to scale. 
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Concentration 


[Substrate ] 
[Product] 


[Product] 


Concentration 


Time 


Figure 13.8 Time course for a typical enzyme- 
catalyzed reaction obeying the Michaelis-Menten, 
Briggs—Haldane models for enzyme kinetics. The 
early stage of the time course is shown in greater 
magnification in the bottom graph. 


13.3b The Michaelis-Menten Equation Is the Fundamental 
Equation of Enzyme Kinetics 


Leonor Michaelis and Maud L. Menten proposed a general theory of enzyme action 
in 1913 consistent with observed enzyme kinetics. Their theory was based on the 
assumption that the enzyme, E, and its substrate, S, associate reversibly to form an 
enzyme-substrate complex, ES: 


kı 
E + S= ES (13.6) 
kı 


This association/dissociation is assumed to be a rapid equilibrium, and Kg is the 
enzyme-substrate dissociation constant. At equilibrium, 


k_,[ES] = k,[E][S] (13.7) 
and 
_ [ENS] kı 
K,= TEST = > (13.8) 


Product, P, is formed in a second step when ES breaks down to yield E + P. 


kı ky 
E+ S=—ES—>E+ P (13.9) 
kı 


E is then free to interact with another molecule of S. 


13.3c Rate Calculations Assume That [ES] Remains Constant 
During an Enzymatic Reaction 


The interpretations of Michaelis and Menten were refined and extended in 1925 by 
Briggs and Haldane, who assumed the concentration of the enzyme-substrate com- 
plex ES quickly reaches a constant value in such a dynamic system. That is, ES is 
formed as rapidly from E + S as it disappears by its two possible fates: dissociation 
to regenerate E + S and reaction to form E + P. This assumption is termed the 
steady-state assumption and is expressed as 


d[ES] 
dt 


=0 (13.10) 


That is, the change in concentration of ES with time, ¢, is 0. Figure 13.8 illustrates the 
time course for formation of the ES complex and establishment of the steady-state 
condition. 


13.3d A Calculations Are Simplified If Velocity Measurements 
Are Made Immediately after Adding S 


One other simplification will be advantageous. Because enzymes accelerate the rate 
of the reverse reaction as well as the forward reaction, it would be helpful to ignore 
any reverse reaction by which E + P might form ES. The velocity of this back reaction 
would be given by v = k_,[E][P]. However, if we observe only the initial velocity for the 
reaction immediately after E and S are mixed in the absence of P, the rate of any back 
reaction is negligible because its rate will be proportional to [P], and [P] is essen- 
tially 0. Given such simplification, we now analyze the system described by 
Equation 13.9 in order to describe the initial velocity v as a function of [S] and amount 
of enzyme. 
The total amount of enzyme is fixed and is given by the formula 


Total enzyme, [Ey] = [E] + [ES] (13.11) 
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where [E] is free enzyme and [ES] is the amount of enzyme in the enzyme-substrate 
complex. From Equation 13.9, the rate of [ES] formation is 


vy = ki(lEr] — [ESDIS] 


where 
[Er] — [ES] = [E] (13.12) 
From Equation 13.9, the rate of [ES] disappearance is 
va = k_\[ES] + k[ES] = (k-; + k2)[ES] (13.13) 
At steady state, d[ES]/dt = 0, and therefore, vy = vy. So, 
ki([Er] — [ES)[S] = (k-ı + ka)[ES] (13.14) 


Rearranging gives 


((Eq] aes = Ct kə) (13.15) 


13.3e The Michaelis Constant, Km, Is Defined as (k_, + kz)/kı 

The ratio of constants (k_; + k2)/k; is itself a constant and is defined as the Michaelis 

constant, Km 

(k-i + ky) 
ky 

Note from Equation 13.15 that K,, is given by the ratio of two concentrations 

(([Er] — [ES]) and [S]) to one ({ES]), so Km has the units of molarity. (Also, because the 


units of k_; and k, are in time! and the units of kı are M~'time™, the units of Km are 
M.) From Equation 13.15, we can write 


(LEr] — [ES)[S] 
[ES] 


Kn = (13.16) 


= Kn (13.17) 


which rearranges to 

[Er][S] 
Kn + [S] 
Now, the most important parameter in the kinetics of any reaction is the rate of 
product formation. This rate is given by 


[ES] = (13.18) 


_ d[P] 
v= di (13.19) 
and for this reaction 
v = k[ES] (13.20) 


Substituting the expression for [ES] from Equation 13.18 into Equation 13.20 gives 


_ AKIErIS] 


ae (13.21) 


The product k2[E7] has special meaning. When [S] is high enough to saturate all of the 
enzyme, the velocity of the reaction, v, is maximal. At saturation, the amount of [ES] 
complex is equal to the total enzyme concentration, Er, its maximum possible value. 
From Equation 13.20, the initial velocity v then equals k[Er] = Vmax. Written 
symbolically, when [S] >> [Er] (and Km), [Er] = [ES] and v = Vmax. Therefore, 


Vinax = k[Eq] (13.22) 


Substituting this relationship into the expression for v gives the Michaelis-Menten 
equation: 


Vonal S] 
a 


el (13.23) 
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This equation says that the initial rate of an enzyme-catalyzed reaction, v, is deter- 
mined by two constants, Km and Vmax, and the initial concentration of substrate. 


13.3f When [S] = Km, V = Vmax/2 


We can provide an operational definition for the constant K, by rearranging 


Equation 13.23 to give 
Km — [S] (- max 1) 


Then, at v = Vinax/2, Km = [S]. That is, Km is defined by the substrate concentration 
that gives a velocity equal to one-half the maximal velocity. Table 13.3 gives the Km 
values of some enzymes for their substrates. 


(13.24) 


13.3g Plots of v versus [S] Illustrate the Relationships 
between Vmax) Km, and Reaction Order 


The Michaelis-Menten equation (Equation 13.23) describes a curve known from 
analytical geometry as a rectangular hyperbola. In such curves, as [S] is increased, 
v approaches the limiting value, Vmax, in an asymptotic fashion. Vmax can be approx- 
imated experimentally from a substrate saturation curve (Figure 13.7), and K,, can be 
derived from Vmax/2, so the two constants of the Michaelis-Menten equation can be 
obtained from plots of v versus [S]. Note, however, that actual estimation of Vmax, and 
consequently Km, is only approximate from such graphs. That is, according to 
Equation 13.23, to get v = 0.99 Vnax» [S] must equal 99 Km, a concentration that may 
be difficult to achieve in practice. 

From Equation 13.23, when [S] >> Km, then v = Vmax- That is, v is no longer depen- 
dent on [S], so the reaction is obeying zero-order kinetics. Also, when [S] < Km, then 
v~(Vinax/Km)[S]. That is, the rate, v, approximately follows a first-order rate equation, 
v = k'[S], where k' = Vmax’ Km. 

Kmand Vmax once known explicitly, define the rate of the enzyme-catalyzed reaction, 
provided that: 


1. The reaction involves only one substrate, or if the reaction is multisubstrate, the 
concentration of only one substrate is varied and the concentrations of all other 
substrates are held constant. 


| Table 13.3 | Km Values for Some Enzymes 


Enzyme Substrate Km (mM) 
Carbonic anhydrase CO, 12 
Hexokinase Glucose 0.15 
Fructose 1.5 
B-Galactosidase Lactose 4 
Aspartate aminotransferase Aspartate 0.9 
a-Ketoglutarate 0.1 
Oxaloacetate 0.04 
Glutamate 4 
Threonine deaminase Threonine 5 
Pyruvate carboxylase HCO,- 1.0 
Pyruvate 0.4 
ATP 0.06 
Lysozyme Hexa-N-acetylglucosamine 0.006 
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2. The reaction ES—>E + P is irreversible, or the experiment is limited to observing TP EREN MIN Values of k. (Turnover 
= ‘cat 


only initial velocities where [P] = 0. Number) for Some Enzymes 


3. [S]o > [Er] and [Eq] is held constant. 
4. All other variables that might influence the rate of the reaction (temperature, pH, _ Enzyme kat (S~?) 
ionic strength, and so on) are held constant. Catalase 40,000,000 
Carbonic anhydrase 1,000,000 
13.3h Turnover Number Defines the Activity Acetylcholinesterase 14,000 
of One Enzyme Molecule Penicillinase 2000 
The turnover number of an enzyme, Keat, is a measure of its maximal catalytic activity. Lactate dehydrogenase 1000 
Keat is defined as the number of substrate molecules converted into product per enzyme Chymotrypsin 100 
molecule per unit time when the enzyme is saturated with substrate. The turnover DNA polymerase I 15 
number is also referred to as the molecular activity of the enzyme. For the simple 
Lysozyme 0.5 


Michaelis-Menten reaction (Equation 13.9) under conditions of initial velocity 
measurements, ky = keat. Provided the concentration of enzyme, [Er], in the reaction 
mixture is known, Keat can be determined from Vmax. At saturating [S], v = Vmax = ko 
[Er]. Thus, 


ky = Ft = ha (13.25) 
The term keat represents the kinetic efficiency of the enzyme. Table 13.4 lists turnover 
numbers for some representative enzymes. Catalase has the highest turnover number 
known; each molecule of this enzyme can degrade 40 million molecules of H20; in 
1 second! At the other end of the scale, lysozyme requires 2 seconds to cleave a glyco- 
sidic bond in its glycan substrate. 

In many situations, the actual molar amount of the enzyme is not known. However, 
its amount can be expressed in terms of the activity observed. The International 
Commission on Enzymes defines one international unit as the amount that catalyzes the 
formation of 1 micromole of product in 1 minute. (Because enzymes are very sensitive to 
factors such as pH, temperature, and ionic strength, the conditions of assay must be 
specified.) In the process of purifying enzymes from cellular sources, many extraneous 
proteins may be present. Then, the units of enzyme activity are expressed as enzyme 
units per mg protein, a term known as specific activity (see Table 5.1). 


13.31 The Ratio, k.a/Km, Defines the Catalytic Efficiency of an Enzyme 


Under physiological conditions, [S] is seldom saturating and Keat itself is not particu- 
larly informative. That is, the in vivo ratio of [S]/K,, usually falls in the range of 0.01 
to 1.0, so active sites often are not filled with substrate. Nevertheless, we can derive a 
meaningful index of the efficiency of Michaelis-Menten-type enzymes under these 
conditions by using the following equations. As presented in Equation 13.23, if 


y= Vnaxl S| 
Kn + [S] 
and Vmax = Keat [Er], then 
pans keal Er lS] 
v= K. + [S] (13.26) 
When [S] << Km, the concentration of free enzyme, [E], is approximately equal to 
[Er], so 
= (=) E][S (13.27) 
uh [E][S] 


That is, kcat/Km is an apparent second-order rate constant for the reaction of E and S 
to form product. Because Km is inversely proportional to the affinity of the enzyme 
for its substrate and Keat is directly proportional to the kinetic efficiency of the en- 
zyme, kcat/Km provides an index of the catalytic efficiency of an enzyme operating at 
substrate concentrations substantially below saturation amounts. 
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An interesting point emerges for the cases where keat = k2. Then 
Kea _ Kiko 


ene ae 13.2 
Ka k-1 +k ae) 


But kı must always be greater than or equal to kk>/(k_, + k2). That is, the reaction can 
go no faster than the rate at which E and S come together. Thus, k; sets the upper limit 
for Kea/Km. In other words, the catalytic efficiency of an enzyme cannot exceed the 
diffusion-controlled rate of combination of E and S to form ES. In H30, the rate constant 
for such diffusion is approximately 10°/M-s for small substrates (for example, 
glyceraldehyde 3-P) and an order of magnitude smaller (~10*/M - s) for substrates the 
size of nucleotides. Those enzymes that are most efficient in their catalysis have keat/ 
Kn ratios approaching this value. Their catalytic velocity is limited only by the rate at 
which they encounter S; enzymes this efficient have achieved the so-called catalytic 
perfection. All E and S encounters lead to reaction because such catalytically perfect 
enzymes can channel S to the active site, regardless of where S hits E. Table 13.5 lists 
the kinetic parameters of several enzymes in this category. Note that keat and Km both 
show a substantial range of variation in this table, even though their ratio falls around 
108/M - s. 


13.3} Linear Plots Can Be Derived from 
the Michaelis-Menten Equation 


Because of the hyperbolic shape of v versus [S] plots, Vmax can be determined only 
from an extrapolation of the asymptotic approach of v to some limiting value as 
[S] increases indefinitely (Figure 13.7); and Km is derived from that value of [S], 
giving v = Vyax/2. However, several rearrangements of the Michaelis-Menten 
equation transform it into a straight-line equation. The best known of these is the 
Lineweaver—Burk double-reciprocal plot. Taking the reciprocal of both sides of the 
Michaelis—Menten equation, Equation 13.23, yields the equality 


1 Ka \/ 1) 1 
y 7 (=e) l Vmax (13.29) 


This conforms to y = mx + b (the equation for a straight line), where y = 1/v; m, the 
slope, is Km/ Vmax; x = I/[S]; and b = 1/Vmax. Plotting 1/v versus 1/[S] gives a straight 
line whose x-intercept is —1/Km, whose y-intercept is 1/Vinax, and whose slope is 
Km/ Vmax (Figure 13.9). 


| Table 13.5 | Enzymes Whose keat/Km Approaches the Diffusion-Controlled 


Rate of Association with Substrate 


Kat Kii Keatl Km 
Enzyme Substrate (s') (M) (M~'s~') 
Acetylcholinesterase Acetylcholine 1.4x 104 9x10 1.6X 108 
Carbonic anhydrase CO, 1x106 0.012 8.3x107 
HCO; 4x105 0.026 1.5x107 
Catalase H,O; 4x10 Ll 4x 107 
Crotonase Crotonyl-CoA 5.7X 103 2x10 2.8108 
Fumarase Fumarate 800 5x107 1.6Xx108 
Malate 900 2.5X1075 3.6X107 
Triosephosphate Glyceraldehyde- 4.3x103 1.81075 2.4x 108 
isomerase 3-phosphate* 
B-Lactamase Benzylpenicillin 2x10? 2x10 1x108 


*Km for glyceraldehyde-3-phosphate is calculated on the basis that only 3.8% of the substrate in solution is unhydrated 
and therefore reactive with the enzyme. 
Adapted from Fersht, A., Enzyme Structure and Mechanism, 2nd ed. New York: W. H. Freeman, 1985. 
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The Hanes—Woolf plot is another rearrangement of the Michaelis-Menten equation 
that yields a straight line: 
Multiplying both sides of Equation 13.29 by [S] gives 


[S] Kn \(1\, [S] _ Km , [S] 
v = s (7e) Fa Ta Ta (13.30) 
and 
a (+) sj + A 13.31 
v p Vinx l l Vins ( = ) 


Graphing [S]/v versus [S] yields a straight line where the slope is 1/Vinax, the y-intercept 
is Kn/Vinax, and the x-intercept is —K,, as shown in Figure 13.10. The Hanes—Woolf 
plot has the advantage of not overemphasizing the data obtained at low [S], a fault in- 
herent in the Lineweaver—Burk plot. The common advantage of these plots is that they 
allow both Km and Vmax to be accurately estimated by extrapolation of straight lines 
rather than asymptotes. Computer fitting of v versus [S] data to the Michaelis-Menten 
equation is more commonly done than graphical plotting. 
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v max 


Vin 


r K, 
ee yintercept= — 


max 


[S] 


447 


Figure 13.9 The Lineweaver—Burk double-reciprocal 
plot. 


Figure 13.10 A Hanes—Woolf plot of [S]/v versus [S]. 
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A Deeper Look 


An Example of the Effect of Amino Acid Substitutions on Km and keat: 
Wild-Type and Mutant Forms of Human Sulfite Oxidase 


Mammalian sulfite oxidase is the last enzyme in the pathway for 
degradation of sulfur-containing amino acids. Sulfite oxidase (SO) 
catalyzes the oxidation of sulfite (SO3?~) to sulfate (S047), using 
the heme-containing protein, cytochrome c, as electron acceptor: 


SO; + 2 cytochrome Coxidizea + HyO—= 
SO, + 2 cytochrome Creduceg + 2 H* 


Isolated sulfite oxidase deficiency is a rare and often fatal genetic 
disorder in humans. The disease is characterized by severe neurolog- 
ical abnormalities, revealed as convulsions shortly after birth. 
R. M. Garrett and K. V. Rajagopalan at Duke University Medical 
Center have isolated the human cDNA for sulfite oxidase from 
the cells of normal (wild-type) and SO-deficient individuals. The 
expression of these SO cDNAs in transformed Escherichia coli cells 
allowed the isolation and kinetic analysis of wild-type and mutant 
forms of SO, including one (designated R160Q) in which the Arg 
at position 160 in the polypeptide chain is replaced by Gln. A 
genetically engineered version of SO (designated R160K) in which 


Kinetic Constants for Wild-Type and Mutant Sulfite Oxidase 


Enzyme Km f (uM) kat (57°) kat/Km (10° M™' s7’) 
Wild-type 17 18 1.1 
R160Q 1900 3 0.0016 
R160K 360 5.5 0.015 


Replacing R!® in sulfite oxidase by Q increases Km, decreases 
Kat; and markedly diminishes the catalytic efficiency (Keat Km) of 
the enzyme. The R160K mutant enzyme has properties intermedi- 
ate between wild-type and the R160Q mutant form. The substrate, 
SO;7-, is strongly anionic, and R!® is one of several Arg residues 
situated within the SO substrate-binding site. Positively charged 
side chains in the substrate-binding site facilitate SO;?~ binding and 
catalysis, with Arg being optimal in this role. 


Lys replaces Arg!® was also studied. 


13.3k Nonlinear Lineweaver—Burk or Hanes—Woolf 
Plots Are a Property of Regulatory Enzymes 


If the kinetics of the reaction disobey the Michaelis-Menten equation, the violation 
is revealed by a departure from linearity in these straight-line graphs. We shall see in 
Chapter 15 that such deviations from linearity are characteristic of the kinetics of 
regulatory enzymes known as allosteric enzymes. Such regulatory enzymes are very 
important in the overall control of metabolic pathways. 


13.31 Enzymatic Activity Is Strongly Influenced by pH 


Enzyme-substrate recognition and the catalytic events that ensue are greatly 
dependent on pH. An enzyme possesses an array of ionizable side chains and 
prosthetic groups that not only determine its secondary and tertiary structure, but 
may also be intimately involved in its active site. Furthermore, the substrate itself 
often has ionizing groups, and one or another of the ionic forms may preferentially 
interact with the enzyme. Enzymes in general are active only over a limited pH range, 
and most have a particular pH at which their catalytic activity is optimal. These 
effects of pH may be due to effects on Km or Vmax or both. Figure 13.11 illustrates the 
relative activity of four enzymes as a function of pH. Trypsin, an intestinal protease, has 
a slightly alkaline pH optimum, whereas pepsin, a gastric protease, acts in the acidic 
confines of the stomach and has a pH optimum near 2. Papain, a protease found in 
papaya, is relatively insensitive to pHs between 4 and 8. Cholinesterase activity is pH 
sensitive below pH 7 but not between pH 7 and 10. The cholinesterase activity-pH 
profile suggests that an ionizable group with a pK, near six is essential to its activity. 
Might this group be a histidine side chain within its active site? Although the pH 
optimum of an enzyme often reflects the pH of its normal environment, the optimum 
may not be precisely the same. This difference suggests that the pH-activity response 
of an enzyme may be a factor in the intracellular regulation of its activity. 


13.3m The Response of Enzymatic Activity to Temperature Is Complex 


Like most chemical reactions, the rates of enzyme-catalyzed reactions generally 
increase with increasing temperature. However, at temperatures above 50°C to 60°C, 
enzymes typically show a decline in activity (Figure 13.12). Several effects are operating 
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Papain Cholinesterase 

Optimum pH of some enzymes 

5 
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x 

s Pepsin 1.5 

3 Catalase 7.6 

Trypsin Tl 
Fumarase 7.8 
Ribonuclease 7.8 
Arginase 9.7 


Figure 13.11 The pH activity profiles of four different enzymes. 


here: (1) the characteristic increase in reaction rate with temperature; (2) a temperature- 
dependent equilibrium between active enzyme (E,ctive) and the catalytically inactive 
but not denatured state of the enzyme (Einact); and (3) thermal denaturation of protein 
structures at higher temperatures. An equilibrium model of these effects proposed by 
Roy M. Daniel and Michael J. Danson fits well with experimentally observed responses 
of enzyme activity versus temperature, 


Keq k inact 


Eactive = Emt —— X ad 3 .32) 


where X designates the irreversibly inactivated form of the enzyme and kjnact 1s the 
first-order rate constant for thermal denaturation. Most enzymatic reactions double 
in rate for every 10°C rise in temperature (that is, Qio = 2, where Qio is defined as the 
ratio of activities at two temperatures 10 degrees Celsius apart) as long as the enzyme is 
stable and fully active. Some enzymes, such as those catalyzing reactions having very 
high activation energies, show proportionally greater Qio values. The increasing rate 
with increasing temperature is ultimately offset by the temperature-dependent 
equilibrium between active and inactive states of the native enzyme, and, ultimately, 
the denaturation of protein structure at elevated temperatures. The fit of this model 
with the experimental data supports the existence of Einact. Indeed, Einact reflects the 
intrinsic flexibility, and, hence, greater thermal sensitivity of the active site, compared 
with the rest of the enzyme’s structure. Not all enzymes are quite so thermally labile. 
For example, the enzymes of thermophilic prokaryotes (thermophilic = heat-loving) 
found in geothermal springs retain full activity at temperatures in excess of 85°C. 


13.4 | What Can Be Learned from the Inhibition 
of Enzyme Activity? 


Systematic perturbations are a basic tool of experimental scientists; much can be 
learned about the normal workings of any system by inducing changes in it and then 
observing the effects of the change. If the velocity of an enzymatic reaction is 
decreased or inhibited by some agent, the kinetics of the reaction obviously have been 
perturbed. The study of enzyme inhibition has contributed significantly to our 
understanding of enzymes. 


13.4a Enzymes May Be Inhibited Reversibly or Irreversibly 


Enzyme inhibitors are classified in several ways. The inhibitor may interact either 
reversibly or irreversibly with the enzyme. Reversible inhibitors interact with the en- 
zyme through noncovalent association/dissociation reactions. In contrast, irreversible 
inhibitors usually form covalent bonds with side chains or prosthetic groups in the 
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Figure 13.12 The effect of temperature on enzyme 
activity. 
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enzyme. That is, the consequence of irreversible inhibition is a decrease in the 
concentration of active enzyme. The kinetics observed are consistent with this inter- 
pretation, as will be discussed later in this chapter. 


13.4b Reversible Inhibitors May Bind at the Active Site 
or at Some Other Site 


Reversible inhibitors fall into three major categories: competitive, noncompetitive, and 
uncompetitive. Competitive inhibitors are characterized by the fact that the substrate 
and inhibitor compete for the same binding site on the enzyme, the so-called active site 
or substrate-binding site. Thus, increasing the concentration of S favors the likelihood 
of S binding to the enzyme instead of the inhibitor, I. That is, high [S] can overcome the 
effects of I. The effects of the other major types, noncompetitive and uncompetitive 
inhibition, cannot be overcome by increasing [S]. The three types can be distinguished 
by the particular patterns obtained when the kinetic data are analyzed in linear plots, 
such as double-reciprocal (Lineweaver—Burk) plots. A general formulation for common 
inhibitor interactions in our simple enzyme kinetic model would include 


E + IEI and/or I + ES=—IES (13.33) 


Competitive Inhibition Consider the following system: 


ky ky k 
E + S—ES—>E+P E + I5EI (13.34) 
kı k-3 


where an inhibitor, I, binds reversibly to the enzyme at the same site as S. S-binding 
and I-binding are mutually exclusive, competitive processes. Formation of the ternary 
complex, IES, where both S and I are bound, is physically impossible. This condition 
leads us to anticipate that S and I must share a high degree of structural similarity 
because they bind at the same site on the enzyme. Also notice that, in this model, EI 
does not react to give rise to E + P. That is, I is not changed by interaction with E. The 
rate of the product-forming reaction is v = k[ES]. 

It is revealing to compare the equation for the uninhibited case, Equation 13.23 (the 
Michaelis-Menten equation) with Equation 13.43 for the rate of the enzymatic reaction 
in the presence of a fixed concentration of the competitive inhibitor, [I] 


pen VmaxlS] 
e Fi] (13.35) 
y= Vinal S| T) (13.36) 
[S] + ra + m) 


(see also Table 13.6). The Km term in the denominator in the inhibited case is increased 
by the factor (1 + [I]/Kj); thus, v is less in the presence of the inhibitor, as expected. In 
the absence of I, the two equations are identical. Figure 13.13 shows a Lineweaver—Burk 


| Table 13.6 | IEX The Effect of Various Types of Inhibitors on the Michaelis-Menten Rate Equation and on Apparent Km and 


Apparent Vmax 


Inhibition Type Rate Equation Apparent K,, Apparent Vmax 
None v = Vorax{S|/(Km+[S]) Kn Va 
Competitive V= Vinax[SV(S]+ Km(1 +HIVKD) Ka(1+[0/K) Vaz 
Noncompetitive v = (Vinax{SVC.+[I/Kp)/(Kint [S]) Ka Vmax (1 +OK 
Mixed v= VraS +K )K mn + + [IVKy’)[S]) Ka(1+[0/KD/( +IVK:’) Vna (1 +IV/K:') 
Uncompetitive V = VraS Ka +S HINK) Ka/(1 +K’) Vna (1 +IVK 


K, is defined as the enzyme-inhibitor dissociation constant K,=[E][I]/[El]; K,’ is defined as the enzyme-substrate complex/inhibitor dissociation constant Kj’ =[ES][I]/[IES]. 
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Figure 13.13 Lineweaver—Burk plot of competitive inhibition, showing lines for no I, [I], and 2[I]. Note 
that when [S] is infinitely large (1/[S]=0), Vmax is the same, whether I is present or not. (right) Enzyme 
(grey) modeled with three binding sites: oval, substrate-binding site; triangle, uncompetitive inhibitor- 
binding site; and rectangle, noncompetitive inhibitor-binding site. Substrate, yellow; competitive inhibitor, 
orange. Note that S and competitive inhibitor compete for binding at the substrate-binding site. 


plot of competitive inhibition. Several features of competitive inhibition are evident. 
First, at a given [I], v decreases (1/v increases). When [S] becomes infinite, v = Vmax and 
is unaffected by I because all of the enzyme is in the ES form. Note that the value of the 
—x-intercept decreases as [I] increases. This —x-intercept is often termed the apparent 
Km (or Kmapp) because it is the Km apparent under these conditions. The diagnostic 
criterion for competitive inhibition is that Vmax is unaffected by I; that is, all lines share 
acommon j-intercept. This criterion is also the best experimental indication of binding 
at the same site by two substances. Competitive inhibitors resemble S structurally. 


A Deeper Look 


The Equations of Competitive Inhibition 
Given the relationships between E, S, and I described previously and Solving for [E] gives 


recalling the steady-state assumption that d[ES]/dt = 0, from Equa- 
tions (13.14) and (13.16) we can write [E] 


K[E][S] _ [E][S] 
(ky + k) K, 


m 


KıKnlEr] 
(KiKm + Ki[S] + Ka[ID 


(13.42) 


ES 


(13.37) Because the rate of product formation is given by v = k[ES], from 


Equation 13.34 we have 
Assuming that E + I=—EI reaches rapid equilibrium, the rate of 


k,[E][S 
EI formation, v’ = k3[E][I], and the rate of disappearance of EI, v= ee (13.43) 
vy’ = k_[EI], are equal. So, ™ 
KE] = k-[E1] (13.38) 50, 
(k.KiLEr]L$) 
Therefore, y= (KiKa m KS] +4 K,l1) (13.44) 
k 
[EI] = aa (EI (13.39) Because Vmax = ko[Ex], 
=3 
_ VmaxlS] 
If we define Kj as k_3/k3, an enzyme-inhibitor dissociation constant, K Kall] (13.45) 
then Ka + [S] + a 
pen = EW (13.40) of 
Kı 
v ; 
knowing [Er] = [E] + [ES] + [EI]. Then Is] + K (1 ee aN 
E EJI j 1 
nan pe (13.41) 


Kao K 
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Figure 13.14 Structures of succinate, the substrate 


of succinate dehydrogenase (SDH), and malonate, Competitive 
the competitive inhibitor. Fumarate (the product of Substrate Product inhibitor 
SDH acti inate) is also shown. 
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Succinate Dehydrogenase—A Classic Example of Competitive Inhibition The citric acid 
cycle enzyme succinate dehydrogenase (SDH) is competitively inhibited by malonate. 
Figure 13.14 shows the structures of succinate and malonate. The structural similar- 
ity between them is obvious and is the basis of malonate’s ability to mimic succinate 
and bind at the active site of SDH. However, unlike succinate, which is oxidized by 
SDH to form fumarate, malonate cannot lose two hydrogens; consequently, it is 
unreactive. 


Noncompetitive Inhibition Noncompetitive inhibitors interact with both E and ES (or 
with S and ES, but this is a rare and specialized case). In this case, the inhibitor is not 
binding to the same site as S, and the inhibition cannot be overcome by raising [S]. 
There are two types of noncompetitive inhibition: pure and mixed. 


Pure Noncompetitive Inhibition In this situation, the binding of I by E has no effect on 
the binding of S by E. That is, S and I bind at different sites on E, and binding of I 
does not affect binding of S. Consider the system 


1 


Kı I 
E + I==EI ES+I==IES (13.47) 


Pure noncompetitive inhibition occurs if K; = Kr’. This situation is relatively uncom- 
mon; the Lineweaver—Burk plot for such an instance is given in Figure 13.15. Note that 
Km is unchanged by I (the x-intercept remains the same, with or without I). Note also 
that the apparent Vmax decreases. A similar pattern is seen if the amount of enzyme in 
the experiment is decreased. Thus, it is as if I lowered [E]. 


Mixed Noncompetitive Inhibition In this situation, the binding of I by E influences the 
binding of S by E. Either the binding sites for I and S are near one another or confor- 
mational changes in E caused by I affect S binding. In this case, Kı and Kj’, as defined 
previously, are not equal. Both the apparent Km and the apparent Vmax are altered by 
the presence of I, and Ky/Vmax is not constant (Figure 13.16). This inhibitory pattern 
is commonly encountered. A reasonable explanation is that the inhibitor is binding at 
a site distinct from the active site yet is influencing the binding of S at the active site. 
Presumably, these effects are transmitted via alterations in the protein’s conformation. 
Table 13.6 includes the rate equations and apparent Km and Vmax values for both types 
of noncompetitive inhibition. 


Uncompetitive Inhibition Completing the set of inhibitory possibilities is uncompeti- 
tive inhibition. Unlike competitive inhibition (where I combines only with E) or non- 
competitive inhibition (where I combines with E and ES), in uncompetitive inhibition, 
I combines only with ES. 


’ 


I 
ES + I5 IES (13.48) 
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(a) (b) 


ES IES \ a 


Figure 13.15 Lineweaver—Burk plot of pure noncompetitive inhibition. (a) Enzyme (grey) modeled 
with three binding sites: oval, substrate-binding site; triangle, uncompetitive inhibitor-binding site; 
and rectangle, noncompetitive-inhibitor binding site. (b) 1/v versus 1/[S] plot for pure noncompetitive 
inhibition. Substrate, yellow; noncompetitive inhibitor, orange. Note that I does not alter Km but that 
it decreases Vmax- 


(a) Ky < Ky’ (b) Ki < Ky 


1 
+e —= 
aoe a al cae V 
os AW Se max 
one Ver _—— 
ee U e st 


1 
[S 
Figure 13.16 Lineweaver—Burk plot of mixed noncompetitive inhibition. Note that both intercepts 
and the slope change in the presence of I. (a) When K; is less than Kj’; (b) when K; is greater than Kj’. 


Because IES does not lead to product formation, the observed rate constant for 
product formation, k>, is uniquely affected. In simple Michaelis-Menten kinetics, k2 
is the only rate constant that is part of both Vmaxand Km. The pattern obtained 
in Lineweaver—Burk plots is a set of parallel lines (Figure 13.17). A clinically impor- 
tant example is the action of lithium in alleviating manic depression; Li* ions are 
uncompetitive inhibitors of myo-inositol monophosphatase. Some pesticides are 
also uncompetitive inhibitors, such as Roundup, an uncompetitive inhibitor of 
3-enolpyruvylshikimate-5-P synthase, an enzyme essential to aromatic amino acid 
biosynthesis. 


13.4c Enzymes Also Can Be Inhibited in an Irreversible Manner 


If the inhibitor combines irreversibly with the enzyme—for example, by covalent 
attachment—the kinetic pattern seen is like that of noncompetitive inhibition because 
the net effect is a loss of active enzyme. Usually, this type of inhibition can be distin- 
guished from the noncompetitive, reversible inhibition case because the reaction of I 
with E (and/or ES) is not instantaneous. Instead, there is a time-dependent decrease in 
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Figure 13.17 Lineweaver—Burk plot of uncompetitive 
inhibition. (a) Enzyme (grey) modeled with three 
binding sites: oval, substrate-binding site; triangle, 
uncompetitive inhibitor binding site; and rectangle, 
noncompetitive inhibitor binding site. Substrate = 
yellow; uncompetitive inhibitor = orange. (b) 1/v 
versus 1/[S] plot for uncompetitive inhibition. Note 
that both intercepts change but the slope (Km/ Vmax) 
remains constant in the presence of I. 


(a) 
a 
—— 
Ky 
ES TES 
(b) 
1 
D +I 
-I 
[I] 
I ae Ki 
Weave E 
Ti 
Ka 1 
oa 1 == 
7 7 V 
Pa O max 
A < 
1 
ig k TSI 
Ky’ Km 
K 


enzymatic activity as E + I—>EI proceeds, and the rate of this inactivation can be fol- 
lowed. Also, unlike reversible inhibitions, dilution or dialysis of the enzyme-inhibitor 
solution does not shift the equilibrium between E + I and dissociate the EI complex. 
Since dilution does not restore enzyme activity, the inhibitor must be irreversibly 
bound. 


Suicide Substrates—Mechanism-Based Enzyme Inactivators Suicide substrates are 
synthetic inhibitory substrate analogs designed so that, via normal catalytic action of 
the enzyme, a very reactive group is generated. This reactive group then forms a 
covalent bond with a nearby functional group within the active site of the enzyme, 
thereby causing irreversible inhibition. Suicide substrates, also called Trojan horse 
substrates, are a type of affinity label. As substrate analogs, they bind with specificity 
and high affinity to the enzyme active site; in their reactive form, they become 
covalently bound to the enzyme. This covalent link effectively labels a particular 
functional group within the active site, identifying the group as a key player in the 
enzyme’s catalytic cycle. 


Penicillin—A Suicide Substrate Several drugs in current medical use are mechanism- 
based enzyme inactivators. For example, the antibiotic penicillin exerts its effects by 
covalently reacting with an essential serine residue in the active site of glycopeptide 
transpeptidase, an enzyme that acts to crosslink the peptidoglycan chains during synthe- 
sis of bacterial cell walls (Figure 13.18). Penicillin consists of a thiazolidine ring fused to 
a -lactam ring to which a variable R group is attached. A reactive peptide bond in the 
-lactam ring covalently attaches to a serine residue in the active site of the glycopeptide 
transpeptidase. (The conformation of penicillin around its reactive peptide bond 
resembles the transition state of the normal glycopeptide transpeptidase substrate.) The 
penicilloyl-enzyme complex is catalytically inactive. Once cell wall synthesis is blocked, 
the bacterial cells are very susceptible to rupture by osmotic lysis and bacterial growth 
is halted. 
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Penicillin 


Active enzyme 


Figure 13.18 Penicillin is an irreversible inhibitor of 
the enzyme glycopeptide transpeptidase, also known 


Penicilloyl-enzyme complex as glycoprotein peptidase, which catalyzes an 
(enzymatically inactive) essential step in bacterial cell wall synthesis. 


13.5 | What Is the Kinetic Behavior of Enzymes 
Catalyzing Bimolecular Reactions? 


Thus far, we have considered only the simple case of enzymes that act upon a single 
substrate, S. This situation is not common. Usually, enzymes catalyze reactions in 
which two (or even more) substrates take part. 

Consider the case of an enzyme catalyzing a reaction involving two substrates, A and 
B, and yielding the products P and Q: 


A+B==P+Q (13.49) 


Enzymatic reactions involving two substrates are called bisubstrate reactions. In gen- 
eral, bisubstrate reactions proceed by one of two possible routes: 


1. Both A and B are bound to the enzyme and then the reaction occurs to give P + Q: 
E+ A + B—>AEB—> PEQ—>E+P+Q (13.50) 


Reactions of this type are defined as sequential or single-displacement reactions. 

They can be either of two distinct classes: 

a. random, where either A or B may bind to the enzyme first, followed by the other 
substrate, or 

b. ordered, where A, designated the /eading substrate, must bind to E first before B 
can be bound. 

Both classes of single-displacement reactions are characterized by lines that intersect 

to the left of the 1/v axis in Lineweaver—Burk plots where the rates observed with 

different fixed concentrations of one substrate (B) are graphed versus a series of 

concentrations of A (Figure 13.19). 
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Figure 13.19 Single-displacement bisubstrate 
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Human Biochemistry 


Viagra—An Unexpected Outcome in a Program of Drug Design 


Prior to the accumulation of detailed biochemical information on 
metabolism, enzymes, and receptors, drugs were fortuitous discov- 
eries made by observant scientists; the discovery of penicillin as a 
bacteria-killing substance by Fleming is an example. Today, drug 
design is the rational application of scientific knowledge and prin- 
ciples to the development of pharmacologically active agents. A 
particular target for therapeutic intervention is identified (such as an 
enzyme or receptor involved in illness), and chemical analogs of its 
substrate or ligand are synthesized in hopes of finding an inhibitor 
(or activator) that will serve as a drug to treat the illness. Sometimes 
the outcome is unanticipated, as the story of Viagra (sildenafil ci- 
trate) reveals. 

When the smooth muscle cells of blood vessels relax, blood flow 
increases and blood pressure drops. Such relaxation is the result of 
decreases in intracellular [Ca*] triggered by increases in intracellular 
[cGMP] (which in turnis triggered by nitric oxide, NO; see Chapter 32). 
Cyclic GMP (cGMP) is hydrolyzed by phosphodiesterases to form 
5'-GMP, and the muscles contract again. Scientists at Pfizer rea- 
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soned that, if phosphodiesterase inhibitors could be found, they 
might be useful drugs to treat angina (chest pain due to inadequate 
blood flow to heart muscle) or hypertension (high blood pressure). 
The phosphodiesterase (PDE) prevalent in vascular muscle is PDE 
5, one of at least eleven different subtypes of PDE in human cells. 
The search was on for substances that inhibit PDE 5, but not the 
other prominent PDE types, and Viagra was found. Disappoint- 
ingly, Viagra showed no significant benefits for angina or hyper- 
tension, but some men in clinical trials reported penile erection. 
Apparently, Viagra led to an increase in [cGMP] in penile vascular 
tissue, allowing vascular muscle relaxation, improved blood flow, 
and erection. A drug was born. 

In a more focused way, detailed structural data on enzymes, re- 
ceptors, and the ligands that bind to them has led to rational drug 
design, in which computer modeling of enzyme-ligand interactions re- 
places much of the initial chemical synthesis and clinical prescreen- 
ing of potential therapeutic agents, saving much time and effort in 
drug development. 
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Note the structural similarity between 
cGMP (left) and Viagra (right). 
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2. The other general possibility is that one substrate, A, binds to the enzyme and reacts 
with it to yield a chemically modified form of the enzyme (E’) plus the product, P. 
The second substrate, B, then reacts with E’, regenerating E and forming the other 
product, Q. 


CATAS EE eal i 
P B 


Reactions that fit this model are called ping-pong or double-displacement reactions. 
Two distinctive features of this mechanism are the obligatory formation of a modified 
enzyme intermediate, E’, and the pattern of parallel lines obtained in double- 
reciprocal plots of the rates observed with different fixed concentrations of one 
substrate (B) versus a series of concentrations of A (see Figure 13.22). 


13.5a The Conversion of AEB to PEQ Is the Rate-Limiting 
Step in Random, Single-Displacement Reactions 


In this type of sequential reaction, all possible binary enzyme-substrate complexes 
(AE, EB, PE, EQ) are formed rapidly and reversibly when the enzyme is added to a 
reaction mixture containing A, B, P, and Q: 


A+ E = AE EP == P +E 
NS fi 


AEB == PEQ 


re A eer 


The rate-limiting step is the reaction AEB—>+PEQ. It does not matter whether A or 
B binds first to E, or whether Q or P is released first from QEP. Sometimes, reac- 
tions that follow this random order of addition of substrates to E can be distin- 
guished from reactions obeying an ordered, single-displacement mechanism. If A 
has no influence on the binding constant for B (and vice versa) and the mechanism 
is purely random, the lines in a Lineweaver—Burk plot intersect at the 1/[A] axis 
(Figure 13.20). 
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Figure 13.20 Random, single-displacement bisubstrate mechanism where A does not affect B binding, 
and vice versa. 
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Figure 13.21 The structures of creatine and creatine 
phosphate (Creatine-P), guanidinium compounds 
that are important in muscle energy metabolism. 


Creatine Kinase Acts by a Random, Single-Displacement Mechanism An example of a 
random, single-displacement mechanism is seen in the enzyme creatine kinase, a phos- 
phoryl transfer enzyme that uses ATP as a phosphoryl donor to form creatine phos- 
phate (CrP) from creatine (Cr). Creatine-P is an important reservoir of phosphate-bond 
energy in muscle cells (Figure 13.21). 


ADP:E == ADP +E 


N VA 


ATP:E:Cr == ADP:E:CrP 


7 N 


E + Cr == E:Cr E:CrP === E + CrP 


ATP + E == ATP:E 


The overall direction of the reaction will be determined by the relative concentrations 
of ATP, ADP, Cr, and CrP and the equilibrium constant for the reaction. The enzyme 
can be considered to have two sites for substrate (or product) binding: an adenine nucle- 
otide site, where ATP or ADP binds, and a creatine site, where Cr or CrP is bound. In 
such a mechanism, ATP and ADP compete for binding at their unique site, and Cr and 
CrP compete at the specific Cr/CrP-binding site. Note that no modified enzyme form 
(E’), such as an E-P intermediate, appears here. The reaction is characterized by rapid 
and reversible binary ES complex formation, followed by addition of the remaining 
substrate, and the rate-determining reaction taking place within the ternary complex. 


13.5b In an Ordered, Single-Displacement Reaction, 
the Leading Substrate Must Bind First 


Inan ordered reaction, the leading substrate, A (also called the obligatory or compulsory 
substrate), must bind first. Then the second substrate, B, binds. B cannot bind to free 
enzyme in the absence of A. The reaction between A and B occurs in the ternary 
complex and is usually followed by an ordered release of the products of the reaction, 
P and Q. In the following diagrams, P is the product of A and is released last. One 
representation, suggested by W. W. Cleland, is as follows: 


A B Q P 


| d | | 


E AE AEB == PEQ PE E 


Another way of portraying this mechanism is as follows: 
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Note that A and P are competitive for binding to the free enzyme, E, but not A and B 
(or P and B). 


NAD*-Dependent Dehydrogenases Show Ordered Single-Displacement Mechanisms 
Nicotinamide adenine dinucleotide (NAD+*)-dependent dehydrogenases are enzymes that 
typically behave according to the kinetic pattern just described. A general reaction of 
these dehydrogenases is 


NAD* + BH,=—=NADH + Ht +B 
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The leading substrate (A) is nicotinamide adenine dinucleotide (NAD*), and NAD* 
and NADH (product P) compete for a common site on E. A specific example is 
offered by alcohol dehydrogenase (ADH): 


NAD* + CH;CH,OH ==NADH + H* + CH;CHO 
(A) ethanol (P) acetaldehyde 
(B) (Q) 


We can verify that this ordered mechanism is not random by demonstrating that no 
B (ethanol) is bound to E in the absence of A (NAD*). 


13.5c Double-Displacement (Ping-Pong) Reactions Proceed 
Via Formation of a Covalently Modified Enzyme 
Intermediate 


Double-displacement reactions are characterized by a pattern of parallel lines when 
1/v is plotted as a function of 1/[A] at different concentrations of B, the second sub- 
strate (Figure 13.22). Reactions conforming to this kinetic pattern are characterized 
by the fact that the product of the enzyme’s reaction with A (called P in the following 
schemes) is released prior to reaction of the enzyme with the second substrate, B. Asa 
result of this process, the enzyme, E, is converted to a modified form, E’, which then 
reacts with B to give the second product, Q, and regenerate the unmodified enzyme 


form, E: 
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parallel lines in plots of 1/v vs. 1/[A]. 
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Figure 13.23 Glutamate:aspartate aminotransferase, 
an enzyme conforming to a double-displacement 
bisubstrate mechanism. 


7 y YY 
AeA AA 


Note that these schemes predict that A and Q compete for the free enzyme form, E, 
while B and P compete for the modified enzyme form, E’. A and Q do not bind to F’, 
nor do B and P combine with E. 


Aminotransferases Show Double-Displacement Catalytic Mechanisms One class of en- 
zymes that follow a ping-pong-type mechanism are aminotransferases (previously 
known as transaminases). These enzymes catalyze the transfer of an amino group 
from an amino acid to an a-keto acid. The products are a new amino acid and the keto 
acid corresponding to the carbon skeleton of the amino donor: 


amino acid, + keto acid; —> keto acid; + amino acid» 


A specific example is glutamate:aspartate aminotransferase. Figure 13.23 depicts the 
scheme for this mechanism. Note that glutamate and aspartate are 
competitive for E and that oxaloacetate and a-ketoglutarate compete for E’. In 
glutamate:aspartate aminotransferase, an enzyme-bound coenzyme, pyridoxal phos- 
phate (a vitamin Bg derivative), serves as the amino group acceptor/donor in the 
enzymatic reaction. The unmodified enzyme, E, has the coenzyme in the aldehydic 
pyridoxal form, whereas in the modified enzyme, E’, the coenzyme is actually pyri- 
doxamine phosphate (Figure 13.23). Not all enzymes displaying ping-pong—type 
mechanisms require coenzymes as carriers for the chemical substituent transferred in 
the reaction. 
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13.5d Exchange Reactions Are One Way to Diagnose 
Bisubstrate Mechanisms 


A number of diagnostic tests are necessary for the assignment of a reaction mechanism 
to a specific enzyme. One is the graphic analysis of the kinetic patterns observed. It is 
usually easy to distinguish between single- and double-displacement reactions in this 
manner, and examining competitive effects between substrates aids in assigning reac- 
tions to random versus ordered patterns of S binding. A second diagnostic test is to 
determine whether the enzyme catalyzes an exchange reaction: a double-displacement 
reaction where the chemical group transferred from a substrate S to E to form E’ is 
transferred back to a different molecule of the same substrate. Consider, as an example, 
the two enzymes sucrose phosphorylase and maltose phosphorylase. Both catalyze the 
phosphorolysis of a disaccharide and both yield glucose-l-phosphate and a free hexose: 


Sucrose + P;=—glucose-1-phosphate + fructose 
Maltose + P;——glucose-1-phosphate + glucose 


Interestingly, in the absence of sucrose and fructose, sucrose phosphorylase will cata- 
lyze the exchange of inorganic phosphate, P;, into glucose-1-phosphate. This reaction 
can be followed by using *°P, as a radioactive tracer and observing the incorporation of 
32P into glucose-1-phosphate: 


2p. + G-1-P=—P; + G-1-2P 


Maltose phosphorylase cannot carry out a similar reaction. The *P exchange reac- 
tion of sucrose phosphorylase is accounted for by a double-displacement mechanism 
where E’ is E-glucose: 


Sucrose + E=—E-glucose + fructose 
E-glucose + P;=—E + glucose-1-phosphate 


Thus, in the presence of just ?P; and glucose-l-phosphate, sucrose phosphorylase 
still catalyzes the second reaction and radioactive P; is incorporated into glucose- 
1-phosphate over time. 

Maltose phosphorylase proceeds via a single-displacement reaction that necessarily 
requires the formation of a ternary maltose:E:P; (or glucose:E:glucose-1-phosphate) 
complex for any reaction to occur. Exchange reactions are a characteristic of enzymes 
that obey double-displacement mechanisms at some point in their catalysis. 


13.5e Multisubstrate Reactions Can Also Occur in Cells 


Thus far, we have considered enzyme-catalyzed reactions involving one or two substrates. 
How are the kinetics described in those cases in which more than two substrates partici- 
pate in the reaction? An example is the glycolytic enzyme glyceraldehyde-3-phosphate 
dehydrogenase: 


NAD+ + glyceraldehyde-3-P + P,—=NADH + H+ + 1,3-bisphosphoglycerate 


Many other multisubstrate examples abound in metabolism. These situations are man- 
aged by realizing that the interaction of the enzyme with its many substrates can be 
treated as a series of unisubstrate or bisubstrate steps in a multistep reaction pathway. 
Thus, the complex mechanism of a multisubstrate reaction is resolved into a sequence of 
steps, each of which obeys the single- and double-displacement patterns just discussed. 


13.6 | How Can Enzymes Be So Specific? 


The extraordinary ability of an enzyme to catalyze only one particular reaction is a 
quality known as specificity. Specificity means an enzyme acts only on a specific 
substance, its substrate, invariably transforming it into a specific product. That is, an 
enzyme binds only certain compounds, and then, only a specific reaction ensues. 


13.6 How Can Enzymes Be So Specific? 
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New ideas do not always gain immediate acceptance: 
“Although we did many experiments that in my opinion 
could only be explained by the induced-fit theory, gain- 
ing acceptance for the theory was still an uphill fight. 
One (journal) referee wrote, ‘The Fischer Key-Lock 
theory has lasted 100 years and will not be overturned 
by speculation from an embryonic scientist.” Daniel 
Koshland, 1996. How to get paid for having fun. Annual 
Review of Biochemistry 65:1-13. 


Some enzymes show absolute specificity, catalyzing the transformation of only one 
specific substrate to yield a unique product. Other enzymes carry out a particular 
reaction but act on a class of compounds. For example, hexokinase (ATP:hexose- 
6-phosphotransferase) will carry out the ATP-dependent phosphorylation of a number 
of hexoses at the 6-position, including glucose. Specificity studies on enzymes entail 
an examination of the rates of the enzymatic reaction obtained with various structural 
analogs of the substrate. By determining which functional and structural groups within 
the substrate affect binding or catalysis, enzymologists can map the properties of the 
active site, analyzing questions such as: Can the active site accommodate sterically 
bulky groups? Are ionic interactions between E and S important? Are H bonds formed? 


13.6a The Lock-and-Key Hypothesis Was 
the First Explanation for Specificity 


Pioneering enzyme specificity studies at the turn of the twentieth century by the great 
organic chemist Emil Fischer led to the notion of an enzyme resembling a lock and its 
particular substrate the key. This analogy captures the essence of the specificity that 
exists between an enzyme and its substrate, but enzymes are not rigid templates like 
locks. 


13.6b The Induced-Fit Hypothesis Provides 
a More Accurate Description of Specificity 


Enzymes are highly flexible, conformationally dynamic molecules, and many of their 
remarkable properties, including substrate binding and catalysis, are due to their 
structural pliancy. Realization of the conformational flexibility of proteins led Daniel 
Koshland to hypothesize that the binding of a substrate by an enzyme is an interac- 
tive process. That is, the shape of the enzyme’s active site is actually modified upon 
binding S, in a process of dynamic recognition between enzyme and substrate aptly 
called induced fit. In essence, substrate binding alters the conformation of the protein 
so that the protein and the substrate fit each other more precisely. The process is truly 
interactive in that the conformation of the substrate also changes as it adapts to the 
conformation of the enzyme. Indeed, this substrate-induced conformational change 
is key to enzyme specificity. Substrate specificity is ultimately determined by two fac- 
tors: (1) weak binding of S to E, and (2) the rate of the substrate-induced conforma- 
tional change leading to the closed, catalytically competent state. 

This idea also helps explain some of the mystery surrounding the enormous catalytic 
power of enzymes: In enzyme catalysis, precise orientation of catalytic residues com- 
prising the active site is necessary for the reaction to occur; substrate binding induces 
this precise orientation by the changes it causes in the protein’s conformation. 


13.6c Induced Fit Favors Formation of the Transition State 


The catalytically active enzyme substrate complex is an interactive structure in which 
the enzyme causes the substrate to adopt a form that mimics the transition state of the 
reaction. Thus, a poor substrate would be one that was less effective in directing the 
formation of an optimally active enzyme:transition state conformation. This active 
conformation of the enzyme molecule is thought to be relatively unstable in the ab- 
sence of substrate, and free enzyme thus reverts to a different conformational state. 


13.6d Specificity and Reactivity 


Consider, for example, why hexokinase catalyzes the ATP-dependent phosphorylation 
of hexoses such as glucose but not smaller phosphoryl-group acceptors such as glycerol, 
ethanol, or even water. Surely these smaller compounds are not sterically forbidden 
from approaching the active site of hexokinase (Figure 13.24). Indeed, water should 
penetrate the active site easily and serve as a highly effective phosphoryl-group accep- 
tor. Accordingly, hexokinase should display high ATPase activity. It does not. Only the 
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Figure 13.24 A drawing, roughly to scale, of H2O, glycerol, glucose, and an idealized hexokinase 
molecule. Only glucose will trigger a conformational change in hexokinase, even though the glycerol 
and water, as smaller molecules, are not sterically excluded from the active-site cleft. 


binding of hexoses induces hexokinase to assume its fully active conformation. The 
hexose-binding site of hexokinase is located between two protein domains. Binding of 
glucose in the active site induces a conformational change in hexokinase that causes the 
two domains to close upon one another, creating the catalytic site. 


13.7 | Are All Enzymes Proteins? 


Thus far, the discussion has assumed that enzymes are proteins. The traditional view 
of enzymes as proteins did not anticipate that the antibody family of proteins might 
include members with catalytic activity. Even more surprising was the discovery of 
catalytic RNAs. 


13.7a RNA Molecules That Are Catalytic Have Been 
Termed Ribozymes 


It was long assumed that all enzymes are proteins. However, in the early 1980s, 
instances of biological catalysis by RNA molecules were discovered. Catalytic RNAs, 
or ribozymes, satisfy several enzymatic criteria: They are substrate specific, they 
enhance the reaction rate, and they emerge from the reaction unchanged. Most ribo- 
zymes act in RNA processing, cutting the phosphodiester backbone at specific sites 
and religating needed segments to form functional RNA strands while discarding 
extraneous pieces. For example, bacterial RNase P is a ribozyme involved in the for- 
mation of mature tRNA molecules from longer RNA transcripts. RNase P requires 
an RNA component and a protein subunit for its activity in the cell. In vitro, the 
protein alone is incapable of catalyzing the maturation reaction, but the RNA com- 
ponent by itself can carry out the reaction under appropriate conditions. As another 
example, the introns within some rRNAs and mRNAs are ribozymes that can cata- 
lyze their own excision from large RNA transcripts by a process known as self- 
splicing. For instance, in the ciliated protozoan Tetrahymena, formation of mature 
ribosomal RNA from a pre-rRNA precursor involves the removal of an internal RNA 
segment and the joining of the two ends. The excision of this intron and ligation of the 
exons is catalyzed by the intron itself, in the presence of Mg** and a free molecule of 
guanine nucleoside or nucleotide (Figure 13.25). In vivo, the intervening sequence 
RNA probably acts only in splicing itself out; in vitro, however, it can act many times, 
turning over like a true enzyme. 


The Ribosome Is a Ribozyme A particularly significant case of catalysis by RNA occurs 
in protein synthesis. The peptidy] transferase reaction, which is the reaction of peptide 
bond formation during protein synthesis, is catalyzed by the large rRNA of the large 
ribosomal subunit. The substrates for the peptidyl transferase reaction are two tRNA 
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Figure 13.25 RNA splicing in Tetrahymena rRNA maturation: (a) the guanosine-mediated reaction 
involved in the autocatalytic excision of the Tetrahymena rRNA intron and (b) the overall splicing 
process. The cyclized intron is formed via nucleophilic attack of the freed 3’-OH end of the intron on 
the phosphodiester bond that is 15 nucleotides from the 5'-GA end of the spliced-out intron. 
Cyclization frees a linear 15-mer with a 5'-GA end. 


molecules, one bearing the growing peptide chain (the peptidyl-tRNA ) and the other 
bearing the next amino acid to be added (the aminoacyl-tRNA ,). Both the peptidyl 
chain and the amino acid are attached to their respective tR NAs via ester bonds to the 
O atom at the CCA-3’ ends of these tRNAs (see Figure 11.35). Base-pairing between 
these C residues in the two tRNAs and G residues in the 23S rRNA position the sub- 
strates for the reaction to occur (Figure 13.26). The two Cs at the peptidyl-tRNAp CCA 
end pair with G2251 and G2252 of the 23S rRNA, and the last C (C75) at the 3’-end of 
the aminoacyl-tRNA, pairs with G2553. The 3’-terminal A of the aminoacyl-tRNA, 
interacts with G2583, and the terminal A of the peptidyl-tR NAp binds to A2450. Addi- 
tion of the incoming amino acid to the peptidyl chain occurs when the a-amino group 
of the aminoacyl-tRNA, makes a nucleophilic attack on the carbonyl C linking the 
peptidyl chain to its tRNAp. Specific 23S rRNA bases and ribose-OH groups facilitate 
this nucleophilic attack by favoring proton abstraction from the aminoacyl a-amino 
group (Figure 13.27). The products of this reaction are a one-residue-longer peptidyl 
chain attached to the tRNA; and the empty tRNAp. 

The fact that RNA can catalyze such important reactions is experimental support for 
the idea that a primordial world dominated by RNA molecules existed before the 
evolution of DNA and proteins. Sidney Altman and Thomas R. Cech shared the 1989 
Nobel Prize in Chemistry for their discovery of the catalytic properties of RNA. 


13.7b Antibody Molecules Can Have Catalytic Activity 


Antibodies are immunoglobulins, which are proteins. Catalytic antibodies are anti- 
bodies with catalytic activity (catalytic antibodies are also called abzymes, a word 
created by combining “Ab,” the abbreviation for antibody, with enzyme.) Like other 
antibodies, catalytic antibodies are elicited in an organism in response to 
immunological challenge by a foreign molecule called an antigen. To trigger produc- 
tion of an antibody displaying catalytic activity, the antigen is purposefully engi- 
neered to be an analog of the transition state in a reaction. The strategy is based on the 
idea that a protein specific for binding the transition state of a reaction will promote 
entry of the normal reactant into the reactive, transition-state conformation. Thus, a 
catalytic antibody facilitates, or catalyzes, a reaction by forcing the conformation of 
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Figure 13.26 Structure of the 50S ribosomal subunit. (a) Molecular graphic of the 50S subunit from 
Haloarcula marismortui (PDB id = 1FFK). Ribosomal proteins are shown in blue, the large 23S rRNA 
backbone in brown, the 5S rRNA backbone in olive, and a tRNA substrate analog in red. The tRNA 
analog identifies the peptidyl transferase catalytic center of the 50S subunit. (b) The aminoacyl-tRNA, 
(yellow) and the peptidyl-tR NAp (orange) in the peptidy] transferase active site. Bases of the 23S rRNA 
shown in green and labeled according to their position in the 23S rRNA sequence. Interactions between 
the tRNAs and the 23S rRNA are indicated by dotted lines. The a-amino group of the aminoacyl- 
tRNA, (blue) is positioned for the attack on the carbonyl-C (green) of the peptidyl-tR NAp. (Adapted 
from Figure 2 in Beringer, M., and Rodnina, M. V., 2007. The ribosomal peptidyl transferase. Molecular 
Cell 26:311-321.) 
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its substrate in the direction of its transition state. (A prominent explanation for the 
remarkable catalytic power of conventional enzymes is their great affinity for the 
transition state in the reactions they catalyze.) 

One proof of this principle has been to prepare ester analogs by substituting a phos- 
phorus atom for the carbon in the ester group (Figure 13.28). The phosphonate com- 
pound mimics the natural transition state of ester hydrolysis, and antibodies elicited 
against these analogs act like enzymes in accelerating the rate of ester hydrolysis as much 
as 1000-fold. Abzymes have been developed for a number of other classes of reactions, 
including C—C bond formation via aldol condensation (the reverse of the aldolase reac- 
tion [see Figure 13.2, reaction 4]) and the pyridoxal 5'-P-dependent aminotransferase 
reaction shown in Figure 13.23. Abzyme biotechnology is one way to create designer 
enzymes that catalyze desired chemical reactions. 

Catalytic antibodies also occur naturally. Autoimmune diseases are diseases that arise 
because an individual begins to produce antibodies against one of their own cellular 
constituents. Multiple sclerosis (MS), one such autoimmune disease, is characterized by 
gradual destruction of the myelin sheath surrounding neurons throughout the brain and 
spinal cord. Blood serum obtained from some MS patients contains antibodies capable of 
carrying out the proteolytic destruction of myelin basic protein (MBP). That is, these 
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Figure 13.27 The peptidyl transferase reaction. 
Abstraction of an amide proton from the a-amino 
group of the aminoacyl-tRNA; (red) by the 2'-O of 
the terminal A of the peptidyl-tR NAp (blue) is aided 
by hydrogen-bonding interactions with neighboring 
23S rRNA nucleotides (green). These interactions 
facilitate nucleophilic attack by the a-amino group 
of the aminoacyl-tRNA; on the carbonyl C of the 
peptidyl-tR NAp and peptide bond formation 
between the incoming amino acid and the growing 
peptide chain to give a one-residue-longer peptide 
chain attached to the tRNAy,. (Adapted from Figure 
3 in Beringer, M., and Rodnina, M. V., 2007. The 
ribosomal peptidyl transferase. Molecular Cell 
26:311-321.) 
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Figure 13.28 Creation of an abzyme antigen by modeling the transition state of a reaction. The 
transition-state analog will trigger formation of an abzyme if it is antigenic. (a) The intramolecular 
hydrolysis of a hydroxy ester to yield as products a 5-lactone and the alcohol phenol. Note the cyclic 
transition state. (b) The cyclic phosphonate ester analog of the cyclic transition state. 


antibodies were MBP-destructive proteases. Similarly, hemophilia A is a blood-clotting 
disorder due to lack of the factor VIII, an essential protein for formation of a blood clot. 
Serum from some sufferers of hemophilia A contained antibodies with proteolytic activity 
against factor VIII. Thus, some antibodies are proteases. 


13.8 | Is It Possible to Design an Enzyme 
to Catalyze Any Desired Reaction? 


Enzymes have evolved to catalyze metabolic reactions with high selectivity, specificity, 
and rate enhancements. Given these remarkable attributes, it would be very desirable to 
have the ability to create designer enzymes individually tailored to catalyze any imagin- 
able reaction, particularly those that might have practical uses in industrial chemistry, 
the pharmaceutical industry, or environmental remediation. To this end, several 
approaches have been taken to create a desired enzyme de novo (literally “anew” or 
“from scratch.” In biochemistry, the synthesis of some end product from simpler precur- 
sors.) Most approaches begin with a known enzyme and then engineer it using in vitro 
mutagenesis (see Chapter 12) to replace active-site residues with a new set that might cata- 
lyze the desired reaction. This strategy has the advantage that the known protein struc- 
ture provides a stable scaffold into which a new catalytic site can be introduced. As 
pointed out in Chapter 6, despite the extremely large number of possible amino acid se- 
quences for a polypeptide chain, the number of stable tertiary structures that proteins 
can adopt is rather limited. Yet proteins have an extraordinary range of functional pos- 
sibilities. So, this approach is rational. A second, more difficult, approach attempts the 
completely new design of a protein with the desired structure and activity. Often, this 
approach relies on in silico methods, where the spatial and reactive properties of its puta- 
tive active site are modeled, refined, and optimized via computer. Although this approach 
has fewer limitations in terms of size and shape of substrates, it brings other complica- 
tions, such as the design of a proper protein scaffold for the active site, protein folding, 
and protein stability, to say nothing of the difficulties of going from the computer model 
(in silico) to a real enzyme in a cellular environment (in vivo). Nevertheless, computer 
design of enzymes with keat/Km ratios greater than 10° M~'s~! has been achieved. The 
Rosetta protein modeling suite is popular software for such in silico enzyme design. 
Enzymes have shown adaptability over the course of evolution. New enzyme functions 
have appeared time and time again, as mutation and selection according to Darwinian 
principles operate on existing enzymes. Some enzyme designers have coupled natural evolu- 
tionary processes with rational design using in vitro mutagenesis. Expression of mutated 
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versions of the gene encoding the enzyme in bacteria, followed by rounds of selection for 
bacteria producing an enzyme with even better catalytic properties, takes advantage of nat- 
urally occurring processes to drive further mutation and selection for an optimal enzyme. 
This dual approach is whimsically referred to as semirational design because it relies on the 
rational substitution of certain amino acids with new ones in the active site, followed by 
directed evolution (selection for bacteria expressing more efficient versions of the enzyme). 


13.8a A Designer Enzyme for Bioremediation 


An example of active-site engineering is the site-directed mutation of an epoxide 
hydrolase to change its range of substrate selection so that it now acts on chlorinated 
epoxides (Figure 13.29). Degradation of chlorinated epoxides is a major problem in 
the removal of toxic pollutants from water resources. Mutation of a bacterial epoxide 
hydrolase at three active-site residues (F8, I?'°, and C48) and selection in bacteria for 
enhanced chlorinated epoxide hydrolase activity yielded an F108L/I219L/C248I 
mutant enzyme that catalyzed the conversion of cis-dichloroepoxyethane to Cl— ions 
and glyoxal with a dramatically increased Vmax/Km ratio. 


13.8b A Case Study for Designer Enzyme Development — Kemp Eliminases 


A model reaction favored by scientists exploring the fundamental properties of enzyme 
catalysis and the creation of designer enzymes is the Kemp elimination reaction (Figure 
13.30). The basic design strategy is to test various protein structures, such as the TIM 
a/B—barrel (see Figure 6.34d) as scaffolds on which to construct a Kemp eliminase (KE) 
active site. Different active-site bases (B: ~ ) are then tested in such model systems, such 
as a carboxylate group of an aspartate or glutamate side chain. A refinement of this 
approach is to position the imidazole group of a histidine side chain near the catalytic 
D (or E) carboxylate to form an active-site His-Asp (or Glu) dyad. Using computational 
enzyme design algorithms and close observation of protein structural changes within the 
active site introduced during directed evolution, KE designer enzymes have been created 
that show more than a 700-fold increase in catalytic activity over earlier versions. 


NADH + H* é 
O; HO 2HCl 

H H 7% H. /\ H mR ee 2 

`c=c{ Some is ooo 
cl Cl 1,0 + cl cl H H 

NAD* 
DCE Step 1 cis-1,2-dichloro- Step 2 Glyoxal 
epoxyethane 


Figure 13.29 cis-1,2-Dichloroethylene (DCE) is an industrial solvent that poses hazards to human 
health; DCE occurs as a pollutant in water sources. Bacterial metabolism of DCE to form cis-1,2- 
dichloroepoxyethane (step 1) occurs readily, but enzymatic degradation of the epoxide to glyoxal and 
chloride ions (step 2) is limited. Microbial detoxification of DCE in ground water requires enzymes 
for both steps 1 and 2. Genetic engineering of an epoxide hydrolase to create an enzyme capable 

of using cis-1,2-dichloroepoxyethane as a substrate and catalyzing step 2 is a practical example of 

de novo enzyme design. 
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Figure 13.30 The Kemp elimination reaction occurs when a catalytic base (B:~) abstracts a proton from 
a benzisoxazole ring, leading to the opening of the isoxazole ring and formation of o-cyanophenolate. 
X is typically a nitro (—NO>) group, so the substrate is 5-nitrobenzisoxazole. 
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Foundational Biochemistry Things You Should Know after Reading Chapter 13. 


Enzymes are the agents of metabolic function. 
Enzymes exert kinetic control over thermodynamic potentiality. 


Living systems use enzymes to accelerate and control the rates of 
vitally important biochemical reactions. 


Catalytic power is defined as the ratio of the enzyme-catalyzed rate of 
a reaction to the uncatalyzed rate. 


The substance upon which an enzyme acts is termed the substrate of 
the enzyme. 


Specificity is the term used to define the selectivity of enzymes for 
their substrates. 


The specific site on the enzyme to which the substrate binds and where 
catalysis occurs is called the active site. 


Regulation of enzyme activity ensures that the rate of a metabolic 
reaction is appropriate to cellular requirements. 


Enzyme nomenclature provides a systematic way of naming meta- 
bolic reactions. 


Coenzymes and cofactors are nonprotein components of enzymes 
that are essential for enzyme activity. 


Tightly bound coenzymes are termed prosthetic groups; the protein 
without the prosthetic group is called an apoenzyme. 


Enzyme kinetics is the branch of science that studies the properties of 
enzyme-catalyzed reactions. 


Enzymes, like other catalysts, lower the free energy of activation for 
a reaction. 


Decreasing the free energy of activation, AG?, increases reaction 
rate. 


The rate of a simple enzyme-catalyzed reaction can be described by 
the Michaelis-Menten equation. 


The Briggs—Haldane steady-state assumption (d[ES]/dt = 0) provides 
an improved interpretation of the Michaelis-Menten equation. 


Km, the Michaelis constant, is defined as (k_, + ky)/ky. 
The velocity of the reaction, v, is given by v = d[P]/dt = k[ES]. 


The maximal velocity of the reaction, Vmax, is achieved when all of the 
enzyme, E, is in the ES form. Vmax = ko[Er]. 


When [S] = Km, v= Vmaxl2. 


The turnover number, or molecular activity, of an enzyme is defined 
as kear Keat = Vmax[Er]. 


The catalytic efficiency of an enzyme is defined as Keat/Km.- 


The catalytic efficiency of an enzyme cannot exceed the diffusion- 
controlled rate of encounters between E and S. Thus, the upper limit 
on Keat/Km is about 10? M7!s7!. 


Lineweaver—Burk double-reciprocal (1/v vs. 1/[S]) plots and Hanes- 
Woolf ([S]/v vs. [S]) plots convert the hyperbolic Michaelis-Menten 
v vs. [S] response into a straight line. 


Enzyme activity is affected by pH, temperature, and ionic strength, all 
of which can affect protein conformation. 


Enzyme inhibitors are useful probes of the properties of enzymatic 
reactions. 


Many drugs and other pharmacological agents are enzyme inhibitors. 


Enzyme inhibitors are classified as competitive (compete with S for 
binding to the active site), noncompetitive (bind elsewhere on the 
enzyme), or uncompetitive (bind only to the ES complex). 


Irreversible inhibitors bind irreversibly to or form covalent bonds 
with the enzyme. 


Bisubstrate reaction mechanisms may be single-displacement (both 
substrates bind and then the reaction occurs) or double-displacement 
(the first substrate binds, reaction occurs, product is released, and 
then the second substrate binds, further reaction occurs, and the sec- 
ond product is released). 


Single-displacement reactions may be random (it does not matter 
which substrate binds first) or ordered (one substrate, called the 
leading substrate, binds first; only when it is bound can the second 
substrate bind). 


Fischer’s lock-and-key hypothesis was the original explanation for 
enzyme specificity. 


Koshland’s induced-fit hypothesis is a better explanation of enzyme 
specificity because it considers the conformational change induced in 
the enzyme (and the substrate) when S is bound by E. 


Not only protein molecules but RNA molecules may be catalytic; 
catalytic RNA molecules are termed ribozymes. 


The ribosome is a ribozyme. 
Antibodies that have catalytic activity are called abzymes. 


Designer enzymes are enzymes engineered to catalyze a reaction for 
which no naturally occurring enzyme exists. 


Several approaches have been taken to create designer enzymes: 
rational enzyme design (altering the active site of a known enzyme 
so that it now catalyzes a new reaction) or in silico design (using 
computers to model an active site that will catalyze a desired reac- 
tion and then attempting to create a protein that will have such an 
active site). 


Designer enzymes have potential applications in organic synthesis 
(such as the synthesis of new drugs) and environmental remediation 
(to remove pollutants). 
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7. How Varying the Amount of Enzyme or the Addition of Inhibitors 
Affects v versus [S] Plots Using Figure 13.7 as a model, draw curves 
that would be obtained in v versus [S] plots when 


Answers to all problems are at the end of this book. Detailed solutions 
are available in the Student Solutions Manual. 


1. Understanding Km The activity of two enzymes was studied using 
different carbohydrates as their substrate. The table shows the results. 
How do those two enzymes compare? 


twice as much enzyme is used. 
half as much enzyme is used. 


D-mannose D-galactose a pure noncompetitive inhibitor is added. 


a 

b. 

c. a competitive inhibitor is added. 
D-glucose D-fructose -ribose d 
e 


an uncompetitive inhibitor is added. 


Enzyme A Km=0.25 Km=0.20 Ky = 9.8 Ky = 0.21 Ky = 0.22 
mM mM mM mM mM For each example, indicate how Vmax and Km change. 
EnzymeB K»=7.02 Kn=6.95 Km=0.05 Km=7.25 Ka=7.8 8. Using Graphical Methods to Derive the Kinetic Constants for an 
mM mM mM mM mM Ordered, Single-Displacement Reaction The general rate equation for 


an ordered, single-displacement reaction where A is the leading 


f substrate is 
2. Exploring the Michaelis-Menten Equation - 1 According to the 


Michaelis-Menten equation, what is the v/ Vmax ratio when Vmaxl AJ[B] 


v= 


[S] = 4 Km? 


. Exploring the Michaelis-Menten Equation - II If Vmax = 
100 wmol/mL : s and Km = 2 mM, what is the velocity of the reaction 
when [S] = 20 mM? 


(KSKm + KalB] + KalA] + [A][BD 


Write the Lineweaver—Burk (double-reciprocal) equivalent of 
this equation and from it calculate algebraic expressions for the 
following: 


4. Exploring the Michaelis-Menten Equation - III For a Michaelis-Menten a. the slope 
reaction, kj = 7 X 10’7”/M-s,k_; = 1 X 10°/s, and ky = 2 X 10*/s. What b. the p-intercepts 
are the values of Ks and Km? Does substrate binding approach 


equilibrium, or does it behave more like a steady-state system? c. the horizontal and vertical coordinates of the point of intersec- 


tion when 1/v is plotted versus 1/[B] at various fixed concentra- 
5. Graphing the Results from Kinetics Experiments with Enzyme tions of A 
Inhibitors The following kinetic data were obtained for an enzyme in 9 
the absence of any inhibitor (1) and in the presence of two different 
inhibitors (2) and (3) at 5 mM concentration. Assume [Er] is the same 


in each experiment. 


. Interpreting Kinetics Experiments from Graphical Patterns The 
following graphical patterns obtained from kinetic experiments have 
several possible interpretations depending on the nature of the 
experiment and the variables being plotted. Give at least two 


5) (2) (3) possibilities for each. 
[S] v (umol/ v (umol/ v (umol/ 
(mM) mL : s) mL : s) mL : s) - = 
1 12 43 5.5 as 20 
2 20 8 9 is i 
4 29 14 13 
8 35 21 16 
12 40 26 18 
Graph these data as Lineweaver—Burk plots and use your graph to 
find answers to a. and b. 1 1 
a. Determine Vmax and Km for the enzyme. TS] TS] 


b. Determine the type of inhibition and the K; for each inhibitor. 


6. Calculating K, and K,’ The enzymatic activity of an enzyme was studied 
in the presence of mixed noncompetitive inhibitors A and B. The 
results for inhibitor A (tested at 50 pM) are shown in the Lineweaver— 
Burk format. The results for inhibitor B (tested at 15 uM) are shown 
in the Hanes—Woolf format. Calculate Kj and Kj’ for each inhibitor. 


Enzyme A Enzyme B 
12.0 j 3.5 
l Z sol a 
10.0 3.0 = 
l Z oll Z 
S 80 = = ar 
2 l a g ool 2 
= Ca al 
S 6.0 “fo & isl we 1 
= l +% y= 0.1111x + 0.3175 a’ | oe? y= 0.2778x + 0.3472 an 
0 Gi 1.0 - [S] 
B d — 
l. 


10. Using the Equations of Enzyme Kinetics to Treat Methanol 
Intoxication Liver alcohol dehydrogenase (ADH) is relatively 
nonspecific and will oxidize ethanol or other alcohols, including 
methanol. Methanol oxidation yields formaldehyde, which is quite 
toxic, causing, among other things, blindness. Mistaking it for the 
cheap wine he usually prefers, my dog Clancy ingested about 50 mL 


20% 60” 0.5 be” 
eem Lm 
0.0 E 0.0 a] 
0.0 50.0 100.0 0.0 5.0 10.0 
1/[S] (pM) [S] (aM) 
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11. 


12. 


13. 
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of windshield washer fluid (a solution 50% in methanol). Knowing 
that methanol would be excreted eventually by the dog’s kidneys if its 
oxidation could be blocked, and realizing that, in terms of methanol 
oxidation by ADH, ethanol would act as a competitive inhibitor, I 
offered him some wine. How much (12% ethanol) must he consume in 
order to lower the activity of his ADH on methanol to 5% of its 
normal value if the Km values of canine ADH for ethanol and 
methanol are 1 millimolar and 10 millimolar, respectively? (The K; for 
ethanol in its role as competitive inhibitor of methanol oxidation by 
ADH is the same as its Km.) Both the methanol and ethanol will 
quickly distribute throughout Clancy’s body fluids, which amount to 
about 15 L. Assume the densities of 50% methanol and the wine are 
both 0.9 g/mL. 


Quantitative Relationships between Rate Constants to Calculate Km, 
Kinetic Efficiency (kcat/Km) and Vmax - 1 Measurement of the rate 
constants for a simple enzymatic reaction obeying Michaelis-Menten 
kinetics gave the following results: 


ky = 2X 108 M~! s~! 


k_,=1X 10s"! 

ky =5X 103 87! 

a. What is Ks, the dissociation constant for the enzyme-substrate 
complex? 

b. What is Km, the Michaelis constant for this enzyme? 

c. What is kea (the turnover number) for this enzyme? 

d. What is the catalytic efficiency (Keat/Km) for this enzyme? 

e. Does this enzyme approach “kinetic perfection”? (That is, does 


Ka! Km approach the diffusion-controlled rate of enzyme 
association with substrate?) 

f. Ifa kinetic measurement was made using 2 nanomoles of 
enzyme per mL and saturating amounts of substrate, what would 
Vmax equal? 

g. Again, using 2 nanomoles of enzyme per mL of reaction mixture, 
what concentration of substrate would give v = 0.75 Vmax? 

h. If a kinetic measurement was made using 4 nanomoles of 
enzyme per mL and saturating amounts of substrate, what would 
Vmax equal? What would Km equal under these conditions? 


Quantitative Relationships between Rate Constants to Calculate Km, 
Kinetic Efficiency (kcat/Km) and Vmax - II Triose phosphate isomerase 
catalyzes the conversion of glyceraldehyde-3-phosphate to dihydroxy- 
acetone phosphate. 


Glyceraldehyde-3-P —— dihydroxyacetone-P 


The Km of this enzyme for its substrate glyceraldehyde-3-phosphate is 
1.8 X 1075 M. When [glyceraldehydes-3-phosphate] = 30 uM, the rate 
of the reaction, v, was 82.5 umol mL! s7!. 

a. What is Vina, for this enzyme? 

b. Assuming 3 nanomoles per mL of enzyme was used in this 
experiment ([Fiota] = 3 nanomol/mL), what is keat for this 
enzyme? 

c. What is the catalytic efficiency (kcat/Km) for triose phosphate 
isomerase? 

d. Does the value of kea/ Km reveal whether triose phosphate 
isomerase approaches catalytic perfection? 

e. What determines the ultimate speed limit of an enzyme-catalyzed 
reaction? That is, what is it that imposes the physical limit on 
kinetic perfection? 


Quantitative Relationships between Rate Constants to Calculate 
Km, Kinetic Efficiency (kcat/Km) and Vmax - III The citric acid cycle 
enzyme fumarase catalyzes the conversion of fumarate to form 
malate. 


Fumarate + H,O === malate 


The turnover number, keat, for fumarase is 800/s. The Km of fumarase 
for its substrate fumarate is 5 yM. 


14. 


15. 


16. 


17. 


a. In an experiment using 2 nanomole/mL of fumarase, what 
IS Vinax? 

b. The cellular concentration of fumarate is 47.5 uM. What is v 
when [fumarate] = 47.5 uM? 

c. What is the catalytic efficiency of fumarase? 

d. Does fumarase approach catalytic perfection? 

Quantitative Relationships between Rate Constants to Calculate Km, 


Kinetic Efficiency (Keat/Km) and Vmax- IV Carbonic anhydrase catalyzes 
the hydration of CO»: 


CO, + H20 ==> HCO; 
The Km of carbonic anhydrase for CO; is 12 mM. Carbonic anhydrase 
gave an initial velocity vy, = 4.5 umol HCO; formed/mL - s when 
[CO,] = 36 mM. 
a. What is Vmaxfor this enzyme? 
b. Assuming 5 pmol/mL (5 x 107!’ moles/mL) of enzyme were 
used in this experiment, what is keat for this enzyme? 
c. What is the catalytic efficiency of this enzyme? 
d. Does carbonic anhydrase approach catalytic perfection? 


Quantitative Relationships between Rate Constants to Calculate Km, 
Kinetic Efficiency (keat/Km) and Vmax - V Acetylcholinesterase catalyzes 
the hydrolysis of the neurotransmitter acetylcholine: 


Acetylcholine + H:O —> acetate + choline 


The Km of acetylcholinesterase for its substrate acetylcholine is 

9 x 1075 M. Ina reaction mixture containing 5 nanomoles/mL of 
acetylcholinesterase and 150 uM acetylcholine, a velocity 

Vo = 40 umol/mL - s was observed for the acetylcholinesterase reac- 
tion. 

a. Calculate Vmax for this amount of enzyme. 

b. Calculate keat for acetylcholinesterase. 


c. Calculate the catalytic efficiency (Keat Km) for 
acetylcholinesterase. 


d. Does acetylcholinesterase approach catalytic perfection? 


Quantitative Relationships between Rate Constants to Calculate Km, 
Kinetic Efficiency (kcat/Km) and Vmax - VI The enzyme catalase catalyzes 
the decomposition of hydrogen peroxide: 


2 H,O, == 2 H,O + O: 
The turnover number (keat) for catalase is 40,000,000 s~!. The Km of 
catalase for its substrate HO, is 0.11 M. 
a. In an experiment using 3 nanomole/L of catalase, 
what is Vinax? 


b. What is v when [H,O.] = 0.75 M? 
c. What is the catalytic efficiency of catalase? 
d. Does catalase approach catalytic perfection? 


Calculate Catalytic Efficiency Using Lineweaver—Burk or Hanes—Woolf 
Plots The enzymatic activity of two enzymes, A and B, was 

studied. The results for enzyme A (tested at 20 nM) are shown in a 
Lineweaver—Burk format, while the results for enzyme B (tested at 

5 nM) are shown in a Hanes—Woolf format. Calculate the catalytic 
efficiency for each enzyme. 


Inhibitor A Inhibitor B 


with inhibitor 10.0 | with inhibitor pos 
y= 2.2941 x+ 0.5882 ,6* | J=1.1429x + 0.6429 „o° 


1/v (s/pM) 
a 
[SI/v (s) 


Pad Gauen Without inhibitor = 2<05 „° „77 without inhibitor 
aT a ae y=0.75x+ 0.5 y=0.4x+ 0.3 
0 D E EE ET ST l E SS E a A 
0 20 40 60 80 100 0.0 20 40 60 80 10.0 
1/[S] (pM) [S] (uM) 
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18. The Effect of the Accumulation of P, the Reaction Product, on the 
Michaelis-Menten Equation Equation 13.9 presents the simple 
Michaelis—Menten situation where the reaction is considered to be 
irreversible ([P] is negligible). Many enzymatic reactions are reversible, 
and P does accumulate. 

a. Derive an equation for v, the rate of the enzyme-catalyzed 
reaction S—sP in terms of a modified Michaelis-Menten model 
that incorporates the reverse reaction that will occur in the 
presence of product, P. 

b. Solve this modified Michaelis-Menten equation for the special 
situation when v = 0 (that is, S=—P is at equilibrium, or in 
other words, K., = [P}/[S]). 

(J. B. S. Haldane first described this reversible Michaelis-Menten 

modification, and his expression for Keq in terms of the modified 

M-M equation is known as the Haldane relationship.) 


19. Michaelis-Menten Kinetics Enzyme A follows Simple Michaelis— 
Menten Kinetics. 


a. The K,,of enzyme A for its substrate S is K,S=1 mM. Enzyme A 
also acts on substrate T and its K,'=10 mM. Is S or T the 
preferred substrate for enzyme A? 


b. The rate constant k, with substrate S is 2 X 10*s7!; with 
substrate T, k) = 4 X 10°s~!. Does enzyme A use substrate S or 
substrate T with greater catalytic efficiency? 


20. The Chemical Properties of Lemon Juice When preparing fruit salad 
containing apple slices, lemon juice is sprinkled to prevent the 
darkening of the apple slices. The darkening is an enzymatic oxidation 
reaction (due to exposure to oxygen in air) catalyzed by polyphenol 
oxidase. Lemon juice contains about 75% carbohydrates, 10% organic 
acids, 5% amino acids, 3% inorganics, 2.5% vitamins, and 1% lipids. 
Using the information provided, how would you explain the action of 
lemon juice? 


21. Enzymatic Activity in Clinical Analyses Clinical diagnostics often 
depend on the detection of enzymatic activity in a patient’s blood 
sample. One example is the analysis of lactate dehydrogenase (LDH) 
activity in blood to diagnose a myocardial infarction. For the 
detection of the potential presence of an enzyme in a blood sample, 
substrate is added to the sample at concentrations a 100-times (or 
more) higher than the known value of Km. Give an explanation for 
why such a high substrate concentration is used. 
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22. Enzymatic Perfection As discussed in section 13.31, the rate of 
diffusion is a limiting factor for enzymatic activity. Diffusion rates 
increase with increasing temperature. However, increasing temperature 
does not necessarily increase enzymatic activity. Give an explanation. 


Think-Pair-Share Question 


The study of the kinetics of the degradation of a biopolymer (protein, nu- 
cleic acid, polysaccharide) is complicated, since the number of internal 
bonds that are broken per unit of time needs to be determined. This can be 
done by monitoring the decline in molecular mass as the biopolymer is grad- 
ually broken into smaller pieces. Consider the study of the degradation of 
500 mg of four polysaccharides with different molecular masses. Results of 
the decline in molecular mass for each polysaccharide are listed in the table. 


Reaction Polysaccha- Polysaccha- _—‘ Polysaccha- _—Polysaccha- 
time ride A mol ride B mol ride C mol ride D mol 
(min) mass (g/mol) mass (g/mol) mass (g/mol) mass (g/mol) 
0 250,000 400,000 1,000,000 100,000 
5 97,561 80,000 117,647 66,667 
10 60,606 44,444 62,500 50,000 
15 43,956 30,769 42,533 40,000 


Pick two polysaccharides (another student can pick the remaining two 
polysaccharides). 


a. Plot the molecular mass as a function of the reaction time. 


b. Determine the velocity of the reaction as the moles of glycosidic 
bonds broken per minute. 


c. Plot the reciprocal values of the molecular mass as a function of 
the reaction time. Correlate the slopes of the lines with the 
results from (b). What is the significance of your observation? 


23. Enzyme Activity Profiles Use Figure 13.12 to Answer the Following 

Questions. 

a. Is the enzyme whose temperature versus activity profile 
is shown in Figure 13.12 likely to be from an animal or a 
plant? Why? 

b. What do you think the temperature versus activity profile for an 
enzyme from a thermophilic prokaryote growing in an 80°C pool 
of water would resemble? 
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David W. Grisham 


Mechanisms of Enzyme 
Action 


<d Like the workings of a watch, 
the details of enzyme 
mechanisms are at once 
complex and simple. 


Enzymes derive their catalytic prowess from the simple chemistry of breaking and making 
bonds. This chapter will explore the unique features of this chemistry. The mechanisms of 
thousands of enzymes have been studied in at least some detail. In this chapter, it will be 
possible to examine only a few of these. What are the universal chemical principles that 
influence the mechanisms of enzymes and underlie their enormous catalytic power? 


ll living things depend on enzymes. These remarkable catalysts enable metabolic 

reactions to proceed at rates that sustain life. They are essential for respiration, 
digesting food, muscle and nerve function, and thousands of other roles. This chapter 
describes the chemical foundations of enzyme function. 


14.1 | What Are the Magnitudes of Enzyme-Induced 


Rate Accelerations? 


Enzymes are powerful catalysts. Enzyme-catalyzed reactions are typically 10’ to 10" 
times faster than their uncatalyzed counterparts (Table 14.1). The most impressive 
reaction acceleration known (2 X 10% over the uncatalyzed rate) is that of the alkyl- 
sulfatase from the soil bacterium Coryneform Bla. 

These large rate accelerations correspond to substantial decreases in the free energy 
of activation for the reaction in question. For example, urea is hydrolyzed by urease: 


O 
I 
H,sN—C—NH, + 2 HO + H* — > 2NH,° + HCO; 


The catalyzed reaction shows an energy of activation 84 kJ/mol smaller than that for 
the corresponding uncatalyzed reaction. To fully understand any enzyme reaction, it 
is important to account for the rate acceleration in terms of the structure of the 
enzyme and its mechanism of action. 
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14 


No single thing abides but all things flow. 
Fragment to fragment clings and thus they grow 
Until we know them by name. 


Then by degrees they change and are no more the 
things we know. 
Lucretius (ca. 94 B.c.—50 B.C.) 


14.1 What Are the Magnitudes of Enzyme- 
Induced Rate Accelerations? 


14.2 What Role Does Transition-State 
Stabilization Play in Enzyme 
Catalysis? 


14.3 How Does Destabilization of ES 


Affect Enzyme Catalysis? 


14.4 What Are Transition State Analogs 
and How Tightly Do They Bind to the 


Active Sites of Enzymes? 


14.5 What Are the Mechanisms of Enzyme 


Catalysis? 

What Can Be Learned from the 
Mechanisms of Some Typical 
Enzymes? 


14.6 
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Chapter 14 Mechanisms of Enzyme Action 


| Table 14.1 | CSET A Comparison of Enzyme-Catalyzed Reactions and Their Uncatalyzed Counterparts 


Uncatalyzed Catalyzed 
Reaction Enzyme Rate, v, (sec') Rate, ve (sec ') Vel Vu 
R—O—SO, + HO —> R—OH + S0,- + H* Alkylsulfatase (S—O cleavage) 3 x 10776 6.9 ~2 x 1076 
Fructose-1,6-bisP —> fructose-6-P + P; Fructose-1,6-bisphosphatase 2x 1072 21 1.05 x 107! 
(Glucose), + HO —> (glucose),. + maltose B-amylase 1.9 x 1075 1.4 x 103 7.2 X 10!” 
DNA, RNA cleavage Staphylococcal nuclease 7x 1076 95 1.4 X 10!” 
CH;—O—PO,?- + H,O —> CH;0OH + HPO,- Alkaline phosphatase 1x 107" 14 1.4 x 10!6 
O 
HN é NH, + 2 HO + H —> 2NH,*+HCO, Urease 3 x 10-19 3 x 104 1x 10! 
O 
n__o_CH,CH, + HO —> RCOOH + HOCHCH, Chymotrypsin 1 x 107! 1 x 10? 1 x 10! 
Glucose + ATP —> Glucose-6-P + ADP Hexokinase <1 x 10-8 1.3 X 103 >1.3 X 1010 
O 
CH3CH,OH + NAD’ —> CHCH + NADH + H* Alcohol dehydrogenase <6 X 10°? 2.7X 10% >4.5 x 10° 
CO, + HO —+ HCO,- + Ht Carbonic anhydrase 1072 105 1 x 107 
Creatine + ATP —> Cr-P + ADP Creatine kinase <3 x 107? 4x 10> >1.33 X 104 


Adapted from Koshland, D., 1956. Molecular geometry in enzyme action. Journal of Cellular Comparative Physiology, Supp. 1, 47:217; Wolfenden, R., 2006. Degrees of difficulty of 
water-consuming reactions in the absence of enzymes. Chemical Reviews 106:3379-3396; and Edwards, D. R., Lohman, D. C., and Wolfenden, R., 2012. Catalytic proficiency: the extreme 
case of S—O cleaving sulfatases. Journal of the American Chemical Society 134:525—531. 


In all chemical reactions, the reacting atoms or molecules pass through a state that is 
intermediate in structure between the reactant(s) and the product(s). Consider the 
transfer of a proton from a water molecule to a chloride anion: 


H—O—H+Cl = H-—O?* ..-H---Cl® = HO +H—-Cl 
Reactants Transition state Products 


In the middle structure, the proton undergoing transfer is shared equally by the 
hydroxyl and chloride anions. This structure represents, as nearly as possible, the 
transition between the reactants and products, and it is known as the transition state.! 
All transition states contain at least one partially formed bond. 

Linus Pauling was the first to suggest (in 1946) that the active sites of enzymes bind the 
transition state more readily than the substrate and that, by doing so, they stabilize the 
transition state and lower the activation energy of the reaction. Many subsequent studies 
have shown that this idea is essentially correct, but it is just the beginning in understanding 
what enzymes do. Reaction rates can also be accelerated by destabilizing (raising the energy 
of) the enzyme-substrate complex. Chemical groups arrayed across the active site actually 
guide the entering substrate toward the formation of the transition state. Thus, the enzyme 
active site is said to be preorganized. Enzymes are dynamic, and fluctuations in the active- 
site structure are presumed to organize the initial enzyme-substrate complex into a reactive 
conformation and on to the transition state. Along the way, electrostatic and hydrophobic 
interactions between the enzyme and the substrate mediate and direct these changes that 
make the reaction possible. Often, catalytic groups provided by the enzyme participate 
directly in proton transfers and other bond-making and bond-breaking events. 

This chapter describes and elaborates on each of these contributions to the catalytic 
prowess of enzymes and then illustrates the lessons learned by looking closely at the 
mechanisms of three well-understood enzymes. 


IIt is important to distinguish transition states from intermediates. A transition state is envisioned as an 
extreme distortion of a bond, and thus the lifetime of a typical transition state is viewed as being on the 
order of the lifetime of a bond vibration, typically 107! sec. Intermediates, on the other hand, are 
longer lived, with lifetimes in the range of 107! to 10-3 sec. 
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14.2 What Role Does Transition-State Stabilization Play in Enzyme Catalysis? 


14.2 | What Role Does Transition-State Stabilization 
Play in Enzyme Catalysis? 


Chemical reactions in which a substrate (S) is converted to a product (P) can be pic- 
tured as involving a transition state (which we henceforth denote as X$), a species 
intermediate in structure between S and P (Figure 14.1). As seen in Chapter 13, the 
catalytic role of an enzyme is to reduce the energy barrier between substrate and 
transition state. This is accomplished through the formation of an enzyme-substrate 
complex (ES). This complex is converted to product by passing through a transition 
state, EX? (Figure 14.1). As shown, the energy of EX? is clearly lower than that of X+. 
One might be tempted to conclude that this decrease in energy explains the rate 
enhancement achieved by the enzyme, but there is more to the story. 

The energy barrier for the uncatalyzed reaction (Figure 14.1) is, of course, the differ- 
ence in energies of the S and X? states. Similarly, the energy barrier to be surmounted in 
the enzyme-catalyzed reaction, assuming that E is saturated with S, is the energy differ- 
ence between ES and EX?. Reaction rate acceleration by an enzyme means simply that the 
energy barrier between ES and EX? is less than the energy barrier between S and X. 
In terms of the free energies of activation for the enzyme-catalyzed (e) and uncatalyzed 
(u) reactions, AG, < AG. 

There are important consequences for this statement. The enzyme must stabilize the 
transition-state complex, EX!, more than it stabilizes the substrate complex, ES. Put 
another way, enzymes bind the transition-state structure more tightly than the sub- 
strate (or the product). The dissociation constant for the enzyme-—substrate complex is 


[EIS] 
Ks = = 14.1 
= (14.1) 
and the corresponding dissociation constant for the transition-state complex is 
EJ[X* 
T- EI (14.2) 
[EX] 


Enzyme catalysis requires that Ky < Ks. According to transition-state theory, the rate 
constants for the enzyme-catalyzed (ką) and uncatalyzed (k,) reactions can be related to 
Ks and Ky by 


kelley < KslKt (14.3) 


Thus, the enzymatic rate enhancement is approximately equal to the ratio of the dis- 
sociation constants of the enzyme-substrate and enzyme-transition-state complexes, 
at least when E is saturated with S. 


(a) 


Free energy, G 


Reaction coordinate ——> 


(b) 


s = xt ——~P E+S =~ ES —— Ext 


> E+P 


Figure 14.1 Enzymes catalyze 


477 


reactions by lowering the activation 


energy. Here the free energy of 
activation for (a) the uncatalyzed 
reaction, AG,}, is larger than that 


for (b) the enzyme-catalyzed 
reaction, AG. 
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Reaction coordinate 


Figure 14.2 The intrinsic binding energy of the 
enzyme-substrate (ES) complex (AG,) is 
compensated to some extent by entropy loss due to 
the binding of E and S (TA S) and by destabilization 
of ES (AGa) by strain, distortion, desolvation, and 
similar effects. If AG, were not compensated by TAS 
and A G4, the formation of ES would follow the 
dashed line. 


Figure 14.3 Increasing the energy of the ES 
complex raises the enzyme-catalyzed reaction rate. 
(a) Catalysis does not occur if the ES complex and 
the transition state for the reaction are stabilized to 
equal extents. (b) Catalysis will occur if the 
transition state is stabilized to a greater extent than 
the ES complex (right). Entropy loss and 
destabilization of the ES complex AG, ensure that 
this will be the case. 


14.3 | How Does Destabilization of the Enzyme-Substrate 


Complex Affect Enzyme Catalysis? 


How is it that the transition state Xt is stabilized more than S at the enzyme active 
site? To understand this, consider the formation of the enzyme-substrate complex, ES. 
There are a number of important contributions to the free energy difference between 
the uncomplexed enzyme and substrate (E + S) and the ES complex (Figure 14.2). The 
favorable interactions between the substrate and amino acid residues on the enzyme 
account for the intrinsic binding energy, AG,. The intrinsic binding energy ensures the 
favorable formation of the ES complex, but if uncompensated, it makes the activation 
energy for the enzyme-catalyzed reaction unnecessarily large and wastes some of the 
catalytic power of the enzyme. Gs is partially compensated by entropy loss due to the 
binding of E and S (TAS) and by destabilization of ES (Ga) by strain, distortion, desol- 
vation, and similar effects. 

Compare the two cases in Figure 14.3. If the binding of the substrate is too tight 
(part a), there is no catalysis. On the other hand, if the ES complex is raised in energy 
(part b), catalysis can occur. Because the enzymatic reaction rate is determined by the 
difference in energies between ES and EX}, the smaller this difference, the faster the 
enzyme-catalyzed reaction. Tight binding of the substrate deepens the energy well of 
the ES complex and actually lowers the rate of the reaction. 

The message of Figure 14.3 is that raising the energy of ES will increase the 
enzyme-catalyzed reaction rate. As shown in this figure, this is accomplished in two 
ways: (1) loss of entropy due to the binding of S to E and (2) destabilization of ES by 
strain, distortion, desolvation, or other similar effects. The entropy loss arises from 
the formation of the ES complex (Figure 14.4), a highly organized (low-entropy) 
entity compared with E + S in solution (a disordered, high-entropy situation). 
Because AS is negative for this process, the term — TAS is a positive quantity, and the 
intrinsic binding energy of ES is compensated to some extent by the entropy loss that 
attends the formation of the complex. 

Destabilization of the ES complex can involve structural strain, desolvation, or 
electrostatic effects. Destabilization by strain or distortion is usually just a consequence 
of the fact (noted previously) that the enzyme is designed to bind the transition state more 
strongly than the substrate. When the substrate binds, the imperfect nature of the “fit” 
results in distortion or strain in the substrate, the enzyme, or both. 

Solvation of charged groups on a substrate in solution releases energy, making the 
charged substrate more stable. When a substrate with charged groups moves from water 
into an enzyme active site (Figure 14.4), the charged groups are often desolvated to 
some extent, becoming less stable and therefore more reactive. 

Similarly, when a substrate enters the active site, charged groups may be forced to 
interact (unfavorably) with charges of like sign, resulting in electrostatic destabilization 


(a) (b) 


Destabilization of ES 
facilitates catalysis 


No destabilization, 
thus no catalysis 
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14.4 


(a) 


Enzyme 


A high-entropy situation where 
both substrate and enzyme can 
undergo translational motion 


The highly ordered, low-entropy 
complex 


(b) 


® 
Sees 


7 y 
SES 
& 


Solvation shell around the substrate 


Desolvation of the substrate raises 
the energy of the ES complex 


(c) 


Charges on substrate stabilized 
by interactions with water 


Electrostatic destabilization 
raises the energy of the ES complex 


(Figure 14.4). The reaction pathway acts in part to remove this stress. If the charge on 
the substrate is diminished or lost in the course of reaction, electrostatic destabilization 
can result in rate acceleration. 

Whether by strain, desolvation, or electrostatic effects, destabilization raises the 
energy of the ES complex, and this increase is summed in the term AGg, the free energy 
of destabilization (see Figures 14.2 and 14.3). 


14.4 | What Are Transition State Analogs and 
How Tightly Do They Bind to the Active 
Sites of Enzymes? 


How much more tightly does the transition state bind to an enzyme compared to sub- 
strates, products, and inhibitor molecules? To answer this important question, consider 
the magnitudes of Kgs and Ky and the values of k, and ku in Equation 14.3. The ratio k./ky 
may exceed 10!°, as noted previously. Given a ratio of 10'° and a typical Ks of 1074 M, the 
value of Kr would be 107% M. The value of Ky for fructose-1,6-bisphosphatase is an 
astounding 7 X 107%% M! This is the dissociation constant for the transition state from the 
enzyme, and this very low value corresponds to very tight binding of the transition state 
by the enzyme. 


What Are Transition State Analogs and How Tightly Do They Bind to the Active Sites of Enzymes? 
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Figure 14.4 Interactions that can destabilize the ES 
complex, increasing the rate of the enzyme-catalyzed 
reaction. (a) Formation of the ES complex results in 
entropy loss. Before binding, E and S are free to 
undergo translational and rotational motion. The 
ES complex is a more highly ordered, low-entropy 
complex. (b) Substrates typically lose waters of 
hydration in the formation of the ES complex. 
Desolvation raises the energy of the ES complex, 
making it more reactive. (c) Electrostatic 
destabilization of a substrate may arise from 
juxtaposition of like charges in the active site. If 
such charge repulsion is relieved in the course of the 
reaction, electrostatic destabilization can result in a 
rate increase. 
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A Deeper Look 


Transition-State Analogs Make Our World Better 


Enzymes (human, plant, and bacterial) are often targets for drugs agents that improve the lives of millions and millions of people. A 
and other beneficial agents. Transition-state analogs (TSAs), with few applications of transition-state analogs for human health and 
very high affinities for their enzyme-binding sites, often make ideal for agriculture are shown here. 

enzyme inhibitors, and TSAs have become ubiquitous therapeutic 


Enalapril and Aliskiren Lower Blood Pressure H O 
High blood pressure is a significant risk factor C@H,CH,OOC Nes 
for cardiovascular disease, and drugs that lower H N 
blood pressure reduce the risk of heart attacks, ČH, 
heart failure, strokes, and kidney disease. Blood 
pressure is partly regulated by aldosterone, a 
steroid synthesized in the adrenal cortex and 
released in response to angiotensin II, a peptide 
produced from angiotensinogen in two proteo- 
lytic steps by renin (an aspartic protease) and 
angiotensin-converting enzyme (ACE). Vasotec 
(enalapril) manufactured by Merck and Biovail 
is an ACE inhibitor. Novartis and Speedel have 
developed Tekturna (aliskiren) as a renin inhibi- 
tor. Both are TSAs. 


HOOC 


Enalapril 


Courtesy of James Gathany/CDC 


Atorvastatin (Lipitor) 


AP Images/Amy Sancetta 


Statins Lower Serum Cholesterol 
Statins such as Lipitor are powerful cholesterol-lowering drugs, bo 
because they are transition-state analog inhibitors of HMG-CoA NE O. NH 
reductase, a key enzyme in the biosynthetic pathway for cholesterol ii Oy Se 
(discussed in Section 24.4). Z N | Ä 
Protease Inhibitors Are AIDS Drugs N : N N H 
Crixivan (indinavir) by Merck, Invirase (saquinavir) by Roche, and O = NH, OH 
similar protease inhibitor drugs are transition-state analogs for the ba 
HIV-1 protease, discussed in Section 14.6. O H 

Saquinavir 

(Continued) 
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Tamiflu Is a Viral Neuraminidase Inhibitor 

Influenza is a serious respiratory illness that affects 5% to 15% of the 
earth’s population each year and results in 250,000 to 500,000 deaths 
annually, mostly among children and the elderly. Protection from 
influenza by vaccines is limited by the antigenic variation of the influ- 
enza virus. Neuraminidase, a major glycoprotein on the influenza virus 
membrane envelope, is essential for viral replication and infectivity. 
Tamiflu is a neuraminidase inhibitor and antiviral agent that mimics 
the transition state of the neuraminidase reaction. 


Tamiflu 


Juvenile Hormone Esterase Is a Pesticide Target 

Insects have significant effects on human health, being the transmit- 
ting agents (vectors) for diseases such as malaria, West Nile virus, and 
viral encephalitis, all carried by mosquitoes, and Lyme disease and 
Rocky Mountain spotted fever, carried by ticks. One strategy for con- 
trolling insect populations is to alter the actions of juvenile hormone, 
a terpene-based substance that regulates insect life cycle processes. 
Levels of juvenile hormone are controlled by juvenile hormone ester- 
ase (JHE), and inhibition of JHE is toxic to insects. OTFP (figure) is 
a potent transition-state analog inhibitor of JHE. 


Courtesy of the Otis Historical Archives/National Museum of Health and Medicine 


The 1918 flu pandemic killed more than 20 million people worldwide. 


3-Octylthio-1,1,1-trifluoropropan-2-one (OTFP) 


Darwin Dale/Science Source 


Courtesy of James Gathany/CDC 


How Many Other Drug Targets Might There Be? 

If the human genome contains approximately 20,500 genes, how 
many of these might be targets for drug therapy? Andrew Hopkins 
has proposed the term “druggable genome” to conceptualize the sub- 
set of human genes that might express proteins able to bind druglike 
molecules. The DrugBank database at http://drugbank.com contains 
more than 20,000 Food and Drug Administration (FDA) approved 
prescription drugs. Nearly half of these are directed specifically 
to enzymes. More than 18,000 experimental drugs are presently 
under study and testing. It is easy to imagine that thousands more 
drugs will eventually be developed, with many of these designed as 
transition-state analogs for enzyme reactions. 


It is unlikely that such tight binding in an enzyme transition state will ever be determined 
in a direct equilibrium measurement, however, because the transition state itself is a moving 
target. It exists only for about 10~'* to 107" sec, less than the time required for a bond vibra- 
tion. On the other hand, the nature of the elusive transition state can be explored using 
transition-state analogs, stable molecules that are chemically and structurally similar to the 
transition state. Such molecules should bind more strongly than a substrate and more 
strongly than competitive inhibitors that bear no significant similarity to the transition state. 
Hundreds of examples of such behavior have been reported. For example, Robert Abeles 
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studied a series of inhibitors of proline racemase (Figure 14.5) and found that pyrrole- 
2-carboxylate bound to the enzyme 160 times more tightly than L-proline, the normal sub- 
strate. This analog binds so tightly because it is planar and is similar in structure to the 
planar transition state for the racemization of proline. Two other examples of transition- 
state analogs are shown in Figure 14.6. Phosphoglycolohydroxamate binds 40,000 times 


Proline racemase reaction 


Ht H+ 
E 
À À 1 
H | H _| H 
L-Proline Planar transition p-Proline 
state 


Figure 14.5 The proline racemase reaction LA ea 
interconverts the L- and D-isomers of proline. (Pink COO +7— COO 
highlights show the two configurations of proline.) N N 
Pyrrole-2-carboxylate and A-1-pyrroline-2- il d 


carboxylate mimic the planar transition state of the 
reaction. Pyrrole-2-carboxylate A-1-Pyrroline-2-carboxylate 


(a) Yeast aldolase reaction 


CH,OPO37— 
Pe Glyceraldehyde- 
COROS 3-phosphate c=0 
ome —_ 5 HO—C —H 
HO — CHo H—¢' 0H 
Enediolate 
E e] (Transition state) H —C —OH 
CH,OPO37— 
Fructose-1,6- 
bisphosphate 


Phosphoglycolohydroxamate 


(b) Calf intestinal adenosine deaminase reaction 


NH, Oo 
LO N N 
H 
N EO m k I > 
WN L 
N N 
N< ON NO R 


Adenosine Transition state 


Figure 14.6 Examples of transition state analogs. 
(a) Phosphoglycolohydroxamate is an analog of the 
enediolate transition state of the yeast aldolase 
reaction. (b) Purine riboside, a potent inhibitor of 
the calf intestinal adenosine deaminase reaction, Hydrated form of 

binds to adenosine deaminase as the 1,6-hydrate. purine ribonucleoside 

The hydrated form of purine riboside is an analog [K8 xIM Po 


of the proposed transition state for the reaction. 
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Chemistry in History 


> | 


Dr. John Cornforth 


John Cornforth (1918-2013) earned a B.S. in organic 
chemistry at the University of Sydney in Sydney, 
Australia. The next year, he and another organic 
chemist from Sydney, Rita Harradence, won pres- 
tigious scholarships to study abroad. Both chose 
to go to Oxford to do their doctoral work with Sir 
Robert Robinson, studying cholesterol biosynthesis. 
As World War II unfolded, Robinson’s lab turned to 
study of methods for preparing penicillin and Rob- 
inson assigned Cornforth and Harradence to work 
together on the synthesis of penicillamine, to aid the 
Allies. Cornforth and Harradence married while still 
in graduate school and continued their collabora- 
tion, authoring 41 papers together over their careers. 


Dr. Cornforth suffered from otosclerosis which 
led to progressive hearing loss that left him com- 
pletely deaf in his early 20s. He attributed his pur- 
suit of chemistry as a career to a talented teacher at 
his high school and the impression that his impend- 
ing deafness would not be much of a hinderance in 
the field. Cornforth received the 1975 Nobel Prize 
in Chemistry for his studies of the stereochemistry 
of the enzymes involved in cholesterol biosynthesis. 
In his Nobel biography, Cornforth said of his wife 
Rita: “she has eased for me beyond measure the dif- 
ficulties of communication that accompany deaf- 
ness; her encouragement and fortitude have been my 
strongest supports.” 


more tightly to yeast aldolase than the substrate dihydroxyacetone phosphate. Even more 
remarkable, the 1,6-hydrate of purine ribonucleoside has been estimated to bind to adenos- 
ine deaminase with a Kj of 3 X 1078 M! 

It should be noted that transition-state analogs are only approximations of the tran- 
sition state itself and will never bind as tightly as would be expected for the true 
transition state. These analogs are, after all, stable molecules and cannot be expected to 
resemble a true transition state too closely. 


14.5 | What Are the Mechanisms of Enzyme Catalysis? 


Enzymes employ a variety of chemical strategies to catalyze reactions. This section 
describes those found most often in enzymes — near-attack conformations, protein 
dynamics, covalent catalysis, general acid-base catalysis, low-barrier hydrogen bonds, 
quantum mechanical tunneling, and metal-ion catalysis. 


14.5a Enzymes Facilitate Formation of Near-Attack Conformations 


Exquisite and beautiful details of enzyme active-site structure and dynamics have 
emerged from X-ray crystal structures of enzymes and computer simulations of molecu- 
lar conformation and motion at the active site. Importantly, these studies have shown 
that the reacting atoms and catalytic groups are precisely positioned for their roles. This 
preorganization of the active site allows it to select and stabilize conformations of the 
substrate(s) in which the reacting atoms are in van der Waals contact and at an angle 
resembling the bond to be formed in the transition state. Thomas Bruice has termed such 
arrangements near-attack conformations (NACs), and he has proposed that NACs are the 
precursors to transition states of reactions (Figure 14.7). In the absence of an enzyme, 
potential reactant molecules adopt a NAC only about 0.0001% of the time. On the other 
hand, NACs have been shown to form in enzyme active sites from 1% to 70% of the time. 

The alcohol dehydrogenase (ADH) reaction provides a good example of a NAC on 
the pathway to the reaction transition state (Figure 14.8). The ADH reaction converts a 
primary alcohol to an aldehyde (through an ordered, single-displacement mechanism; 
see Section 13.5). The reaction proceeds by a proton transfer to water followed by a 
hydride transfer to NAD*. In the enzyme active site, Ser*® accepts the proton from the 
alcohol substrate, the resulting negative charge is stabilized by a zinc ion, and the sub- 
strate pro-R hydrogen is poised above the NAD! ring prior to hydride transfer (Figure 
14.8). Computer simulations of the enzyme-substrate complex involving the deproton- 
ated alcohol show that this intermediate exists as a NAC 60% of the time. The kinetic 
advantage of such an enzymatic reaction, compared with its nonezymatic counterpart, 
is the ease of formation of the NAC and the favorable free energy difference between the 
NAC and the transition state (see Figure 14.7). 


Free energy, G 


Reaction coordinate 


Figure 14.7 Reacting atoms and catalytic groups 
must be precisely positioned to form near-attack 
conformations (NACs). (a) For reactions involving 
bonding between O, N, C, and S atoms, NACs are 
characterized as having reacting atoms within 3.2 A 
and an approach angle of +15° of the bonding angle 
in the transition state. (b) In an enzyme active site, 
the enzyme-substrate complex and the NAC are 
separated by a small energy barrier, and NACs form 
readily. In the absence of the enzyme, the energy gap 
between the substrate and the NAC is much greater 
and NACs are rarely formed. The energy separation 
between the NAC and the transition state is 
approximately the same in the presence and absence 
of the enzyme. (Adapted from Bruice, T., 2002. A 
view at the millennium: the efficiency of enzymatic 
catalysis. Accounts of Chemical Research 35:139-148.) 
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A Deeper Look 


How to Read and Write Mechanisms 


The custom among chemists and biochemists of writing chemical 
reaction mechanisms with electron dots and curved arrows began 
with two of the greatest chemists of the twentieth century. Gilbert 
Newton Lewis was the first to suggest that a covalent bond consists 
of a shared pair of electrons, and Sir Robert Robinson was the first 
to use curved arrows to illustrate a mechanism in a paper in the 
Journal of the Chemical Society in 1922. 

Learning to read and write reaction mechanisms should begin with 
a review of Lewis dot structures in any good introductory chemistry 
text. It is also important to understand valence electrons and formal 
charge. The formal charge of an atom is calculated as the number 
of valence electrons minus the electrons owned by an atom. More 
properly 


Formal charge = group number — 
nonbonding electrons — (1/2 shared electrons) 


Students of mechanisms should also appreciate electronegativity— 
the tendency of an atom to attract electrons. Electronegativities of the 
atoms important in biochemistry go in the order 


F>O>N>CO}H 


Thus, in a C-N bond, the N should be viewed as more electron 
rich and C as electron deficient. An electron-rich atom is termed nu- 
cleophilic and will have a tendency to react with an electron-deficient 
(electrophilic) atom. 

In written mechanisms, a curved arrow shows the movement of 
an electron pair from its original position to a new one. The tail of 
the arrow shows where the electron pair comes from, and the head 
of the arrow shows where the electron pair is going. Thus, the ar- 
row represents the actual movement of a pair of electrons from a 
filled orbital into an empty one. By convention, an arrow with a full 
arrowhead -~ represents movement of an electron pair, whereas a 
half arrowhead -N represents a single electron (for example, in a free 
radical reaction). For a bond-breaking event, the arrow begins in the 
middle of the bond, and the arrowhead points at the atom that will 
accept the electrons: 


(X 


A+B —> A Se BR 


For a bond-making event, the arrow begins at the source of 
the electrons (for example, a nonbonded pair), and the arrowhead 
points to the atom where the new bond will be formed: 


at po — > A—B 


It has been estimated that 75% of the steps in enzyme reaction 
mechanisms are proton (H*) transfers. If the proton is donated or 
accepted by a group on the enzyme, it is often convenient (and tradi- 
tional) to represent that group as B, for base, even if B is protonated 
and behaving as an acid: 


BORN — >» B+ HIND 
~ ~ 


It is important to appreciate that a proton transfer can change a 
nucleophile into an electrophile, and vice versa. Thus, it is necessary 
to consider (1) the protonation states of substrate and active-site 
residues and (2) how pK, values can change in the environment of 
the active site. For example, an active-site histidine, which might 
normally be protonated, can be deprotonated by another group and 
then act as a base, accepting a proton from the substrate: 


| ga HB — 
FY 
HN N-H — > HN N 
\Z4 + Z a 
H—O. 
CHo 
HtB— 
Ser 
HN, ,N-H Ya 
-O 
SCH, 
Ser 


Water can often act as an acid or base at the active site through 
proton transfer with an assisting active-site residue: 


= f = n 
/ \ —O / \ | 
— > HN. ,N-H + R—C—O-—R'’ 
NZ +t i 
OH 


These concepts provide a sense of what is reasonable and what 
makes good chemical logic in a reaction. 


14.5b Protein Motions Are Essential to Enzyme Catalysis 


Proteins are constantly moving. As noted in Chapter 6 (Table 6.2), bonds vibrate, side 
chains bend and rotate, backbone loops wiggle and sway, and whole domains move 
with respect to each other. Enzymes depend on such motions to initiate and direct 
catalytic events. Protein motions may support catalysis in several ways: Active site 
conformation changes can 


e Assist substrate binding 
e Bring catalytic groups into position around a substrate 
e Induce formation of a NAC 
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Benzyl 
j ee alcohol 
(substrate) 


(a) ® 


e Assist in bond making and bond breaking 
e Facilitate conversion of substrate to product 


A good example of protein motions facilitating catalysis is human cyclophilin A, which 
catalyzes the interconversion between cis and trans conformations of proline in peptides 
(Figure 14.9). NMR studies of cyclophilin A have provided direct measurements of the 
active-site motions occurring in this enzyme. Certain active-site residues (Lys*”, Leu®’, Ser”, 
Ala!!!) Gln!, Ala!, and Gly!) of the enzyme undergo conformation changes during 
substrate binding, whereas Arg® is involved directly in the cis-to-trans interconversion itself. 
The residues undergoing motion during catalysis are shown in Figure 14.10. 

The protein motions that assist catalysis may be quite complex. Stephen Benkovic and 
Sharon Hammes-Schiffer have characterized an extensive network of coupled protein 
motions in dihydrofolate reductase. This network extends from the active site to the surface 
of the protein, and the motions in this network span time scales of femtoseconds (10715 sec) 


485 


Figure 14.8 The complex of horse liver ADH with 
benzyl alcohol illustrates the approach to a near- 
attack conformation. Computer simulations by 
Bruice and co-workers show that the side-chain 
oxygen of Ser! approaches within 1.8 A of the 
hydroxyl hydrogen of the substrate, benzyl alcohol, 
and that the pro-R hydrogen of benzyl alcohol lies 
2.75 A from the C-4 carbon of the nicotinamide 
ring. The reaction mechanism involves hydroxy] 
proton abstraction by Ser“ and hydride transfer 
from the substrate to C-4 of the NAD+ 
nicotinamide ring (PDB id = 1HLD). 


Figure 14.9 Human cyclophilin A is a prolyl 
isomerase. (a) This enzyme catalyzes the 
interconversion of trans and cis conformations 

of proline in peptides. (b) The active site of 
cyclophilin with a bound peptide containing 
proline in trans (left) and cis (right) conformations 
(PDB id = IRMH). 
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Figure 14.10 Catalysis in enzyme active sites 
depends on motion of active-site residues. NMR 
studies by Dorothee Kern and her co-workers show 
that several cyclophilin active-site residues, including 
Arg” (red dot) and Lys*”, Leu’, Ser”, Ala!°!, Gin!”, 
Ala! , and Gly'!™ (green dots), undergo greater 
motion during catalysis than residues elsewhere in 
the protein. (Adapted from Eisenmesser, E., et al., 
2002. Enzyme dynamics during catalysis. Science 
295:1520-1523.) 


Figure 14.11 Examples of covalent bond formation 
between enzyme and substrate. In each case, a 
nucleophilic center (X:) on an enzyme attacks an 
electrophilic center on a substrate. 
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to milliseconds. Such extensive networks of motion make it likely that the entire folded 
structure of the protein may be involved in catalysis at the active site. 


14.5c Covalent Catalysis 


Some enzyme reactions derive much of their rate acceleration from the formation of 
covalent bonds between enzyme and substrate. Consider the reaction: 


BX + Y—> BY + X 


and an enzymatic version of this reaction involving formation of a covalent intermediate: 
BX + E—> E:B + X + Y — E + BY 


If the enzyme-catalyzed reaction is to be faster than the uncatalyzed case, the acceptor 
group on the enzyme must be a better attacking group than Y and a better leaving group 
than X. Note that most enzymes that carry out covalent catalysis have ping-pong kinetic 
mechanisms. 

The side chains of amino acids in proteins offer a variety of nucleophilic centers for 
catalysis, including amines, carboxylates, aryl and alkyl hydroxyls, imidazoles, and thiol 
groups. These groups readily attack electrophilic centers of substrates, forming cova- 
lently bonded enzyme-substrate intermediates. Typical electrophilic centers in sub- 
strates include phosphoryl groups, acyl groups, and glycosyl groups (Figure 14.11). The 


e 
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N a | 

R—O-P OR A ROZP SF Oe t RO 
* ai 2 @ 
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IP 2. | 
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Acyl enzyme 
HOCH, HOCH, 
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n 
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N 
OH OH 6) 
@- X Glucosyl enzyme 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


14.5 What Are the Mechanisms of Enzyme Catalysis? 


| Table 14.2 _| 14.2 Enzymes That Form Covalent Intermediates 


Enzyme Reacting Group Covalent Intermediate 
Trypsin Serine Acyl-Ser 
Chymotrypsin (Section 14.6) 

Glyceraldehyde-3-P Cysteine Acyl-Cys 
dehydrogenase (Section 18.4) 

Phosphoglucomutase Serine Phospho-Ser 
(Section 15.5) 

Phosphoglycerate mutase Histidine Phospho-His 


(Section 18.4) 
Succinyl-CoA synthetase 
(Section 19.4) 


Aldolase (Section 18.3) Lysine and other Schiff base 
Pyridoxal phosphate enzymes amino groups 
(Sections 13.5, 25.4, and 25.5) 


covalent intermediates thus formed can be attacked in a subsequent step by a water 
molecule or a second substrate, giving the desired product. Covalent electrophilic 
catalysis is also observed, but it usually involves coenzyme adducts that generate elec- 
trophilic centers. Hundreds of enzymes are now known to form covalent intermediates 
during catalysis. Several examples of covalent catalysis will be discussed in detail in later 
chapters, as noted in Table 14.2. 


14.5d General Acid-Base Catalysis 


Nearly all enzyme reactions involve some degree of acid or base catalysis. There are 
two types of acid-base catalysis: (1) specific acid-base catalysis, in which the reaction 
is accelerated by H* or OH™ diffusing in from the solution, and (2) general acid-base 
catalysis, in which H* or OH is created in the transition state by another molecule or 
group, which is termed the general acid or general base, respectively. By definition, 
general acid-base catalysis is catalysis in which a proton is transferred in the transition 
state. Consider the hydrolysis of p-nitrophenylacetate by specific base catalysis or by 
general base catalysis, with imidazole acting as a general base (Figure 14.12). In the 


Reaction 
O O 


| Il 
CH;C— O < Xo, + HO === cH,C—O- + HO < \-vo, + Ht 
Specific base mechanism 
(9 
Il 


CHC — O < \- NO» 
ss 


TOH 


General base mechanism i 

O O 
S Pa N 
g < \- NO, 


Figure 14.12 Catalysis of p-nitrophenylacetate hydrolysis can occur either by specific base hydrolysis 
(where hydroxide from the solution is the attacking nucleophile) or by general base catalysis (in which 
a base like imidazole can promote hydroxide attack on the substrate carbonyl carbon by removing a 
proton from a nearby water molecule). 
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specific base mechanism, hydroxide diffuses into the reaction from solution. In the 
general base mechanism, the hydroxide that catalyzes the reaction is generated from 
water in the transition state. The water has been made more nucleophilic without 
generation of a high concentration of OH- or without the formation of unstable, 
high-energy species. General acid or general base catalysis may increase reaction 
rates 10- to 100-fold. In an enzyme, ionizable groups on the protein provide the H* 
transferred in the transition state. An ionizable group will be most effective as an H* 
transferring agent at or near its pK,. Because the pK, of the histidine side chain is near 
7, histidine is often the most effective general acid or base. 


14.5e Low-Barrier Hydrogen Bonds 


Low-barrier hydrogen bonds contribute to catalysis in some enzyme reactions. As 
previously noted, the typical strength of a hydrogen bond is 10 to 30 kJ/mol. For an 
O—H— O hydrogen bond, the O— O separation is typically 0.28 nm and the H bond 
is a relatively weak electrostatic interaction. The hydrogen is firmly linked to one of 
the oxygens at a distance of approximately 0.1 nm, and the distance to the other oxy- 
gen is thus about 0.18 nm, which corresponds to a bond order of about 0.07. (Bond 
order refers to the number of electron pairs in a bond. For a single bond, the bond 
order is 1.) Not all hydrogen bonds are weak, however. As the distance between het- 
eroatoms becomes smaller, the overall bond becomes stronger, the hydrogen becomes 
centered, and the bond order approaches 0.5 for both O —H interactions (Figure 14.13). 
These interactions are more nearly covalent in nature, and the stabilization energy is 
much higher. Notably, the barrier that the hydrogen atom must surmount to exchange 
oxygens becomes lower as the O—O separation decreases (Figure 14.13). When the 
barrier-to-hydrogen exchange has dropped to the point that it is at or below the zero- 
point energy level of hydrogen, the interaction is referred to as a low-barrier hydrogen 
bond (LBHB). The hydrogen is now free to move anywhere between the two oxygens 
(or, more generally, two heteroatoms). The stabilization energy of LBHBs may 
approach 100 kJ/mol in the gas phase and 60 kJ/mol or more in solution. LBHBs 
require matched pK,s for the two electronegative atoms that share the hydrogen. As 
the two pK, values diverge, the stabilization energy of the LBHB is decreased. Widely 
divergent pK, values thus correspond to ordinary, weak hydrogen bonds. 

How may low-barrier hydrogen bonds affect enzyme catalysis? A weak hydrogen 
bond in an enzyme ground state may become an LBHB in a transient intermediate, or 
even in the transition state for the reaction. In such a case, the energy released in form- 
ing the LBHB is used to help the reaction that forms it, lowering the activation barrier 
for the reaction. Alternatively, the purpose of the LBHB may be to redistribute electron 


(a) (b) (c) (d) 


O—H O Ores He eeeee O Osses Hesses Oo 


kK-+0.28nm—>| ke025nm>|  fko.23 nm>| 


Figure 14.13 Comparison of (a) conventional (weak) hydrogen bonds and (b, c) low-barrier hydrogen 
bonds (b and c). The horizontal line in each case is the zero-point energy of hydrogen. (a) An O—H—O 
hydrogen bond of length 0.28 nm, with the hydrogen attached to one or the other of the oxygens. The 
bond order for the stronger O—H interaction is approximately 1.0, and the weaker O—H interaction is 
0.07. As the O—O distance decreases, the hydrogen bond becomes stronger, and the bond order of the 
weakest interaction increases. In (b), the O—O distance is 0.25 nm, and the barrier is equal to the zero- 
point energy. In (c), the O—O distance is 0.23 to 0.24 nm, and the bond order of each O—H interaction 
is 0.5. (d) If the distance for particle (proton or electron) transfer is sufficiently small, the overlap of 
probability functions (red curves) permits efficient quantum mechanical tunneling between donor (D) 
and acceptor (A) groups. 
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density in the reactive intermediate, achieving rate acceleration by facilitation of hydro- 
gen tunneling. Enzyme mechanisms that will be examined in the next section and also 
later in this chapter (the serine proteases and aspartic proteases) appear to depend upon 
one or the other of these effects. 


14.5f Quantum Mechanical Tunneling in Electron and Proton Transfers 


The fundamental premise of transition-state theory, as described in this chapter, is that 
reactants cannot become products until they acquire enough energy to distort their struc- 
tures to that of the transition state. However, quantum mechanical tunneling offers a path 
around this energy barrier, and many enzymes exploit it. According to quantum theory, 
if an atom or electron is transferred in a chemical reaction from one site to another across 
an activation barrier, there is a finite probability that the particle will appear (as part of 
the product) on the other side of the energy barrier, even though it cannot achieve suffi- 
cient energy to reach the transition state. The likelihood of tunneling depends on the 
distance over which the particle must move and the shape (width and height) of the 
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Human Biochemistry 


Antibiotic Resistance by Superbugs 


Penicillin (Figure 13.18), the first antibiotic drug, became widely avail- 
able in the 1940s, but treating bacterial infections with penicillin and 
other antibiotics precipitated the emergence of antibiotic-resistant 
bacteria. Over the last 70 years, whenever a new antibiotic has been 
introduced, resistant strains of bacteria have appeared quickly—a 
history that evokes images of war. Whenever challenged by a new anti- 
biotic, bacteria fight back with the powerful weapon of spontaneous 
mutations that confer resistance. Chemists retaliate by modifying exist- 
ing drugs or developing new classes of antibiotics, and the battles rage 
back and forth. In recent years, outbreaks of superbugs (in Chicago, 
Los Angeles, and other locations around the world) that are resistant 
to even the most powerful antibiotics and the decline in new antibiotics 
coming to market give evidence that the bacteria are winning. 


HO 


Hf COOH 
Amoxicillin Clavulanic Acid 


Amoxicillin and other B-lactam antibiotics, as noted in Figure 13.18, are 
suicide inhibitors that inhibit by binding covalently and irreversibly to the 
bacterial enzymes involved in essential peptidoglycan-synthesizing enzymes. 
Bacteria respond to antibiotic challenge with B-lactamases that hydrolyze 
and inactivate the drug. For this reason, clavulanic acid, an irreversible 
inhibitor of B-lactamases, is often administered along with amoxicillin, to 
increase its effectiveness. 


The designation as a superbug includes the bacteria that have 
developed resistance even to the carbapenems, a relatively new 
and particularly powerful class of antibiotics. These carbapenem- 
resistant Gram-negative pathogens (CRGNPs), including the 
Enterobacteriaceae, were involved in the outbreaks cited earlier. 
Carbapenem resistance in these organisms is ascribed to NDM-1, 
a metallo-B-lactamase (which depends on a catalytic Zn?* in the 
active site). It is particularly concerning that the NDM-1 gene is 


carried by a plasmid in many of the resistant strains that have been 
studied, allowing the gene to be transferred readily between dif- 
ferent strains of bacteria, thus conferring antibiotic resistance to 
organisms that previously had none. Of even greater concern, no 
current drugs are effective against metallo-B-lactamases. However, 
Gerard Wright and colleagues have shown that aspergillomaras- 
mine (AMA), a fungal metabolite, is a potent inhibitor of NDM-1, 
raising the intriguing possibility that AMA or a similar metallo- 
B-lactamase inhibitor may someday be used as a co-treatment with 
carbapenems as a useful therapy against superbugs. 
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Figure 14.14 Thermolysin is an endoprotease (it 
cleaves polypeptides in the middle of the chain) with 
a catalytic Zn?* ion in the active site. The Zn** ion 
stabilizes the buildup of negative charge on the 
peptide carbonyl oxygen, as a glutamate residue 
deprotonates water, promoting hydroxide attack on 
the carbonyl carbon. Thermolysin is found in certain 
laundry detergents, where it is used to remove 
protein stains from fabrics. 


potential energy barrier that separates reactant from product. Wave/particle duality in 
quantum theory implies that a wavelength A can be calculated for a particle of mass m 
with a given kinetic energy FE, according to the de Broglie equation: 

A= i 14.4) 

VamE um 

where / is Planck’s constant. Tunneling can only play a significant role in a reaction 
when the wavelength of the transferring particle is similar to the distance over which 
it is transferred. In such cases, the overlap of probability functions for the particle 
in the reactant and product states is sufficient to permit effective tunneling 
(Figure 14.13d). 

In enzyme reactions, only electrons and hydrogen (H*, H-, or H:) satisfy this condi- 
tion. Consider the low likelihood of tunneling for atoms larger than hydrogen. 
For example, the de Broglie wavelength of a carbon atom, with a mass of 12 Da, is 
approximately 0.25 A, a distance too short for tunneling in any reaction involving car- 
bon transfer. Note also that tunneling permits the transfer of electrons over substantial 
molecular distances. Assuming a kinetic energy of 10 kJ/mole, Equation 14.4 yields 
de Broglie wavelengths for the electron and hydrogen of 38 A and 0.9 A, respectively. 
The latter wavelength is on the order of distances involved in proton, hydrogen, and 
hydride transfers in enzyme reactions, and it’s likely that tunneling is a contributing fac- 
tor in most, if not all, hydrogen transfer reactions. 


14.5g Metal lon Catalysis 


Many enzymes require metal ions for maximal activity. If the enzyme binds the metal 
very tightly or requires the metal ion to maintain its stable, native state, it is referred to 
as a metalloenzyme. Enzymes that bind metal ions more weakly, perhaps only during the 
catalytic cycle, are referred to as metal activated. One role for metals in metal-activated 
enzymes and metalloenzymesis to act as electrophilic catalysts, stabilizing the increased 
electron density or negative charge that can develop during reactions. Thermolysin is 
one of the enzymes that function in this manner (Figure 14.14). Another potential func- 
tion of metal ions is to provide a powerful nucleophile at neutral pH. Coordination to a 
metal ion can increase the acidity of a nucleophile with an ionizable proton: 


M2+ + NucH — = M?2* (NucH) ——= M?* (Nuc-) + H+ 


The reactivity of the coordinated, deprotonated nucleophile is typically intermediate 
between that of the un-ionized and ionized forms of free nucleophile. Carboxypepti- 
dase, a protease that cleaves residues from the C-terminus of many proteins, contains 
an active site Zn**, which facilitates deprotonation of a water molecule in this manner. 


14.5h Noncatalytic Residues May Still Play a Role in 
Catalysis in Enzyme Active Sites 


Only about half of the common amino acid residues (the polar residues, especially those 
with dissociable protons—His, Cys, Asp, Glu, Arg, Lys, Tyr, Ser, Thr, Asn, and Gln) 
engage directly in catalytic effects in enzyme active sites. However, the catalytic actions 
of these residues can be influenced by other polar and charged residues at the active site 
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and even by hydrophobic residues. The so-called secondary, or noncatalytic, residues at 
enzyme active sites play a variety of roles: 


e Raising or lowering catalytic residue pK, values through electrostatic or hydropho- 
bic interactions. 

e Orientation of catalytic residues, as will be seen in the serine proteases, where 
Asp! orients His” (see Figure 14.21). 

e Charge stabilization, as will be seen in chorismate mutase, where active-site 
arginines stabilize negatively charged carboxyl groups on the substrate 
(see Figures 14.31 and 14.33). 

e Proton transfers via hydrogen tunneling. In such quantum mechanical tunneling, 
the proton transfer is accomplished by molecular motions that lead to degeneracy 
of a pair of localized proton vibrational states (Figure 14.13). Proton tunneling 
can be facilitated by nearby molecular motions of secondary residues coupled to 
the motion and vibration of the bonds in question. 


14.6 | What Can Be Learned from the Mechanisms 
of Some Typical Enzymes? 


The rest of this chapter will be devoted to several classic and representative enzyme 
mechanisms, including those of the serine proteases, the aspartic proteases, and choris- 
mate mutase. Both the serine proteases and the aspartic proteases use general acid-base 
catalysis chemistry; the serine proteases also employ a covalent catalysis strategy. Cho- 
rismate mutase, on the other hand, uses neither of these and depends instead on the 
formation of a NAC to carry out its reaction. These particular cases are well under- 
stood, because the three-dimensional structures of the enzymes and the bound sub- 
strates are known at atomic resolution and because great efforts have been devoted to 
kinetic and mechanistic studies. They are important because they represent reaction 
types that appear again and again in living systems and because they demonstrate many 
of the catalytic principles cited previously. Enzymes are the catalytic machines that sus- 
tain life, and what follows is an intimate look at the inner workings of the machinery. 


14.6a Serine Proteases 


Serine proteases are a class of proteolytic enzymes whose catalytic mechanism is based 
on an active-site serine residue. Serine proteases are one of the best-characterized 
families of enzymes. This family includes trypsin, chymotrypsin, elastase, thrombin, 
subtilisin, plasmin, tissue plasminogen activator, and other related enzymes. The first 
three of these are digestive enzymes and are synthesized in the pancreas and secreted 
into the digestive tract as inactive proenzymes, or zymogens. Within the digestive tract, 
the zymogen is converted into the active enzyme form by cleaving off a portion of the 
peptide chain. Thrombin is a crucial enzyme in the blood-clotting cascade, subtilisin 
is a bacterial protease, and plasmin breaks down the fibrin polymers of blood clots. 
Tissue plasminogen activator (TPA) specifically cleaves the proenzyme plasminogen, 
yielding plasmin. Due to its ability to stimulate breakdown of blood clots, TPA can 
minimize the harmful consequences of a heart attack if administered to a patient 
within 30 minutes of onset. Finally, although not itself a protease, acetylcholinesterase 
is a serine esterase and is related mechanistically to the serine proteases. It degrades 
the neurotransmitter acetylcholine in the synaptic cleft between neurons. 


14.6b The Digestive Serine Proteases 


Trypsin, chymotrypsin, and elastase all carry out the same reaction—the cleavage of 
a peptide chain—and although their structures and mechanisms are quite similar, 
they display very different specificities. Trypsin cleaves peptides on the carbonyl side 
of the basic amino acids, arginine or lysine. Chymotrypsin prefers to cleave on the 
carbonyl side of aromatic residues, such as phenylalanine and tyrosine. Elastase is not 
as specific as the other two; it mainly cleaves peptides on the carbonyl side of small, 
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Chymotrypsinogen Trypsinogen Elastase 
H C 


Figure 14.15 Comparison of the amino acid 
sequences of chymotrypsinogen, trypsinogen, and 
elastase. Each circle represents one amino acid. 
Numbering is based on the sequence of 
chymotrypsinogen. Filled circles indicate residues 
that are identical in all three proteins. Disulfide 
bonds are indicated in orange. The positions of the 
three catalytically important active-site residues 
(His*’, Asp!”, and Ser!°°) are indicated. Scissors 
indicate positions of proteolytic cleavage that 
activates these enzymes. 


neutral residues. These three enzymes all possess molecular weights in the range of 
25,000, and all have similar sequences (Figure 14.15) and three-dimensional struc- 
tures. The structure of chymotrypsin is typical of the three enzymes (Figure 14.16). 
The molecule is ellipsoidal in shape and contains an a-helix at the C-terminal end 
(residues 230 to 245) and several B-sheet domains. Most of the aromatic and hydro- 
phobic residues are buried in the interior of the protein, and most of the charged or 
hydrophilic residues are on the surface. Three polar residues—His*’, Asp!, and 
Ser?5—form what is known as a catalytic triad at the active site (Figures 14.15, 14.16, 
and 14.17). These three residues are conserved in trypsin and elastase as well. 


Figure 14.16 Structure of chymotrypsin (white) in a 
complex with eglin C (blue ribbon), a target protein. 
The residues of the catalytic triad (His*’, Asp!°, and 
Ser!>) are highlighted. His*’ (red) is flanked by 
Asp!°? (gold) and by Ser!’ (green). The catalytic site 
is filled by a peptide segment of eglin. Note how 
close Ser!» is to the peptide that would be cleaved in 
the chymotrypsin reaction 

(PDB id = 1ACB). Figure 14.17 The catalytic triad of chymotrypsin. 
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Trypsin Chymotrypsin 


Figure 14.18 The substrate-binding pockets of trypsin (PDB id = 2CMY), chymotrypsin (PDB id = 
1ACB), and elastase (PDB id = 3EST). Asp!® (aqua) coordinates Arg and Lys residues of peptide 
substrates in the trypsin binding pocket. Val?! (purple) and Thr?™ (green) make the elastase binding 
pocket shallow and able to accommodate only small, nonbulky residues. 


The active site is actually a depression on the surface of the enzyme, with a pocket that 
the enzyme uses to identify the residue for which it is specific (Figure 14.18). Chymo- 
trypsin, for example, has a pocket surrounded by hydrophobic residues and large 
enough to accommodate an aromatic side chain. The pocket in trypsin has a negative 
charge (Asp'*’) at its bottom, facilitating the binding of positively charged arginine 
and lysine residues. Elastase, on the other hand, has a shallow pocket with bulky 
threonine and valine residues at the opening. Only small, nonbulky residues can be 
accommodated in its pocket. In the active sites of all these enzymes, the backbone of 
the peptide substrate is hydrogen bonded in antiparallel fashion to residues 215 to 219 
and bent so that the peptide bond to be cleaved is bound close to His*’ and Ser!». 


14.6¢ The Chymotrypsin Mechanism in Detail: Kinetics 


Much of what is known about the chymotrypsin mechanism is based on studies of the 
hydrolysis of artificial substrates—simple organic esters, such as p-nitrophenylacetate 
(Figure 14.19). p-Nitrophenylacetate is an especially useful model substrate, because 
the nitrophenolate product is easily observed, due to its strong absorbance at 400 nm. 
When large amounts of chymotrypsin are used in kinetic studies with this substrate, 
a rapid initial burst of p-nitrophenolate is observed (in an amount approximately 
equal to the enzyme concentration), followed by a much slower, linear rate of nitro- 
phenolate release (Figure 14.20a). Observation of a burst, followed by slower, steady- 
state product release, is strong evidence for a multistep mechanism, with a fast first 
step and a slower second step. 

In the chymotrypsin mechanism, the nitrophenylacetate combines with the enzyme to 
form an ES complex. This is followed by a rapid step in which an acyl-enzyme intermediate 
is formed, with the acetyl group covalently bound to the very reactive Ser!*°. The nitrophe- 
nyl moiety is released as nitrophenolate (Figure 14.20b, fast step), accounting for the burst 
of nitrophenolate product. Attack of a water molecule on the acyl-enzyme intermediate 
yields acetate as the second product in a subsequent, slower step. The enzyme is now free 
to bind another molecule of p-nitrophenylacetate, and the p-nitrophenolate product pro- 
duced at this point corresponds to the slower, steady-state formation of product 
(Figure 14.20a). In this mechanism, the release of acetate is the rate-limiting step and 
accounts for the observation of burst kinetics. 


14.6d The Serine Protease Mechanism in Detail: 
Events at the Active Site 
A likely mechanism for peptide hydrolysis by chymotrypsin is shown in Figure 14.21. 


As the backbone of the substrate peptide binds adjacent to the catalytic triad, the 
specific side chain fits into its pocket (step 1). Asp! of the catalytic triad positions 


p-Nitrophenylacetate 


Figure 14.19 Chymotrypsin cleaves simple esters, 
in addition to peptide bonds. p-Nitrophenylacetate 
has been used in studies of the chymotrypsin 
mechanism. The cleavage site is indicated with the 
pink scissors. 
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Figure 14.20 Burst kinetics observed in the (a) 
chymotrypsin reaction with para- 
nitrophenylacetate. (a) A burst of 
nitrophenolate is followed by a slower, steady- 
state release. After an initial lag period, 
acetate release is observed. This kinetic 
pattern is consistent with the scheme in (b): 
rapid formation of an acyl-enzyme 
intermediate (and the burst of 
nitrophenolate). The slower, steady-state 
release of products corresponds to rate- 
limiting breakdown of the acyl-enzyme 
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His*’ and immobilizes it through a hydrogen bond as shown. In step 2, His” acts as a 
general base to withdraw a proton from Ser!>, facilitating nucleophilic attack by 
Ser! on the carbonyl carbon of the peptide bond to be cleaved. This is probably a 
concerted step, because proton transfer prior to Ser!” attack on the acyl carbon would 
leave a relatively unstable negative charge on the serine oxygen. In step 3, donation of 
a proton from His” to the peptide’s amide nitrogen creates a protonated amine on the 
covalent, tetrahedral intermediate, facilitating the subsequent bond breaking (step 4) 
and dissociation of the amine product (step 5). The negative charge on the peptide 
oxygen is unstable; the tetrahedral intermediate is short lived and rapidly breaks 
down to expel the amine product. The acyl-enzyme intermediate that results is rea- 
sonably stable; it can even be isolated using substrate analogs for which further reac- 
tion cannot occur. With normal peptide substrates, however, subsequent nucleophilic 
attack at the carbonyl carbon by water (step 6) generates another transient tetrahe- 
dral intermediate (Figure 14.21g). His*’ acts as a general base in this step, accepting a 
proton from the attacking water molecule. The subsequent collapse of the tetrahedral 
intermediate (step 7) is assisted by proton donation from His” to the serine oxygen in 
a concerted manner. Deprotonation of the carboxyl group and its departure from the 
active site (step 8) complete the reaction as shown. 

Asp! does not appear to interact directly with the substrate. The catalytic role 
of Asp'” in trypsin and the other serine proteases had been surmised on the basis 
of its proximity to His” in structures obtained from X-ray diffraction studies, but 
it had never been demonstrated with certainty in physical or chemical studies. As 
can be seen in Figure 14.16, Asp! is buried at the active site; it is normally inacces- 
sible to chemical modifying reagents. In 1987, Charles Craik, William Rutter, and 
their colleagues used site-directed mutagenesis to prepare a mutant trypsin with 
an asparagine in place of Asp!°*. This mutant trypsin possessed a hydrolytic 
activity with ester substrates only 1/10,000 that of native trypsin, demonstrating 
that Asp! is indeed essential for catalysis and that its ability to immobilize and 
orient His*’ by formation of a hydrogen bond is crucial to the function of the 
catalytic triad. 
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Figure 14.21 A detailed mechanism for the 
chymotrypsin reaction. Note the low-barrier 
hydrogen bond (LBHB) in (c) and (g). 
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(b) 


Figure 14.22 Symmetry and pseudosymmetry in the 
aspartic proteases. Structures of (a) HIV-1 protease, 
a dimer (PDB id = 7HVP), and (b) pepsin, a 

monomer. Pepsin’s N-terminal half is shown in red; 
C-terminal half is shown in blue (PDB id = 5PEP). 


The serine protease mechanism relies in part on a low-barrier hydrogen bond between 
Asp!” and His”. In the free enzyme, the pK, values of Asp! and His” are very different, 
and the H bond between them is a weak one. However, donation of the proton of Ser!” to 
His” lowers the pK, of the protonated imidazole ring so it becomes a close match to that of 
Asp!°, and the H bond between them becomes an LBHB. The energy released in the forma- 
tion of this LBHB is used to facilitate the formation of the subsequent tetrahedral interme- 
diate (Figure 14.21c, g). 


14.6e The Aspartic Proteases 


Mammals, fungi, and higher plants produce a family of proteolytic enzymes known 
as aspartic proteases. These enzymes are active at acidic (or sometimes neutral) pH, 
and each possesses two aspartic acid residues at the active site. Aspartic proteases 
carry out a variety of functions (Table 14.3), including digestion (pepsin and chymo- 
sin), lysosomal protein degradation (cathepsin D and E), and regulation of blood 
pressure (renin is an aspartic protease involved in the production of angiotensin, a 
hormone that stimulates smooth muscle contraction and reduces excretion of salts 
and fluid). The aspartic proteases display a variety of substrate specificities, but nor- 
mally they are most active in the cleavage of peptide bonds between two hydrophobic 
amino acid residues. The preferred substrates of pepsin, for example, contain aro- 
matic residues on both sides of the peptide bond to be cleaved. 

Most aspartic proteases are composed of 323 to 340 amino acid residues, with 
molecular weights near 35,000 Da. Aspartic protease polypeptides consist of two 
homologous domains that fold to produce a tertiary structure composed of two similar 
lobes, with approximate twofold symmetry (Figure 14.22). Each of these lobes or 
domains consists of two B-sheets and two short a-helices. The two domains are bridged 
and connected by a six-stranded, antiparallel B-sheet. The active site is a deep and 
extended cleft, formed by the two juxtaposed domains and large enough to accommo- 
date about seven amino acid residues. The two catalytic aspartate residues, residues 32 
and 215 in porcine pepsin, for example, are located deep in the center of the active site 
cleft. The N-terminal domain forms a flap that extends over the active site, which may 
help to immobilize the substrate in the active site. 

On the basis, in part, of comparisons with chymotrypsin, trypsin, and the other 
serine proteases, it was at first hypothesized that aspartic proteases might function by 
formation of covalent enzyme-substrate intermediates involving the active-site aspar- 
tate residues. However, all attempts to trap or isolate a covalent intermediate failed, 
and a mechanism (see following section) favoring noncovalent enzyme—substrate 
intermediates and general acid—general base catalysis is now favored for aspartic 
proteases. 


| Table 14.3 | LERT Some Representative Aspartic Proteases 


Name Source Function 

Pepsin“ Stomach Digestion of dietary protein 

Chymosint Stomach Digestion of dietary protein 

Cathepsin D Spleen, liver, and many Lysosomal digestion of proteins 

other animal tissues 

Renin? Kidney Conversion of angiotensinogen to 
angiotensin I; regulation of blood 
pressure 

HIV-proteaseS AIDS virus Processing of AIDS virus proteins 


“The second enzyme to be crystallized (by John Northrop in 1930). Even more than urease before it, pepsin study by 
Northrop established that enzyme activity comes from proteins. 

TAlso known as rennin, it is the major pepsinlike enzyme in gastric juice of fetal and newborn animals. It has been 
used for thousands of years, in a gastric extract called rennet, in the making of cheese. 

A drop in blood pressure causes release of renin from the kidneys, which converts more angiotensinogen to 
angiotensin. 

SA dimer of identical monomers, homologous to pepsin 
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14.6f The Mechanism of Action of Aspartic Proteases 


A crucial piece of evidence supporting the general acid-base model is the pH depen- 
dence of protease activity (Figure 14.23). For many years, enzymologists hypothesized 
that the aspartate carboxyl groups functioned alternately as general acid and general 
base. This model requires that one of the aspartate carboxyls be protonated and one 
be deprotonated when substrate binds. (This made sense because X-ray diffraction 
data on aspartic proteases had shown that the active-site structure in the vicinity of 
the two aspartates is highly symmetric.) However, Stefano Piana and Paolo Carloni 
reported in 2000 that molecular dynamics simulations of aspartic proteases were con- 
sistent with a low-barrier hydrogen bond involving the two active-site aspartates. This 
led to a new mechanism for the aspartic proteases (Figure 14.24) that begins with 
Piana and Carloni’s model of the LBHB structure of the free enzyme (state E). In this 
model, the LBHB holds the twin aspartate carboxyls in a coplanar conformation, with 
the catalytic water molecule on the opposite side of a ten-atom cyclic structure. 

Following substrate binding, a counterclockwise flow of electrons moves two pro- 
tons clockwise and creates a tetrahedral intermediate bound to a diprotonated enzyme 
form (FT). Then a clockwise movement of electrons moves two protons counterclock- 
wise and generates the zwitterion intermediate bound to a monoprotonated enzyme 
form (ET’). Collapse of the zwitterion cleaves the C—N bond of the substrate. Disso- 
ciation of one product leaves the enzyme in the diprotonated FQ form. Finally, deprot- 
onation and rehydration lead to regeneration of the ten-atom cyclic structure, E. 

What is the purpose of the low-barrier hydrogen bond in the aspartic protease mech- 
anism? It may be to disperse electron density in the ten-atom cyclic structure, accom- 
plishing rate acceleration by means of hydrogen tunneling (Figure 14.25). The barrier 
between the ES and ET’ states of Figure 14.24 is imagined to be large, and the state FT 
may not exist as a discrete intermediate but rather may exist transiently to facilitate 
conversion of ES and ET’. 


14.6g The HIV-1 Protease Is an Aspartic Protease 


Recent research on acquired immunodeficiency syndrome (AIDS) and its causative 
viral agent, the human immunodeficiency virus (HIV-1), has brought a new aspartic 
protease to light. HIV-1 protease cleaves the polyprotein products of the HIV-1 
genome, producing several proteins necessary for viral growth and cellular infection 
(Figure 14.26). HIV-1 protease cleaves several different peptide linkages. For example, 
the protease cleaves between the Tyr and Pro residues of the sequence Ser-Gln-Asn- 
Tyr-Pro-Ile-Val, which joins the p17 and p24 HIV-1 proteins. 


(a) (b) 


Pepsin HIV protease 


Enzyme activity 
e 
Inhibition constants 


Figure 14.23 pH-rate profiles for (a) pepsin and (b) HIV protease. (Adapted from Denburg, J., et al., 
1968. The effect of pH on the rates of hydrolysis of three acylated dipeptides by pepsin. Journal of the 
American Chemical Society 90:479-486; and Hyland, J., et al., 1991. Human immunodeficiency virus-1 
protease 2. Use of pH rate studies and solvent kinetic isotope effects to elucidate details of chemical 
mechanism. Biochemistry 30:8454-8463.) 
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Figure 14.24 Mechanism for the aspartic proteases. The letter titles describe the states as follows: 

E represents the enzyme form with a low-barrier hydrogen bond between the catalytic aspartates, 

F represents the enzyme form with both aspartates protonated and joined by a conventional hydrogen 
bond, S represents bound substrate, T represents a tetrahedral amide hydrate intermediate, 

P represents bound carboxyl product, and Q represents bound amine product. This mechanism is 
based on a mechanism proposed by Dexter Northrop, a distant relative of John Northrop, who had 
first crystallized pepsin in 1930 and the data of Thomas Meek. (Northrop, D. B., 2001. Follow the 
protons: a low-barrier hydrogen bond unifies the mechanisms of the aspartic proteases. Accounts of 
Chemical Research 34:790-797. Meek, T. D., Catalytic mechanisms of the aspartic proteinases. In 
Sinnott, M., ed, Comprehensive Biological Catalysis: A Mechanistic Reference, San Diego: Academic 
Press, 1998.) 
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Figure 14.25 Energy level diagram for the aspartic Figure 14.26 HIV mRNA provides the genetic information for synthesis of a polyprotein. Proteolytic 
protease reaction, showing ground-state hydrogen cleavage of this polyprotein by HIV protease produces the individual proteins required for viral 
tunneling (arrow), with consequent rate acceleration. growth and cellular infection. 


The HIV-1 protease is a remarkable viral imitation of mammalian aspartic pro- 
teases: It is a dimer of identical subunits that mimics the two-lobed monomeric 
structure of pepsin and other aspartic proteases. The HIV-1 protease subunits are 
99-residue polypeptides that are homologous with the individual domains of the 
monomeric proteases. Structures determined by X-ray diffraction studies reveal 
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Frances Arnold and Directed Evolution of Enzymes 


Enzymes have been evolving naturally since the emergence of life 
on Earth, changing slowly to improve the fitness of organisms. The 
methods of directed evolution were developed in laboratories in the 
1990s to speed the evolutionary process, producing enzymes that 
catalyze useful and novel reactions. The methodology developed 
by Frances Arnold at Caltech is based on several steps: 1) selection 
of a suitable starting enzyme with properties similar to the desired 
one; 2) use of an error-prone (low fidelity) DNA polymerase to 
construct a DNA library of mutated versions of the gene for the 
starting enzyme; 3) insertion of these mutant genes into bacteria, 
each expressing a slightly different version of the starting enzyme; 
and screening to test the activity of each of the resulting modified 
enzymes. The genes for enzymes with even slightly improved activity 
are subjected to additional rounds of random mutation, expres- 
sion, and testing. Arnold showed that successive generations of this 
process could lead to dramatic increases in the catalytic prowess of 
these evolved enzymes. 

For an enzyme of a few hundred residues, allowing just one or 
two mutations per variant would require testing of a few thousand 


new enzyme candidates. At the same time, Arnold found that 
useful traits could emerge after as few as five to ten iterations of 
mutagenesis and selection, and her protocol was adopted rapidly 
and modified in laboratories around the world. Directed evolution 
has produced enzymes that synthesize fuels, pharmaceuticals, and 
industrial and agricultural chemicals. Other applications have de- 
veloped therapeutic and diagnostic enzymes and gene editing tools. 
In recognition of her pioneering accomplishments, Frances Arnold 
shared in the Nobel Prize in Chemistry in 2018. She noted at the 
time, “I wanted to rewrite the code of life, to make new molecular 
machines that would solve human problems.” 


References 

Chen, K., and Arnold, F. H., 1993. Tuning the activity of an enzyme for 
unusual environments: sequential random mutagenesis of subtilisin E for 
catalysis in dimethylformamide. Proceedings of the National Academy of 
Sciences 90:5618—5622. 

Arnold, Frances H., 2018. Directed evolution: bringing new chemistry to 
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that the active site of HIV-1 protease is formed at the interface of the homodimer 
and consists of two aspartate residues, designated Asp% and Asp’, one contrib- 
uted by each subunit (Figure 14.27). In the homodimer, the active site is covered by 
two identical flaps, one from each subunit, in contrast to the monomeric aspartic 
proteases, which possess only a single active-site flap. Enzyme kinetic measurements 
by Thomas Meek and his collaborators at SmithKline Beecham Pharmaceuticals 
have shown that the mechanism of HIV-1 protease is very similar to those of other 
aspartic proteases. 


14.6h Chorismate Mutase: A Model for Understanding 
Catalytic Power and Efficiency 


Direct comparison of an enzyme reaction with the analogous uncatalyzed reaction is usu- 
ally difficult, if not impossible. There are several problems: First, many enzyme-catalyzed 
reactions do not proceed at measurable rates in the absence of the enzyme. Second, many 
enzyme-catalyzed reactions involve formation of a covalent intermediate between the 
enzyme and the substrate. Third, a reaction occurring in an enzyme active site might pro- 
ceed through a different transition state than the corresponding solution reaction. Choris- 
mate mutase, which catalyzes a concerted intramolecular rearrangement of chorismate to 
prephenate in the biosynthesis of phenylalanine in microorganisms and plants, is 


Figure 14.27 HIV-1 protease complexed with 
the inhibitor Crixivan (red) made by Merck. 
(Left) The flaps (residues 46-55 from each 
subunit) covering the active site are shown in 
green, and the active-site aspartate residues 
involved in catalysis are shown in light 
purple. (Right) The close-up of the active site 
shows the interaction of Crixivan with the 
carboxyl groups (yellow) of the essential 
aspartate residues (PDB id = 1HSG). 
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Figure 14.28 The chorismate mutase reaction is a 
Claisen rearrangement. (a) The chorismate mutase 
reaction converts chorismate to prephenate. (b) A 
classic Claisen rearrangement. Conversion of allyl 
phenyl ether to 2-allyl alcohol proceeds through a 
cyclohexadienone intermediate, which then 
undergoes a keto-enol tautomerization. 


Figure 14.29 The conversion of chorismate to 
prephenate could occur (in principle) through a boat 
transition state or a chair transition state. The 
difference can be understood by imagining two 
different isotopes of hydrogen (blue and green) at 
carbon 9 of chorismate and the products that would 
result in each case. Knowles and co-workers have 
shown that both the uncatalyzed reaction and the 
reaction on chorismate mutase occur through a 
chair transition state. The molecule shown at right is 
a transition state analog for the chorismate mutase 
reaction. 


an exception to all these potential problems. First, although the rate of the enzyme- 
catalyzed reaction is more than a million times faster, the uncatalyzed solution reaction 
still proceeds at reasonable and measurable rates. Second, the enzyme reaction does not 
employ a covalent intermediate. In this simple reaction, one carbon—oxygen bond is bro- 
ken, and one carbon-carbon bond is formed. It is an example of a Claisen rearrangement, 
familiar to any student of organic chemistry (Figure 14.28). 

What about the transition states for the catalyzed and uncatalyzed reactions? There 
are two possible transition states, one involving a chair conformation and the other a 
boat (Figure 14.29). Jeremy Knowles and his co-workers have shown that both the enzy- 
matic and the solution reactions take place via a chair transition state, and a transition- 
state analog of this state has been characterized (Figure 14.29). 


The Chorismate Mutase Active Site Lies at the Interface of Two Subunits The chorismate 
mutase from Escherichia coli is the N-terminal portion (109 residues) of a bifunctional 
enzyme, termed the P protein, which also has a C-terminal prephenate dehydrogenase 
activity. The N-terminal portion of the P protein has been prepared as a separate protein 
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by recombinant DNA techniques, and this engineered protein is a fully functional 
chorismate mutase. The structure shown in Figure 14.30 is a homodimer, each monomer 
consisting of three a-helices (denoted H1, H2, and H3) connected by short loops. The two 
monomers are dovetailed in the dimer structure, with the H1 helices paired and the H3 
helices overlapping significantly. The long, 10-turn H1 helices form an antiparallel coiled 
coil, with leucines at positions 10, 17, 24, and 31 in a classic 7-residue repeat pattern. 

The chorismate mutase dimer contains two equivalent active sites, each formed from 
portions of both monomers. The structure shown in Figure 14.30 contains a bound 
transition-state analog (Figure 14.29) stabilized by 12 electrostatic and hydrogen-bonding 
interactions (Figure 14.31). Arg% from one subunit and Arg!"" from the other coordinate 
the carboxyl groups of the analog, and a third arginine (Arg*') coordinates a water mole- 
cule, which in turn coordinates both carboxyls of the analog. Each oxygen of the analog is 
coordinated by two groups from the active site. In addition, there are hydrophobic residues 
surrounding the analog, especially Val** on one side and Tle*! and Val®> on the other. 


The Chorismate Mutase Active Site Favors a Near-Attack Conformation The chorismate 


mutase reaction mechanism requires that the carboxyvinyl group fold over the 


(a) (b) 


Val®> 


Val? 


Figure 14.30 Chorismate mutase is a symmetric homodimer, each monomer consisting of three 
a-helices connected by short loops. (a) The dimer contains two equivalent active sites, each formed 
from portions of both monomers (PDB id = 4CSM). (b) A close-up of the active site, showing the 
bound transition-state analog (pink, see Figure 14.29). 


Figure 14.31 In the chorismate mutase active site, 
the transition-state analog is stabilized by 12 
electrostatic and hydrogen-bonding interactions 
(pink dotted lines). Water molecule in blue 
participates in four hydrogen bonds. (Adapted from 
Lee, A., et al., 1995. Atomic structure of the buried 
catalytic pocket of Escherichia coli chorismate 
mutase. Journal of the American Chemical Society 
117:3627-3628.) 
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Human Biochemistry 


Protease Inhibitors-Game Changers for AIDS Patients 


Infection with HIV was once considered a death sentence, but the 
emergence of a new family of drugs called protease inhibitors has 
made it possible for some AIDS patients to improve their overall 
health and extend their lives. These drugs are all specific inhibitors 
of the HIV-1 protease. By inhibiting the protease, they prevent the 
development of new virus particles in the cells of infected patients. 
A combination of drugs—including a protease inhibitor together 
with a reverse transcriptase inhibitor like AZT—can reduce the 
human immunodeficiency virus (HIV) to undetectable levels in 
about 40% to 50% of infected individuals. Patients who respond 
successfully to this combination therapy have experienced dramatic 
improvement in their overall health and a substantially lengthened 
life span. 

Four of the protease inhibitors approved for use in humans by 
the U.S. Food and Drug Administration are shown in the figures: 
Crixivan by Merck, Invirase by Hoffman-LaRoche, Norvir by 
Abbott, and Viracept by Agouron. These drugs were all developed 
from a structure-based design strategy; that is, the drug molecules 
were designed to bind tightly to the active site of the HIV-1 protease. 
The backbone OH-group in all these substances inserts between the 
two active-site carboxyl groups of the protease. 

In the development of an effective drug, it is not sufficient 
merely to show that a candidate compound can cause the desired 
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biochemical effect. It must also be demonstrated that the drug can 
be effectively delivered in sufficient quantities to the desired site(s) 
of action in the organism and that the drug does not cause undesir- 
able side effects. The HIV-1 protease inhibitors shown here fulfill all 
of these criteria. Other drug candidates have been found that are 
even better inhibitors of HIV-1 protease in cell cultures, but many 
of these fail the test of bioavailability—the ability of a drug to be 
delivered to the desired site(s) of action in the organism. 

Candidate protease inhibitor drugs must be relatively specific 
for the HIV-1 protease. Many other aspartic proteases exist in the 
human body and are essential to a variety of body functions, includ- 
ing digestion of food and processing of hormones. An ideal drug 
thus must strongly inhibit the HIV-1 protease, must be delivered 
effectively to the lymphocytes where the protease must be blocked, 
and should not adversely affect the activities of the essential human 
aspartic proteases. 

A final but important consideration is viral mutation. Certain 
mutant HIV strains are resistant to one or more of the protease 
inhibitors, and even for patients who respond initially to protease 
inhibitors, it is possible that mutant viral forms may eventually arise 
and thrive in infected individuals. The search for new and more 
effective protease inhibitors is ongoing. 
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chorismate ring to facilitate the Claisen rearrangement (Figure 14.32). This implies the 
formation of a NAC on the way to the transition state. Bruice and his co-workers have 
carried out extensive molecular dynamics simulations of the chorismate mutase reac- 
tion. Their calculations show that in the nonenzymatic reaction, only 0.0001% of 
chorismate in solution exists in the NAC required for reaction. Similar calculations 
show that in the enzyme active site, chorismate adopts a NAC 30% of the time. The 
computer-simulated NAC in the chorismate mutase active site (Figure 14.33) is similar 
in many ways to the chorismate mutase-TSA complex, with Arg’ and Arg"! coordinat- 
ing the two carboxylate groups of chorismate so as to position the carboxyvinyl group 
in the conformation required for transition-state formation. This conformation is also 
stabilized by Val and Ile, which are in van der Waals contact with the vinyl group 
and the chorismate ring, respectively. Thus, the NAC of chorismate is promoted by 
electrostatic and hydrophobic interactions with active-site residues. 


Figure 14.32 The mechanism of the chorismate 
mutase reaction. The carboxyvinyl group folds up 
and over the chorismate ring, and the reaction 
proceeds via an internal rearrangement. 


Figure 14.33 Chorismate bound to the active site of 
chorismate mutase in a structure that resembles a 
NAC. Arrows indicate hydrophobic interactions, and 
red dotted lines indicate electrostatic interactions. 
(Adapted from Hur, S., and Bruice, T., 2003. The 
near attack conformation approach to the study of 
the chorismate to prephenate reaction. Proceedings 
of the National Academy of Sciences USA 

100: 12015-12020.) 
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Figure 14.34 The energetic profile of the chorismate 
mutase reaction. Computer simulations by Bruice 
and his co-workers show that the NAC and the E-S 
complex are separated by only 0.42 kJ/mol, meaning 
that the NAC forms much more readily in the 
enzyme active site than it does in the absence of 
enzyme. The NAC and the reaction transition state 
are separated by similar energy barriers in either the 
presence or the absence of the enzyme. Thus, the 
catalytic prowess of the enzyme lies in its ability to 
form the NAC at a very small energetic cost. 
(Adapted from Bruice, T., 2002. A view at the 
millennium: the efficiency of enzymatic catalysis. 
Accounts of Chemical Reactions 35:139-148.) 


Free energy, G 


Reaction coordinate 


The energetics of the chorismate mutase reaction are revealing (Figure 14.34). 
Computer simulations by Bruice and his co-workers show that formation of a NAC in 
the absence of the enzyme is energetically costly, whereas formation of the NAC in the 
enzyme active site is facile, with only a modest energy cost. On the other hand, the energy 
required to move from the NAC to the transition state is about the same for the solution 
and the enzyme reactions. Clearly, the catalytic advantage of chorismate mutase is the 
ease of formation of a NAC in the active site. 


Foundational Biochemistry Things You Should Know after Reading Chapter 14. 


Enzyme-catalyzed reactions are typically 10’ to 10° times faster than 
their uncatalyzed counterparts; the alkylsulfatase from Coryneform 
Bla exhibits a rate acceleration of 2 X 10%. 


Linus Pauling was the first to postulate that the active sites of 
enzymes bind the transition state more tightly than the substrate 
or product. 


The catalytic role of enzymes is to reduce the energy barrier between 
substrate and transition state. In terms of free energies of activation, 


AG < AG. 


The intrinsic binding energy of the ES complex is typically compen- 
sated by loss of entropy due to binding of S to E, and destabilization 
of ES by strain, distortion, and de-solvation of the substrate. 


The dissociation constants for transition states from their enzymes 
are postulated to be in the range of 107% to 10% M. 


Transition state analogs are stable molecules that are chemically and 
structurally similar to the transition state and bind more strongly 
than substrates and competitive inhibitors. 


Enzymes facilitate formation of near-attack conformations (NACs), 
in which reacting atoms are in van der Waals contact and at an angle 
resembling the bond to be formed in the transition state. 


Protein motions are essential to enzyme catalysis. They assist 
substrate binding, bring catalytic groups into position around sub- 
strates, induce formation of NACs, assist in bond making and bond 
breaking, and facilitate conversion of substrate(s) to product(s). 


Some enzyme reactions derive much of their rate acceleration from 
the formation of covalent bonds between enzyme and substrate. 


General acid-base catalysis is catalysis in which a proton or OH” is 
transferred in the transition state of a reaction. 


Stabilization energy released in the formation of low-barrier hydro- 
gen bonds can lower the activation barrier for a reaction. 


Quantum mechanical tunneling is a contributing factor in most, if 
not all, hydrogen transfer reactions. 


Metal ions play a variety of catalytic roles in many enzymes. 


The serine proteases employ both covalent catalysis and general 
acid-base catalysis to catalyze peptide bond cleavage. 


Observation of burst kinetics is evidence for involvement of a cova- 
lent intermediate in an enzyme reaction. 


The serine protease mechanism involves two cycles of formation and 
collapse of tetrahedral intermediates. 


The mechanisms of aspartic proteases involve general acid-base 
catalysis and formation and breakdown of low-barrier hydrogen 
bonds, but not covalent catalysis. 


Human aspartic proteases are monomeric enzymes with pseudo- 
symmetric structures. The HIV-1 protease is a symmetric homodimer. 


Chorismate mutase is an enzyme that provides a direct comparison 
of enzyme-catalyzed and uncatalyzed reactions; rate acceleration by 
this enzyme appears to be due entirely to formation of a near-attack 
conformation in the enzyme active site. 
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Problems 


Answers to all problems are at the end of this book. Detailed solutions are 
available in the Student Solutions Manual. 


1. Characterizing a Covalent Enzyme Inhibitor Tosyl-L-phenylalanine 
chloromethyl ketone (TPCK) specifically inhibits chymotrypsin by 
covalently labeling His”. 


O CHo 0 


conh Y-ren 


—/ | 
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Tosyl-L-phenylalanine chloromethyl ketone (TPCK) 


CH.Cl 


a. Propose a mechanism for the inactivation reaction, indicating the 
structure of the product(s). 

b. State why this inhibitor is specific for chymotrypsin. 

c. Propose a reagent based on the structure of TPCK that might be 
an effective inhibitor of trypsin. 


2. Using Site-Directed Mutants to Understand an Enzyme Mechanism In 
this chapter, the experiment in which Craik and Rutter replaced 
Asp!” with Asn in trypsin (reducing activity 10,000-fold) was 
discussed. 

a. On the basis of your knowledge of the catalytic triad structure in 
trypsin, suggest a structure for the uncatalytic triad of Asn-His- 
Ser in this mutant enzyme. 

b. Explain why the structure you have proposed explains the 
reduced activity of the mutant trypsin. 
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6. Deriving an Expression for Catalytic Power Based on the following 
reaction scheme, derive an expression for k,/k,, the ratio of the rate 
constants for the catalyzed and uncatalyzed reactions, respectively, in 
terms of the free energies of activation for the catalyzed (A G+) and 
the uncatalyzed (A G,Ż) reactions. 
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c. See the original journal articles (Sprang, et al., 1987. Science 
237:905—909; and Craik, et al., 1987. Science 237:909-913) to see 
Craik and Rutter’s answer to this question. 


3. Serine vs. Cysteine or Threonine Give a possible explanation as to why 
serine is often a critically important residue in an enzyme’s active site, 
but cysteine and structurally similar threonine are not. 


4. Assessing the Action of an Aspartic Protease Inhibitor Pepstatin (see 
below) is an extremely potent inhibitor of the monomeric aspartic 
proteases, with K; values of less than 1 nM. 

a. On the basis of the structure of pepstatin, suggest an explana- 
tion for the strongly inhibitory properties of this peptide. 

b. Would pepstatin be expected to also inhibit the HIV-1 protease? 
Explain your answer. 
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5. Aspartic Proteases The structures of several ACE inhibitors are 
shown in the figure (also see A Deeper Look: Transition-State 
Analogs Make Our World Better and Section 14.6e). Find the 
common structural features for these compounds and explain why 
they are inhibitors of aspartic proteases. 
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7. Comparison of Enzymatic and Nonenzymatic Rate Constants 


10. 


11. 


The keat for alkaline phosphatase-catalyzed hydrolysis of meth- 
ylphosphate is approximately 14/s at pH 8 and 25°C. The rate 
constant for the uncatalyzed hydrolysis of methylphosphate under 
the same conditions is approximately 107!5/s. What is the differ- 
ence in the free energies of activation of these two reactions? 


. Understanding the Actions of Proteolytic Enzymes Active a- 


chymotrypsin is produced from chymotrypsinogen, an inactive 
precursor, as shown in the color figure. The first intermediate— 
a-chymotrypsin—displays chymotrypsin activity. Suggest 
proteolytic enzymes that might carry out these cleavage reactions 
effectively. 


Chymotrypsinogen 
(inactive) e-—.. 
a-Chymotrypsin 
(active) 
Ser Arg Thr Asn 
a-Chymotrypsin ab a. 


(active) 
eu_ lle 


Understanding the Mechanism of Carboxypeptidase A Consult the 
classic paper by William Lipscomb (1982. Accounts of Chemical 
Research 15:232-238), and on the basis of this article write a simple 
mechanism for the enzyme carboxypeptidase A. 


Ala 


Calculation of Rate Enhancement from Energies of Activation 
The relationships between the free energy terms defined in the 
solution to Problem 4 earlier are shown in the following figure: 


Reaction coordinate 


If the energy of the ES complex is 10 kJ/mol lower than the energy of E 
+ S, the value of AG, is 20 kJ/mol, and the value of A G,Ż is 90 kJ/mol, 
what is the rate enhancement achieved by an enzyme in this case? 


Understanding the Implications of Transition State Stabilization 

As noted in Section 14.4, a true transition state can bind to an 
enzyme active site with a Ky as low as 7 X 107% M. This is a 
remarkable number, with interesting consequences. Consider a 
hypothetical solution of an enzyme in equilibrium with a ligand that 
binds with a Kp of 1077 M. If the concentration of free enzyme, [E], 
is equal to the concentration of the enzyme—ligand complex, [EL], 
what would [L], the concentration of free ligand, be? Calculate the 
volume of solution that would hold one molecule of free ligand at 
this concentration. 


12. 


Substrate vs. Transition State An enzyme is known to bind com- 

pounds A and B with Kg values of 0.15 mM and 0.01 mM, respec- 

tively. The mechanism of the enzymatic reaction for compound A is 

shown in the figure. Although difficult to determine, the Kr values for 

both compounds were estimated to be 5 x 10% Mand 1 X 10” M, 

respectively. 

a. Explain the differences in Kr values between the two com- 
pounds? 

b. What would be the effect on the enzymatic activity if the enzyme 
was incubated with a mix of both compounds A and B? 
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13. 


14. 


15. 


16. 


Understanding the Very Tight Binding of Transition States Another 
consequence of tight binding (problem 11) is the free energy change 
for the binding process. Calculate A G°’ for an equilibrium with a Kp 
of 10-7 M. Compare this value to the free energies of the noncova- 
lent and covalent bonds with which you are familiar. What are the 
implications of this number, in terms of the binding of a transition 
state to an enzyme active site? 


Meaning of Dissociation Constants An enzyme E binds to a ligand L 
with a dissociation constant Kp = {L[E]|L]/[EL] = 107% M. It is 
tempting to assume that if the enzyme is fully saturated with L, then 
[E] = [EL] and that the concentration of free ligand is 

[L] = Kp = 10-7! M. 

a. What is wrong with this assumption? 

b. What is the correct relationship between these four parameters? 


c. What is the concentration of free ligand? What can be calculated 
from the information given? 


(Hint: Revisit Section 13.3 for clues.) 


Quantum Tunneling In Section 14.5f, de Broglie wavelengths for an 
electron and a proton were calculated using Equation 14.4 and assuming 
a kinetic energy of 10 kJ/mole. Deuterium (D) is sometimes used to 
probe the mechanism of enzymatic catalysis. What is the de Broglie 
wavelength of D? The mass of D is 3.343584 X 10” kg, Planck’s 
constant A is 6.6207004 x 10~** (m’kg)/s and 1 J = 1 (m’kg)/s’. 


Effect of Environment on Enzyme Activity Apart from changes in pH, 
temperature, or salts, what other factor or factors could diminish the 
activity of an enzyme? 


. Assessing the Metabolic Consequences of Life without Enzymes The 


incredible catalytic power of enzymes can perhaps best be appreci- 
ated by imagining how challenging life would be without just one of 
the thousands of enzymes in the human body. For example, consider 
life without fructose-1,6-bisphosphatase, an enzyme in the gluconeo- 
genesis pathway in liver and kidneys (see Chapter 22), which helps 
produce new glucose from the food we eat: 


Fructose-1,6-bisphosphate + HO — Fructose-6-P + P; 


The human brain requires glucose as its only energy source, and the 
typical brain consumes about 120 g (or 480 kilocalories) of glucose 
daily. Ordinarily, two pieces of sausage pizza could provide more 
than enough potential glucose to feed the brain for a day. According 
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to a national fast-food chain, two pieces of sausage pizza provide 
1340 kilocalories, 48% of which is from fat. Fats cannot be converted 
to glucose in gluconeogenesis, so that leaves 697 kilocalories 
potentially available for glucose synthesis. The first-order rate 
constant for the hydrolysis of fructose-1,6-bisphosphate in the 
absence of enzyme is 2 X 10~7°/sec. Calculate how long it would take 
to provide enough glucose for one day of brain activity from two 
pieces of sausage pizza without the enzyme. 


The following graphs show the temperature and pH dependencies of four 
enzymes, A, B, X, and Y. Problems 18 through 24 refer to these graphs. 
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Enzymes X and Y in the figure are both protein-digesting enzymes 
found in humans. Where would each enzyme most likely be at work, 
in the mouth, stomach, or the small intestine? 

Which statement is true concerning enzymes X and Y? 


a. They could not possibly be at work in the same part of the body 
at the same time. 


b. They have different temperature ranges at which they work best. 
c. Ata pH of 4.5, enzyme X works slower than enzyme Y. 
At their appropriate pH ranges, both enzymes work equally 
fast. 


What conclusion may be drawn concerning enzymes A and B? Could 
one or both be human enzymes? 


At which temperatures might enzymes A and B both work? 


An enzyme-substrate complex can form when the substrate(s) bind(s) 
to the active site of the enzyme. Which environmental condition 
might alter the conformation of each enzyme (A, B, X, or Y) to the 
extent that its substrate is unable to bind? 
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Problems 


At 35°C, the rate of the reaction catalyzed by enzyme A begins to 
level off. Which hypothesis best explains this observation? 


a. The temperature is too far below optimum. 

b. The enzyme has become saturated with substrate. 
c. Both A and B. 

d. Neither A nor B. 


At what environmental conditions would digestive enzyme Y be 
unable to bring its substrate(s) to the transition state? 


Review the mechanisms of the serine and aspartic proteases, and 
compare these two mechanisms carefully. Are there steps in the 
mechanisms that are similar? How are they similar? How are they differ- 
ent? Suggest experiments that could support or refute your hypotheses. 


Consult the following reference (Samanta, U. and Bahnson, B. J., 
2008. Crystal structure of human plasma platelet-activating factor 
acetylhydrolase. Journal of Biological Chemistry 283:31617—31624), 
consider the active-site structure of this enzyme in the light of 
material in this chapter, and write a detailed mechanism for the PAF 
acetylhydrolase. 


Superbug infections are becoming more common around the world. 
Many of these infections arise from the action of B-lactamases, of 
which there are several types with different mechanisms of action. 
Consult the end-of-chapter reference by von Nussbaum and Schiffer 
and write detailed mechanisms for the serine B-lactamases and 
metallo-B-lactamases. 


A kinase reaction—the transfer of a phosphoryl group from ATP to a 
substrate—would take thousands of years to complete under normal 
conditions without an enzyme, but enzymes carry out phosphoryl 
transfers in milliseconds! Understanding how such rate enhancements 
might occur is one of the ultimate goals of biochemical science. 
Dorothee Kern and her colleagues have provided intriguing new insights 
about enzyme catalysis by studying the microscopic steps in the catalytic 
cycle of adenylate kinase, which catalyzes the reaction: ATP + AMP = 
2ADP. Consult the following reference (Kerns, S. J., et al., 2015. The 
energy landscape of adenylate kinase during catalysis. Nature Structural 
and Molecular Biology 22:124-131.) and describe the experimental 
methods used and the conclusions presented by Kern and her colleagues. 


Think-Pair-Share Question 


Modeling Enzyme Mechanism Esterases, enzymes capable of hydrolyzing 
esters into the carboxylic acid and alcohol components, are common in 
the circulatory system and in many tissues. Acetylsalicylic acid (Aspirin) 
undergoes ester hydrolysis by esterases when absorbed into circulation, 
releasing salicylic acid. The figure shows a model of an esterase with a 
simple diagram of its active site and the presence of three critical amino 
acid residues. Included in the figure are the structures of three possible 
substrates for this enzyme and their Ky values. 


a. Sketch a general enzyme mechanism for the hydrolysis of the 
ester bond in any of the three structures shown that involves (1) 
a transient covalent bond between substrate and enzyme and (2) 
a general base mechanism. (Use any relevant figure from 
Chapter 14 as your guide). 


b. Explain the differences in Ky values. 
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Enzyme Regulation 15 


Allostery is a key chemical process that makes 
possible intracellular and intercellular regulation: 
“.. . the molecular interactions which ensure the 
transmission and interpretation of (regulatory) 
signals rest upon (allosteric) proteins endowed with 
discriminatory stereospecific recognition properties.” 


Jacques Monod 
Chance and Necessity 


< Like the delicate balance of forces in this mobile, 
metabolic regulation is achieved through a responsive 
interplay between enzymes and small molecules. 


Christie's Images/Corbis 


Regulation of enzyme activity ensures the harmonious integration of the diverse and often 15.1 What Factors Influence 

divergent reactions of metabolism. At key positions in metabolic pathways, regulatory enzymes Enzymatic Activity? 

sense the momentary needs of the cell and adjust their catalytic activity accordingly. What are 15.2 What Are the General Features 

the properties of regulated enzymes? How do regulatory enzymes sense the momentary needs of Allosteric Regulation? 

of cells? What molecular mechanisms are used to regulate enzyme activity? 15.3 Can Allosteric Regulation Be 
Explained by Conformational Changes 
in Proteins? 

Eze catalyze essentially all of the thousands of metabolic reactions taking place 15.4 What Kinds of Covalent Modification 

in cells. Many of these reactions are at cross-purposes: Some enzymes catalyze the Regulate the Activity of Enzymes? 
breakdown of substances, whereas others catalyze synthesis of the same substances; 15.5 Is the Activity of Some Enzymes 


Controlled by Both Allosteric 
Regulation and Covalent Modification? 

15.6 What Is the Relationship between 
Quaternary Structure and Allosteric 
Regulation? 


many metabolic intermediates have more than one fate; and energy is released in some 
reactions and consumed in others. 


15.1 | What Factors Influence Enzymatic Activity? 


The activity displayed by enzymes is affected by a variety of factors, some of which 
are essential to the harmony of metabolism. Two ways to regulate the amount of ac- 
tivity at a given time are (1) to increase or decrease the number of enzyme molecules 
and (2) to increase or decrease the activity of each enzyme molecule. Although these 
ways seem straightforward, the cellular mechanisms to achieve them are complex and 
varied. 
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15.la The Availability of Substrates and Cofactors Usually Determines 
How Fast the Reaction Goes 


The availability of substrates and cofactors typically determines the enzymatic reac- 
tion rate. In general, enzymes have evolved such that their K,,, values approximate the 
prevailing in vivo concentration of their substrates. (It is also true that the concentra- 
tion of some enzymes in cells is within an order of magnitude or so of the concentra- 
tions of their substrates.) 


15.1b As Product Accumulates, the Apparent Rate 
of the Enzymatic Reaction Will Decrease 


The enzymatic rate, v = d[P]/dt, slows down as product accumulates and equilibrium 
is approached. The apparent decrease in rate is due to the conversion of P to S by the 
reverse reaction as [P] rises. Once [P]/[S] = Keq, no further reaction is apparent. Keq 
defines thermodynamic equilibrium. Enzymes have no influence on the thermody- 
namics of a reaction. Also, product inhibition can be a kinetically valid phenomenon: 
Some enzymes are inhibited by the products of their action. 


15.1c Genetic Regulation of Enzyme Synthesis Determines 
the Amount of Enzyme Present at Any Moment 


The amount of enzyme synthesized by a cell is determined by transcription regulation. If the 
gene encoding a particular enzyme protein is turned on or off, changes in the amount of 
enzyme activity soon follow. Induction, which is enzyme synthesis through transcrip- 
tional activation, and repression, which is the shutdown of enzyme synthesis, are impor- 
tant mechanisms for the regulation of metabolism. By controlling the amount of an 
enzyme that is present at any moment, cells can either activate or terminate various 
metabolic routes. Genetic controls over enzyme levels have a response time ranging from 
minutes in rapidly dividing bacteria to hours (or longer) in higher eukaryotes. Once 
synthesized, the enzyme may also be degraded, either through normal turnover of the 
protein or through specific decay mechanisms that target the enzyme for destruction. 


15.1d Enzyme Activity Can Be Regulated Allosterically 


Enzymatic activity can also be activated or inhibited through noncovalent interaction 
of the enzyme with small molecules (metabolites) other than the substrate. This form of 
control is termed allosteric regulation because the activator or inhibitor binds to the 
enzyme at a site other than (allo means “other”) the active site. Furthermore, such 
allosteric regulators, or effector molecules, are often quite different sterically from the 
substrate. Because this form of regulation results simply from reversible binding of regu- 
latory ligands to the enzyme, the cellular response time can be virtually instantaneous. 
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15.le Enzyme Activity Can Be Regulated through Covalent Modification 


Enzymes can be regulated by covalent modification, the reversible covalent attach- 
ment of a chemical group. Enzymes susceptible to such regulation are called intercon- 
vertible enzymes because they can be reversibly converted between two forms. Thus, 
a fully active enzyme can be converted into an inactive form simply by the covalent 
attachment of a functional group. Covalent modification reactions are catalyzed by 
special converter enzymes, which are themselves subject to metabolic regulation. Pro- 
tein kinases are one class of converter enzymes. They act in covalent modification by 
attaching a phosphoryl moiety to target proteins (Figure 15.1). Protein kinases cata- 
lyze the ATP-dependent phosphorylation of —OH groups on Ser, Thr, or Tyr side 
chains (or in some cases, the imidazole-N atoms in histidine residues). Removal of the 
phosphate group by a phosphoprotein phosphatase returns the enzyme to its original 
state. In contrast to the example in Figure 15.1, some enzymes exist in an inactive state 
unless specifically converted into the active form through covalent addition of a func- 
tional group. Although covalent modification represents a stable alteration of the 
enzyme, a different converter enzyme operates to remove the modification. Thus, 
when the conditions that favored modification of the enzyme are no longer present, 
the process can be reversed, restoring the enzyme to its unmodified state. Because 
covalent modification events are catalyzed by enzymes, they occur very quickly, with 
response times of seconds or even less for significant changes in metabolic activity. 

Enzyme regulation is an important matter to cells, and evolution has provided a 
variety of additional options, including zymogens, isozymes, and modulator proteins. 
We will discuss these options first and then return to the major topics of this chapter— 
enzyme regulation through allosteric mechanisms and covalent modification. 


15.1f Zymogens Are Inactive Precursors of Enzymes 


Most proteins become fully active as their synthesis is completed and they spontane- 
ously fold into their native, three-dimensional conformations. Some proteins, however, 
are synthesized as inactive precursors, called zymogens or proenzymes, that acquire 
full activity only upon specific proteolytic cleavage of one or several of their peptide 
bonds. Unlike allosteric regulation or covalent modification, zymogen activation by 
specific proteolysis is an irreversible process. Activation of enzymes and other physi- 
ologically important proteins by specific proteolysis is a strategy frequently exploited 
by biological systems to switch on processes at the appropriate time and place, as the 
following examples illustrate. 


Insulin Some protein hormones are synthesized in the form of inactive precursor mol- 
ecules, from which the active hormone is derived by proteolysis. For instance, insulin, 
an important metabolic regulator, is generated by proteolytic excision of a specific 
peptide from proinsulin (Figure 15.2). 


Proteolytic Enzymes of the Digestive Tract Enzymes of the digestive tract that serve to 
hydrolyze dietary proteins are synthesized in the stomach and pancreas as zymogens 
(Table 15.1). Only upon proteolytic activation are these enzymes able to form a 
catalytically active substrate-binding site. The activation of chymotrypsinogen is an 
interesting example (Figure 15.3). Chymotrypsinogen is a 245 residue polypeptide 
chain crosslinked by five disulfide bonds. Chymotrypsinogen is converted to an enzy- 
matically active form called 7-chymotrypsin when trypsin cleaves the peptide bond 
joining Arg" and Ile'®. The enzymatically active 7-chymotrypsin acts upon other 
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Figure 15.1 Enzyme regulation by reversible covalent 
modification. Protein kinases catalyze the ATP- 
dependent phosphorylation of target proteins. 
Phosphoprotein phosphates reverse this action by 
hydrolytic removal of the phosphate group. 


Proinsulin 
NHs 


) Connecting 
peptide 1 
10 10 l 
S 
-Si 
S 
10 S 
l =] 
20 20 S 
40 | 
i 
2i 
30 30 
COO- 


Figure 15.2 Proinsulin is an 86 residue precursor to 
insulin (the sequence shown here is human 
proinsulin). Proteolytic removal of residues 31 to 

65 yields insulin. Residues 1 through 30 (the B chain) 
remain linked to residues 66 through 87 (the A 
chain) by a pair of interchain disulfide bridges. 
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| Table 151 | Pancreatic and Gastric Zymogens 


Zymogen Active Protease Origin 

Trypsinogen Trypsin Pancreas 
Chymotrypsinogen Chymotrypsin Pancreas 
Procarboxypeptidase Carboxypeptidase Pancreas 
Proelastase Elastase Pancreas 
Pepsinogen Pepsin Stomach 


Chymotrypsinogen (inactive zymogen) 
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Cleavage at Arg! 
by trypsin 


a-Chymotrypsin (active so 
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Figure 15.3 The proteolytic activation of chymotrypsinogen. Disulfide bonds are shown in yellow. 


am-chymotrypsin molecules, excising two dipeptides: Ser'*-Arg!> and Thr“—Asn!*. 
The end product of this processing pathway is the mature protease a-chymotrypsin, in 
which the three peptide chains, A (residues | through 13), B (residues 16 through 146), 
and C (residues 149 through 245), remain together because they are linked by two 
disulfide bonds, one from A to B and one from B to C. 


Blood Clotting The formation of blood clots is the result of a series of zymogen 
activations (Figure 15.4). The amplification achieved by this cascade of enzymatic ac- 
tivations allows blood clotting to occur rapidly in response to injury. Seven of the clot- 
ting factors in their active form are serine proteases: kallikrein, XII}, XI,, IXa, VII, Xa, 
and thrombin. Two routes to blood clot formation exist. The intrinsic pathway is insti- 
gated when the blood contacts abnormal surfaces caused by injury; the extrinsic 
pathway is initiated by factors released from injured tissues. The pathways merge at 
factor X and culminate in clot formation. Thrombin excises peptides rich in negative 
charge from fibrinogen, converting it to fibrin, a molecule with a different surface 
charge distribution. Fibrin readily aggregates into ordered fibrous arrays that are sub- 
sequently stabilized by covalent crosslinks. Thrombin specifically cleaves Arg—Gly 
peptide bonds and is homologous to trypsin, which is also a serine protease. 


15.1g Isozymes Are Enzymes with Slightly Different Subunits 
and Kinetic Properties 


A number of enzymes exist in more than one quaternary form, differing in their 
relative proportions of structurally equivalent but catalytically distinct polypeptide 
subunits. A classic example is mammalian lactate dehydrogenase (LDH), which exists 
in two prominent oligomeric forms, M and H. LDH M is an Ay homotetramer (the A 
subunit is encoded by the LDH A gene). LDH H is also a homotetramer, but its 
subunits are products of the LDH B gene, so its subunit composition is By (Figure 15.5). 
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Figure 15.4 The cascade of activation steps leading to blood clotting. The intrinsic and extrinsic 
pathways converge at factor X, and the final common pathway involves the activation of thrombin 
and its conversion of fibrinogen into fibrin, which aggregates into ordered filamentous arrays that 
become crosslinked to form the clot. 
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Figure 15.5 The isozymes of lactate dehydrogenase (LDH). (a) The five possible A and B subunit 
distributions in LDH molecules. (b) Tissue distributions of the five LDH isomer types. Active 
skeletal muscle tissue becomes anaerobic and produces pyruvate from glucose via glycolysis. It 
needs LDH to regenerate NAD* from NADH so that glycolysis can continue. The lactate produced 
is released into the blood. LDH M works best in the pyruvate-to-lactate direction. Heart tissue is 
aerobic and uses lactate as a fuel, converting it to pyruvate via LDH and using the pyruvate to fuel 
the citric acid cycle to obtain energy. LDH H works best in the lactate-to-pyruvate direction. 
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The exponent n is referred to as the Hill coefficient, in 
honor of Archibald V. Hill, who first observed sigmoid 
binding curves in studies of Oz binding by hemoglobin. 


Sigmoid 


Hyperbolic 


[S] 


Figure 15.6 Sigmoid v versus [S] plot. The blue line 
represents the hyperbolic plot characteristic of 
normal Michaelis—Menten-type enzyme kinetics. 
The red line is S-shaped or sigmoid, and v is 
proportional to [S]’, where n>1. As lines with ever 
greater values of n are drawn, the resulting v vs. [S] 
curves approach a step function (the curves more 
and more resemble a step from zero v to Vmax over a 
very narrow range of [S]). As n increases, the 
response of y to S binding is described as being 
more cooperative; that is, the system is showing 
ultrasensitivity: v is very sensitive to [S] over a 
narrow range of [S]. 


Heterotetrameric isozymes (A3B, A2B2, AB;) can also form, depending on the relative 
expression of the LDH A and LDH B genes in different tissues. The kinetic properties 
of the various LDH isozymes differ. Different tissues express different isozyme 
forms, as appropriate to their particular metabolic needs. By regulating the relative 
amounts of A and B subunits they synthesize, the cells of various tissues control 
which isozymic forms are likely to assemble and thus which kinetic parameters 
prevail. 


15.2 | What Are the General Features 
of Allosteric Regulation? 


The two prominent mechanisms of enzyme regulation that ensure rapid responses to 
meet the moment-by-moment metabolic needs of cells are allosteric regulation and 
covalent modification. Allosteric regulation acts to modulate enzymes situated at key 
steps in metabolic pathways. Consider as an illustration the following pathway, where 
A is the precursor for formation of an end product, F, in a sequence of five enzyme- 
catalyzed reactions: 


enz | enz 2 enz 3 enz 4 enz 5 


A 5? B >C >D >E > F 


In this scheme, F symbolizes an essential metabolite, such as an amino acid or a nu- 
cleotide. In such systems, F, the essential end product, inhibits enzyme 1, the first step 
in the pathway. Therefore, when sufficient F is synthesized, it blocks further synthe- 
sis of itself. This phenomenon is called feedback inhibition or feedback regulation. 


15.2a Regulatory Enzymes Have Certain Exceptional Properties 


Enzymes such as enzyme 1, which are subject to feedback regulation, represent a 
distinct class of enzymes, the regulatory enzymes (also called allosteric enzymes). As a 
class, these enzymes have certain exceptional properties: 


1. Their kinetics do not obey the Michaelis-Menten equation. Their v versus [S] plots 
yield sigmoid- or S-shaped curves rather than rectangular hyperbolas (Figure 15.6). 
Such curves suggest a second-order (or higher) relationship between v and [S]; that 
is, v is proportional to [S]", where n > 1. A qualitative description of the mechanism 
responsible for the S-shaped curves is that binding of one S to a protein molecule 
makes it easier for additional substrate molecules to bind to the same protein mol- 
ecule. In the jargon of allostery, substrate binding is cooperative. 

2. Inhibition of a regulatory enzyme by a feedback inhibitor does not conform to any 
normal inhibition pattern, and the feedback inhibitor F bears little structural simi- 
larity to A, the substrate for the regulatory enzyme. F apparently acts at a binding 
site distinct from the substrate-binding site. The term allosteric is apt, because F is 
sterically dissimilar and, moreover, acts at a site other than the site for S. Its effect 
is called allosteric inhibition. 

3. Regulatory or allosteric enzymes like enzyme 1 are, in some instances, regulated by 
activation. That is, whereas some effector molecules such as F exert negative effects 
on enzyme activity, other effectors show stimulatory, or positive, influences on activity. 

4. Allosteric enzymes typically have an oligomeric organization. They are composed 
of more than one polypeptide chain (subunit), and each subunit has a binding 
site for substrate, as well as a distinct binding site for allosteric effectors. Thus, 
allosteric enzymes typically have more than one S-binding site and more than one 
effector-binding site per enzyme molecule. 

5. The interaction of an allosteric enzyme with effectors alters the distribution of 
conformational possibilities or subunit interactions available to the enzyme. That 
is, the regulatory effects exerted on the enzyme’s activity are achieved by conforma- 
tional changes occurring in the protein when effector metabolites bind. 
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In addition to enzymes, noncatalytic proteins may exhibit many of these properties; 
hemoglobin is the classic example. The allosteric properties of hemoglobin are the sub- 
ject of Section 15.6 What Is the Relationship Between Quaternary Structure and 
Allosteric Regulation? 


15.3 | Can Allosteric Regulation Be Explained 
| by Conformational Changes in Proteins? 


Conformational changes lie at the heart of protein function and allosteric regulation 
is no exception. Various models have been proposed to account for the behavior of 
allosteric proteins. All of them note that proteins can exist in different conforma- 
tional states. Models usually propose a small number of conformations (two or, at 
most, three) for a given protein. 


15.3a The Symmetry Model for Allosteric Regulation Is Based 
on Two Conformational States for a Protein 


For example, the model for allosteric behavior of Jacques Monod, Jeffries Wyman, and 
Jean-Pierre Changeux (the MWC model) proposes two conformational states for an 
allosteric protein: the R (relaxed) state and the T (taut) state. The MWC model is some- 
times referred to as the symmetry model because all subunits in an oligomer are assumed 
to have the same conformation, whether it is R or T. R-state and T-state protein molecules 
are in equilibrium, with the T conformation greatly favored over the R ([T] >> [R]), under 
conditions in which no ligands are present. Assume the substrate S binds only to R, with 
A (the activator) and I (the inhibitor) binding to R and T, respectively. The linked equilib- 
ria lead to changes in the relative amounts of R and T and, therefore, shifts in the sub- 
strate saturation curve. The parameters of such a system are that (1) S and A (or I) have 
different affinities for R and T and (2) A (or I) modifies the apparent Ky; for S by shifting 
the relative R versus T population. This model further suggests that substrate and alloste- 
ric activators (positive effectors) bind only to the R state and allosteric inhibitors (negative 
effectors) bind only to the T state. Figure 15.7 illustrates such a model for a dimeric pro- 
tein, each monomer of which has a substrate-binding site and an effector-binding site. 
Because S binds only to the R state, S binding perturbs the R = T equilibrium in favor of 
more R-state conformers and thus more S binding. That is, S binding is cooperative. The 
concentration of ligand giving half-maximal response is defined as Kos. (Like Km, the 
units of Ko,;are molarity; Km cannot be used to describe these constants because the pro- 
tein does not conform to the Michaelis-Menten model for enzyme kinetics.) The MWC 
model accounts for the action of allosteric effectors. Positive effectors bind only to the R 
state and thus cause a shift of the R = T equilibrium in favor of more R and thus easier S 
binding. Negative effectors do the opposite; they perturb the R = T equilibrium in favor 
of T, the conformation that cannot bind S. Note that positive effectors (allosteric activa- 
tors) cause a decline in the Ky; for S (signifying easier binding of S) and negative effectors 
raise Ko; for S (Figure 15.7). Note that the MWC model assumes an equilibrium between 
conformational states, but ligand binding does not alter the conformation of the protein. 


15.3b The Sequential Model for Allosteric Regulation Is Based 
on Ligand-Induced Conformational Changes 


An alternative model proposed by Daniel Koshland, George Nemethy, and David 
Filmer (the KNF model) relies on the well-accepted idea that ligand binding triggers a 
change in the conformation of a protein. And, if the protein is oligomeric, ligand- 
induced conformational changes in one subunit may lead to changes in the conforma- 
tion of its neighbors. Such ligand-induced conformational change could cause the 
subunits of an oligomeric protein to shift from a low-affinity state to a high-affinity 
state. For example, S binding to one monomer may cause the other monomers to 
adopt conformations with higher affinity for S (Figure 15.8). Interestingly, the KNF 
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Figure 15.7 The MWC symmetry model for allosteric behavior (a) The allosteric protein exists in two 
conformational states, R (pink) and T (orange). Assume S binds only to R, with A and I binding to R 
and T, respectively. The linked equilibria lead to changes in the relative amounts of R and T and, 
therefore, shifts in the substrate saturation curve. (b) The parameters of such a system are that (1) S 
and A (or I) have different affinities for R and T and (2) A (or I) modifies the apparent Ko.s for S by 
shifting the relative R versus T population. 
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Figure 15.8 The Koshland—Nemethy—Filmer sequential model for allosteric behavior. (a) S binding can, 
by induced fit, cause a conformational change in the subunit to which it binds. (b) If subunit interactions 
are tightly coupled, binding of S to one subunit may cause the other subunit to assume a conformation 
having a greater or lesser affinity for S. That is, the ligand-induced conformational change in one subunit 
can affect the adjoining subunit. (c) Such effects could be transmitted between neighboring peptide 
domains by changing alignments of nonbonded amino acid residues. (d) Theoretical curves for the 
binding of a ligand to a protein having four identical subunits, each with one binding site for the ligand. 
The fraction of maximal velocity (v/ Vmax) is plotted as a function of [S]/Kos. 


negative cooperativity ensues. 
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model also explains how ligand-induced conformational changes could cause sub- 
units of a protein to adopt conformations with little or no affinity for the ligand, a 
phenomenon referred to as negative cooperativity. The KNF model is termed the 
sequential model because subunits undergo sequential changes in conformation due to 
ligand binding. A comparison of the response of velocity to substrate concentration 
for positive versus negative cooperativity is shown in Figure 15.8d. 


15.3c The Notable Difference between the MWC and KNF Models 


In the MWC model, the different conformations have different affinities for the vari- 
ous ligands, and the concept of ligand-induced conformational changes is ignored. 
In contrast, the KNF model is based on ligand-induced conformational changes. 


15.3d Changes in the Oligomeric State of a Protein 
Can Also Give Allosteric Behavior 


Although the MWC and KNF models are the best-known paradigms for allosteric protein 
behavior, other models have been put forward. For example, instead of R and T, consider 
a monomer-—oligomer equilibrium for an allosteric protein, where only the oligomer binds 
S and [monomer] >> [ oligomer]. This model strongly resembles the MWC model. In yet 
another model, we might have a monomeric protein with distinct binding sites for several 
different ligands. In this case, binding of ligand A to its site might cause a conformational 
change such that the protein shows much greater affinity for S than it would in the absence 
of A. Or, binding of ligand I might result in a conformational change in the protein such 
that its affinity for S is abolished. Although the binding of other ligands may affect the 
affinity of the monomer for S, S binding cannot show cooperativity in monomeric pro- 
teins, because, unlike oligomers, the monomer has only one binding site for S. 

It is important to realize that all of these various models are attempts to use simple con- 
cepts to explain the complex behavior of a protein. Although these models provide reason- 
able approximations and useful insights, the molecular mechanisms underlying allostery 
cannot be expected to conform rigidly to any one of these models. Shortly, we explore the 
regulated behavior of a real protein (glycogen phosphorylase) with these models in mind. 


15.4 | What Kinds of Covalent Modification 
Regulate the Activity of Enzymes? 


Covalent modification of proteins through introduction of phosphoryl groups 
and/or other charged or bulky functional groups results in substantial protein 
conformational changes that can dramatically affect enzyme activity. 


15.4a Covalent Modification through Reversible Phosphorylation 


As we saw in Figure 15.1, enzyme activity can be regulated through reversible phos- 
phorylation; indeed it is the most prominent form of covalent modification in cellular 
regulation. Phosphorylation is accomplished by protein kinases that target specific 
enzymes for modification. Phosphoprotein phosphatases operate in the reverse 
direction to remove the phosphate group through hydrolysis of the side-chain phos- 
phoester bond. Because protein kinases and phosphoprotein phosphatases work in 
opposing directions, regulation must be imposed on these converter enzymes so that 
their interconvertible enzyme targets are locked in the desired state (active versus in- 
active) and a wasteful cycle of ATP hydrolysis is avoided. Thus, converter enzymes 
are themselves the targets of allosteric regulation or covalent modification. 


15.4b Protein Kinases: Target Recognition and Intrasteric Control 


Protein kinases are converter enzymes that catalyze the ATP-dependent phosphory- 
lation of serine, threonine, or tyrosine hydroxyl groups in target proteins (Table 15.2). 
Phosphorylation introduces a bulky group bearing two negative charges, causing 
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Table 15.2 Examples of the Major Protein Kinase Classes Accordin ificity, Consensus Target Sequence, and Activators 


Protein Kinase Class Target Sequence* Activators 


I. Ser/Thr protein kinases 
A. Cyclic nucleotide—-dependent 


cAMP-dependent (PKA) —R(R/K)X(S*/T*)— cAMP 
cGMP-dependent —(R/K)KKX(S*/T*)— cGMP 
B. Ca?*-calmodulin (CaM)-dependent 
Phosphorylase kinase (PhK) —KRKQIS* VRGL— Phosphorylation by PKA 
Myosin light-chain kinase (MLCK) —KKRPQRATS*NV— Ca?*-CaM 
C. Protein kinase C (PKC) Ca**, diacylglycerol 
D. Mitogen-activated protein kinases (MAP kinases) —PXX(S*/T*)P— Phosphorylation by MAPK kinase 
E. G-protein—coupled receptors 


B-Adrenergic receptor kinase (BARK) 
Rhodopsin kinase 


II. Ser/Thr/Tyr protein kinases 
MAP kinase kinase (MAPK kinase) —TEY— Phosphorylation by Raf (a protein kinase) 


II. Tyr protein kinases 
A. Cytosolic tyrosine kinases (src, fgr, abl, etc.) 
B. Receptor tyrosine kinases (RTKs) 
Plasma membrane receptors for hormones such as 
epidermal growth factor (EGF) or platelet-derived 
growth factor (PDGF) 


*X denotes any amino acid. 


conformational changes that alter the target protein’s function. The added phospho- 
ryl group is distinct from naturally occurring amino acids, since no amino acid side 
chain can provide two negative charges. Protein kinases represent a protein super- 
family whose members are widely diverse in terms of size, subunit structure, and 
subcellular localization. Nevertheless, all share a common catalytic mechanism based 
on a conserved catalytic core/kinase domain of approximately 260 amino acid resi- 
dues (Figure 15.9). Protein kinases are classified as Ser/Thr and/or Tyr specific. They 
also differ in terms of the target proteins that they recognize and phosphorylate; tar- 
get selection depends on the presence of an amino acid sequence within the target 
protein that is recognized by the kinase. For example, cAMP-dependent protein 
kinase (PKA) phosphorylates proteins having Ser or Thr residues within an R(R/K) 
X(S*/T*) target consensus sequence (* denotes the residue that becomes phosphory- 
lated). That is, PKA phosphorylates Ser or Thr residues that occur in an Arg-(Arg or 
Lys)-(any amino acid)-(Ser or Thr) sequence segment (Table 15.2). Although target 
sites for phosphorylation were originally defined as linear sequences of contiguous 
amino acids, it is now appreciated that residues that are not adjacent in the primary 
structure of the protein can come into proximity upon protein folding to form a con- 
sensus target motif for a protein kinase. 

Since protein kinases identify their targets through recognition of specific consensus 
sequences, these kinases can be inhibited by a mechanism called intrasteric control 
(Figure 15.10). Intrasteric control occurs when a regulatory subunit (or protein domain) 
has a pseudosubstrate sequence that mimics the target sequence but usually lacks an 
OH-bearing side chain at the right place. For example, the cAMP-binding RI regulatory 
subunits of PKA possess the pseudosubstrate sequence RRGA*I, and this sequence 
binds to the active site of PKA catalytic subunits, blocking their activity. This 
pseudosubstrate sequence has an alanine residue (A*) where serine occurs in the PKA 
target sequence; Ala is sterically similar to serine but lacks a phosphorylatable OH 
group. (There are two isoforms of R subunits. The RII isoform has an RRVS*V 
pseudosubstrate sequence.) When PKA regulatory subunits bind cAMP, they undergo 
a conformational change and dissociate from the catalytic (C) subunits, and the active 
site of PKA is free to bind and phosphorylate its targets. In other protein kinases, the 

; E , pseudosubstrate sequence involved in intrasteric control and the kinase domain are part 
Figure 15.9 Protein kinase A is shown complexed f : ee : : 
with a pseüdosubstrate peptide (orange): This of the same polypeptide chain. In these cases, binding of an allosteric effector (like 
complex also includes ATP (red) and two Mn?* ions | CAMP) induces a conformational change in the protein that releases the pseudosubstrate 
(vellow) bound at the active site (PDB id = 1ATP). sequence from the active site of the kinase domain. 
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(a) B4-B5 loop 


(b) 


RC 


inactive 


R,-(cAMP) , 


The abundance of many protein kinases in cells is an indication of the great impor- 
tance of protein phosphorylation in cellular regulation. Over 500 protein kinase genes 
have been identified in the human genome. (In contrast, there are less than 200 phospho- 
protein phosphatase genes in the human genome.) Tyrosine kinases (protein kinases that 
phosphorylate Tyr residues) occur only in multicellular organisms (yeast has no tyrosine 
kinases). Tyrosine kinases are components of signaling pathways involved in cell-cell 
communication (see Chapter 32). Table 15.3 presents the classification of eukaryotic pro- 
tein kinases genes into eight principal families according to function and conserved se- 
quence similarity. 


=ukaryotic Protein Kinases Accord 


ce Similarity* l 


Family Family Members 

AGC This family of Ser-Thr kinases includes cAMP, cGMP, and Ca?*-phospholipid- 
dependent kinases; ribosomal protein S6-phosphorylating kinases; kinases that 
phosphorylate G protein-coupled receptors; and all close relatives of these 
groups. 

CAMK Ser-Thr kinases activated by Ca**-calmodulin. 

CK1 Ser-Thr kinases important in cell signaling and more; Casein kinase 1 and close 
relatives. 

CMGC Cyclin-dependent kinases, mitogen-activated protein kinases, CDK-like kinases, 
and glycogen synthase kinase. Members of this family may phosphorylate Ser, 
Thr, or Tyr residues. 

RGC Receptor guanylate cyclase kinases. These membrane receptors form cGMP from 
GTP. They are pseudo-kinases that have a domain sequence organization similar 
to TKs. 

STE Ser-Thr kinases functioning in MAP kinase cascades by phosphorylating MAP 
kinases. 

TK Tyrosine kinases phosphorylate Tyr residues; TKs include the growth factor 
receptor tyrosine kinases (RTKs). 

TKL Tyrosine kinase-like: a diverse family that resembles TKs, but are actually 


Ser-Thr kinases. 


*This classification does not include atypical protein kinases (aPKs) because aPKs share no clear sequence similarity 
to any of these families. As such, aPKs are represented by only a small number of eukaryotic protein kinases. 
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Figure 15.10 Cyclic AMP-dependent protein kinase 
(also known as PKA) is a 170 kDa RC; tetramer in 
mammalian cells. (a) The two R (regulatory) subunits 
bind two equivalents of cAMP each (Kp“™? ~ 30-300 
nM). cAMP binding releases the R subunits from the 
C (catalytic) subunits. The C subunits are 
enzymatically active as monomers. In the molecular 
graphic on the left, a CRTI, tetramer, the C subunits 
are pink; the R subunits, blue (PDB id = 3TNP). 
When [cAMP] is low, an R-subunit dimer binds to 
two copies of the C subunit, forming an inactive 
complex (left). Note the highlighted 64-65 loops in 
the RIIS subunits, which dock the C subunits by 
binding to their C-terminal tails. Docking to the R 
subunits forces the C subunits into a closed, inactive 
conformation. When [cAMP] rises, it binds to the R 
subunits, causing their conformation to change, which 
releases the C subunits as enzymatically active 
monomers ((PDB id = 1J3H; the molecular graphic 
on the right). (b) diagram showing inactive R2C> 
tetramer of left. In the presence of cAMP, R3C> 
dissociates to give an R,(cAMP), complex and two 
catalytically active C subunits. 
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Figure 15.11 Activation of the tricarboxylic acid 
cycle enzyme, malate dehydrogenase (MDH) by 
acetylation. (a) Relative MDH activity from human 
liver cells in the absence (—) and presence (+) 

of inhibitors of lysine deacetylating enzymes. 

(b) Activation of MDH in human liver cells treated 
with glucose. The relative MDH activity in human 
liver cells exposed to a range (0, 6.25, 12.5, and 

25 mM) of glucose concentrations is shown. 
Glucose enhances MDH acetylation. Adapted from 
Figure 2 in Zhao, S., et al., 2010. Regulation of 
cellular metabolism by protein lysine acetylation. 
Science 327:1000-1004. 


15.4c Phosphorylation Is Not the Only Form of Covalent Modification 
That Regulates Enzyme Function 


Several hundred different chemical modifications of proteins have been discovered thus 
far, ranging from carboxylation (addition of a carboxyl group), acetylation (addition of 
an acetyl group), prenylation (see Figure 9.23), and glycosylation (see Figures 7.32 
through 7.37), to covalent attachment of a polypeptide to the protein (addition of ubiq- 
uitin to free amino groups on proteins), to name just a few. A compilation of known 
protein modifications can be found in RESID, at https://proteininformationresource 
.org/resid/resid.shtml Only a small number of these covalent modifications are used to 
achieve metabolic regulation through reversible conversion of an enzyme between ac- 
tive and inactive forms. Table 15.4 presents a few examples. Note that three of these 
types of covalent modification require nucleoside triphosphates (ATP, UTP) that are 
related to cellular energy status; another relies on reducing potential within the cell, 
which also reflects cellular energy status. 


15.4d Acetylation Is a Prominent Modification 
for the Regulation of Metabolic Enzymes 


Acetylation of an e-NH;* group on a lysine side chain changes it from a positively 
charged amino group to a neutral amide, which affects the protein's conformation 
and thus its function. The enzymes responsible are acetyl-CoA-dependent lysine acet- 
yltransferases or KATs. Proteomics studies show that acetylation of metabolic en- 
zymes is an important mechanism for regulating the flow of metabolic substrates 
(carbohydrates, fats) through the central metabolic pathways of carbohydrate and fat 
metabolism. Acetylation activates some enzymes and inhibits others. Cellular levels 
of major metabolic fuels such as glucose, fatty acids, and amino acids influence the 
degree of acetylation (Figure 15.11). The in vivo stability of some enzymes may also be 


| Table 15.4 | IEZA Additional Examples of Regulation by Covalent Modification 


Reaction Amino Acid Side Chain Reaction (see figure indicated) 

Adenylylation Tyrosine Transfer of AMP from ATP to Tyr-OH 

Uridylylation Tyrosine Transfer of UMP from UTP to Tyr-OH 

ADP-ribosylation Arginine Transfer of ADP-ribose from NAD* to Arg 

Methylation Glutamate Transfer of methyl group from 
S-adenosylmethionine to Glu 
y-carboxyl group 

Oxidation-reduction Cysteine (disulfide) Reduction of Cys-S—S-Cys to Cys-SH + 
HS-Cys 

Acetylation Lysine Transfer of acetyl group from acetyl-CoA to 
Lys €-amino group 

(a) (b) 
250 400 


Relative MDH activity (%) 
oo 
S 
5 
Sa pe pe 


Relative MDH activity (%) 


mM glucose 
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regulated by acetylation. Deacetylation by KDACs (lysine deacetylases) reverses the 
effects of acetylation. KDACs include sirtuins, a class of NAD*-dependent protein 
deacetylating enzymes. Sirtuins are implicated in the regulation of energy metabolism 
and longevity. 


15.5 | Is the Activity of Some Enzymes Controlled by Both 
Allosteric Regulation and Covalent Modification? 


Many allosteric enzymes are also subject to regulation by covalent modification. As a 
general rule, the covalently modified allosteric enzyme is more active and less respon- 
sive to allosteric regulation. Exceptions to this rule are allosteric enzymes that are less 
active and more responsive to allosteric regulation when they are covalently modified. 
Glycogen phosphorylase, the enzyme that catalyzes the release of glucose units from 
glycogen, serves as an excellent example of the many enzymes regulated both by 
allosteric controls and by covalent modification. 


15.5a The Glycogen Phosphorylase Reaction Converts Glycogen 
into Readily Usable Fuel in the Form of Glucose-1-Phosphate 


The cleavage of glucose units from the nonreducing ends of glycogen molecules is 
catalyzed by glycogen phosphorylase, an allosteric enzyme. The enzymatic reaction 
involves phosphorolysis of the bond between C-1 of the glucose unit and the glyco- 
sidic oxygen, to yield glucose-1-phosphate and a glycogen molecule that is shortened 
by one residue (Figure 15.12). (Because the reaction involves attack by phosphate in- 
stead of H,O, it is referred to as phosphorolysis rather than hydrolysis.) Phosphorolysis 
produces a phosphorylated sugar product, glucose-1-P, which is converted to the gly- 
colytic substrate, glucose-6-P, by phosphoglucomutase (Figure 15.13). 


15.5b Glycogen Phosphorylase Is a Homodimer 


Muscle glycogen phosphorylase is a dimer of two identical subunits (842 residues, 
97 kDa). Each subunit contains an active site at the center of the subunit and an 
allosteric effector site near the subunit interface (Figure 15.14). In addition, a 


CHOH CHOH CHOH CHOH 
O Oo Oo oO 
OH OH OH OH 
HO (0) Oo (0) O 
L_ OH OH OH OH —n 
Nonreducing end residues 
P; 
CHOH CHOH CHOH CHOH 
O Oo O Oo 
+ 
OH 2— OH OH OH 
HO OPO; HO Oo O O 
OH L_ OH OH OH —n-1 
a-b-Glucose-1-phosphate residues 
HOCH, 2-O3POCHy 
O O 
H H H H H H 
OH H 7 OH H 
HO OPO? HO OH 
H OH H OH 
Glucose-1-phosphate Glucose-6-phosphate 


Figure 15.12 The glycogen phosphorylase reaction. 


Figure 15.13 The phosphoglucomutase reaction. 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


524 Chapter 15 Enzyme Regulation 


Figure 15.14 Structure of the glycogen Tower helix 
phosphorylase monomer (PDB id = 8GPB). 


AMP at allosteric 
effector site 


Glycogen 
binding 


site % 


Pyridoxal-P at catalytic site 


regulatory phosphorylation site is located at Ser'4 on each subunit. A glycogen-binding 
site on each subunit facilitates prior association of glycogen phosphorylase with its 
substrate and also exerts regulatory control on the enzymatic reaction. 

Each subunit contributes a tower helix (residues 262 to 278) to the subunit—subunit 
contact interface in glycogen phosphorylase. In the phosphorylase dimer, the tower he- 
lices extend from their respective subunits and pack against each other in an antiparallel 
manner. 


15.5c Glycogen Phosphorylase Activity Is Regulated Allosterically 


The binding of the substrate inorganic phosphate (P;) to muscle glycogen phosphorylase 
is highly cooperative (Figure 15.15a), which allows the enzyme activity to increase 
markedly over a rather narrow range of substrate concentration. That is, muscle glyco- 
gen phosphorylase shows cooperativity in substrate binding. 


ATP and Glucose-6-P Are Allosteric Inhibitors of Glycogen Phosphorylase ATP can be 
viewed as the end product of glycogen phosphorylase action, in that the glucose-1-P 
liberated by glycogen phosphorylase is degraded in muscle via metabolic pathways 
whose purpose is energy (ATP) production. Glucose-l-P is readily converted into 


(a) (b) (c) 


+ AMP 


+ ATP or glucose-6-P 


[Pj] [Pi] [Pj] 


Figure 15.15 v versus S curves for glycogen phosphorylase. (a) The response to the concentration of 
the substrate phosphate (P;). (b) ATP and glucose-6-P are feedback inhibitors. (c) AMP is a positive 
effector. It binds at the same site as ATP. 
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glucose-6-P to feed such pathways. (In the liver, glucose-1-P from glycogen is converted 
to glucose and released into the bloodstream to raise blood glucose levels.) Thus, feed- 
back inhibition of glycogen phosphorylase by ATP and glucose-6-P provides a very 
effective way to regulate glycogen breakdown. Both ATP and glucose-6-P act by de- 
creasing the affinity of glycogen phosphorylase for its substrate P; (Figure 15.15b). Be- 
cause the binding of ATP or glucose-6-P has a negative effect on substrate binding, 
these substances act as negative effectors. Note in Figure 15.15b that the substrate satu- 
ration curve is displaced to the right in the presence of ATP or glucose-6-P, and a 
higher substrate concentration is needed to achieve half-maximal velocity (Vnax/2). 
When concentrations of ATP or glucose-6-P accumulate to high levels, glycogen phos- 
phorylase is inhibited; when [ATP] and [glucose-6-P] are low, the activity of glycogen 
phosphorylase is regulated by availability of its substrate, P;. 


AMP Is an Allosteric Activator of Glycogen Phosphorylase AMP also provides a regula- 
tory signal to glycogen phosphorylase. It binds to the same site as ATP, but it stimulates 
glycogen phosphorylase rather than inhibiting it (Figure 15.15c). AMP acts as a positive 
effector, meaning that it enhances the binding of substrate to glycogen phosphorylase. 
Significant levels of AMP indicate that the energy status of the cell is low and that more 
energy (ATP) should be produced. Reciprocal changes in the cellular concentrations of 
ATP and AMP and their competition for binding to the same site (the allosteric site) on 
glycogen phosphorylase, with opposite effects, allow these two nucleotides to exert rapid 
and reversible control over glycogen phosphorylase activity. Such reciprocal regulation 
ensures that the production of energy (ATP) is commensurate with cellular needs. 

To summarize, muscle glycogen phosphorylase is allosterically activated by AMP 
and inhibited by ATP and glucose-6-P. When ATP and glucose-6-P are abundant, gly- 
cogen breakdown is inhibited. When cellular energy reserves are low (i.e., high [AMP] 
and low [ATP] and [G-6-P]), glycogen catabolism is stimulated. Caffeine can also act as 
an allosteric inhibitor of muscle glycogen phosphorylase (Figure 15.16). 

Glycogen phosphorylase conforms to the MWC model of allosteric transitions, with 
the active form of the enzyme designated the R state and the inactive form denoted as 
the T state (Figure 15.16). Thus, AMP promotes the conversion to the active R state, 
whereas ATP, glucose-6-P, and caffeine favor conversion to the inactive T state. 


4 Covalent control > 
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Figure 15.16 The mechanism of covalent 
modification and allosteric regulation of glycogen 
phosphorylase. Covalent modification converts 
glycogen phosphorylase from the feedback- 
regulated, less active b form to the intrinsically more 
active a form that is unresponsive to feedback 
inhibition. 
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Figure 15.17 The major conformational change that 
occurs in the N-terminal residues upon 
phosphorylation of Ser!*. Ser'4 is shown in red. 
N-terminal conformation of unphosphorylated 
enzyme (phosphorylase b): yellow; N-terminal 
conformation of phosphorylated enzyme 
(phosphorylase a): cyan. (Molecular graphic created 
from PDB id = 8GPB and PDB id = 1GPA.) 


X-ray diffraction studies of glycogen phosphorylase in the presence of allosteric effec- 
tors have revealed the molecular basis for the T—— R conversion. Although the structure 
of the central core of the phosphorylase subunits is identical in the T and R states, a sig- 
nificant change occurs at the subunit interface between the T and R states. This conforma- 
tion change at the subunit interface is linked to a structural change at the active site that is 
important for catalysis. In the T state, the negatively charged carboxyl group of Asp? 
faces the active site, so binding of the anionic substrate phosphate is unfavorable. In the 
conversion to the R state, Asp** is displaced from the active site and replaced by Arg>®. 
The exchange of negatively charged aspartate for positively charged arginine at the active 
site provides a favorable binding site for phosphate anion. These allosteric controls serve 
as a mechanism for adjusting the activity of glycogen phosphorylase to meet normal meta- 
bolic demands. However, in crisis situations in which abundant energy (ATP) is needed 
immediately, these controls can be overridden by covalent modification of glycogen phos- 
phorylase. Covalent modification through phosphorylation of Ser!* in glycogen phos- 
phorylase converts the enzyme from a less active, allosterically regulated form (the b form) 
to a more active, allosterically unresponsive form (the a form). 


15.5d Covalent Modification of Glycogen Phosphorylase 
Overrides Allosteric Regulation 


As early as 1938, it was known that glycogen phosphorylase existed in two forms: the 
less active phosphorylase b and the more active phosphorylase a. In 1956 Edwin Krebs 
and Edmond Fischer reported that a converting enzyme could convert phosphorylase 
b to phosphorylase a. Three years later, Krebs and Fischer demonstrated that the 
conversion of phosphorylase b to phosphorylase a involved covalent phosphoryla- 
tion, as shown in Figure 15.16. 

Phosphorylation of Ser!* causes a dramatic conformation change in phosphorylase. 
Upon phosphorylation, the amino-terminal end of the protein (including residues 
10 through 22) swings through an arc of 120°, moving into the subunit interface 
(Figure 15.17). This conformation change moves Ser!* by more than 3.6 nm. The phos- 
phorylated, or a form, of glycogen phosphorylase is much less sensitive to allosteric 
regulation than the b form. Thus, covalent modification of glycogen phosphorylase 
converts this enzyme from an allosterically regulated form into a persistently active 
form. Covalent modification overrides the allosteric regulation. 

Dephosphorylation of glycogen phosphorylase is carried out by phosphoprotein 
phosphatase 1. The action of phosphoprotein phosphatase | inactivates glycogen phos- 
phorylase. The 1992 Nobel Prize in Physiology or Medicine was awarded to Krebs and 
Fischer for their pioneering studies of reversible protein phosphorylation as an impor- 
tant means of cellular regulation. 


15.5e Enzyme Cascades Regulate Glycogen Phosphorylase 
Covalent Modification 


The phosphorylation reaction that activates glycogen phosphorylase is mediated by 
an enzyme cascade (Figure 15.18). The first part of the cascade leads to hormonal 
stimulation (described in the next section) of adenylyl cyclase, a membrane-bound 
enzyme that converts ATP to adenosine-3’, 5’-cyclic monophosphate, denoted as cyclic 
AMP or simply cA MP (Figure 15.19). This regulatory molecule is found in all eukary- 
otic cells and acts as an intracellular messenger molecule, controlling a wide variety 
of processes. Cyclic AMP is known as a second messenger because it is the intracel- 
lular agent of a hormone (the first messenger). 

The hormonal stimulation of adenylyl cyclase is effected by a transmembrane signaling 
pathway consisting of three components, all membrane associated. Binding of hormone 
to the external surface of a hormone receptor causes a conformational change in this 
transmembrane protein, which in turn stimulates a GTP-binding protein (abbreviated G 
protein). G proteins are heterotrimeric proteins consisting of a- (45-47 kDa), B- (35 kDa), 
and y- (7-9 kDa) subunits. The a-subunit binds GDP or GTP and has an intrinsic, slow 
GTPase activity. In the inactive state, the G,g, complex has GDP at the nucleotide site. 
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Figure 15.19 The adenylyl cyclase reaction. The reaction is driven forward by subsequent hydrolysis 
of pyrophosphate by the enzyme inorganic pyrophosphatase. 


When a G protein is stimulated by a hormone-receptor complex, GDP dissociates and 
GTP binds to G,, causing it to dissociate from Gg, and to associate with adenylyl cyclase 
(Figure 15.20). Binding G, (GTP) activates adenylyl cyclase to form cAMP from ATP. 
However, the intrinsic GTPase activity of G, eventually hydrolyzes GTP to GDP, leading 
to dissociation of G, (GDP) from adenylyl cyclase and reassociation with Gg, to form the 
inactive G,g, complex. This cascade amplifies the hormonal signal because a single 
hormone-teceptor complex can activate many G proteins before the hormone dissociates 
from the receptor, and because the G,-activated adenylyl cyclase can synthesize many 
cAMP molecules before bound GTP is hydrolyzed by G,. Over 800 different human genes 
encoding G-protein coupled receptors (GPCRs) have been identified and more than 20 
distinct G, proteins are known. Since hormone-activated GPCRs catalyze GTP/GDP ex- 
change on G proteins, they are sometimes called guanine nucleotide exchange factors 
(GEFs). GPCRs are key components in a large number of eukaryotic signaling cascades. 

Cyclic AMP is an essential activator of cAMP-dependent protein kinase (PKA). Binding 
of cyclic AMP to the regulatory subunits induces a conformation change that causes the 
dissociation of the C monomers from the R dimer (Figure 15.10). The free C subunits are 
active and can phosphorylate other proteins. One of the many proteins phosphorylated by 
PKA is phosphorylase kinase (Figure 15.18). Phosphorylase kinase is inactive in the unphos- 
phorylated state and active in the phosphorylated form. As its name implies, phosphorylase 
kinase functions to phosphorylate (and activate) glycogen phosphorylase. Thus, hormonal 
activation of adenylyl cyclase leads to activation of glycogen breakdown. 
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Figure 15.20 Hormone binding to its receptor leads 
via G-protein activation to cAMP synthesis. 
Adenylyl cyclase and the hormone receptor are 
integral plasma membrane proteins; G, and Gy, are 
membrane-anchored proteins. 


15.6 | What Is the Relationship between Quaternary 
Structure and Allosteric Regulation? 


Allosteric regulation is a property of proteins that likely emerged with the appear- 
ance of quaternary structure. Many proteins that are not enzymes, such as hormone 
receptors, are allosterically regulated. As a monomer and a tetramer, myoglobin (Mb) 
and hemoglobin (Hb) provide insights the relationship between allostery and quater- 
nary organization. This section focuses exclusively on the comparative biochemistry 
of Mb and Hb to reveal these insights. Myoglobin and hemoglobin are oxygen (O>)- 
binding proteins found in animals. Mb serves as an O>-storage protein in muscle; Hb 
functions as an O,-tranport protein in blood. Because they are two of the most-studied 
proteins in nature, they have become paradigms of protein structure and function. 
Moreover, hemoglobin is a model for protein quaternary structure and allosteric 
function. The binding of O, by hemoglobin, and its modulation by effectors such as 
protons, CO), and 2,3-bisphosphoglycerate, depend on interactions between subunits 
in the Hb tetramer. Subunit—subunit interactions in Hb reveal much about the func- 
tional significance of quaternary associations and allosteric regulation. 


15.6a The Comparative Biochemistry of Myoglobin and 
Hemoglobin Reveals Insights into Allostery 


A comparison of the properties of hemoglobin and myoglobin offers insights into allo- 
steric phenomena, even though these proteins are not enzymes. Hemoglobin displays 
sigmoid-shaped O,-binding curves (Figure 15.21). The unusual shape of these curves was 
once a great enigma in biochemistry. Such curves closely resemble allosteric enzyme- 
substrate saturation graphs (see Figure 15.6). In contrast, myoglobin’s interaction with 
oxygen obeys classical Michaelis—Menten-type substrate saturation behavior. 

Myoglobin is a compact globular protein composed of a single polypeptide chain 
153 amino acids in length; its molecular mass is 17.2 kDa (Figure 15.22). It contains 
heme, a porphyrin ring system complexing an iron ion, as its prosthetic group 
(Figure 15.23). Oxygen binds to Mb via its heme. Hemoglobin (Hb) is also a compact 
globular protein, but Hb is a tetramer. It consists of four polypeptide chains, each of 
which is very similar structurally to the myoglobin polypeptide chain, and each bears a 
heme group. Thus, a hemoglobin molecule can bind four O, molecules. In adult human 
Hb, there are two identical chains of 141 amino acids, the a-chains, and two identical 
B-chains, each of 146 residues. The human Hb molecule is an a28-type tetramer of 
molecular mass 64.45 kDa. 

The myoglobin polypeptide chain and the a- and B-chains of hemoglobin are com- 
posed of 8 a-helical segments denoted by the letters A through H. The short, unordered 
regions that connect the helices are named for the segments they connect, as in the AB 
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Figure 15.21 O,-binding curves for hemoglobin and myoglobin. 
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Myoglobin (Mb) Hemoglobin (Hb) 


Figure 15.22 The myoglobin (PDB id = 2MM1) and hemoglobin (PDB id = 2HHB) molecules. 


region or the EF region. In an amino acid numbering system unique to globin chains, 
successive residues in the helices are numbered, such as the histidine at position 8 in the 
F helix, known as His F8. 

The tetrameric nature of Hb is crucial to its biological function: When a molecule of 
O; binds to a heme in Hb, the heme Fe ion is drawn into the plane of the porphyrin ring. 
This slight movement sets off a chain of conformational events that are transmitted to 
adjacent subunits, dramatically enhancing the affinity of their heme groups for O2. That 
is, the binding of O to one heme of Hb makes it easier for the Hb molecule to bind ad- 
ditional equivalents of O2. Hemoglobin is a marvelously constructed molecular ma- 
chine. Let us dissect its mechanism, beginning with its monomeric counterpart, the 
myoglobin molecule. 


15.6b Myoglobin Is an Oxygen-Storage Protein 


Myoglobin is the oxygen-storage protein of muscle. The muscles of diving mammals 
such as seals and whales are especially rich in this protein, which serves as a store for 
O- during the animal’s prolonged periods underwater. Myoglobin is abundant in skel- 
etal and cardiac muscle of nondiving animals as well. Myoglobin is the cause of the 
characteristic red color of muscle. 


15.6c O, Binds to the Mb Heme Group 


Iron prefers to interact with six ligands, four of which share a common plane. The 
fifth and sixth ligands lie above and below this plane (see Figure 15.24). In heme, four 
of the ligands are provided by the nitrogen atoms of the four pyrroles. A fifth ligand 
is donated by the imidazole side chain of amino acid residue His F8. When myoglobin 
binds O, to become oxymyoglobin, the O, molecule adds to the heme iron ion as the 
sixth ligand (Figure 15.24). O, adds end on to the heme iron, but it is not oriented 
perpendicular to the plane of the heme. Rather, it is tilted about 60° with respect to 
the perpendicular. 


15.6d O, Binding Alters Mb Conformation 


What happens when the heme group of myoglobin binds oxygen? X-ray crystallogra- 
phy has revealed that a crucial change occurs in the position of the iron atom relative 
to the plane of the heme. In deoxymyoglobin, the ferrous ion actually lies 0.055 nm 
above the plane of the heme, in the direction of His F8. The iron—porphyrin complex 
is therefore dome shaped. When O, binds, the iron atom is pulled back toward the 
porphyrin plane and is now displaced from it by only 0.026 nm. The consequences of 
this small motion are trivial as far as the biological role of myoglobin is concerned. 
However, as we shall soon see, this slight movement profoundly affects the properties 
of hemoglobin. 


529 


-00C C007 
i a 
CHo CH 
\ H / 
7 Gc C C 
HC ard S NZ D7 Z CH; 
Ne | / 
C—NH N—C 
Í Noa 
HC HC 
Nc=n  HN—C” 
fd ob `a 
HeC=CO~ Ne Se Scl j 
Ly E A 
HCH, i 
CH, 
Protoporphyrin IX 
-00C COO 
im a 
CHp CHp 
\ H 
HC G c 7 CH 
3 4 ZF y SL Ye a) 
\ | / 
C—N N—C 
4 ‘Bx X 
HC Fe HC 
\c=N ag 
ra l d y —CH 
HeC=C~ Nel Sel Ael i 
E E 
H CH; qH 
CHo 
Heme 


(Fe-protoporphyrin IX) 


Figure 15.23 Heme is formed when protoporphyrin 
IX binds Fe?*. 
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Figure 15.24 The six liganding positions of an iron 
ion. 
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A Deeper Look 


The Oxygen-Binding Curves of Myoglobin and Hemoglobin 


Myoglobin 
The reversible binding of oxygen to myoglobin, 


MbO, == Mb + O- 
can be characterized by the equilibrium dissociation constant, K. 


_ [Mb][O;] 


mor (15.1) 


If Y is defined as the fractional saturation of myoglobin with O», 
that is, the fraction of myoglobin molecules having an oxygen mol- 
ecule bound, then 


[MbO,] 
~ [MbO,] + [Mb] 


(15.2) 


The value of Y ranges from 0 (no myoglobin molecules carry an O32) 
to 1.0 (all myoglobin molecules have an O, molecule bound). Substi- 
tuting from Equation 15.1, ([Mb][O>])/K for [MbO,] gives 


[Mb][O,] ( [0] 
K K 


_ _ __ [0] 
or) (103 +1) [0] + K 
K 


(15.3) 


K 


and, if the concentration of O, is expressed in terms of the partial 
pressure (in torr) of oxygen gas in equilibrium with the solution of 
interest, then 


pO, 


2 


(In this form, K has the units of torr.) The relationship defined 
by Equation 15.4 plots as a hyperbola. That is, the MbO, satura- 
tion curve resembles an enzyme-substrate saturation curve. For 
myoglobin, a partial pressure of 1 torr for pO; is sufficient for half- 
saturation (Figure 1). We can define Ps) as the partial pressure of 
O, at which 50% of the myoglobin molecules have a molecule of O, 
bound (that is, Y = 0.5), then 


(15.5) 


1.0 Reena Coens Saeed eee eey Seem 


pOds, torr 


Figure 1 Oxygen saturation curve for myoglobin in the form of Y versus 
pO showing Ps is at a pO, of 1 torr. 


(Note from Equation 15.1 that when [MbO,] = [Mb], K = [O,], 
which is the same as saying when Y = 0.5, K = Psy.) The general 
equation for O; binding to Mb becomes 
Y= pO, 
pO, + Ps 
The ratio of the fractional saturation of myoglobin, Y, to free 
myoglobin, 1 — Y, depends on pO, and K according to the equation 
Y - pO, 
l= yY K 


(15.6) 


(15.7) 


Hemoglobin 

New properties emerge when four heme-containing polypeptides come 
together to form a tetramer. The O, -binding curve of hemoglobin is 
sigmoid rather than hyperbolic (see Figure 15.21), and Equation 15.4 
does not describe such curves. Of course, each hemoglobin molecule 
has four hemes and can bind up to four oxygen molecules. Suppose for 
the moment that O, binding to hemoglobin is an all-or-none phenome- 
non, where Hb exists either free of O, or with four O, molecules bound. 
This supposition represents the extreme case for cooperative binding 
of a ligand by a protein with multiple binding sites. In effect, it says 
that if one ligand binds to the protein molecule, then all other sites are 
immediately occupied by ligand. Or, to say it another way for the case 
in hand, suppose that four O, molecules bind to Hb simultaneously: 


Hb + 40,=—Hb(O)),4 
Then the dissociation constant, K, would be 
[Hb][O,} 
~ (HBO) 


By analogy with Equation 15.4, the equation for fractional satura- 
tion of Hb is given by 


K (15.8) 


__[pOy} 
[pO] + K 


A plot of Y versus pO, according to Equation 15.9 is presented in 
Figure 2. This curve has the characteristic sigmoid shape seen for O, 
binding by Hb. Half-saturation is set to be a pO, of 26 torr. Note that 
when pO, is low, the fractional saturation, Y, changes very little as 
pO, increases. The interpretation is that Hb has little affinity for O, 
at these low partial pressures of O. However, as pO, reaches some 
threshold value and the first O2 is bound, Y, the fractional saturation, 
increases rapidly. Note that the slope of the curve is steepest in the re- 
gion where Y = 0.5. The sigmoid character of this curve is diagnostic 
of the fact that the binding of O, to one site on Hb strongly enhances 
binding of additional O, molecules to the remaining vacant sites on 
the same Hb molecule, a phenomenon aptly termed cooperativity. (If 
each O, bound independently, exerting no influence on the affinity of 
Hb for more O, binding, this plot would be hyperbolic.) 

The experimentally observed oxygen-binding curve for Hb does 
not fit the graph given in Figure 2 exactly. If we generalize Equation 
15.9 by replacing the exponent 4 with n, we can write the equation as 


__ poy 
[pO + K 


(15.9) 


(15.10) 


(Continued) 
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Rearranging yields 


IY K (15.11) 
This equation states that the ratio of oxygenated heme groups (Y) to 
O; -free heme (1 — Y) is equal to the nth power of the pO, divided 
by the apparent dissociation constant, K. 

Archibald Hill demonstrated in 1913, well before any knowl- 
edge about the molecular organization of Hb existed, that the 
O,-binding behavior of Hb could be described by Equation 15.11. If 
a value of 2.8 is taken for n, Equation 15.11 fits the experimentally 


1.0 


Y 0.5 


pOo, torr 


Figure 2 Oxygen saturation curve for Hb in the form of Y versus pO», 
assuming n = 4 and Psy = 26 torr. The graph has the characteristic, 
experimentally observed, sigmoid shape. 


observed O,-binding curve for Hb very well (Figure 3). If the bind- 
ing of O, to Hb were an all-or-none phenomenon, n would equal 
4, as discussed previously. If the O,-binding sites on Hb were com- 
pletely noninteracting, that is, if the binding of one O, to Hb had 
no influence on the binding of additional O, molecules to the same 
Hb, n would equal 1. Figure 3 compares these extremes. Obviously, 
the real situation falls between the extremes of n = 1 or 4. The 
qualitative answer is that O, binding by Hb is highly cooperative, 
and the binding of the first O, markedly enhances the binding of 
subsequent O, molecules. However, this binding is not quite an all- 
or-none phenomenon. 


1.0 


Y 0.5 


pOo, torr 


Figure 3 A comparison of the experimentally observed O, curve for Hb 
yielding a value for n of 2.8 (blue), the hypothetical curve if n = 4 (red), 
and the curve if n = 1 (noninteracting O,-binding sites, purple). 


15.6e Cooperative Binding of Oxygen by Hemoglobin 
Has Important Physiological Significance 


The relative oxygen affinities of hemoglobin and myoglobin reflect their respective 
physiological roles (see Figure 15.21). Myoglobin, as an oxygen storage protein, has a 
greater affinity for O than hemoglobin at all oxygen pressures. Hemoglobin, as the 
oxygen carrier, becomes saturated with O; in the lungs, where the partial pressure of Oz 
(pO) is about 100 torr.! In the capillaries of tissues, pO, is typically 40 torr, and oxygen 
is released from Hb. In muscle, some of it can be bound by myoglobin, to be stored for 
use in times of severe oxygen deprivation, such as during strenuous exercise. 


15.6f Hemoglobin Has an a2, Tetrameric Structure 


As noted, hemoglobin is an af, tetramer. Each of the four subunits has a conformation 
virtually identical to that of myoglobin (Figure 15.22). The subunits pack in a tetrahedral 
array, creating a roughly spherical molecule 6.4 X 5.5 X 5.0 nm. The four heme groups, 
nestled within the easily recognizable cleft formed between the E and F helices of each 
polypeptide, are exposed at the surface of the molecule. The heme groups are quite far 
apart; 2.5 nm separates the closest iron ions, those of hemes a; and Bo, and those of hemes 
a, and ßı. The subunit interactions are mostly between dissimilar chains: Each of the 
a-chains is in contact with both B-chains, but there are few a—a or B-B interactions. 


'The torr is a unit of pressure named for Evangelista Torricelli, inventor of the barometer. One torr 


corresponds to | millimeter of mercury (1/760th of an atmosphere). 
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Figure 15.25 Changes in the position of the heme 
iron atom upon oxygenation lead to conformational 
changes in the hemoglobin molecule. Darker 
shading shows the heme and F helix in the 
unoxygenated state and lighter shading shows the 
conformation of the heme and its movement of the 
F helix after O2-binding. 


15.6g Oxygenation Markedly Alters the Quaternary Structure of Hb 


X-ray crystallographic analysis reveals that oxyhemoglobin and deoxyhemoglobin 
differ markedly in quaternary structure. In particular, specific aB-subunit interac- 
tions change. Hemoglobin, as a conformationally dynamic molecule, consists of two 
dimeric halves, an a,8,-subunit pair and an aB,-subunit pair. Each aB-dimer moves 
as a rigid body, and the two halves of the molecule slide past each other upon oxygen- 
ation of the heme. The two halves rotate some 15° about an imaginary pivot passing 
through the aB-subunits; some atoms at the interface between aB-dimers are relo- 
cated by as much as 0.6 nm. 


15.6h Movement of the Heme Iron by Less Than 0.04 nm Induces 
the Conformational Change in Hemoglobin 


In deoxyhemoglobin, histidine F8 is liganded to the heme iron ion, but steric 
constraints force the Fe**:His-N bond to be tilted about 8° from the perpendicular to 
the plane of the heme. Steric repulsion between histidine F8 and the nitrogen atoms 
of the porphyrin ring system, combined with electrostatic repulsions between the 
electrons of Fe*t and the porphyrin z-electrons, forces the iron atom to lie out of the 
porphyrin plane by about 0.06 nm. Changes in electronic and steric factors upon 
heme oxygenation allow the Fe** atom to move about 0.039 nm closer to the plane of 
the porphyrin, so now it is displaced only 0.021 nm above the plane. It is as if the O2 
were drawing the heme Fe” into the porphyrin plane (Figure 15.25). This modest 
displacement of 0.039 nm seems a trivial distance, but its biological consequences are 
far reaching. As the iron atom moves, it drags histidine F8 along with it, causing helix 
F, the EF corner, and the FG corner to follow. These shifts are transmitted to the 
subunit interfaces, where they trigger conformational readjustments that lead to the 
rupture of interchain salt links. 


15.61 The Physiological Significance of the Hb:O, Interaction 


We can determine quantitatively the physiological significance of the sigmoid nature of 
the hemoglobin oxygen-binding curve, or, in other words, the biological importance of 
cooperativity. The equation 


Y 7 [pO,]" 
1 — Y Ps 


describes the relationship between pO, the affinity of hemoglobin for O, (defined as 
Psy, the partial pressure of O, giving half-maximal saturation of Hb with O,), and 


FG corner 


Heme 


Porphyrin 
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the fraction of hemoglobin with O, bound, Y, versus the fraction of Hb with no O, 
bound, (1 — Y) (see A Deeper Look box: “The Oxygen-Binding Curves of Myoglobin 
and Hemoglobin”. The coefficient n is the Hill coefficient, an index of the cooperativ- 
ity (sigmoidicity) of the hemoglobin oxygen-binding curve. Taking pO; in the lungs as 
100 torr, Psp as 26 torr, and n as 2.8, the fractional saturation of the hemoglobin heme 
groups with O,, is 0.98. If pO, were to fall to 10 torr within the capillaries of an exercis- 
ing muscle, Y would drop to 0.06. The oxygen delivered under these conditions would 
be proportional to the difference, Yiungs — Ymuscie, Which is 0.92. That is, virtually all the 
oxygen carried by Hb would be released. Suppose instead that hemoglobin binding of 
O- were not cooperative; in that case, the hemoglobin oxygen-binding curve would be 
hyperbolic, and = 1.0. Then Yin the lungs would be 0.79 and Y in the capillaries, 0.28; 
the difference in Y values would be 0.51. Thus, under these conditions, the cooperativity 
of oxygen binding by Hb means that 0.92/0.51 or 1.8 times as much O, can be delivered. 


15.6) The Oxy and Deoxy Forms of Hemoglobin Represent 
Two Different Conformational States 


Hemoglobin resists oxygenation (see Figure 15.21) because the deoxy form is stabi- 
lized by specific hydrogen bonds and salt bridges (ion-pair bonds) (Figure 15.26). All 
of these interactions are broken in oxyhemoglobin, as the molecule stabilizes into a 
new conformation. The shift in helix F upon oxygenation leads to rupture of the Tyr 
B145:Val B98 hydrogen bond. In deoxyhemoglobin, with these interactions intact, the 
C-termini of the four subunits are restrained, and this conformational state is termed 
T, the tense or taut form. In oxyhemoglobin, these C-termini have almost complete 
freedom of rotation, and the molecule is now in its R, or relaxed, form. 


15.6k The Allosteric Behavior of Hemoglobin Has Both Symmetry (MWC) 
Model and Sequential (KNF) Model Components 


Oxygen is accessible only to the heme groups of the a-chains when hemoglobin is in 
the T conformational state. Max Perutz has pointed out that the heme environment 
of B-chains in the T state is virtually inaccessible because of steric hindrance by 
amino acid residues in the E helix. This hindrance disappears when the hemoglobin 
molecule undergoes transition to the R conformational state. Binding of O; to the B- 
chains is thus dependent on a T-to-R conformational shift, and this shift is triggered 
by the subtle changes that occur when O, binds to the a-chain heme groups. Together 
these observations describe a behavior that is partially MWC and partially KNF: O, 
binding to one a-subunit and then the other leads to sequential changes in conforma- 
tion, followed by a switch in quaternary structure at the Hb:20, state from T to R. 
Thus, the real behavior of this protein is an amalgam of the two prominent theoretical 
models for allosteric behavior. 


15.61 H* Promotes the Dissociation of Oxygen from Hemoglobin 


Protons, carbon dioxide, and chloride ions, as well as the metabolite 2,3-bisphospho- 
glycerate (or BPG), all affect the binding of O, by hemoglobin. Their effects have 
interesting ramifications, which we shall see as we discuss them in turn. Deoxyhemo- 
globin has a higher affinity for protons than oxyhemoglobin. Thus, as the pH 
decreases, dissociation of O, from hemoglobin is enhanced. In simple symbolism, 
ignoring the stoichiometry of O, or H* involved: 


HbO, + H* =— HbH* + O, 


Expressed another way, H* is an antagonist of oxygen binding by Hb, and the satura- 
tion curve of Hb for O, is displaced to the right as acidity increases (Figure 15.27). 
This phenomenon is called the Bohr effect, after its discoverer, Danish physiologist 
Christian Bohr (the father of Niels Bohr, the atomic physicist). The effect has impor- 
tant physiological significance because actively metabolizing tissues produce acid, 
promoting O, release where it is most needed. About two protons are taken up by 
deoxyhemoglobin. The N-termini of the two a-chains and the His 6146 residues have 


(a) 


z 


(b) 


Figure 15.26 Salt bridges between different subunits 
in human deoxyhemoglobin. These noncovalent, 
electrostatic interactions are disrupted upon 
oxygenation. (a) A focus on those salt bridges and 
hydrogen bonds involving interactions between 
N-terminal and C-terminal residues in the a-chains. 
Residues in the lower center are Arg a; 141 (green) 
with Val a; | (purple), Asp a 126 (orange), Lys az 
127 (yellow), and Val B> 34 (olive); residues at top 
are Val a, 93 (yellow) with Tyr a; 140 (purple). 

(b) A focus on those salt bridges and hydrogen 
bonds involving C-terminal residues of B-chains: 
Val 62 78 (olive) with Tyr 2 145 (purple); His Bz 146 
(light blue) with Asp $ 94 (orange) and Lys a, 40 
(yellow) (PDB id = 2HHB). 
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Figure 15.27 The oxygen saturation curves for 
myoglobin and for hemoglobin at five different pH 
values: 7.6, 7.4, 7.2, 7.0, and 6.8. 
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Figure 15.28 Oxygen-binding curves of blood and of 
hemoglobin in the absence and presence of CO, and 
BPG. From left to right: stripped Hb, Hb + CO,, 
Hb + BPG, Hb + BPG + COs, and whole blood. 
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been implicated as the major players in the Bohr effect. (The pK, of a free amino 
terminus in a protein is about 8.0, but the pK, of a protein histidine imidazole is 
around 6.5.) Neighboring carboxylate groups of Asp 894 residues help stabilize the 
protonated state of the His 6146 imidazoles that occur in deoxyhemoglobin. However, 
when Hb binds O,, changes in the conformation of B-chains upon Hb oxygenation 
move the negative Asp function away, and dissociation of the imidazole protons is 
favored. 


15.6m CO, Also Promotes the Dissociation of O, from Hemoglobin 


Carbon dioxide has an effect on O, binding by Hb that is similar to that of H*, partly 
because it produces H* when it dissolves in the blood: 


carbonic anhydrase _ 


CO, + H-O 4 HCO; == H* + HCO, 


carbonic acid bicarbonate 


The enzyme carbonic anhydrase promotes the hydration of CO. Many of the protons 
formed upon ionization of carbonic acid are picked up by Hb as O, dissociates. The 
bicarbonate ions are transported with the blood back to the lungs. When Hb becomes 
oxygenated again in the lungs, H* is released and reacts with HCO; to re-form 
H,CO;, from which CO, is liberated. The CO, is then exhaled as a gas. 

In addition, some CO, is directly transported by hemoglobin in the form of carba- 
mate (—NHCOO ). Free a-amino groups of Hb react with CO, reversibly: 


R—NH), + CO. == R—NH—COO- + Ht 


This reaction is driven to the right in tissues by the high CO, concentration; the equi- 
librium shifts the other way in the lungs where [CO ] is low. Thus, carbamylation of 
the N-termini converts them to anionic functions, which then form salt links with the 
cationic side chains of Arg a141 that stabilize the deoxy or T state of hemoglobin. 

In addition to CO», Cl” and BPG also bind better to deoxyhemoglobin than to oxy- 
hemoglobin, causing a shift in equilibrium in favor of O, release. These various effects 
are demonstrated by the shift in the oxygen saturation curves for Hb in the presence of 
one or more of these substances (Figure 15.28). Note that the O,-binding curve for 
Hb + BPG + CO; fits that of whole blood very well. 
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15.6n 2,3-Bisphosphoglycerate Is an Important 
Allosteric Effector for Hemoglobin 


The binding of 2,3-bisphosphoglycerate (BPG) to Hb promotes the release of O,» 
(Figure 15.28). Erythrocytes (red blood cells) normally contain about 4.5 mM BPG, 
a concentration equivalent to that of tetrameric hemoglobin molecules. Interest- 
ingly, this equivalence is maintained in the Hb-BPG binding stoichiometry because 
the tetrameric Hb molecule has but one binding site for BPG. This site is situated 
within the central cavity formed by the association of the four subunits. The strongly 
negative BPG molecule (Figure 15.29) is electrostatically bound via interactions with 
the positively charged functional groups of each Lys B82, His 62, His 6143, and the 
NH;*-terminal group of each B-chain. These positively charged residues are 
arranged to form an electrostatic pocket complementary to the conformation and 
charge distribution of BPG (Figure 15.30). In effect, BPG crosslinks the two B- 
subunits. The ionic bonds between BPG and the two -chains aid in stabilizing the 
conformation of Hb in its deoxy form, thereby favoring the dissociation of oxygen. 
In oxyhemoglobin, this central cavity is too small for BPG to fit. Or, to put it another 
way, the conformational changes in the Hb molecule that accompany O, binding 
perturb the BPG-binding site so that BPG can no longer be accommodated. Thus, 
BPG and O, are mutually exclusive allosteric effectors for Hb, even though their 
binding sites are physically distinct. 


15.60 BPG Binding to Hb Has Important Physiological Significance 


The importance of the BPG effect is evident in Figure 15.28. Hemoglobin stripped of 
BPG is virtually saturated with O, at a pO, of only 20 torr, and it cannot release its 
oxygen within tissues, where the pO; is typically 40 torr. BPG shifts the oxygen satu- 
ration curve of Hb to the right, making Hb an O, delivery system eminently suited to 
the needs of the organism. BPG serves this vital function in humans, most primates, 
and a number of other mammals. However, the hemoglobins of cattle, sheep, goats, 
deer, and other animals have an intrinsically lower affinity for O2, and these Hbs are 
relatively unaffected by BPG. 


15.6p Fetal Hemoglobin Has a Higher Affinity for O2 
Because It Has a Lower Affinity for BPG 


The fetus depends on its mother for an adequate supply of oxygen, but its circulatory sys- 
tem is entirely independent. Gas exchange takes place across the placenta. Ideally then, 
fetal Hb should be able to absorb O, better than maternal Hb so that an effective transfer 
of oxygen can occur. Fetal Hb differs from adult Hb in that the B-chains are replaced by 
very similar, but not identical, 146-residue subunits called y-chains (gamma chains). 
Fetal Hb is thus ayy. Recall that BPG functions through its interaction with the B-chains. 
BPG binds less effectively with the y-chains of fetal Hb (also called Hb F). (Fetal y-chains 


Figure 15.29 The structure, in ionic form, of BPG or 2,3-bisphosphoglycerate, an important allosteric 
effector for hemoglobin. 2,3-PG is shown in the three common structural representations used in this 
book. 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


535 


536 Chapter 15 Enzyme Regulation 


Figure 15.30 The ionic binding of BPG to the two 
B-subunits of Hb. BPG lies at center of the cavity 
between the two B-subunits. The highlighted 
residues are N-terminal Val B; and Val B, (yellow), 
His 8, 2, His B2 2, His B; 143, and His 62 143 
(purple), Lys Bı 82 and Lys £2 82 (green) 

(PDB id = 1B86). 


Human Biochemistry 


Hemoglobin and Nitric Oxide 


Nitric oxide (NO -) is a simple gaseous molecule whose many re- 
markable physiological functions are still being discovered. For ex- 
ample, NO - is known to act as a neurotransmitter and as a second 
messenger in signal transduction. Furthermore, endothelial relaxing 
factor (ERF, also known as endothelium-derived relaxing factor, 
or EDRF), a once-elusive hormonelike agent that acts to relax the 
musculature of the walls (endothelium) of blood vessels and lower 
blood pressure, has been identified as NO -. It has long been known 
that NO - is a high-affinity ligand for Hb, binding to its heme-Fe* 
atom 10,000 times better than O2. An enigma thus arises: Why isn’t 
NO - instantaneously bound by Hb within human erythrocytes and 
prevented from exerting its vasodilation properties? 

The reason that Hb doesn’t block the action of NO - is due to 
a unique interaction between Cys 936 of Hb and NO - discovered 
by Li Jia, Celia and Joseph Bonaventura, and Jonathan Stamler at 
Duke University. Nitric oxide reacts with the sulfhydryl group of 
Cys 938, forming an S-nitroso derivative: 


—CH»—S+N=O 


This S-nitroso group is in equilibrium with other S-nitroso 
compounds formed by reaction of NO - with small-molecule thiols 
such as free cysteine or glutathione (an isoglutamylcysteinylglycine 
tripeptide): 


H || 
C—C—N—CHg» 
| H 
CHə—S—N- O 


H | 
H; +N c CH,—CH»,—C—N 
H 
COo- 


S-nitrosoglutathione 


COO- 


These small-molecule thiols serve to transfer NO - from erythrocytes 
to endothelial receptors, where it acts to relax vascular tension. NO - 
itself is a reactive free-radical compound whose biological half-life is 
very short (1-5 s). S-nitrosoglutathione has a half-life of several hours. 

The reactions between Hb and NO - are complex. NO - forms a 
ligand with the heme-Fe’* that is quite stable in the absence of Os. 
However, in the presence of O», NO: is oxidized to NO3;° and the 
heme-Fe?* of Hb is oxidized to Fe**, forming methemoglobin. Fortu- 
nately, the interaction of Hb with NO - is controlled by the allosteric 
transition between R-state Hb (oxyHb) and T-state Hb (deoxyHb). 
Cys 936 is more exposed and reactive in R-state Hb than in T-state 
Hb, and binding of NO - to Cys 938 precludes reaction of NO - with 
heme iron. Upon release of O, from Hb in tissues, Hb shifts confor- 
mation from R state to T state, and binding of NO - at Cys 936 is 
no longer favored. Consequently, NO - is released from Cys 938 and 
transferred to small-molecule thiols for delivery to endothelial recep- 
tors, causing capillary vasodilation. This mechanism also explains the 
puzzling observation that free Hb produced by recombinant DNA 
methodology for use as a whole-blood substitute causes a transient 
rise of 10 to 12 mm Hg in diastolic blood pressure in experimental 
clinical trials. (Conventional whole-blood transfusion has no such ef- 
fect.) It is now apparent that the “synthetic” Hb, which has no bound 
NO -, is binding NO - in the blood and preventing its vasoregulatory 
function. 

In the course of hemoglobin evolution, the only invariant amino 
acid residues in globin chains are His F8 (the obligatory heme li- 
gand) and a Phe residue acting to wedge the heme into its pocket. 
However, in mammals and birds, Cys 938 is also invariant, no doubt 
due to its vital role in NO - delivery. 


Adapted from Jia, L., et al., 1996. S-Nitrosohaemoglobin: A dynamic activ- 
ity of blood involved in vascular control. Nature 380:221—226. 
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have Ser instead of His at position 143 and thus lack two of the positive charges in the 
central BPG-binding cavity.) Figure 15.31 compares the relative affinities of adult Hb (also 
known as Hb A) and Hb F for O, under similar conditions of pH and [BPG]. Note that Hb 
F binds O; at pO, values where most of the oxygen has dissociated from Hb A. Much of 
the difference can be attributed to the diminished capacity of Hb F to bind BPG (compare 
Figures 15.28 and 15.31); Hb F thus has an intrinsically greater affinity for O2, and oxygen 
transfer from mother to fetus is ensured. 


15.6q Sickle-Cell Disease Is Characterized by Abnormal Red Blood Cells 


In 1904, a Chicago physician treated a 20-year-old Black college student complaining 
of headache, weakness, and dizziness. The blood of this patient revealed serious 
anemia—only half the normal number of red cells were present. Many of these cells 
were abnormally shaped; in fact, instead of the characteristic disc shape, these eryth- 
rocytes were elongated and crescentlike in form, a feature that eventually gave name 
to the disease. These sickle cells pass less freely through the capillaries, impairing 
circulation and causing tissue damage. Furthermore, these cells are more fragile and 
rupture more easily than normal red cells, leading to anemia. 


15.6r Sickle-Cell Disease Has Been Characterized as a Molecular Disease 


A single amino acid substitution in the B-chains of Hb causes sickle-cell disease. Re- 
placement of the glutamate residue at position 6 in the B-chain by a valine residue 
marks the only chemical difference between Hb A and sickle-cell hemoglobin, Hb S. 
The amino acid residues at position £6 lie at the surface of the hemoglobin molecule. 
In Hb A, the ionic R groups of the Glu residues are suited to this environment. In 
contrast, the aliphatic side chains of the Val residues in Hb S create hydrophobic pro- 
trusions where none existed before. To the detriment of individuals who carry this 
trait, a hydrophobic pocket forms in the EF corner of each B-chain of Hb when it is in 
the deoxy state, and this pocket nicely accommodates the Val side chain of a neigh- 
boring Hb S molecule (Figure 15.32). This interaction leads to the aggregation of 
Hb S molecules into long, chainlike polymeric structures. The obvious consequence 
is that deoxyHb S is less soluble than deoxyHb A. The concentration of hemoglobin 
in red blood cells is high (about 150 mg/mL), so even in normal circumstances it is on 
the verge of crystallization. The formation of insoluble deoxyHb S fibers distorts the 
red cell into the elongated sickle shape characteristic of the disease.” 


*In certain regions of Africa, the sickle-cell trait is found in 20% of the people. Why does such a delete- 
rious heritable condition persist in the population? Individuals with this trait are less susceptible to the 
most virulent form of malaria. The geographic distribution of malaria and the sickle-cell trait are 
positively correlated. 


Figure 15.31 Comparison of the oxygen saturation 
curves of Hb A and Hb F under similar conditions 
of pH and [BPG]. 
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(a) 


Oxy- 
hemoglobin A 


Deoxy- 
hemoglobin A 


Oxy- 
hemoglobin S 


(b) 


Deoxyhemoglobin S polymerizes into filaments 


Deoxy- 
hemoglobin S 


Figure 15.32 The polymerization of Hb S via the interactions between the hydrophobic Val side 
chains at position B6 and the hydrophobic pockets in the EF corners of -chains in neighboring Hb 
molecules. (a) The protruding block on Oxy S represents the Val hydrophobic protrusion. The 
complementary hydrophobic pocket in the EF corner of deoxy B-chains is represented by a square- 
shaped indentation. (This indentation is probably present in Hb A also.) Only the 8, Val protrusions 
and the B; EF pockets are shown. (The £; Val protrusions and the 6 EF pockets are not involved, 
although they are present.) (b) The polymerization of Hb S via 8, Val6 insertions into neighboring 


Bı pockets. (c) Molecular graphic of an Hb S dimer of tetramers. B, Val residues are highlighted 
in blue; heme is shown in red (PDB id = 2HBS). (d) Molecular graphic of the Hb S filament 
(PDB id = 2HBS). Recall that heme is bound in the EF cleft, so heme highlights the location of 
the B, Val-binding site on 6, subunits. 


Foundational Biochemistry Things You Should Know after Reading Chapter 15. 


The availability of substrates and cofactors usually determines the 
rate of an enzymatic reaction. 


As the product of the reaction accumulates, the apparent rate of 
product formation will slow down due to the increasing rate of the 
reverse reaction, which is directly dependent on [P]. 


Genetic regulation of enzyme synthesis and decay determines the 
amount of enzyme present at any moment. 


Enzymes can be regulated allosterically through reversible binding of 
metabolic effectors at sites other than the active site. 


Enzyme activity can also be regulated through covalent modifica- 
tion, the reversible attachment of chemical groups to amino acid side 
chains in the enzyme. 


Enzymes susceptible to reversible covalent modification are called 
interconvertible enzymes. 


The enzymes that catalyze reversible covalent modification are called 
converter enzymes, and such converter enzymes are subject to meta- 
bolic regulation. 


Zymogens are inactive precursors from which active enzymes can be 
generated by proteolytic cleavage. 


e Isozymes are enzymes whose subunits are homologous. The different 
isozyme subunits have different kinetic properties. 

e Allosteric enzymes catalyze committed steps in metabolic pathways. 

e Many allosteric enzymes are susceptible to feedback inhibition by the 
end product of the metabolic pathway. 

e The kinetic behavior of allosteric enzymes does not conform to the 
Michaelis-Menten model for enzyme kinetics. 

e Substrate saturation curves for allosteric enzymes are sigmoid 
(S-shaped). 

e Allosteric effectors interact with allosteric enzymes at binding sites 
distinct from the substrate-binding (active) site. 

e The activity of an allosteric enzyme can be activated or inhibited, 
depending on the nature of the allosteric effector it binds. 

e Allosteric enzymes are oligomers; each subunit in an allosteric 
enzyme has a substrate-binding site and an effector-binding site. 

e The interaction of allosteric enzymes with their substrates or effectors 
alters the conformation of the subunits. 

e 


Conformational changes in allosteric enzymes are the basis of their 
changing affinity for the various ligands. 
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The Monod—Wyman-—Changeux (MWC) model of allosteric behavior 
postulates that 


1. Allosteric enzymes are oligomeric. 

2. The allosteric enzyme can exist in (at least) two conformational 
states, called R and T. 

3. All subunits in any molecule of enzyme are in the same confor- 
mation (either all R or all T). 

4. The different conformations of the allosteric enzyme have dif- 
ferent affinities for the various ligands. 


5. Lis the equilibrium constant for the R = T equilibrium defined 
as L = T)/Ro. If L is large, and S binds only to R, substrate 
binding is cooperative. 

6. The concentration of ligand giving half-maximal saturation is 
defined as Kos. 


The Monod—Wyman—Changeux (MWC) model of allosteric behav- 
ior is based on linked equilibria between conformational states and 
ligand-binding properties of allosteric proteins. 


The Koshland—Nemethy—Filmer (K NF) model for allosteric regulation 
is based on ligand-induced conformational changes in allosteric proteins. 


Reversible changes in the oligomeric state of an allosteric protein can 
also give rise to allosteric behavior. 


Covalent modification through reversible phosphorylation is a promi- 
nent means of metabolic regulation. 


Protein kinases catalyze the ATP-dependent phosphorylation of 
target proteins. 


Protein kinases phosphorylate certain Ser, Thr, or Tyr-OH side chains. 


Target recognition: The specificity of protein kinases is determined 
by the sequence context in which the Ser, Thr, or Tyr residue is found. 


The activity of protein kinases is regulated by intrasteric control: A pseu- 
dosubstrate sequence occupies and blocks the protein kinase active site. 


Binding of metabolic regulators (or phosphorylation) leads to the 
dissociation of the pseudosubstrate sequence from the protein kinase 
active site and activation of its protein kinase function. 


Phosphorylation is not the only form of covalent modification that 
regulates protein function. 


Acetylation, adenylylation, uridylylation, ADP-ribosylation, methyla- 
tion, and disulfide oxidation—reduction are other forms of covalent 
modification acting in metabolic regulation. 


Glycogen phosphorylase is a paradigm of allosteric regulation and 
covalent modification through reversible phosphorylation. 


Problems 539 


AMP allosterically activates glycogen phosphorylase; ATP and glu- 
cose allosterically inhibit glycogen phosphorylase. 


Glycogen phosphorylase is also regulated through reversible phos- 
phorylation of residue Ser!4. 


Glycogen phosphorylase b, the unphosphorylated form of the enzyme, 
is intrinsically less active and more susceptible to allosteric regulation 
than glycogen phosphorylase a, the phosphorylated form. 


Glycogen phosphorylase behavior reveals an important theme 
in metabolic regulation: covalent modification overrides allosteric 
regulation. 


Whether glycogen phosphorylase is covalently modified is determined 
by an enzyme cascade initiated by signal transduction in response to 
hormones. 


The signal transduction/enzyme cascade culminating in glycogen 
phosphorylase phosphorylation and activation involves a hormone 
receptor, G protein, adenylyl cyclase transmembrane signaling path- 
way and activation of a series of protein kinases. 


The comparative biochemistry of the oxygen-binding proteins 
myoglobin and hemoglobin reveals insights into allostery. 


Myoglobin (Mb) is an oxygen-storage protein; hemoglobin (Hb) is an 
oxygen-transport protein. 

Cooperative binding of O, by hemoglobin has important physiologi- 
cal significance in oxygen delivery to tissues. 


Oxygenation markedly alters the conformation of hemoglobin’s 
quaternary structure. 


Deoxy- and oxy-Hb represent two conformational states of the protein. 


The allosteric behavior of Hb has attributes of both the MWC and 
KNF models. 


H* promotes the dissociation of O, from Hb, as does CO, and 
2,3-bisphosphoglycerate (BPG). 


Fetal hemoglobin has a higher affinity for O, than adult Hb because 
it has a lower affinity for BPG. 


Sickle-cell anemia is a molecular disease due to a mutation in the 
B-globin gene that replaces the glutamate residue at position 6 with 
a valine residue. 


The valine residue at position 6 in the B—globin chains of HbS (sickle 
cell Hb) leads to the polymerization of the deoxy form of HbS into 
filaments that distort the shape of red blood cells. 


Answers to all problems are at the end of this book. Detailed solutions are enzl enz2 enz3 enz4 enz5 
ilable in the Student Solutions Manual. 
available in the Student Solutions Manu A B c D E .F 


1. General Controls Over Enzyme Activity List six general ways in which 
enzyme activity is controlled. 

5. Graphical Analysis of MWC Allosteric Enzyme Kinetics (Integrates with 
Chapter 13.) Draw both Lineweaver—Burk plots and Hanes—Woolf 
plots for an MWC allosteric enzyme system, showing separate curves 
for the kinetic response in (a) the absence of any effectors, (b) the 
presence of allosteric activator A, and (c) the presence of allosteric 
inhibitor I. 


4. Allosteric Regulation of a Pathway Consider the sequence of enzy- 6. Graphical Analysis of Negative Cooperativity in KNF Allosteric Enzyme 
matic reactions in the pathway shown in the figure. Why does it make Kinetics The KNF model for allosteric transitions includes the 
biochemical sense that enzyme | would be allosterically regulated possibility of negative cooperativity. Draw Lineweaver—Burk and 
rather than enzyme 5? Hanes—Woolf plots for the case of negative cooperativity in substrate 


2. Enzyme Induction and Repression Describe how induction and 
repression can affect metabolic activity. 


3. Proteases as Zymogens The digestive proteases are made inside 
pancreatic cells as inactive precursors, so-called zymogens. Why is it 
necessary to make these proteases as inactive precursors? 
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Chapter 15 Enzyme Regulation 


binding. (As a point of reference, include a line showing the classic 
Michaelis-Menten response of v to [S].) 


. The Quantitative Advantage of Allosteric Behavior for 


O,-Transporting Heme Proteins 


[pO,]" 
Ps 


; Y 
The equation = 


(Il — Y) 


allows the calculation of Y (the fractional saturation of hemoglobin 
with O3), given Psy and n (see the Deeper Look box: “The Oxygen- 
Binding Curves of Myoglobin and Hemoglobin”). Let Ps) = 26 torr 
and n = 2.8. Calculate Y in the lungs, where pO, = 100 torr, and Yin 
the capillaries, where pO, = 40 torr. What is the efficiency of O, deliv- 
ery under these conditions (expressed as Yiungs — Yeapittaries)? Repeat the 
calculations, but for n = 1. Compare the values for Yiunes — Yeapittaries 
for n = 2.8 versus Yiunes ~ Yeapittaries for n = 1 to determine the effect 
of cooperative O» binding on oxygen delivery by hemoglobin. 


. Predict the Effect of Caffeine Consumption on Glycogen Phosphorylase 


Activity The cAMP formed by adenylyl cyclase (Figure 15.19) 
does not persist because 5’-phosphodiesterase activity prevalent 

in cells hydrolyzes cAMP to give 5'-AMP. Caffeine inhibits 
5'-phosphodiesterase activity. Describe the effects on glycogen 
phosphorylase activity that arise as a consequence of drinking lots 
of caffeinated coffee. 


. Saturation of Hemoglobin with Oxygen A typical red blood cell 


(RBC) contains about 270,000,000 hemoglobin molecules. There are 
approximately 25 trillion RBCs in the human circulatory system. 
What volume of O; (in L) is needed to fully saturate these RBCs? 
(Assume that temperature = 37°C; P = 100 torr and R = 62.36367 
L torr/K mole.) 


Structure of cAMP Consider the chemical structure of cAMP (Figure 
15.19). Is it biochemically possible to synthesize cGMP, cCMP, and 
cUMP? 


Predict the O,-Binding Properties of Stored Blood Whose Hemoglobin 
Is BPG-Depleted If no precautions are taken, blood that has been 
stored for some time becomes depleted in 2,3-BPG. What happens if 
such blood is used in a transfusion? 


Estimate the Quantitative Effects of Allosteric Regulators on Glycogen 
Phosphorylase Activity Enzymes have evolved such that their K, 
values (or Ko 5 values) for substrate(s) are roughly equal to the in vivo 
concentration(s) of the substrate(s). Assume that glycogen phosphor- 
ylase is assayed at [Pj] ~ Ko.s in the absence and presence of AMP or 
ATP. Estimate from Figure 15.15 the relative glycogen phosphorylase 
activity when (a) neither AMP or ATP is present, (b) AMP is present, 
and (c) ATP is present. (Hint: Use a ruler to get relative values for the 
velocity v at the appropriate midpoints of the saturation curves.) 


cAMP and Phosphodiesterase Phosphodiesterase inactivates cAMP. 
Would inhibitors of phosphodiesterase promote or inhibit the 
hydrolysis of glycogen? 


Describe the Effects on cAMP and Glycogen Levels in Cells Exposed 

to Cholera Toxin Cholera toxin is an enzyme that covalently modifies 
the G,-subunit of G proteins. (Cholera toxin catalyzes the transfer 

of ADP-ribose from NAD* to an arginine residue in G,, an ADP- 
ribosylation reaction.) Covalent modification of G, inactivates its 
GTPase activity. Predict the consequences of cholera toxin on cellular 
cAMP and glycogen levels. 


cAMP, G proteins and Glycogen Storage Adenylyl cyclase is an 
important enzyme, playing a central role in many signaling pathways. 
Explain why inhibition of adenylyl cyclase would lead to more 
glycogen storage in the cytosol and why inhibition of the GTPase 
activity of the G protein that activates adenylyl cyclase would lead to 
less glycogen storage. 


16. 


17. 


18. 


19. 


20. 


21. 


22. 


23. 


One Way Negative Cooperativity Might Make Metabolic Sense 
Allosteric enzymes that sit at branch points leading to several 
essential products sometimes display negative cooperativity for 
feedback inhibition (allosteric inhibition) by one of the products. 
What might be the advantage of negative cooperativity instead of 
positive cooperativity in feedback inhibitor binding by such enzymes? 


Kinases and Phosphatases Enzyme activity is regulated through 
phosphorylation/dephosphorylation by kinases and phosphatases (see 
Figure 15.1). Kinases and phosphatases are in turn regulated by their 
own set of kinases and phosphatases and these are regulated as well. 
What would be the consequence, on an energy level, if kinases and 
phosphatases were unregulated and constantly active? 


Protein Kinase Specificity Via Consensus Target Sequences and 
Intrasteric Control Consult Table 15.2 and 


a. Suggest a consensus amino acid sequence within phosphorylase 
kinase that makes it a target of protein kinase A (the 
cAMP-dependent protein kinase). 


b. Suggest an effective amino acid sequence for a regulatory 
domain pseudosubstrate sequence that would exert intrasteric 
control on phosphorylase kinase by blocking its active site. 


Allosteric Regulation versus Covalent Modification What are the 
relative advantages (and disadvantages) of allosteric regulation versus 
covalent modification? 


Potential Treatment of Sickle-Cell Disease by Drugs Targeted to HbS 
You land a post as scientific investigator with a pharmaceutical 
company that would like to develop drugs to treat people with 
sickle-cell anemia. They want ideas from you! What molecular 
properties of Hb S might you suggest as potential targets of drug 
therapy? 


Nitric Oxide Is a Metabolic Regulator, But What Kind? Under 
appropriate conditions, nitric oxide (NO -) combines with Cys 93 in 
hemoglobin and influences its interaction with O». Is this interaction 
an example of allosteric regulation or covalent modification? 


Is Lactate an Allosteric Effector for Mb? Lactate, a metabolite 
produced under anaerobic conditions in muscle, lowers the affinity of 
myoglobin for O2. This effect is beneficial, because O, dissociation 
from Mb under anaerobic conditions will provide the muscle with 
oxygen. Lactate binds to Mb at a site distinct from the O-binding site 
at the heme. In light of this observation, discuss whether myoglobin 
should be considered an allosteric protein. 


Morpheeins—a Model for Allosteric Regulation Based on the 
Reversible Association-Dissociation of a Monomeric-Oligomeric 
Protein An allosteric model based on multiple oligomeric states of a 
protein has been proposed by E. K. Jaffe (2005. Morpheeins: A new 
structural paradigm for allosteric regulation. Trends in Biochemical 
Sciences 30:490-497). This model coins the term morpheeins to 
describe the different forms of a protein that can assume more than 
one conformation, where each distinct conformation assembles into 
an oligomeric structure with a fixed number of subunits. For example, 
conformation A of the protein monomer forms trimers, whereas 
conformation B of the monomer forms tetramers. If trimers and 
tetramers have different kinetic properties (Km and keat values), as in 
low-activity trimers and high-activity tetramers, then the morpheein 
ensemble behaves like an allosterically regulated enzyme. Drawing 

on the traditional MWC model as an analogy, diagram a simple 
morpheein model in which wedge-shaped protein monomers assemble 
into trimers but the alternative conformation for the monomer (a 
square shape) forms tetramers. Further, the substrate, S, or allosteric 
regulator, A, binds only to the square conformation, and its binding 
prevents the square from adopting the wedge conformation. Describe 
how your diagram yields allosteric behavior. 
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24. Draw Substrate Saturation Curves for the Substrates of CTP Synthetase 
That Show Both Positive and Negative Cooperativity 
CTP synthetase catalyzes the synthesis of CTP from UTP: 


UTP + ATP + glutamine = CTP + glutamate + ADP + P; 


The substrates UTP and ATP show positive cooperativity in their 
binding to the enzyme, which is an a4-type homotetramer. However, 
the other substrate, glutamine, shows negative cooperativity. Draw 
substrate saturation curves of the form v versus [S]/Ko,; for each of 
these three substrates that illustrate these effects. 


25. Draw a Model for the Conformational States of Glyceraldehyde-3-P 
Dehydrogenase, a Tetrameric Enzyme Displaying Negative 
Cooperativity Glyceraldehyde-3-phosphate dehydrogenase catalyzes 
the synthesis of 1,3-bisphosphoglycerate: 


Glyceraldehyde-3-P + P; + NAD* = 1,3-BPG + NADH + H* 


The enzyme is a tetramer. NAD* binding shows negative coopera- 
tivity. Draw a diagram of possible conformational states for this 
tetrameric enzyme and its response to NAD* binding that illustrates 
negative cooperativity. 


26. Proteomics as a Tool to Study Metabolic Regulation Proteomics studies 
have revealed that protein acetylation is an important mechanism for 
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regulation of metabolic enzymes. Describe how proteomics might 
have led to this discovery. 


27. Hb:O, affinity On the basis of the graphs shown in Figures 15.27 and 
15.28 and the relationship between blood pH and respiration 
(Chapter 2), predict the effect of hyperventilation and hypoventilation 
on Hb:O; affinity. 


28. Regulation of Glycogen Phosphorylase Figure 15.18 traces the 
activation of glycogen phosphorylase from hormone to phosphoryla- 
tion of the b form of glycogen phosphorylase to the a form. These 
effects are reversible when hormone disappears. Suggest reactions by 
which such reversibility is achieved. 


Think-Pair-Share Question 


Malaria and Sickle-Cell Disease When the malarial parasite 
Plasmodium falciparcum infects red blood cells (RBCs), those 

RBCs bearing HbS are more likely to be eliminated by the liver or 
spleen than those bearing normal HbA. Propose a mechanism to 
explain why RBCs with Hb S are more likely to be eliminated than 
normal RBCs. (See Sections 15.6k, 15.6p and beyond for discussions 
on the properties of Hb and Hb S). 
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Molecular Motors 16 


Great ideas originate in the muscles. 


— Thomas Edison 


< Michelangelo’s David epitomizes the 
musculature of the human form. 


Bettmann/Corbis 


Movement is an intrinsic property associated with all living things. Within cells, molecules 16.1 What Are the General Features 
undergo coordinated and organized movements, and cells themselves may move across a of Molecular Motors? 

surface. At the tissue level, muscle contraction allows higher organisms to carry out and 16.2 What Is the Molecular Mechanism 
control crucial functions, such as peristalsis in the gut and the beating of the heart. How of Muscle Contraction? 

can biological macromolecules, carrying out conformational changes on the molecular level, 16.3 What Are the Molecular Motors That 
achieve these feats of movement that span the microscopic and macroscopic worlds? Orchestrate the Mechanochemistry 


of Microtubules? 


16.4 How Do Molecular Motors 
Unwind DNA? 


16.5 How Do Bacterial Flagella Use a 
Proton Gradient to Drive Rotation? 


otor proteins, also known as molecular motors, use chemical energy (typically 

ATP, GTP, or ion gradients) to orchestrate movement, transforming the energy 
into the mechanical energy of motion. In all cases, the energy source (for example, 
ATP hydrolysis) is presumed to drive and control protein conformational changes 
that result in sliding or walking movements of one molecule relative to another. To 
carry out directed movements, molecular motors must be able to associate and dis- 
sociate reversibly with a polymeric protein array, a surface or substructure in the cell. 
ATP hydrolysis drives the process by which the motor protein ratchets along the pro- 
tein array or surface. Elucidation of these basically simple processes has been ex- 
tremely challenging for biochemists, involving the application of many sophisticated 
chemical and physical methods in many different laboratories. This chapter describes 
the structures and chemical functions of molecular motor proteins and some of the 
experiments by which we have come to understand them. 


16.1 | What Are the General Features of Molecular Motors? 


All molecular motors couple an energy source to a movement via protein conforma- 
tional changes. Molecular motors may be linear or rotating. Linear motors crawl or 
creep along a polymer lattice, whereas rotating motors consist of a rotating element 
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(the rotor) and a stationary element (the stator), in a fashion much like a simple elec- 
trical motor. 

To carry out directed movements, linear molecular motors must be able to associate 
and dissociate reversibly with a polymeric protein array, a surface or substructure in the 
cell. ATP hydrolysis (or another energy source) drives the process by which the motor 
protein ratchets along the protein array or surface. The linear motors we will discuss 
include kinesins and dyneins (which crawl along microtubules), myosin (which slides 
along actin filaments in muscle), and DNA helicases (which move along a DNA lattice, 
unwinding duplex DNA to form single-stranded DNA). Rotating motors include the 
flagellar motor complex, described in this chapter, and the ATP synthase involved in 
oxidative phosphorylation. 


16.2 | What Is the Molecular Mechanism 
of Muscle Contraction? 


Muscle contraction is the result of interactions between myosin and actin, the two 
predominant muscle proteins. Thick filaments of myosin slide along thin filaments of 
actin to cause contraction. 


16.2a Muscle Contraction Is Triggered by Ca?+ Release 
from Intracellular Stores 


The long and multinucleate cells of skeletal muscle are referred to as muscle fibers. 
Skeletal muscles in higher animals consist of 100 wm-diameter fibers, some as long as 
the muscle itself. Each of these muscle fibers contains hundreds of myofibrils 
(Figure 16.1), each of which spans the length of the fiber and is about 1 to 2 um in diam- 
eter. Myofibrils are linear arrays of cylindrical sarcomeres, the basic structural units of 
muscle contraction. Each myofibril is surrounded by a specialized endoplasmic 


SR membrane Transverse tubule 


Nucleus 


Contractile ~ 
filaments 


Sarcolemma 
Sarcoplasmic reticulum 


Myofibril Z Terminal cisternae 


Figure 16.1 The structure of a skeletal muscle cell, 
showing the manner in which transverse tubules 
enable the sarcolemmal membrane to extend into 
the interior of the fiber. T-tubules and sarcoplasmic 
reticulum (SR) membranes are juxtaposed at 
structures termed triad junctions (inset). 


Transverse tubule 
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Human Biochemistry 


Medical Applications of Smooth Muscle Effectors 


Not all vertebrate muscle is skeletal muscle. Vertebrate organ- 
isms employ smooth muscle for long, slow, and involuntary 
contractions in various organs, including large blood vessels, 
intestinal walls, the gums of the mouth, and in the female, the 
uterus. Smooth muscle contraction is triggered by Ca**-activated 


CHOH 
OH H CH; 
HO A: CHo N l CH; 
4 buy 
Albuterol 


phosphorylation of myosin by myosin light-chain kinase (MLCK). 
The action of epinephrine and related agents forms the basis of 
therapeutic control of smooth muscle contraction. Breathing 
disorders, including asthma and various allergies, can result from 
excessive contraction of bronchial smooth muscle tissue. Treat- 
ment with epinephrine, whether by tablets or aerosol inhalation, 
inhibits MLCK and relaxes bronchial muscle tissue. More specific 
bronchodilators, such as albuterol (see accompanying figure), act 
more selectively on the lungs and avoid the undesirable side effects 
of epinephrine on the heart. Albuterol is also used to prevent 
premature labor in pregnant women because of its relaxing effect 


on uterine smooth muscle. Conversely, oxytocin, and its synthetic 
version Pitocin, stimulates contraction of uterine smooth muscle. 
This natural secretion of the pituitary gland is often administered 
to induce labor. 


+ 
HN — Gly — Leu — Pro Cys — Asn — Gln — Ile — Tyr — Cys — COO 


Oxytocin (Pitocin) 


reticulum called the sarcoplasmic reticulum (SR). The SR contains high concentrations 
of Ca?*, and the release of Ca** from the SR and its interactions within the sarcomeres 
trigger muscle contraction. The muscle fiber is surrounded by the sarcolemma, a spe- 
cialized plasma membrane. Extensions of the sarcolemma, called transverse tubules, or 
t-tubules, reach deep into the muscle fiber, enabling the sarcolemmal membrane to be 
in contact with each myofibril (Figure 16.1). 

Skeletal muscle contractions are initiated by nerve stimuli that act directly on the 
muscle. Nerve impulses produce an electrochemical signal called an action potential 
that spreads over the sarcolemmal membrane and into the fiber along the t-tubule net- 
work. This signal induces the release of Ca?* ions from the SR. These Ca?* ions bind to 
proteins within the muscle fibers and induce contraction. 


16.2b The Molecular Structure of Skeletal Muscle 
Is Based on Actin and Myosin 


Examination of myofibrils under the electron microscope reveals a banded or stri- 
ated structure. The bands are traditionally identified by letters (Figure 16.2). 
Regions of high electron density, denoted A bands, alternate with regions of low 
electron density, the I bands. Small, dark Z lines lie in the middle of the I bands, 
marking the ends of each sarcomere. Each A band has a central region of slightly 
lower electron density called the H zone, which contains a central M disc (also called 
an M line). Electron micrographs of cross sections of each of these regions reveal 
molecular details. The H zone shows a regular, hexagonally arranged array of thick 
filaments of myosin (15 nm diameter), whereas the I band shows a regular, hexago- 
nal array of thin filaments of actin, together with proteins known as troponin and 
tropomyosin (7 nm diameter). In the dark regions at the ends of each A band, the 
thin and thick filaments interdigitate, as shown in Figure 16.2. The thin and thick 
filaments are joined by cross-bridges. These cross-bridges are actually extensions of 
the myosin molecules, and muscle contraction is accomplished by the sliding of the 
cross-bridges along the thin filaments, a mechanical movement driven by the free 
energy of ATP hydrolysis. 


The Composition and Structure of Thin Filaments Actin, the principal component of 
thin filaments, is found in substantial amounts in most eukaryotic cells. At low ionic 
strength, actin exists as a 42 kDa globular protein, called G-actin (Figure 16.3). 
Under physiological conditions (higher ionic strength), G-actin polymerizes to form 
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Courtesy of Hugh Huxley, Brandeis University 


Figure 16.3 The three-dimensional structure of 
an actin monomer from skeletal muscle. This view 
shows the two domains (/eft and right) of actin 
(PDB id = 1J6Z). 


Thin filaments Thick filaments M disc Thick and thin 
filaments 


Figure 16.2 Electron micrograph of a skeletal muscle myofibril (in longitudinal section). The length 
of one sarcomere is indicated, as are the A and I bands, the H zone, the M disc, and the Z lines. 
Cross sections from the H zone show a hexagonal array of thick filaments, whereas the I band cross 
section shows a hexagonal array of thin filaments. 


a fibrous form of actin, called F-actin. As shown in Figure 16.4, F-actin is a 
right-handed helical structure, with a helix pitch of about 72 nm per turn. The 
F-actin helix is the core of the thin filament, to which tropomyosin and the troponin 
complex also add. Tropomyosin winds around actin filaments and prevents myosin 
binding in resting muscle. When a nerve impulse arrives at the sarcolemmal mem- 
brane, Ca?* ions released from the sarcoplasmic reticulum bind to the troponin 
complex, inducing a conformation change that allows myosin to bind to actin, initiat- 
ing contraction. In nonmuscle cells, actin filaments are the highways across which a 
variety of cellular cargo is transported. 


The Composition and Structure of Thick Filaments Myosin, the principal component 
of muscle thick filaments, is a large protein consisting of six polypeptides, with an 
aggregate molecular weight of approximately 540 kDa. As shown in Figure 16.5, 
the six peptides include two 230 kDa heavy chains as well as two pairs of different 
20 kDa light chains, denoted LC1 and LC2. The heavy chains consist of globular 
amino-terminal myosin heads, joined to long a-helical carboxy-terminal segments, 
the tails. These tails are intertwined to form a left-handed coiled coil approxi- 
mately 2 nm in diameter and 130 to 150 nm long. Each of the heads in this dimeric 
structure is associated with an LC] and an LC2. The myosin heads exhibit ATPase 
Figure 16.4 A molecular model of an actin polymer, activity, and hydrolysis of ATP by the myosin heads drives muscle contraction. 


Based on the actin monomer SHUCtüTe SHOWA in LCI is also known as the essential light chain, and LC2 is designated the regulatory 
Figure 16.3 (PDB id = 1A5X). Actin consists of two light chain. Both light chai h l t l duli dt in C 
strands of actin polymer wrapped around each ight chain. Both light chains are homologous to calmodulin and troponin C. 
other. This image shows three monomers in each Dissociation of LC1 from the myosin heads in vitro results in loss of the myosin 
strand. ATPase activity. 
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Figure 16.5 Myosin structure. (a) A schematic drawing of a myosin hexamer, showing the two heavy 
chains and four light chains. The tail is a coiled coil of intertwined a-helices extending from the two 
globular heads. One of each of the myosin light-chain proteins, LC] and LC2, is bound to each of 
the globular heads. (b) A ribbon diagram shows the structure of the myosin head. The head and neck 
domains of the heavy chain are red; the essential light chain is yellow and the regulatory light chain is 
blue (PDB id = 1B7T). 


The myosin head consists of a globular domain where ATP is bound and hydro- 
lyzed, and a long a-helical neck to which the light chains are bound. The most promi- 
nent feature of the globular head is the actin-binding cleft between the so-called upper 
and lower domains. The N-terminal domain and the upper domain together form a 
seven-stranded B-sheet. The ATP-binding site is partially defined by three loops: 
switch 1, switch 2, and the P-loop. Conformation changes driven by ATP hydrolysis 
cause a rotation of the converter domain (Figure 16.5) and the long-neck helix—the 
fundamental event in contraction. 


Repeating Structural Elements Are the Secret of Myosin’s Coiled Coils A number of key 
features of the myosin sequence are responsible for the a-helical coiled coils formed by 
myosin tails. Several orders of repeating structure are found in all myosin tails, includ- 
ing 7, 28, and 196 residue repeating units. Large stretches of the tail domain are com- 
posed of seven residue repeating segments. The first and fourth residues of these seven 
residue units are generally small, hydrophobic amino acids, whereas the second, third, 
and sixth are likely to be charged residues. The consequence of this arrangement is 
shown in Figure 16.6. Seven residues form two turns of an a-helix, and in the coiled 
coil structure of the myosin tails, the first and fourth residues face the interior contact 
region of the coiled coil. Residues b, c, and f (2, 3, and 6) of the seven-residue repeat 
face the periphery, where charged residues can interact with the water solvent. At the 
28 (4 X 7) residue and 196 (28 X 7) residue levels, specialized amino acid sequence 
patterns promote packing of large numbers of myosin tails in offset or staggered 
arrays (Figure 16.7). 
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Figure 16.6 An axial (top-down) view of the 
two-stranded, a-helical coiled coil of a myosin tail. 
Hydrophobic residues a and d of the 7 residue 
repeat sequence align to form a hydrophobic core. 
Residues b, c, and f face the outer surface of the 
coiled coil and are typically ionic. 
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Figure 16.7 The packing of myosin molecules in a 
thick filament. Adjoining molecules are offset by 
approximately 14 nm, a distance corresponding to 
98 residues of the coiled coil. The long tails of 
myosin molecules are anchored at the M disk, 
leaving a bare zone in the middle. 


Location of M disc region 


Myosin heads Bare zone 


16.2c The Mechanism of Muscle Contraction 
Is Based on Sliding Filaments 


When muscle fibers contract, the thick myosin filaments slide or walk along the thin 
actin filaments. The basic elements of the sliding filament model were first described in 
1954 by two different research groups: Hugh Huxley and his colleague Jean Hanson, 
and Andrew Huxley and his colleague Ralph Niedergerke. Several key discoveries paved 
the way for this model. Electron microscopic studies of muscle revealed that sarcomeres 
decreased in length during contraction and that this decrease was due to decreases in the 
width of both the I band and the H zone (Figure 16.8). At the same time, the width of the 
A band (which is the length of the thick filaments) and the distance from the Z discs to 
the nearby H zone (that is, the length of the thin filaments) did not change. These obser- 
vations made it clear that the lengths of both the thin and thick filaments were constant 
during contraction. This conclusion was consistent with a sliding filament model. 


The Sliding Filament Model The shortening of a sarcomere (Figure 16.8) involves slid- 
ing motions in opposing directions at the two ends of a myosin thick filament. Net 
sliding motions in a specific direction occur because the thin and thick filaments both 
have directional character. Actin filaments always extend outward from the Z lines in 
a uniform manner. The myosin thick filaments also assemble in a directional manner. 


Careers in Chemistry 


Dr. Enrique M. De La Cruz 


received postdoctoral training at the University of 


Dr. Enrique M. De La Cruz is a Professor and 
Chair of Molecular Biophysics and Biochemistry 
at Yale University as well as the Head of Branford 
College where he works alongside his wife, 
Leslie Carmin. He received a B.S. in Biology and 
Chemistry at Rutgers University-Newark School 
of Arts and Sciences before earning his Ph.D. in 
Biochemistry, Cell and Molecular Biology from 
Johns Hopkins University School of Medicine. He 


Pennsylvania. 

Dr. De La Cruz and his lab integrate kinetic and 
thermodynamic analyses of catalytic pathways with 
computational modeling to help develop and test 
predictive models of biopolymer fragmentation, 
work output by motor proteins, and enzyme func- 
tion and adaptation. 

When asked how biochemistry shapes his daily 
life, Dr. De La Cruz joked, “It is in all we do!” 
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Figure 16.8 The sliding filament model of a ' b a d Í 
skeletal muscle contraction. The decrease in X ' ‘, r ! d “ 
sarcomere length is due to decreases in the i j b 4 á 
width of the I band and H zone, with no 
change in the width of the A band. These Contracted 


observations mean that the lengths of both 
the thick and thin filaments do not change 
during contraction. Rather, the thick and 
thin filaments slide along one another. 


H zone and I band decrease in width 
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A Deeper Look 


The P-Loop NTPases—Energy to Run the Motors 


Energy obtained from hydrolysis of nucleoside triphosphates (NTPs) 
is fundamental to a myriad of biological processes, and indeed to life 
itself. The P-loop NTPases are an abundant class of enzymes that 
harness the energy of NTPs and direct it to a vast array of critical 
cellular functions, including molecular motors. Genomic analysis 
reveals that 10-18% of prokaryotic and eukaryotic genes contain 
P-loop NTP domains! 

P-Loop NTPases all contain a polypeptide loop with the se- 
quence GxxxxGK(S/T) or GxxGK(S/T) between a B-strand and 
an a-helix. This sequence, the so-called P-loop, also known as the 
Walker A motif, coordinates the triphosphate chain of the NTP 
to be cleaved. The P-loop, and a second, more variable region 
called the Walker B motif (UWWY[D/E], where ¥ is a hydropho- 
bic residue), provide the binding interactions with the nucleoside 
triphosphate (typically ATP or GTP). The P-loop NTPases share 
a common aa core, consisting of a parallel B-sheet sandwiched 
between two sets of a-helices. 

The essential features of NTP binding and hydrolysis are rep- 
resented in the F,Fo-ATP synthase, which binds ADP and P; and The mitochondrial F,|Fo-ATP synthase is a P-loop NTPase (PDB id = 


B-Glu188 


synthesizes ATP at the interface between two subunits. The ATP 1H8E). The ATP binding site includes the a-helix, Walker A motif 
binding site consists of Walker A (P-loop) and Walker B motifs (P-loop), and -sheet shown here from the 6-subunit and an arginine 
from the B subunit and an arginine finger that extends from the finger (Arg373, yellow) from the a-subunit. ADP is shown in salmon, 
adjacent a subunit (see figure). Mg in purple, and AIF,” in blue. 

Here ADP and A1F, bound in the NTP site mimic the pen- 
tavalent transition state for ATP hydrolysis, with Al in place of the The binding and hydrolysis of NTP between adjacent subunits is 
y-phosphorus of ATP and the F atoms in place of oxygens. a common behavior of many P-loop NTPases, as we will see with 


the helicases described later in this chapter. 


The polarity of myosin thick filaments reverses at the M disc, which means that actin 
filaments on either side of the M disc are pulled toward the M disc during contraction 
by the sliding of the myosin heads, causing net shortening of the sarcomere. 


ATP Decreases Myosin’s Affinity for Actin The molecular events of contraction are 
powered by the ATPase activity of myosin. Much of our present understanding of this 
reaction and its dependence on actin can be traced to several key discoveries by 
Albert Szent-Györgyi at the University of Szeged in Hungary in the early 1940s. 

In a series of elegant and insightful experiments, Szent-Györgyi showed the following: 


e Solution viscosity is increased dramatically when solutions of myosin and actin are 
mixed. Increased viscosity is a manifestation of the formation of an actomyosin complex. 

e The viscosity of an actomyosin solution is lowered by the addition of ATP, indicat- 
ing that ATP decreases myosin’s affinity for actin. 

e Myosin ATPase activity is increased substantially by actin. (For this reason, 
Szent-Györgyi gave the name actin to the thin filament protein.) The ATPase turn- 
over number of pure myosin is 0.05/s, but when actin is added, the turnover number 
increases to about 10/s, a number more like that of intact muscle fibers. 


Szent-Gy6rgyi’s experiments demonstrated that ATP hydrolysis and the association 
and dissociation of actin and myosin are coupled. It is this coupling that enables ATP 
hydrolysis to power muscle contraction. 


The Coupling Mechanism: ATP Hydrolysis Drives Conformation Changes in the Myosin 
Heads The only remaining piece of the puzzle is how the close coupling of actin- 
myosin binding and ATP hydrolysis result in the shortening of myofibrils. Put another 
way, how are ATP hydrolysis and the sliding filament model connected? The answer 
to this puzzle is shown in Figure 16.9. The free energy of ATP hydrolysis is translated 
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Figure 16.9 The mechanism of skeletal 
muscle contraction. (a) A scheme for the 
contraction cycle. ATP binding to the 
myosin head dissociates the strongly 
bound actomyosin rigor complex. 
During the recovery step, which occurs 
with ATP bound, the myosin head can 
move freely up and down. ATP 
hydrolysis occurs only in the up state. 
Binding to actin occurs with ATP 
hydrolysis. The power stroke is 
accompanied by dissociation of ADP 
and P; and results in net movement of 
the myosin head along the actin 
filament. (b) Structures representing the 
four states depicted in part (a). (Adapted 
from Malnasi-Csizmadia, A., and 
Kovacs, M., 2010. Emerging complex 
pathways of the actomyosin 
powerstroke. Trends in Biochemical 
Sciences 35:684-690; and Sweeney, 

H. L., and Houdusse, A., 2010. 
Structural and functional insights into 
the myosin motor mechanism. Annual 
Review of Biophysics 39:539-557.) 
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into conformation changes in the myosin head, so dissociation of myosin and actin, 
hydrolysis of ATP, and rebinding of myosin and actin occur with stepwise movement 
of the myosin head along the actin filament. The conformation changes in the myosin 
head are driven by the binding and hydrolysis of ATP. 

As shown in the cycle in Figure 16.9, the myosin heads are mainly dissociated from 
the actin filaments in resting muscle. When the signal to contract is presented (see 
following discussion), the myosin heads move out from the thick filaments to bind to 
actin on the thin filaments. Actin binding closes the cleft in the myosin head, which 
causes a twist in the large B-sheet. The twist causes the P-loop to open, moving away 
from the bound ADP and P;. The -sheet twist also straightens the kink in the relay 
helix. The power stroke occurs almost immediately, accompanied by dissociation of P; 
and then ADP. The power stroke consists of a 60° rotation of the converter domain 
and the long a-helical neck into the down position relative to the myosin head 
(Figure 16.9)—a movement that results in a 100 A movement of the end of the neck in 
the direction of contraction. 

The end of the power stroke is termed the rigor-like state because without access to 
additional ATP, the actin-myosin pair would be locked together, unable to dissociate. 
However, binding another ATP causes dissociation of myosin from actin, as first noticed 
by Szent-Györgyi. 

The conformational changes occurring in this cycle are the secret of the energy cou- 
pling that allows ATP binding and hydrolysis to drive muscle contraction. 


16.3 | What Are the Molecular Motors That Orchestrate 
the Mechanochemistry of Microtubules? 


Most eukaryotic cells contain elaborate networks of protein fibers collectively termed 
the cytoskeleton. The cytoskeleton is a dynamic, three-dimensional structure 
(Figure 16.10) that fills the cytoplasm and functions to: 


e Establish cell shape 

e Provide mechanical strength 

e Facilitate cell movement 

e Support intracellular transport of organelles and other cellular cargo 
e Guide chromosome separation during mitosis and meiosis 


Molecular motors move along the fibers of the cytoskeleton to facilitate these functions. 
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Figure 16.10 The cytoskeleton of eukaryotic cells. (a) Micrograph of the cytoskeleton of a eukaryotic 
cell (microtubules are green; nucleus is blue); (b) schematic drawing of elements of the cytoskeleton. 
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Figure 16.11 (a) The structure of the tubulin 
aB-heterodimer (PDB id = 1JFF). (b) Microtubules 
may be viewed as consisting of 13 parallel, staggered 
protofilaments of alternating a-tubulin and 
B-tubulin subunits. The sequences of the a- and 
-subunits of tubulin are homologous, and the 
aB-tubulin dimers are quite stable if Ca* is 

present. The dimer is dissociated only by strong 
denaturing agents. 


16.3a Filaments of the Cytoskeleton Are Highways 
That Move Cellular Cargo 


Three types of fibers comprise the cytoskeleton: microfilaments of actin (with a 
diameter of 3 to 6 nm), microtubules made from tubulin (20 to 25 nm diameter), and 
intermediate filaments formed from a variety of proteins (about 10 nm diameter). All 
of these have dynamic properties that facilitate the movement of organelles and other 
molecular cargo through the cell. 

Intermediate filaments provide a supporting network that allows cells to resist 
mechanical stress and deformation. Intermediate filaments are dynamic, and short fila- 
ment segments can be transported across cells by motor proteins riding on microfila- 
ments and microtubules. Polymeric actin microfilaments serve at least two functions in 
cells: They form networks just beneath the plasma membrane that link transmembrane 
proteins to cytoplasmic proteins, and they provide transcellular tracks on which organ- 
elles can be transported by myosin-like proteins. Microtubules are the best understood 
components of the cytoskeleton, and they are the focus of this section. 

Microtubules are hollow, cylindrical structures, approximately 30 nm in diame- 
ter, formed from tubulin, a dimeric protein composed of two similar 55 kDa sub- 
units known as a-tubulin and B-tubulin. Eva Nogales, Sharon Wolf, and Kenneth 
Downing determined the structure of the bovine tubulin aB-dimer to 3.7 A resolu- 
tion (Figure 16.11a). Tubulin dimers polymerize as shown in Figure 16.11b to form 
microtubules, which are essentially helical structures, with 13 tubulin monomer 
residues per turn. Microtubules grown in vitro are dynamic structures that are con- 
stantly being assembled and disassembled. Because all tubulin dimers in a microtu- 
bule are oriented similarly, microtubules are polar structures. The end of the 
microtubule at which growth occurs is the plus end, and the other is the minus end. 
Microtubules in vitro carry out a GTP-dependent process called treadmilling, in 
which tubulin dimers are added to the plus end at about the same rate at which 
dimers are removed from the minus end (Figure 16.12). 

Although composed only of 55 kDa tubulin subunits, microtubules can grow suffi- 
ciently large to span a eukaryotic cell or to form large structures such as cilia and fla- 
gella. Inside cells, networks of microtubules play many functions, including formation 
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of the mitotic spindle that segregates chromosomes during cell division, the movement 
of organelles and various vesicular structures through the cell, and the variation and 
maintenance of cell shape. In most cells, microtubules are oriented with their minus 
ends toward the centrosome and their plus ends toward the cell periphery. This consistent 
orientation is important for mechanisms of intracellular transport. 

The human genome includes seven genes for a-tubulin and eight for B-tubulin. 
The variation in amino acid sequence and chain length in these similar genes, termed 
isotypes, is mainly in the unstructured C-terminal tail (CTT) domain, which protrudes 
from the tubulin polymer, whereas the ~400-amino acid structural cores of a and B 
are highly conserved. Tubulin is subject to a large range of post-translational modifi- 
cations (PTMs), almost all of which are confined to the CTT. These modifications 
regulate the function of associated motor proteins such as kinesins and dyneins. The 
various tubulin isotypes and PTMs govern motor velocity, processivity, and microtu- 
bule depolymerization rates, with substantial changes produced by even a single 
amino acid variation. Different molecular motors recognize distinctive tubulin 
structural patterns. 


16.3b Three Classes of Motor Proteins Move Intracellular Cargo 


Three principal classes of motor proteins move organelles and other cellular cargo on 
cytoskeletal filament highways in both eukaryotic and prokaryotic cells. In addition 
to the myosins, most cells contain kinesins and dyneins. Humans possess 40 genes for 
myosins, 45 for kinesins, and at least 14 for dyneins (Table 16.1). The large number of 
genes in each class reflects specialized structures required for a variety of functions. 
This diversity notwithstanding, these three classes of motor proteins share remark- 
able similarities of structure and function. 

Kinesin 1, also called conventional kinesin, is a tetramer consisting of a dimer of 
heavy chains (110 kDa) associated with two light chains (65 kDa). The heavy chains 
contain an N-terminal motor domain, a long coiled-coil stalk with a central hinge, and 
a globular C-terminal tail domain where the light chains bind (Figure 16.13a). The 
motor domain binds to tubulin in microtubules, and the globular tail domain associates 
with the intended cellular cargo, for example, an organelle, an mRNA molecule, or an 
intermediate filament. In different kinesin families, the motor domain is located in dif- 
ferent places in the sequence, depending on the function of the specific family. 

A second class of cytoplasmic motor proteins is that of the dyneins. The first dyneins 
to be discovered were axonemal dyneins, which cause sliding of microtubules in cilia and 
flagella. Cytoplasmic dyneins were first identified in Caenorhabditis elegans, a nematode 
worm. Cytoplasmic dynein consists of a dimer of two heavy chains (500 kDa) with several 
other tightly associated light chains (Figure 16.13c). Each heavy chain contains a large 
motor domain (380 kDa) encompassing six AAA+ domains (see Section 16.4) arranged 
as a hexamer. A 10 nm stalk composed of a coiled coil projects from the head, between the 
fourth and the fifth AAA+ domains. The stalk is the microtubule-binding domain. 


| Table 16.1 | 16.1 Genes for Molecular Motors 


Number of Genes 


Genome Kinesins Dyneins Myosins 
Giardia lamblia (protozoan parasite) 25 10 0 
Saccharomyces cerevisiae (yeast) 25 1 5 
Drosophila melanogaster (fruit fly) 25 13 13 
Caenorhabditis elegans (roundworm) 20 2 17 
Arabidopsis thaliana (flowering plant) 6l 0 17 
Homo sapiens (human) 45 14-16 40 
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Figure 16.12 A model of the GTP-dependent 
treadmilling process. Both a- and 6-tubulin 
possess two different binding sites for GTP. The 
polymerization of tubulin to form microtubules is 
driven by GTP hydrolysis in a process that is only 
beginning to be understood in detail. 
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Figure 16.13 Models of motor proteins. (a) Conventional kinesin (kinesin I); (b) Myosin V; 

(c) Cytoplasmic dynein. Dyneins contain several different light chains. The C-terminal tails of the 
dynein light chains designated LIC (for light intermediate chains) are the cargo-binding domains. 
Adapted from Vale, R., 2003. The molecular motor toolbox for intracellular transport. Cell 112:467-480, 
and Schroeder, C. M., Ostrem, J. M. L., Hertz, N. T., and Vale, R. D., 2014. A Ras-like domain in the 
light intermediate chain bridges the dynein motor to a cargo-binding region. eLife. 3:e03351:1—22. 


Myosin V (a third class of cytoplasmic motor protein) is a multimeric protein that 
consists of 16 polypeptide chains. The structure is built around a dimer of heavy chains, 
each of which includes head, neck, and tail domains (Figure 16.13b). The heavy chain 
head domain is virtually indistinguishable from the head domain of myosin I from 
skeletal muscle (see Figure 16.5), but the neck domain is three times longer than the 
myosin II neck helix and it contains six repeats of a calmodulin-binding domain. Myosin 
V is normally associated with an essential light chain (similar to that of myosin IJ), 
together with several calmodulins. Adjoining the neck is a 30 nm-long coiled-coil 
domain. The tail domain of myosin V also binds a light chain and is adapted to bind 
specific organelles and other cargoes. 


16.3c Dyneins Move Organelles in a Plus-to-Minus Direction; 
Kinesins, in a Minus-to-Plus Direction 


The mechanisms of intracellular, microtubule-based transport of organelles and ves- 
icles were first elucidated in studies of axons, the long projections of neurons that 
extend great distances from the body of the cell. In these cells, it was found that sub- 
cellular organelles and vesicles could travel at surprisingly fast rates—as great as 
1000 to 2000 nm/s—in either direction. 

Cytosolic dyneins specifically move organelles and vesicles from the plus end of a 
microtubule to the minus end. Thus, dyneins move vesicles and organelles from the cell 
periphery toward the centrosome (or, in an axon, from the synaptic termini toward the cell 
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Human Biochemistry 


Effectors of Microtubule Polymerization as Therapeutic Agents 


Microtubules in eukaryotic cells are important for the maintenance 
and modulation of cell shape and the disposition of intracellular ele- 
ments during the growth cycle and mitosis. The inhibition of micro- 
tubule polymerization can block many normal cellular processes. 
The alkaloid colchicine (see accompanying figure), a constituent of 
the swollen, underground stems of the autumn crocus (Colchicum 
autumnale) and meadow saffron, inhibits the polymerization of 
tubulin into microtubules. This blocks the mitotic cycle of plant and 
animal cells. Colchicine also inhibits cell motility and intracellular 
transport of vesicles and organelles (which in turn blocks secretory 
processes of cells). Colchicine has been used for hundreds of years 
to alleviate some of the acute pain of gout and rheumatism. In gout, 
white cell lysosomes surround and engulf small crystals of uric acid. 
The subsequent rupture of the lysosomes and the attendant lysis of 
the white cells initiate an inflammatory response that causes intense 
pain. The mechanism of pain alleviation by colchicine appears to 
involve inhibition of white cell movement in tissues. Interestingly, 
colchicine’s ability to inhibit mitosis has given it an important role 
in the commercial development of new varieties of agricultural 
and ornamental plants. When mitosis is blocked by colchicine, the 
treated cells may be left with an extra set of chromosomes. Plants 
with extra sets of chromosomes are typically larger and more vigor- 
ous than normal plants. Flowers developed in this way may grow 
with double the normal number of petals, and fruits may produce 
much larger amounts of sugar. 

Another class of alkaloids, the vinca alkaloids from Vinca rosea, 
the Madagascar periwinkle, can also bind to tubulin and inhibit 
microtubule polymerization. Vinblastine and vincristine are used 
as potent agents for cancer chemotherapy because of their ability 
to inhibit the growth of fast-growing tumor cells. Colchicine is not 
an effective chemotherapeutic agent, although it appears to act 
similarly to the vinca alkaloids in inhibiting tubulin polymerization. 

The antitumor drug taxol was originally isolated from the bark 
of Taxus brevifolia, the Pacific yew tree. Like vinblastine and 
colchicine, taxol inhibits cell replication by acting on microtu- 
bules. Unlike these other antimitotic drugs, however, taxol stimu- 
lates microtubule polymerization and stabilizes microtubules. The 
remarkable success of taxol in the treatment of breast and ovarian 
cancers stimulated research efforts to synthesize taxol directly and 
to identify new antimitotic agents that, like taxol, stimulate micro- 
tubule polymerization. 


Vinblastine: R = CH3 Il 
Vincristine: R = CHO o 


Colchicine O — CH; 


Taxol 


The structures of vinblastine, vincristine, colchicine, and taxol. 


body). Most kinesins, on the other hand, assist the movement of organelles and vesicles 
from the minus end to the plus end of microtubules, resulting in outward movement of 
organelles and vesicles (Figure 16.14). Certain unconventional kinesins move in the oppo- 
site direction, transporting cargo in the plus-to-minus direction on microtubules. These 
kinesins have their motor domain located at the C-terminus of the polypeptide. 


16.3d Cytoskeletal Motors Are Highly Processive 


The motors that move organelles and other cellular cargo on microtubules and actin 
filaments must be processive, meaning that they must make many steps along their 
cellular journey without letting go of their filamentous highway. Dyneins, nonskeletal 
myosins, and most kinesins are processive motors (Table 16.2). Motors in all these 
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Figure 16.14 Kinesins facilitate outward movement of organelles and vesicles. (a) Rapid axonal transport 
along microtubules permits the exchange of material between the synaptic terminal and the body of the 
nerve cell. (b) Vesicles, multivesicular bodies, and mitochondria are carried through the axon by this 
mechanism. (Adapted from a drawing by Ronald Vale.) 


Table 16.2 Processivity of Motor Proteins 


Distance Processivity 
Traveled (average 
Rate of Before numberof % Chance of 
Movement Step Size Dissociating steps before Dissociating 
Motor (nm/s) (nm) (nm) dissociating) in One Step 
Kinesin 1 600 8 800-1200 100-120 ea 
Myosin V 1000 36 700-2100 20—60 ~2-5 
Dynein (cytoplasmic) 600 24-32 1000 30-40 ~2-3 


classes can carry cargo over roughly similar distances (700 to 2100 nm) before disso- 
ciating.! The step size of kinesin 1 is smaller than those of myosin V and cytoplasmic 
dynein; thus, its overall processivity (the average number of steps made before disso- 
ciating) is necessarily higher. Moving at rates of 600 to 1000 nm/s, these motors can 
carry their cargoes for a second or more before dissociating from their filaments. 


‘Compare these distances with the dimensions of typical cells in Table 1.1. 
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16.3e ATP Binding and Hydrolysis Drive Hand-over-Hand 
Movement of Kinesin 


Kinesin movement along microtubules is driven by the cycle of ATP binding and 
hydrolysis. The molecular details are similar in some ways to those of the skeletal 
muscle myosin—actin motor but are quite different in other ways. Kinesins, like 
skeletal muscle myosin, contain domains that open and close in response to ATP 
binding and hydrolysis. Together these domains act as a y-phosphate sensor, which 
can detect the presence or absence of the y-phosphate of an adenine nucleotide in 
the active site. The conformation changes between the ATP-bound and the 
ADP-bound states thus induce conformation changes that are propagated through 
a relay helix to a neck linker that rotates, in ways similar to skeletal muscle myosin 
(Figure 16.15). Thus, just as in skeletal myosin, small movements of the y-phosphate 
sensor at the ATP site are translated into piston-like movement of a relay helix and 
then into rotations of the neck linker that result in motor movement. 

Here the kinesin and myosin models diverge. The dimer of kinesin heavy chains trans- 
lates these ATP-induced conformation changes into a hand-over-hand movement of its 
motor domain heads along the microtubule filament, in a movement akin to a judo 
expert throwing an opponent with a forward swing of the arm. 
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Figure 16.15 Ribbon structures of the myosin and 
kinesin motor domains and the conformational 
changes triggered by the relay helix. (a) The upper 
panels represent the motor domains of myosin and 
kinesin, respectively, in the ATP- or ADP-P;-like 
state. Similar structural elements in the catalytic 
cores of the two domains are shown in blue, the 
relay helices are dark green, and the mechanical 
elements (neck linker for kinesin, lever arm domains 
for myosin) are yellow. The nucleotide is shown as a 
white space-filling model. (b) The similarity of the 
conformation changes caused by the relay helix in 
going from the ATP/ADP-P;-bound state to the 
ADP-bound or nucleotide-free state is shown in the 
lower panels. In both cases, the mechanical elements 
of the protein shift their positions in response to 
relay helix motion. Note that the direction of 
mechanical element motion is nearly perpendicular 
to the relay helix motion. (Adapted from Vale, R. 
D., and Milligan, R. A., 2000. The way things move: 
Looking under the hood of molecular motor 
proteins. Science 288:88-95.) 
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Kinesin 


Figure 16.16 A model for the motility cycle of 
kinesin. The two heads of the kinesin dimer work 
together to move processively along a microtubule. 
Frame 1: Each kinesin head is bound to the tubulin 
surface. The heads are connected to the coiled coil by 
“neck linker” segments (one shown in orange/yellow, 
and the other in red). Frame 2: Conformation changes 
in the neck linkers flip the trailing head by 160°, over 
and beyond the leading head and toward the next 
tubulin binding site. Frame 3: The new leading head 
binds to a new site on the tubulin surface (with ADP 
dissociation), completing an 80 A movement of the 
coiled coil and the kinesin’s cargo. During this time, 
the trailing head hydrolyzes ATP to ADP and P;. 
Frame 4: ATP binds to the leading head, and P; 
dissociates from the trailing head, completing the 
cycle. (Adapted from Vale, R., and Milligan, R., 
2000. The way things move: Looking under the hood 
of molecular motor proteins. Science 288:88-95.) 


A model of kinesin motor movement is shown in Figure 16.16. Kinesin heads in 
solution (that is, not attached to a microtubule) contain tightly bound ADP. Bind- 
ing of one head of a kinesin multimer to a microtubule causes dissociation of ADP 
from that head. ATP binds rapidly, triggering the neck linker to rotate or ratchet 
forward, throwing the second head forward as well and bringing it near the next 
binding site on the microtubule, 8 nm farther along the filament. The trailing head 
then hydrolyzes ATP and releases inorganic phosphate (but retains ADP), inducing 
its neck linker to return to its original orientation relative to the head. Exchange of 
ADP for ATP on the forward head begins the cycle again. The structure of the 
kinesin—microtubule complex (Figure 16.17) shows kinesin in intimate contact with 
the microtubule at the junction of the a- and B-subunits of a tubulin dimer. 


16.3f Conformation Changes in a Hexameric Ring 
Drive the Dynein Motor 


The movement of myosin motors on cytoskeletal actin filaments is presumed to 
be similar to the myosin—actin interaction in skeletal muscle. The dynein motor, by 
contrast, uses a hexameric ring of AAA+ modules and an associated coiled-coil stalk 
(see Figure 16.13) to transport cellular cargo via a different motor mechanism. 
ATP-dependent conformation changes in the ring of AAA+ modules must be trans- 
lated into movements of the tip of the coiled-coil stalk along a microtubule. A pro- 
posed mechanism for dynein movement (Figure 16.18) suggests that the events of ATP 
binding and hydrolysis and ADP and P; release at an AAA+ module swing a linker 
that joins the AAA+ domain and the dynein tail. 


16.4 | How Do Molecular Motors Unwind DNA? 


The ability of proteins to move in controlled ways along nucleic acid chains is impor- 
tant to many biological processes. For example, when DNA is to be replicated, the 
strands of the double helix must be unwound and separated to expose single-stranded 
DNA templates. Similarly, histone octamers (Figure 11.29) slide along DNA strands in 
chromatin remodeling, Holliday junctions move, and nucleic acids move in and out of 
viral capsids. The motor proteins that move directionally along nucleic acid strands 
and accomplish these many functions are called translocases. The translocases that 
unwind DNA or RNA duplex substrates are termed helicases. Thus, all helicases are 
translocases, but not vice versa. 


Figure 16.17 In the kinesin—microtubule complex, the switch 2 helix (yellow) of kinesin (left) lies in 
contact with the microtubule (right) at the subunit interface of a tubulin dimer (PDB id = 2HXH). 
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Figure 16.18 Structure and mechanism of the dynein motor. (a) Molecular 
graphic model of dynein bound to a microtubule (courtesy of James 
Spudich). A mechanism for the dynein power stroke involves conformation 
changes in the head domain (b) that facilitate movement of the stalk along 
a microtubule (c). ATP binding to the motor domain promotes 
dissociation of dynein from the microtubule. Hydrolysis of ATP causes a 
conformation change that primes the structure for a power stroke. 
Microtubule movement is initiated by tight binding to the tip of the stalk, 
which promotes a conformation change in the head ring (the power 
stroke). Release of ADP and P; from the catalytic site causes tilting of the 
stalk at the end of the cycle. (Adapted from Oiwa, K., and Sakakibara, 

H., 2005. Recent progress in dynein structure and mechanism. Current 
Opinion in Cell Biology 17:98-103.) 


Rebinding 


All translocases and helicases are members of six protein superfamilies (Table 16.3 
and Figure 16.19), all of them related evolutionarily to RecA, a DNA-binding protein 
(See A Deeper Look box “The P-Loop NTPases—Energy to Run the Motors” in 
Section 16.2). Superfamily 1 (SF1) and superfamily 2 (SF2) helicases are monomeric and 
consist of two RecA domains in a tandem repeat. SF1 and SF2 helicases participate in 
several diverse DNA and RNA manipulations. The other helicases (SF3—SF6), which 
often participate at the DNA replication fork, form hexameric rings, each monomer 
built from a single RecA domain. Each superfamily possesses characteristic conserved 
residues and sequence elements (Table 16.3), most of which are shared between several 
superfamilies. All members of a given superfamily move in the same direction on a 
DNA or RNA template (either 5’ to 3’ or 3’ to 5’). In the hexameric helicases, the ATP 
site consists of elements derived from adjacent monomers in the complex. The hexa- 
meric motor proteins of the SF3 and SF6 superfamilies are members of the ancient 
AAA+ ATPase family. (AAA stands for ATPases associated with various cellular activ- 
ities, and the + sign refers to an expanded definition of the family characteristics.) 

Translocases and helicases, like other molecular motors, require energy for their 
function. The energy for movement along a nucleic acid strand, as well as for separa- 
tion of the strands of a duplex (DNA or RNA), is provided by hydrolysis of ATP. 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


560 Chapter 16 Molecular Motors 


Fae Mee Helicase SO O 
Quaternary Direction of Representative 
Superfamily Structure Movement Motor 
SFIA Monomeric A* Rep 
SF1B B RecD 
SF2 Monomeric AorB NS3 
SF3 Hexameric A BPV El 
SF4 Hexameric B T7 gp4 
SF5 Hexameric B Rho 
SF6 Hexameric AorB FtsK 
MCM 


*A: helicase moves 3'—>5' on nucleic acid. B: helicase moves 5’—3’. 

Adapted from table in Singleton, M. R., Dillingham, M. S., and Wigley, D. B., 2007. 
Structure and mechanism of helicases and nucleic acid translocases. Annual Review 
of Biochemistry 76:23-50. 


(a) (b) 
Active 
unwinding 
Translocation 
3 ATP 3 3’ ADP +P, 3 
PoS PSD PoSo PoS 
(I) (II) (IIT) q’) 


Figure 16.19 Structures of translocase and helicase motors. (a) Translocase and helicase 
motors of SF1 and SF2 are monomers that consist of two RecA domains in a tandem repeat 
(PDB id = 1QHG). (b) Motor peptides of SF3 through SF6 associate to form hexamers (as 


shown) or dodecamers (PDB id = 1CR0). 


Translocases and helicases move on nucleic acid strands at rates of a few base pairs to 
several thousand base pairs per second. These movements are carefully regulated by 
accessory proteins in nearly all cases. Translocases and helicases typically move along 
the DNA or RNA lattice for long distances without dissociating. This is termed proces- 
sive movement, and helicases are said to have a high processivity. For example, the 
E. coli BCD helicase, which is involved in recombination processes, can unwind 33,000 
base pairs before it dissociates from the DNA lattice. Processive movement is essential 
for helicases involved in DNA replication, where millions of base pairs must be repli- 
cated rapidly. 

Helicases have evolved at least two structural and functional strategies for achieving 
high processivity. The hexameric helicases (of the SF3 through SF6 superfamilies) 
form ringlike structures that completely encircle at least one of the strands of a DNA 
duplex. The SF1 and SF2 helicases, notably Rep helicase from E. coli, are monomeric 
or homodimeric and move processively along the DNA helix by means of a 
hand-over-hand movement that is remarkably similar to that of kinesin’s movement 
along microtubules. A key feature of hand-over-hand movement of a dimeric motor 
protein along a polymer, whether a microtubule or a DNA strand, is that at least one 
of the motor subunits must be bound to the polymer at any moment. 
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16.4a Negative Cooperativity Facilitates 
Hand-over-Hand Movement 


How does hand-over-hand movement of a motor protein along a polymer occur? 
Clues have come from the structures of Rep helicase and its complexes with DNA. 
The Rep helicase from E. coli is a 76 kDa protein that is monomeric in the absence 
of DNA. Binding of Rep helicase to either single-stranded or double-stranded DNA 
induces dimerization, and the Rep dimer is the active species in unwinding DNA. 
Each subunit of the Rep dimer can bind either single-stranded (ss) or double-stranded 
(ds) DNA. However, the binding of Rep dimer subunits to DNA is negatively coop- 
erative. Once the first Rep subunit is bound, the affinity of DNA for the second 
subunit is at least 10,000 times weaker than that for the first! This negative coopera- 
tivity provides an obvious advantage for hand-over-hand walking. When one hand 
has bound the polymer substrate, the other hand releases. A conformation change 
could then move the unbound hand one step farther along the polymer where it can 
bind again. 

But what would provide the energy for such a conformation change? ATP hydrolysis 
is the driving force for Rep helicase movement along DNA, and the negative cooperativ- 
ity of Rep binding to DNA is regulated by nucleotide binding. In the absence of nucleo- 
tide, a Rep dimer is favored, in which only one subunit is bound to ssDNA. In 
Figure 16.20a, this state is represented as PS [a Rep dimer (P2) bound to ssDNA (S)]. 
Binding ATP analogs induces formation of a complex of the Rep dimer with both 
ssDNA and dsDNA, one to each Rep subunit (shown as PSD in Figure 16.20a). In 
their model, unwinding of the dsDNA and ATP hydrolysis occur at this point, leaving 
a P-S, state in which both Rep subunits are bound to ssDNA. Dissociation of ADP and 
P; leave the PS state again (Figure 16.20a). 

Work by Lohman and his colleagues has shown that coupling of ATP hydrolysis and 
hand-over-hand movement of Rep over the DNA involves the existence of the Rep 
dimer in an asymmetric state. A crystal structure of the Rep dimer in complex with 
ssDNA and ADP shows that the two Rep monomers are in different conformations 
(Figure 16.20b). The two conformations differ by a 130° rotation about a hinge region 
between two subdomains within the monomer subunit. The hand-over-hand walking of 
the Rep dimer along the DNA surface may involve alternation of each subunit between 
these two conformations, with coordination of the movements by nucleotide binding 


and hydrolysis. 
(a) (b) 
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Figure 16.20 Rep helicase mechanism and structure. (a) A hand-over-hand model for 
movement along (and unwinding of) DNA by E. coli Rep helicase. The PS state consists of 
a Rep dimer bound to ssDNA. The PSD state involves one Rep monomer bound to ssDNA 
and the other bound to dsDNA. The PS; state has ssDNA bound to each Rep monomer. 
ATP binding and hydrolysis control the interconversion of these states and walking along the 
DNA substrate. (b) Crystal structure of the E. coli Rep helicase monomer with bound 
ssDNA (dark blue, ball and stick) and ADP (red). The monomer consists of four domains 
designated 1A (residues 1-84 and 196-276), 1B (residues 85-195), 2A (residues 277-373 and 
543-670), and 2B (residues 374-542). The open (purple) and closed (green) conformations of 
the 2B domain are superimposed in this figure (PDB id = 1UAA). (From Korolev, S., Hsieh, 
J., Gauss, G., Lohman, T. L., and Waksman, G., 1997. Major domain swiveling revealed by 
the crystal structures of complexes of E. coli Rep helicase bound to single-stranded DNA 
and ADP. Cell 90:635—647. Reprinted by permission of Cell Press.) 
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(b) 


Figure 16.21 Structure of the papillomavirus E1 
helicase. (a) The papillomavirus E1 protein is a 
605-residue monomer that forms hexameric 
assemblies at specific sites on single-stranded DNA 
(PDB id = 2V9P). (b) The C-terminal helicase 
domain shown here includes an oligomerization 
domain (magenta) and the AAA+ domain (blue). 


16.4b Papillomavirus E1 Helicase Moves along DNA 
as on a Spiral Staircase 


Papillomaviruses are tumor viruses that cause both cancerous and benign lesions in 
a host. Replication of papillomaviral DNA within a host cell requires the multifunc- 
tional 605-residue viral E/ protein. Monomers of El assemble at a replication origin 
on DNA and form a pair of hexamers that wrap around a single strand of DNA. 
These assemblies are helicases that operate bidirectionally in the replication of viral 
DNA. The N-terminal half of the El protein includes a regulatory domain and a 
sequence-specific DNA-binding domain, whereas the helicase activity is located in 
the C-terminal half of the protein. The C-terminal helicase domain (Figure 16.21) 
includes a segment involved in oligomer formation (residues 300 to 378) and an 
AAA+ domain (residues 379 to 605). 

AAA+ domains are found in proteins of many functions, including motor activity 
by dyneins (see Section 16.3) and helicases, protein degradation by proteasomes (see 
Chapter 31), and disassembly of SNARE complexes (see Chapter 9). This ubiquitous 
module consists of two subdomains: an N-terminal nucleotide-binding segment known 
as an a/8 Rossman fold, and a C-terminal a-helical domain (Figure 16.22). The Ross- 
man fold is wedge-shaped and has a B-sheet of parallel strands in a B5-81-84-B3-62 
pattern. Key features of this fold include a Lys residue in the Walker A motif, an Asp- 
Asp or Asp-Glu pair in the Walker B motif, and a crucial Arg residue in a structure 
called an arginine finger. These three motifs are essential for ATP binding and hydroly- 
sis. In an AAA+ hexamer, the ATP-binding sites lie at the interface between any two 
subunits, involving the arginine finger of any given subunit and the Walker A and 
Walker B motifs of the adjacent subunit. 

The structure of a large fragment (residues 306 to 577) of the papillomavirus E1 pro- 
tein bound to a segment of ssDNA (Figure 16.23) reveals the remarkable mechanism by 
which this helicase traverses a DNA chain. The oligomerization domains form a sym- 
metric hexamer, but the six AAA+ domains each display a unique conformation. The 
DNA strand is bound in the center pore of the AAA+ hexamer, with six nucleotides of 
the DNA chain each bound to residues from each of the protein subunits. The crucial 
nucleotide-binding residues include Lys*”° and His*”’ on a hairpin loop and Phe*™, all of 
which face the center pore of the protein (Figure 16.24). Lys* interacts with one DNA 
phosphate oxygen, and His% forms a hydrogen bond with the phosphate of an adjacent 


Walker B 


Figure 16.22 The AAA+ domain is composed of an N-terminal, wedge-shaped Rossman fold and a 
C-terminal a-helical domain (upper left). The P-loop (red), the Walker B motif (yellow), and the 
arginine finger (blue) are shown. The ATP-binding sites lie between subunits of the hexamer. Each 
ATP site includes the arginine finger of one subunit and the Walker A and Walker B motifs of the 
adjacent subunit (PDB id = 1D2N). 
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nucleotide in the DNA chain. The aliphatic portion of Lys% and the aromatic groups 
of Phe** and His” share van der Waals interactions with the DNA sugar moiety link- 
ing these two phosphates. The hairpin loops of the six protein subunits form a spiral 
staircase, following the ssDNA as it threads through the central pore of the hexamer 
(Figure 16.25). 

A central hairpin loop from one of the AAA+ subunits coordinates each DNA 
nucleotide as it enters the helicase pore. Then, as each AAA+ domain proceeds 
through the intermediate states of ATP binding and hydrolysis, its hairpin loop steps 
down through the six conformations of the staircase, maintaining continuous contact 
with its nucleotide, as it escorts it through the pore, finally releasing the nucleotide as 
it exits the pore. Following release, the hairpin moves back to the top of the staircase, 
picks up the next available nucleotide, and begins another journey down the staircase. 
For one full cycle of the hexamer, each subunit hydrolyzes one ATP, releases one 
ADP, and translocates one nucleotide through the central pore. A full cycle thus 
translocates six nucleotides with associated hydrolysis of six ATPs and release of 
six ADPs. 


563 


Figure 16.23 A view of the El helicase along the 
ssDNA axis, showing the DNA-binding hairpin 
loops from each monomer (colored) interacting 
with the phosphates of DNA. DNA is shown as 
a ball-and-stick model in the center of the 
structure (PDB id = 2GXA). 


Figure 16.24 The hairpin loops of each subunit in 
the El helicase interact with two adjacent nucleotides 
in the DNA chain. Interactions include an ionic 
bond between Lys*” (yellow) and a DNA phosphate 
oxygen, a hydrogen bond between His” and the 
phosphate of an adjacent nucleotide, and van der 
Waals interactions between the aromatic rings of 
Phe*® (purple) and His>” (olive) and the aliphatic 
chain of Lys*™%, with the sugar linking the two 
phosphates (PDB id = 2GXA). 
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Figure 16.25 The hairpin loops of the E1 helicase hexamer are arranged in a spiral staircase 
that winds around the DNA strand. (a) Axial view (PDB id = 2GXA) and (b) the 
coordinated escort mechanism for DNA translocation by sequential ATP hydrolysis. As the 
helicase moves along the strand, the hairpin loop of one protein monomer binds each 
nucleotide as it enters the central cavity of the helicase. The loop adopts six conformations as 
the helicase moves along the DNA, preserving the loop—nucleotide interaction until the 
nucleotide exits the cavity. The released protein loop then returns to the other end of the 
cavity to bind a new, incoming nucleotide. Every sixth nucleotide of DNA is colored black 
and is escorted by the purple hairpin. Courtesy of Leemor Joshua-Tor. 


16.5 | How Do Bacterial Flagella Use a Proton 
Gradient to Drive Rotation? 


Bacterial cells swim and move by rotating their flagella using a family of motor pro- 
teins that are very different from those already described in this chapter. The flagella 
of E. coli are helical filaments up to 15,000 nm (15 um) in length and 15 nm in diam- 
eter. The direction of rotation of these filaments affects the movements of the cell. 
When the half-dozen flagella on the surface of the bacterial cell rotate in a counter- 
clockwise (CCW) direction, they twist and bundle together in a left-handed helical 
structure and rotate in a concerted fashion, propelling the cell through the medium. 
Every few seconds, the flagellar motor reverses, the helical bundle of filaments (now 
turning clockwise, or CW) unwinds into a jumble, and the bacterium somersaults or 
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tumbles. Alternating between CCW and CW rotations, the bacterium can move 
toward food sources, such as amino acids and sugars. 

The rotations of bacterial flagellar filaments are the result of the rotation of motor 
protein complexes in the bacterial plasma membrane. 


16.5a The Flagellar Rotor Is a Complex Structure 


The flagellum is built from at least 25 proteins and comprises three parts: a rotary 
motor anchored in the bacterial inner membrane, a long filament that serves as a helical 
propellor, and a hook that functions as a universal joint that connects the motor with 
the filament (Figure 16.26). The long filament is composed of many copies of a globular 
protein called flagellin. The rotary motor includes several rings of protein subunits, 
including the C ring, the MS ring, the P ring, and the L ring. The MS ring is built from 
26 copies of the protein FIIF. The C ring is attached to the MS ring and includes three 
rotor proteins—FliG, FliM, and FliN—involved in rotation of the motor. The C ring 
includes 26 copies of FliG, 34 copies of FliM, and 34 X 4 = 136 copies of FIiN. The 
stationary portion of the motor—the stator—is formed from the proteins motA and 
motB. Eight motA,—motB, complexes are embedded in the bacterial inner membrane 
around the MS ring. 


16.5b Gradients of H* and Nat Drive Flagellar Rotors 


What energy source drives the flagellar motor? Gradients of protons and Na* ions 
exist across bacterial inner membranes, typically with more H* and Na* outside the 
cell. In E. coli, spontaneous inward flow of protons through the motA—motB com- 
plexes drives the rotation of the motor (Figure 16.27). In Vibrio cholerae, inward 
Na* ion flow powers the motor. Flagellar motors are thus energy conversion devices. 
In E. coli, each motA—motB complex passes 70 H* per revolution of the motor. With 
a full complement of eight motA—motB complexes, a motor conducts about 560 
protons per revolution. The H*-driven flagellar rotors reach top rotational speeds 
of about 360 Hz (corresponding to 21,600 rpm). Thus, the overall rate of proton flow 
for the motor is approximately 200,000 H*/sec! Flagellar motors driven by Na* ions 
are even faster, with rotational rates of 1700 Hz (100,000 rpm) observed in 
V. cholerae. 

The motA—motB complexes work with FliG in the C ring to transfer protons across 
the membrane. FliG contains 335 residues, and most of the FliG protein structure (resi- 
dues 104 to 335) consists of two compact domains joined by an a-helix (Figure 16.27). 
A ridge on the C-terminal domain contains five charged residues that interact with 
motA and are important for motor rotation. Asp” of motB is essential for rotation of 
the motor and is probably involved in proton transfer. Two membrane channels from 
the transmembrane segments of the motA4—-motB, complex also participate in proton 
movement. Each encounter of a motA—motB complex with a FliG subunit as the motor 
turns results in movement of one proton through each of these channels. The passage 
of about 70 H* through each motA—motB complex in one revolution of the motor 
(which would involve encounters with 34 FliG subunits) is consistent with this sugges- 
tion (70/34 = ~2). 


16.5c The Flagellar Rotor Self-Assembles in a 
Spontaneous Process 


Flagellar rotors are true masterpieces of biological self-assembly. The ring of FliF 
subunits, within the MS ring, is the first to assemble in the plasma membrane. Other 
proteins then attach to this ring one after another, from the base to the tip, to construct 
the motor structure. Once the motor has formed, the flexible hook and the flagellar 
filament are assembled. Precise recognition of the existing template structure allows 
this highly ordered self-assembly process to proceed without error. The flagellar fila- 
ment is made from 20,000 to 30,000 copies of flagellin polymerized into a hollow 
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helical tube structure. Each turn of the helical filament contains about 5000 flagellin 
subunits and is about 2300 nm long. A complete flagellum can have up to six full 
helical turns. Flagellin molecules are transported through the long, narrow, central 
channel of the motor and flagellum from the cell interior to the far end of the flagel- 
lum, where they self-assemble with the help of a pentameric complex of FliD, a capping 
protein (see Figure 16.26). The FliD complex has a plate and five leg domains. It rotates 
in a stepping fashion at the end of the filament, exposing one binding site at a time and 
guiding the binding of newly arriving flagellin molecules in a helical pattern. 


16.5d Flagellar Filaments Are Composed of Protofilaments of Flagellin 


Each cylindrical flagellar filament is composed of 11 fibrils or protofilaments (component 
filaments) that form the cylinder, with each fibril lying at a slight tilt to the cylinder axis 
(Figure 16.28a). An end-on view of the filament shows 11 subunits, each representing 
the end of a protofilament (Figure 16.28b). The flagellin protein of Salmonella 
typhimurium contains 494 residues and consists of four domains, denoted DO 
through D3 (Figure 16.28c). DO and D1 are composed of a-helices, whereas D2 and 
D3 consist primarily of B-strands. The N-terminus of the peptide chain lies at the 
base of DO. The peptide runs from DO to D3 and then reverses and returns to DO, 
where the N- and C-termini are juxtaposed. The structure resembles a Greek capital 
gamma (I), with a height of 140 A and a width of 110 A. Each flagellin protein is 
arranged with DO inside the filament and D3 facing the outside. The central pore, 
20 A in diameter, is lined by the a-helices of DO. 


16.5e Motor Reversal Involves Conformation Switching 
of Motor and Filament Proteins 


The flagellar motor reverses direction every few seconds so that the bacterium can 
change its swimming direction to seek better environments. Motor reversal involves 
conformation changes both in motor proteins and also in the filament itself. In the 
motor structure, the rotor proteins FliG, FliM, and FIiN work together to control 
direction changes of the motor, and they are known collectively as the switch complex. 
FIiN appears to lie at the base of the C ring, FliG lies at the top of the C ring, and 
FliM resides in the middle, contacting both FliN and FliG (see Figure 16.26). 

Reversal of the flagellar motor causes the long filament to switch from a left-handed 
(L) helical structure to a right-handed (R) helical form. This makes the bundle of fla- 
gella fall apart, causing the bacterium to tumble. This left-to-right switch in the filament 
is caused by a conformational change that occurs in the flagellin subunits in some pro- 
tofilaments. Interestingly, the driving force for these conformation changes is probably 
the torque applied to DO and D1 of flagellin subunits along the filament when the 
motor itself reverses. 


Figure 16.26 A model of the E. coli flagellar motor. 
The motor is anchored by interactions of stationary 
motA and motB proteins in the M and S rings with 
the inner membrane. Spontaneous flow of protons 
through the motA—motB complexes and into the 
cell drives the rotation of the motor. Flow rates of 
200,000 protons per second drive the motor at 
speeds approaching 22,000 rpm. (Adapted from 
Thomas, D. R., Morgan, D. G., Francis, N. R., and 
DeRosier, D. J., 2007. Bit by bit The structure of 
the complete flagellar hook/basal body complex. 
Microscopy and Microanalysis 13:34-35. Image 
provided by David J. DeRosier, Brandeis 
University.) 
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Figure 16.27 Interactions between the stationary 
motA—motB complexes and the rotating FliG ring 
drive the flagellar motor. Proton flow through the 
motA—motB complexes is presumably coupled to 
conformation changes that alter ionic interactions 
between charged residues at the motA—motB and 
FliG interface, driving rotation of the FliG ring. 
Other conserved features include a hydrophobic 
patch (light green), a Gly-Gly motif (purple), and 
a EHPQR motif (blue, in the middle domain 
(PDB id = ILKV). 
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Figure 16.28 The E. coli flagellum is composed of 11 protofilaments that run the length of the 
flagellar filament. The filament is shown in (a) cross section and (b) perpendicular to the filament. 
The protofilaments are long polymers of the flagellin protein (c), which consists of two a-helical 
domains (D0 and D1) that lie at a slight tilt to the filament axis and two B-sheet domains (D2 and 
D3) that extend outward from the filament. Flagellin is shown in two conformation forms, R (left; 
PDB id = 1UCU) and L (right; PDB id = 3A5X). (Parts (a) and (b) courtesy of Keiichi Namba, 
Osaka University, Japan.) 
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Chapter 16 Molecular Motors 


Foundational Biochemistry Things You Should Know after Reading Chapter 16. 


Molecular motors convert the chemical energy of ATP to the me- 
chanical energy of motion. 


Linear motors crawl or creep along a polymer lattice. 


Rotating motors consist of a rotating element (the rotor) and a sta- 
tionary element (the stator). 


The cells of skeletal muscle are long and multinucleate and are re- 
ferred to as muscle fibers. 


Release of Ca** ions from the sarcoplasmic reticulum triggers muscle 
contraction. 


The thin filaments of skeletal muscle are composed of actin mono- 
mers associated in long polymers. 


The thick filaments of skeletal muscle are composed primarily of 
myosin. 


The heavy chain of myosin is comprised of a large globular head and 
a long tail. The tails of two heavy chains intertwine to form a coiled 
coil. 


The P-loop NTPases harness the energy of ATP to accomplish a vari- 
ety of cellular functions including those of molecular motors. 


The shortening of sarcomeres involves sliding motions in opposing 
directions at the two ends of a myosin thick filament. 


ATP lowers the viscosity of solutions of myosin and actin, indicating 
that ATP decreases the affinity of myosin for actin. 


ATP hydrolysis drives conformation changes in the myosin heads that 
mediate muscle contraction. 


The cytoskeleton is a dynamic, three-dimensional structure that 
establishes cell shape, provides mechanical strength, facilitates cell 
movement, supports intracellular transport and guides chromosome 
separation during mitosis and meiosis. 


The cytoskeleton is comprised of microfilaments of actin, microtu- 
bules of tubulin, and intermediate filaments. 


Tubulin is an œf dimeric protein that polymerizes to form microtu- 
bules. 


Microtubules in vitro undergo treadmilling, adding af dimers on the 
plus end and removing dimers on the minus end. 


Myosins, kinesins, and dyneins are the motor proteins that transport 
cellular cargo. 


Kinesin | (conventional kinesin) is a tetramer consisting of two heavy 
chains and two light chains. 


Cytoplasmic dynein consists of two heavy chains and several light chains. 


Dyneins move organelles in a plus-to-minus direction, from the cell 
periphery to the centrosome. 


Kinesins move organelles in a minus-to-plus direction, resulting in 
outward movement of organelles and vesicles. 


Cytoskeletal motors are highly processive. 


ATP binding and hydrolysis drives hand-over-hand movement of 
kinesin. 


ATP hydrolysis drives conformation changes that facilitate movement 
of the dynein stalk along microtubules. 


Helicases are translocases that unwind DNA and RNA duplex sub- 
strates, either 3’ — 5’ or 5' > 3’. 


Negative cooperativity facilitates hand-over-hand movement of Rep 
helicase along DNA. 


The hairpin loops of papillomavirus are arranged in a spiral staircase 
that winds around a DNA strand. 


Rotation of flagellar rotors of many bacteria are driven by spontane- 
ous proton flow across the membrane. 


Flow rates of 200,000 protons per second drive the flagellar motor 
at speeds approaching 22,000 rpm. Reversal of the flagellar motor 
causes the long filament to switch from a left-handed helical structure 
to a right-handed helical form. 


Answers to all problems are at the end of this book. Detailed solutions are 
available in the Student Solutions Manual. 


1. 


Assessing the Performance Consequences of Skeletal Muscle Anatomy 
The cheetah is generally regarded as nature’s fastest mammal, but 
another amazing athlete in the animal kingdom (and almost as fast as 
the cheetah) is the pronghorn antelope, which roams the plains of 
Wyoming. Whereas the cheetah can maintain its top speed of 70 mph 
for only a few seconds, the pronghorn antelope can run at 60 mph for 
about an hour! (It is thought to have evolved to do so in order to 
elude now-extinct ancestral cheetahs that lived in North America.) 
What differences would you expect in the muscle structure and 
anatomy of pronghorn antelopes that could account for their 
remarkable speed and endurance? 


Analyzing the Action of a Muscle Contraction Inhibitor An ATP 
analog, B,y-methylene-ATP, in which a —CH,— group replaces the 
oxygen atom between the B- and y-phosphorus atoms, is a potent 
inhibitor of muscle contraction. At which step in the contraction 
cycle would you expect B,y-methylene-ATP to block contraction? 


Understanding the Role of Phosphocreatine ATP stores in muscle are 
augmented or supplemented by stores of phosphocreatine. During 
periods of contraction, phosphocreatine is hydrolyzed to drive the 
synthesis of needed ATP in the creatine kinase reaction: 


Phosphocreatine + ADP —> creatine + ATP 


Muscle cells contain two different isozymes of creatine kinase, one in 
the mitochondria and one in the sarcoplasm. Explain. 


Understanding the Biochemistry of Rigor Mortis Rigor is a muscle 
condition in which muscle fibers, depleted of ATP and phosphocreatine, 
develop a state of extreme rigidity and cannot be easily extended. 

(In death, this state is called rigor mortis, the rigor of death.) From 

what you have learned about muscle contraction, explain the state of 
rigor in molecular terms. 


. Assessing the Contraction Potential of Human Muscle Skeletal 


muscle can generate approximately 3 to 4 kg of tension or force per 
square centimeter of cross-sectional area. This number is roughly the 
same for all mammals. Because many human muscles have large 
cross-sectional areas, the force that these muscles can (and must) 
generate is prodigious. The gluteus maximus (on which you are 
probably sitting as you read this) can generate a tension of 1200 kg! 
Estimate the cross-sectional area of all of the muscles in your body 
and the total force that your skeletal muscles could generate if they 
all contracted at once. 


Understanding the Structure of Tubulin Calculate a diameter for a 
tubulin monomer, assuming that the monomer MW is 55,000, that 
the monomer is spherical, and that the density of the protein 
monomer is 1.3 g/mL. How does the number that you calculate 
compare to the dimension portrayed in Figure 16.12? 
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12. 


13. 


14. 


15. 


16. 


17. 


18. 


19. 


. Structure of Tubulin Considering the function of tubulin inside cells, 


suggest an explanation for why it is built from two different subunits, 
a and £ (see Figure 16.12), and not from one type of subunit or from 
three or more different subunits. 


. Assessing the Tubulin Content of a Liver Cell Use the number you 


obtained in problem 6 to calculate how many tubulin monomers 
would be found in a microtubule that stretched across the length of a 
liver cell. (See Table 1.1 for the diameter of a liver cell.) 


. Understanding Axonal Vesicle Transport The giant axon of the squid 


may be up to 4 inches in length. Use the value cited in this chapter 
for the rate of movement of vesicles and organelles across axons to 
determine the time required for a vesicle to traverse the length of 
this axon. 


Axonal Vesicle Transport If an axon is 3.5 inches long, how much 
time will it take for a vesicle to be transported down its entire length? 
If a typical tubulin dimer has a length of 80 A, how many dimers are 
needed to cover this axonal length? 


Understanding the Molecular Structure of Myosin As noted in this 
chapter, the myosin molecules in thick filaments of muscle are offset 
by approximately 14 nm. How many residues of a coiled-coil 
structure are needed to span 14 nm? 


Structure of Myosin Revisit Figure 16.5a for a model of myosin. What 
is the function of LC 1? 


Myosin’s Coiled Coils Consider the peptide segments shown. Which 
sequence best fits the requirements for the coiled coil structure of 
myosin? Explain your answer. 

a) VEDWGRQ 

b) ADKVTEF 

c) DREACHW 

d) AREVNYI 


Assessing the Energetics of Sarcoplasmic Reticulum Calcium 
Transport Use the equations of Chapter 9 to determine the free 
energy difference represented by a Ca?* gradient across the sarco- 
plasmic reticulum membrane if the luminal (inside) concentration of 
Ca?* is 1 mM and the concentration of Ca?” in the solution bathing 
the muscle fibers is 1 uM. 


Understanding the Energetic Driving Force of the Ca-ATPase 

Use the equations of Chapter 3 to determine the free energy of 
hydrolysis of ATP by the sarcoplasmic reticulum Ca-ATPase if the 
concentration of ATP is 3 mM, the concentration of ADP is 1 mM, 
and the concentration of P; is 2 mM. 


Energetics of ATPase Activity (Integrates with Chapter 3) Make a 
graph relating the free energy (AG in kJ/mol) for the hydrolysis of 
ATP at differing ratios of [ADP] to [ATP] (ranging from 0.01 to 10) if 
the concentration of phosphate [Pi] is constant at 20 mM and AG? for 
the hydrolysis is —35.7 kJ/mol (see Table 3.1). 


Understanding the Stoichiometry of Sarcoplasmic Reticulum Calcium 
Transport Under the conditions described in problems 15 and 16. 
What is the maximum number of Ca’* ions that could be transported 
per ATP hydrolyzed by the Ca-ATPase? 


Understanding the Energetics of Motor Proteins For each of the 
motor proteins in Table 16.2, calculate the force exerted over the step 
size given, assuming that the free energy of hydrolysis of ATP under 
cellular conditions is — 50 kJ/mol. 


Relating Myosin Function and Stoichiometry to Weight Lifting When 
you go to the gym to work out, you not only exercise many muscles but 
also involve many myosins (and actins) in any given exercise activity. 
Suppose you lift a 10 kg weight a total distance of 0.4 m. Using the 
data in Table 16.2 for myosin, calculate the minimum number of 
myosin heads required to lift this weight and the number of sliding 
steps these myosins must make along their associated actin filaments. 


20. 


21. 


22. 


23. 


24. 


25% 
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Problems 


Understanding Human Smooth Muscle In which of the following 
tissues would you expect to find smooth muscle? 


a. arteries 

b. stomach 

c. urinary bladder 
d. diaphragm 

e. uterus 

f. 


The gums in your mouth. 


Analyzing the Mechanism of Muscle Contraction When an action 
potential (nerve impulse) arrives at a muscle membrane (sarcolemma), 
in what order do the following events occur? 


a. Release of Ca?* ions from the sarcoplasmic reticulum. 
Hydrolysis of ATP, with release of energy. 
Detachment of myosin from actin. 

Sliding of myosin along actin filament. 


eno Ss 


Opening of switch 1 and switch 2 on myosin head. 


Essay Question on the Functional Relevance of Molecular Motors 

(Essay question.) You are invited by the National Science Foundation 
to attend a scientific meeting to set the agenda for funding of basic 
research related to molecular motors for the next 10 years. Only basic 
research will be funded, ruling out studies on human subjects. You are 
asked to suggest the research area most worthy of scientific research. 
Your presentation must include (1) a brief background on what we 
currently know about the subject; (2) identification of a key research 
topic about which more needs to be known; and (3) a justification of 
why additional knowledge in this area is critical for advancing the field 
(that is, why investigations in this area are especially important). You 
are not being asked to provide the methods or experiments that might 
be used to address the problem—only the concept. Base your presenta- 
tion on what you have learned in this chapter (you may consult and 
include references from the Further Reading section), and limit your 
presentation to 300 words. 


Movement of Kinesin Consult Figure 16.16 and use the data in 
problem 9 to determine how many steps a kinesin motor must take to 
traverse the length of the squid giant axon. 


Biochemistry of Muscle Cramps When athletes overexert themselves 

on hot days, they often suffer immobility from painful muscle 

cramps. Which of the following is a reasonable hypothesis to explain 

such cramps? 

a. Muscle cells do not have enough ATP for normal muscle 
relaxation. 

b. Excessive sweating has affected the salt balance within the 
muscles. 

c. Prolonged contractions have temporarily interrupted blood flow 
to parts of the muscle. 


d. All of the above. 
Genetics of Duchenne Muscular Dystrophy Duchenne muscular 


dystrophy is a sex-linked recessive disorder associated with severe 
deterioration of muscle tissue. The gene for the disease: 


a. is inherited by males from their mothers. 

b. should be more common in females than in males. 
c. both aand b. 

d. neither a nor b. 


PSAFE Problems 


(PSAFE models, listed by chapter or protein name, can be found at 
http://www.psafeonline.net.) 


26. 


Examine the PSAFE model for myosin II, and explain the function of 
each subunit. 
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27. Using the PSAFE model for the kinesin protein K1F1A, identify the 
function of switch 1, and explain why K1F1A cannot move cargo 
when switch | is bound to AMPPNP. 


28. Using the PSAFE model for E1 helicase, describe the features of 
AAA+ domains. 


Think-Pair-Share Question 


Motor Proteins on Polysaccharides This chapter discussed the movement 
of motor proteins over protein- or DNA-based rails. Imagine the 
possibility that polysaccharides could be used as rails for motor proteins. 
What should be the chemical and physical requirements for such 
polysaccharides and for the motor proteins that could bind to them? 
Suggest at least three different properties. (Revisit the discussions of 
Chapter 7 for the structure and properties of polysaccharides.) 
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Metabolism: An Overview 


Essential Question 


The word metabolism derives from the Greek word for “change.” Metabolism represents 
the sum of the chemical changes that convert nutrients, the “raw materials” necessary 
to nourish living organisms, into energy and the chemically complex finished products 
of cells. What are the chemical reactions that collectively constitute the metabolism 

of the cell? 


Me. consists of thousands of enzymatic reactions organized into discrete 
pathways. These pathways proceed in a stepwise fashion, transforming 
substrates into end products through many specific chemical intermediates, called 
metabolites. Metabolism is sometimes referred to as intermediary metabolism to 
reflect this aspect of the process. 


17.1 | Is Metabolism Similar in Different Organisms? 


One of the great unifying principles of modern biology is that organisms show marked 
similarity in their major pathways of metabolism. Given the almost unlimited possi- 
bilities within organic chemistry, this generality would seem unlikely. Yet it’s true, 
and it provides strong evidence that all life has descended from a common ancestral 
form. All forms of nutrition and almost all metabolic pathways evolved in early pro- 
karyotes prior to the appearance of eukaryotes 1 billion years ago. For example, 
glycolysis, the metabolic pathway by which energy is released from glucose and cap- 
tured in the form of ATP under anaerobic conditions, is common to almost every cell. 
It is believed to be the most ancient of metabolic pathways, having arisen prior to the 
appearance of oxygen in abundance in the atmosphere. All organisms, even those 
that can synthesize their own glucose, are capable of glucose degradation and ATP 
synthesis via glycolysis. Other prominent pathways are also virtually ubiquitous 
among organisms. 


1 7 E 


and Its 
Regulation 


All is flux, nothing stays still. 
Nothing endures but change. 


Heraclitus (c. 540—c. 480 B.C.) 


< Anise swallowtail butterfly (Papilio zelicans) with its pupal 
case. Metamorphosis of butterflies is a dramatic example 
of metabolic change. 


Key Questions 


17.1 


17.2 


17.3 


17.4 


17.5 


17.6 


Is Metabolism Similar in Different 
Organisms? 

What Can Be Learned from Metabolic 
Maps? 

How Do Anabolic and Catabolic 
Processes Form the Core of Metabolic 
Pathways? 

What Experiments Can Be Used to 
Elucidate Metabolic Pathways? 

What Can the Metabolome Tell Us 
about a Biological System? 

What Food Substances Form the 
Basis of Human Nutrition? 
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Table 17.1 


Classification Carbon Source 


Photoautotrophs CO, Light 
Photoheterotrophs Organic Light 
compounds 


Chemoautotrophs CO, 


Chemoheterotrophs Organic 


compounds 


Metabolic Classification of Or: 


Energy Source 


Oxidation— 
reduction reactions 
Oxidation— 
reduction reactions 


uirements 


to Their Carbon and Energ 


Electron Donors Examples 


H2O, HS, S, other inorganic 
compounds 


Green plants, algae, cyanobacteria, 
photosynthetic bacteria 


Organic compounds Nonsulfur purple bacteria 


Nitrifying bacteria; hydrogen, sulfur, 
and iron bacteria 


Inorganic compounds: H2, H3S, 
NH,*, NO, Fe?*, Mn?* 


All animals, most microorganisms, 
nonphotosynthetic plant tissue such as 
roots, photosynthetic cells in the dark 


Organic compounds (e.g., 
glucose) 


Solar 
energy 


Photoautotrophic Heterotrophic 
cells cells 


Smee eee” 


2 
co, 


Figure 17.1 The flow of energy in the biosphere is 
coupled primarily to the carbon and oxygen cycles. 


171a Living Things Vary in Their Carbon and Energy Requirements 


Although most cells have the same core set of central metabolic pathways, different cells 
(and, by extension, different organisms) are characterized by the alternative pathways 
they might express. These pathways offer a wide diversity of metabolic possibilities. For 
instance, organisms are often classified according to the major metabolic pathways they 
exploit to obtain carbon or energy. Classification based on carbon requirements defines 
two major groups: autotrophs and heterotrophs. Autotrophs are organisms that can use 
carbon dioxide as their only source of carbon. Heterotrophs require an organic form of 
carbon, such as glucose, in order to synthesize other essential carbon compounds. 

Classification based on energy sources also gives two groups: phototrophs and chemo- 
trophs. Phototrophs are photosynthetic organisms, which use light as a source of energy. 
Chemotrophs use organic compounds such as glucose or, in some instances, oxidizable 
inorganic substances such as Fe**, NO»~, NH,*, or elemental sulfur as sole sources of 
energy. Typically, the energy is extracted through oxidation-reduction reactions. Based 
on these characteristics, every organism falls into one of four categories (Table 17.1). 


Metabolic Diversity Varies Across Life’s Five Kingdoms Prokaryotes (the kingdom Monera, 
which consists of archaea and bacteria) show a greater metabolic diversity than all the four 
eukaryotic kingdoms (Protista, Fungi, Plants, and Animals) put together. Prokaryotes are 
variously chemoheterotrophic, photoautotrophic, photoheterotrophic, or chemoautotro- 
phic. Noprotistsarechemoautotrophs; fungiand animalsareexclusively chemoheterotrophs; 
and plants are characteristically photoautotrophs, although some are heterotrophic in 
their mode of carbon acquisition. 


17.1b Oxygen Is Essential to Life for Aerobes 


A further metabolic distinction among organisms is whether or not they can use oxy- 
gen as an electron acceptor in energy-producing pathways. Those that can are called 
aerobes or aerobic organisms; others, termed anaerobes, can subsist without O3. 
Organisms for which O; is obligatory for life are called obligate aerobes; humans are 
an example. Some species, the so-called facultative anaerobes, can adapt to anaerobic 
conditions by substituting other electron acceptors for O; in their energy-producing 
pathways; Escherichia coli is an example. Yet others cannot use oxygen at all and are 
even poisoned by it; these are the obligate anaerobes. Clostridium botulinum, the bac- 
terium that produces botulin toxin, is such an organism. 


171c The Flow of Energy in the Biosphere and the Carbon 
and Oxygen Cycles Are Intimately Related 


The primary source of energy for life is the sun. Photoautotrophs use light energy to 
drive the synthesis of organic molecules, such as carbohydrates, from atmospheric 
CO, and water (Figure 17.1). Molecular oxygen (O2) is a by-product of this process. 
Heterotrophic cells then use these organic products of photosynthetic cells both as 
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A Deeper Look 


Calcium Carbonate—A Biological Sink for CO, 


Carbon dioxide accumulation in the atmosphere is a major driver 
of global warming. A significant CO, sequestration site that is often 
overlooked is the calcium carbonate shells of corals, molluscs, and 
crustacea. These invertebrate animals deposit CaCO; in the form of 
protective exoskeletons. In some invertebrates, such as the sclerac- 
tinians (hard corals) of tropical seas, photosynthetic dinoflagellates 


(marine eukaryotic algae) live within the animal cells as endosym- 
bionts called zooxanthellae. These phototrophic cells use light to 
drive the resynthesis of organic molecules from CO, released (as 
bicarbonate ion) by the animal’s metabolic activity. In the presence 
of Ca’*, the photosynthetic CO) fixation pulls the deposition of 
CaCO3, as summarized in the following coupled reactions: 


Ca?* + 2 HCO; === CaCO, + HCO; 
H,CO; —= H-0 + CO, 


HO + CO, —> carbohydrate + O, 


fuels and as building blocks, or precursors, for the biosynthesis of their own unique 
complement of biomolecules. O» is reduced to water as a consequence of energy- 
releasing electron-transfer reactions that underlie the use of these organic fuels. Ulti- 
mately, CO, is the end product of heterotrophic carbon metabolism, and CO, is 
returned to the atmosphere for reuse by the photoautotrophs. In effect, solar energy 
is converted to the chemical energy of organic molecules by photoautotrophs, and 
heterotrophs recover this energy by oxidizing the organic substances. The flow of 
energy in the biosphere is thus conveyed within the carbon and oxygen cycles, and the 
force driving these cycles is light energy. 


17.2 | What Can Be Learned from Metabolic Maps? 


Metabolic maps (Figure 17.2) portray the principal reactions of the intermediary 
metabolism of carbohydrates, lipids, amino acids, nucleotides, and their derivatives. 
These maps are confusing at first glance and seem difficult to decipher. For now, view 
this map as an overview of the complexity of intermediary metabolism. Despite their 
appearance, these maps become easy to follow once the major metabolic routes are 
known and their functions are understood. The underlying order of metabolism and 
the important interrelationships between the various pathways then appear as simple 
patterns against the seemingly complicated background. An interactive version of the 
IUBMB-Sigma-Nicholson Metabolic Pathways chart is available online. In addition, 
both an atlas of human metabolism and a virtual metabolic human are available 
online. These tools allow users to model metabolic pathways and explore rela- 
tionships between human health, disease, and nutrition. 


17.2a The Metabolic Map Can Be Viewed as a Set of Dots and Lines 


One interesting transformation of the intermediary metabolism map is to represent 
each intermediate as a black dot and each enzyme as a line (Figure 17.3). Then, the 
more than 1000 different enzymes and substrates depicted in Figure 17.2 are repre- 
sented by just two symbols. This map has about 520 dots (metabolites or intermedi- 
ates). This simplified map reveals the network nature of metabolism, where some 
intermediates (dots) have a variety of fates, depending upon which enzyme (line) 
might act on them. Table 17.2 lists the numbers of dots that have one or two or more 
lines (enzymes) associated with them. Thus, this table classifies intermediates by the 
number of enzymes that act upon them. A dot connected to just a single line must be 
either a nutrient, a storage form, an end product, or an excretory product of metabo- 
lism. Also, because many pathways tend to proceed in only one direction (that is, they 
are essentially irreversible under physiological conditions), a dot connected to just 
two lines is probably an intermediate in only one pathway and has only one fate in 
metabolism. If three lines are connected to a dot, that intermediate has at least two 


| Table 17.2 | Number of Dots 


(Intermediates) in the Metabolic Map 
of Figure 17.2, and the Number of 


Lines Associated with Them 


Lines Dots 
lor2 410 

3 71 

4 20 

5 11 

6 or more 8 
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Figure 17.2 A metabolic map, indicating the reactions of intermediary metabolism and the enzymes 
that catalyze them. More than 500 different chemical intermediates, or metabolites, and a greater 
number of enzymes are represented here. (Source: From Donald Nicholson, Map #22, © International 
Union of Biochemistry and Molecular Biology.) 
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Figure 17.3 The metabolic map as a set of dots and lines. The heavy dots and lines trace the central 
energy-releasing pathways known as glycolysis and the citric acid cycle. (Adapted from Alberts, B., 
et al., Molecular Biology of the Cell, 2nd ed. New York: Garland Publishing Co., 1989.) 


possible metabolic fates; four lines, three fates; and so on. Note that about 80% of the 
intermediates connect to only one or two lines and thus have only a single role in the 
cell. However, intermediates at branch points are subject to a variety of fates. In such 
instances, the pathway followed is an important regulatory choice. Indeed, whether 
any substrate is routed down a particular metabolic pathway is the consequence of a 
regulatory decision made in response to the cell’s (or organism’s) momentary require- 
ments for energy or nutrition. 


17.2b Alternative Metabolic Pathway Models Can Provide New Insights 
into Pathways 


Alternative mappings of metabolic reactions have been proposed for several reasons. 
First the sheer complexity of pathways has prompted biochemists to seek simpler 
portrayals of an organism’s chemistry. Second, traditional metabolite-focused maps 
(such as Figures 17.1 and 17.4a) do not provide insight into the essential role of enzymes 


What Can Be Learned from Metabolic Maps? 
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in metabolism. Even more significantly, the rise of genomics (the study of the whole 
genomes of organisms), transcriptomics (the study of global messenger RNA expres- 
sion), and proteomics (the study of the totality of proteins) has provoked fresh inter- 
pretations of biological order and function. For example, Juliet Gerrard has proposed 
that metabolic maps be recast in protein-centric presentations (Figure 17.4b). In such 
maps, the metabolites and the enzymes that interconvert them are transposed, reveal- 
ing a new emphasis—the metabolites are signals in a cellular network of proteins. 

Protein-centric maps may be condensed and simplified by realizing that some pathway 
enzymes are clustered in multienzyme complexes and that metabolites are literally passed 
from enzyme to enzyme within such clusters (Figure 17.4c). The result is a simplified rep- 
resentation of metabolic networks, containing only the essential signaling information. 
Metabolic maps are representations of large amounts of information. Conceptualizing 
them in different formats enables biochemists to analyze vast amounts of information in 
new and insightful ways. 


17.2c Metabolism Is a Network 


The architecture of metabolism is typically presented as a collection of enzymes 
arranged in linear or cyclic pathways. These pathways are interconnected into a net- 
work consisting of thousands of metabolites and enzymes. In a metabolic network, 
intermediates can be described as nodes linked by the reactions that produce and con- 
sume them, as shown in Figure 17.2 and Table 17.1. A recent detailed study of metabo- 
lism in E. coli revealed a median connectivity of two nodes, but the average was 5.8. 


(a) (b) 


Enzymes Enzymes Figure 17.4 Three different representations of a metabolic 

10 and 11 14 and 15 pathway. (a) The traditional view of a metabolic pathway is 
metabolite-centric. Metabolites are indicated by capital let- 
ters in blue circles; enzymes are indicated above the arrows by 
the letter E, with subscript numbers indicating different reac- 
tions. (b) The protein- (or enzyme-) centric view of Gerrard 
is more informative for some purposes. Here and in (c), 
metabolites are indicated by capital letters above the arrows; 
enzymes are indicated within yellow circles or boxes, with 
subscript numbers indicating different reactions. (c) A simpli- 
fied version of the protein-centric view where proteins in the 
pathway form multifunctional complexes. 


Enzymes 
12 and 13 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


The average is skewed because P;, ADP, and ATP (the three most connected metabo- 
lites) participate in more than a thousand different reactions. Of the more than 800 
known metabolites in E. coli, roughly 500 participate in only a few reactions. Because 
metabolism shows such a high degree of connectivity, mathematical models are 
required to analyze it effectively. The scientific discipline in which computational and 
mathematical modeling is used to study biological systems is called systems biology. 


17.2d Metabolic Pathways with Multiple Enzymes May Take Different Forms 


Every metabolic pathway consists of sequential enzymatic steps (Figure 17.5). Several 
types of organization are possible. The enzymes of some multienzyme systems may exist 
as physically separate, soluble entities, with diffusing intermediates (Figure 17.5a). In 
other instances, the enzymes of a pathway are collected to form a discrete multienzyme 
complex, and the substrate is sequentially modified as it is passed along from enzyme to 
enzyme (Figure 17.5b). This type of organization has the advantage that intermediates 
are not lost or diluted by diffusion. In a third pattern of organization, the enzymes com- 
mon to a pathway reside together as a membrane-bound system (Figure 17.5c). In this 
case, the enzyme participants (and perhaps the substrates as well) must diffuse in just 
the two dimensions of the membrane to interact with their neighbors. 

As research reveals the ultrastructural organization of the cell in ever greater detail, 
more and more of the so-called soluble enzyme systems are found to be physically 
united into functional complexes. Thus, in many (perhaps all) metabolic pathways, the 
consecutively acting enzymes interact with one another to form stable aggregations that 
are sometimes referred to as metabolons, a word meaning “units of metabolism.” 
Further, in more highly evolved organisms such as birds and mammals, evolutionary 
processes have led to the fusion of genes encoding enzymes for several sequential steps 
in a metabolic pathway. Expression of the fused gene produces a single multifunctional 
polypeptide capable of catalyzing a series of steps in a pathway. 
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Figure 17.5 Schematic representation of types of 
multienzyme systems carrying out a metabolic path- 
way: (a) Physically separate, soluble enzymes with 
diffusing intermediates. (b) A multienzyme complex. 
Substrate enters the complex and becomes bound 
and then sequentially modified by enzymes E; to E; 
before product is released. No intermediates are free 
to diffuse away. (c) A membrane-bound multienzyme 
system. 
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Chemistry in History 


Dr. Marie Maynard Daly 


Dr. Marie Maynard Daly (1921-2003) contended 
with the twin adversities of sexism and segregation 
during her hard-fought climb through academia 
where she earned multiple degrees in chemistry. She 
earned her bachelor’s degree from Queens College, 
her master’s from New York University, and her 
PhD from Columbia University. This dedication 
led to her distinction as the first black woman in the 


Dr. Daly is responsible for some of the earliest 
research on the relationship between diet, blood 
cholesterol levels, and cardiac health. She was also 
a major contributor to some of the foundational 
research in the structure and function of DNA. 
When Dr. Daly retired, she created a scholarship 
fund at Queens College to help African-American 
science students pursue their education. 


United States to earn a chemistry PhD. 


17.3 | How Do Anabolic and Catabolic Processes 
| Form the Core of Metabolic Pathways? 


Metabolism serves two fundamentally different purposes: the generation of energy to 
drive vital functions and the synthesis of biological molecules. To achieve these ends, 
metabolism consists largely of two contrasting processes: catabolism and anabolism. 
Catabolic pathways are characteristically energy yielding, whereas anabolic pathways are 
energy requiring. Catabolism involves the oxidative degradation of complex nutrient 
molecules (carbohydrates, lipids, and proteins) obtained either from the environment or 
from cellular reserves. The breakdown of these molecules by catabolism leads to the 
formation of simpler molecules such as lactic acid, ethanol, carbon dioxide, urea, or 
ammonia. Catabolic reactions are usually exergonic, and often the chemical energy 
released is captured in the form of ATP (see Chapter 3). Because catabolism is oxidative 
for the most part, part of the chemical energy may be conserved as energy-rich electrons 
transferred to the coenzymes NAD* (nicotinamide adenine dinucleotide) and NADP* 
(nicotinamide adenine dinucleotide phosphate). These two reduced coenzymes have 
very different metabolic roles: NAD* reduction is part of catabolism; NADPH oxida- 
tion is an important aspect of anabolism. The energy released upon oxidation of NADH 
is coupled to the phosphorylation of ADP in aerobic cells, and so NADH oxidation 
back to NAD* serves to generate more ATP. The coupled process by which NADH 
oxidation leads to ATP synthesis is called oxidative phosphorylation. Most of the ATP 
produced in aerobic cells arises via oxidative phosphorylation. In contrast, NADPH is 
the source of the reducing power needed to drive reductive biosynthetic reactions. 

Thermodynamic considerations demand that the energy necessary for biosynthesis 
of any substance exceed the energy available from its catabolism. Otherwise, organisms 
could achieve the status of perpetual motion machines: A few molecules of substrate 
whose catabolism yielded more ATP than required for its resynthesis would allow the 
cell to cycle this substance and harvest an endless supply of energy. 


17.3a Anabolism Is Biosynthesis 


Anabolism is a biosynthetic process in which the varied and complex biomolecules (pro- 
teins, nucleic acids, polysaccharides, and lipids) are assembled from simpler precursors. 
Such biosynthesis involves the formation of new covalent bonds, and an input of chemical 
energy is necessary to drive these endergonic processes. The ATP generated by catabolism 
provides this energy. Furthermore, NADPH is an excellent donor of high-energy electrons 
for the reductive reactions of anabolism. Despite their divergent roles, anabolism and 
catabolism are interrelated in that the products of one provide the substrates of the other 
(Figure 17.6). Many metabolic intermediates are shared between the two processes, and the 
precursors needed by anabolic pathways are found among the products of catabolism. 


17.3b Anabolism and Catabolism Are Not Mutually Exclusive 


Interestingly, anabolism and catabolism occur simultaneously in the cell. The con- 
flicting demands of concomitant catabolism and anabolism are managed by cells in 
two ways. First, the cell maintains tight and separate regulation of both catabolism 
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Energy-yielding nutrients Cell macromolecules 
Carbohydrates Proteins 
Fats Polysaccharides 
Proteins Lipids 
Nucleic acids 


— 
Catabolism ——~~ »| |Anabolism 
(oxidative, (reductive, 
exergonic) » | | endergonic) 
—— >] 
Energy-poor end products Various metabolic intermediates 
HO Glucose 
CO, Glutamate and glutamine 
NHs Glycerol 
` Pyruvate and other a-keto acids 
Acetyl-CoA 


and anabolism, so metabolic needs are served in an immediate and orderly fashion. 
Second, competing metabolic pathways are often localized within different cellular 
compartments. Isolating opposing activities within distinct compartments, such as 
separate organelles, avoids interference between them. For example, the enzymes 
responsible for catabolism of fatty acids, the fatty acid oxidation pathway, are localized 
within mitochondria. In contrast, fatty acid biosynthesis takes place in the cytosol. 


17.3c The Pathways of Catabolism Converge to a Few End Products 


The catabolism of the principal energy-yielding nutrients (carbohydrates, lipids, and 
proteins) in a typical heterotrophic cell involves a succession of enzymatic reactions. In 
the presence of oxygen (aerobic catabolism), these molecules are degraded ultimately to 
carbon dioxide, water, and ammonia. Aerobic catabolism consists of three distinct 
stages. In stage 1, the nutrient macromolecules are broken down into their respective 
building blocks. Despite the great diversity of macromolecules, these building blocks 
represent a rather limited number of products. Proteins yield up their 20 component 
amino acids, polysaccharides give rise to carbohydrate units that are convertible to 
glucose, and lipids are broken down into glycerol and fatty acids (Figure 17.7). 

In stage 2, the collection of product building blocks generated in stage 1 is further 
degraded to yield an even more limited set of simpler metabolic intermediates. The deami- 
nation of amino acids leaves a-keto acid carbon skeletons. Several of these a-keto acids 
are citric acid cycle intermediates and are fed directly into stage 3 catabolism via this cycle. 
Others are converted either to the three-carbon a-keto acid pyruvate or to the acetyl 
groups of acetyl-coenzyme A (acetyl-CoA). Glucose and the glycerol from lipids also gen- 
erate pyruvate, whereas the fatty acids are broken into two-carbon units that appear as 
acetyl-CoA. Because pyruvate also gives rise to acetyl-CoA, we see that the degradation of 
macromolecular nutrients converges to a common end product, acetyl-CoA (Figure 17.7). 

Stage 3 of catabolism involves the combustion of acetyl-CoA units in the citric acid 
cycle, an oxidative process that is coupled to NAD” reduction, so NADH is formed. CO, 
is a by-product. NADH oxidation through electron transfer to O, is coupled to ATP 
synthesis (oxidative phosphorylation), with H,O as the by-product of O, reduction. Thus, 
CO, and H,O are the ultimate waste products of aerobic catabolism. Stage 3 metabolism 
generates most of the energy produced by the cell. 


17.3d Anabolic Pathways Diverge, Synthesizing an Astounding Variety 
of Biomolecules from a Limited Set of Building Blocks 
A rather limited collection of simple precursor molecules is sufficient to provide for the 


biosynthesis of virtually any cellular constituent, be it protein, nucleic acid, lipid, or 
polysaccharide. All of these substances are constructed from appropriate building 


Figure 17.6 Energy relationships between 

the pathways of catabolism and anabolism. 
Oxidative, exergonic pathways of catabolism 
release free energy and reducing power that are 
captured in the form of ATP and NADPH, 
respectively. Anabolic processes are endergonic, 
consuming chemical energy in the form of ATP 
and using NADPH as a source of high-energy 
electrons for reductive purposes. 
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Large biomolecules 


The various kinds of proteins, polysaccharides, and 
fats are broken down into their component 
building blocks, which are relatively few in number. 


Building block 
molecules 


The various building blocks are degraded into a 
common product, the acetyl groups of acetyl-CoA. 


Amino acids 


Common 
degradation product 


Catabolism converges via the citric acid cycle to 
three principal end products: water, carbon dioxide, 
and ammonia. 


End Simple, small 
products end products of 
catabolism 


NH, 


Glycolysis 


Pentoses, 
hexoses 


Glyceral 


Pyruvate 


Glycerol, fatty acids 


ldehyde-3-phosphate 
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Acetyl-CoA 


Citric acid 
cycle 


| Oxidative phosphorylation 


H,O 
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Figure 17.7 The three stages of catabolism. Stage 1: Proteins, polysaccharides, and lipids are 
broken down into their component building blocks, which are relatively few in number. 
Stage 2: The various building blocks are degraded into the common product, the acetyl groups 


of acetyl-CoA. Stage 3: Catabolism converges to three principal end products: water, carbon dioxide, 


and ammonia. 
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blocks via the pathways of anabolism. In turn, the building blocks (amino acids, nucle- 
otides, sugars, and fatty acids) can be generated from metabolites in the cell. For exam- 
ple, amino acids can be formed by amination of the corresponding a-keto acid carbon 
skeletons, and pyruvate can be converted to hexoses for polysaccharide biosynthesis. 


17.3e Amphibolic Intermediates Play Dual Roles 


Certain central pathways of intermediary metabolism, such as the citric acid cycle, and 
many metabolites of other pathways have dual purposes—they serve in both catabo- 
lism and anabolism. For example, oxaloacetate, an intermediate in the citric acid cycle, 
is the precursor for synthesis of the amino acid aspartate. This dual nature is reflected 
in the designation of such pathways as amphibolic rather than solely catabolic or ana- 
bolic. In any event, in contrast to catabolism—which converges to the common inter- 
mediate, acetyl-CoA—the pathways of anabolism diverge from a small group of simple 
metabolic intermediates to yield a spectacular variety of cellular constituents. 


17.3f Corresponding Pathways of Catabolism 
and Anabolism Differ in Important Ways 


The anabolic pathway for synthesis of a given end product usually does not match 
precisely the pathway used for catabolism of the same substance. Some of the 
intermediates may be common to steps in both pathways, but different enzymatic 
reactions and unique metabolites characterize other steps. A good example of these 
differences is found in a comparison of the catabolism of glucose to pyruvic acid by 
the pathway of glycolysis and the biosynthesis of glucose from pyruvate by the path- 
way called gluconeogenesis. The glycolytic pathway from glucose to pyruvate consists 
of 10 enzymes. Although it may seem efficient for glucose synthesis from pyruvate to 
proceed by a reversal of all 10 steps, gluconeogenesis uses only seven of the glycolytic 
enzymes in reverse, replacing those remaining with four enzymes specific to glucose 
biosynthesis. In similar fashion, the pathway responsible for degrading proteins to 
amino acids differs from the protein synthesis system, and the oxidative degradation 
of fatty acids to two-carbon acetyl-CoA groups does not follow the same reaction 
path as the biosynthesis of fatty acids from acetyl-CoA. 


17.3g Metabolic Regulation Requires Different Pathways for Oppositely 
Directed Metabolic Sequences 


The main reason for different pathways serving in opposite metabolic directions is 
that such pathways must be independently regulated. If catabolism and anabolism 
passed along the same set of metabolic tracks, equilibrium considerations would dic- 
tate that slowing the traffic in one direction by inhibiting a particular enzymatic reac- 
tion would also slow traffic in the opposite direction. Independent regulation of 
anabolism and catabolism can be accomplished only if these two contrasting pro- 
cesses move along different routes or, in the case of shared pathways, the rate-limiting 
steps serving as the points of regulation are catalyzed by enzymes that are unique to 
each opposing sequence (Figure 17.8). Note that opposing pathways are reciprocally 
regulated: Activation of one is accompanied by inhibition of the other. Reciprocal 
regulation means that the opposing metabolic sequences function in only one direc- 
tion at any moment in time. The direction in which they operate is determined by 
allosteric effectors acting on key steps in the opposing pathways. 


17.3h ATP Is Produced and Consumed in a Cellular Energy Cycle 


We saw in Chapter 3 that ATP is the energy currency of cells. In phototrophs, ATP is 
one of the two energy-rich primary products resulting from the transformation of 
light energy into chemical energy. (The other is NADPH; see the following discus- 
sion.) In heterotrophs, the pathways of catabolism have as their major purpose the 
release of free energy that can be captured in the form of energy-rich phosphoric 
anhydride bonds in ATP. In turn, ATP provides the energy that drives the manifold 
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Figure 17.8 Parallel pathways of catabolism and anabolism must differ in at least one metabolic step 
to be regulated independently. Shown here are two possible arrangements of opposing catabolic and 
anabolic sequences between A and P. (a) The parallel sequences proceed via independent routes. 
(b) Only one reaction has two different enzymes, a catabolic one (E3) and its anabolic counterpart (E6). 
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Figure 17.9 The ATP cycle in cells. ATP is formed O, ADP + P 
via photosynthesis in phototrophic cells or catabo- i 
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activities of all living cells—the synthesis of complex biomolecules, the work involved 
in transporting substances into cells, the work of cell motility, and the work of muscle 
contraction. These diverse activities are all powered by energy released in the hydro- 
lysis of ATP to ADP and P;. Thus, there is an energy cycle in cells where ATP serves 
as the vessel carrying energy from photosynthesis or catabolism to the energy- 
requiring processes unique to living cells (Figure 17.9). 


17.31 NAD* Collects Electrons Released in Catabolism 


The substrates of catabolism—proteins, carbohydrates, and lipids—are good sources of 
chemical energy because the carbon atoms in these molecules are in a relatively reduced 
state (Figure 17.10). In the oxidative reactions of catabolism, reducing equivalents are 
released from these substrates, often in the form of hydride ions (a proton coupled with two 
electrons, H:~). These hydride ions are transferred in enzymatic dehydrogenase reactions 
from the substrates to NAD* molecules, reducing them to NADH. A second proton 
accompanies these reactions, appearing in the overall equation as H* (Figure 17.11). 
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Figure 17.10 Comparison of the state of reduction 

of carbon atoms in biomolecules: —CH,— (fats) > 
Moré Less —CHOH-— (carbohydrates) “C=O (carbonyls) > 
reduced reduced —COOH (carboxyls) > CO; (carbon dioxide, the 
state state final product of catabolism). 
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In turn, NADH is oxidized back to NAD* when it transfers its reducing equivalents to 
electron acceptor systems that are part of the metabolic apparatus of the mitochondria. 
The ultimate oxidizing agent (e~ acceptor) is O2, becoming reduced to H,O. 

Oxidation reactions are exergonic, and the energy released is coupled with the forma- 
tion of ATP in the process called oxidative phosphorylation. The NAD*—-NADH system 
can be viewed as a shuttle that carries the electrons released from catabolic substrates to 
the mitochondria, where they are transferred to O», the ultimate electron acceptor in 
catabolism. In the process, the free energy released upon NADH oxidation by O, is 
trapped through phosphorylation of ADP to form ATP. The NADH cycle is an impor- 
tant player in the transformation of the chemical energy of carbon compounds into the 
chemical energy of phosphoric anhydride bonds. Such transformations of energy from 
one form to another are referred to as energy transduction. Oxidative phosphorylation is 
one cellular mechanism for energy transduction. Also, relative NAD* levels are an 
important indicator of the metabolic status of cells, with a high NAD*/NADH ratio 
indicating energy limitation, as in starvation. 


17.3) NADPH Provides the Reducing Power for Anabolic Processes 


Whereas catabolism is fundamentally an oxidative process, anabolism is, by its contrasting 
nature, reductive. The biosynthesis of the complex constituents of the cell begins at the 
level of intermediates derived from the degradative pathways of catabolism; or, at the most 
basic level, biosynthesis begins with oxidized substances available in the inanimate envi- 
ronment, such as carbon dioxide. When the hydrocarbon chains of fatty acids are assem- 
bled from acetyl-CoA units, activated hydrogens are needed to reduce the carbonyl (C=O) 
carbon of acetyl-CoA into a —CH),— at every other position along the chain. When glu- 
cose is synthesized from CO, during photosynthesis in plants, reducing power is required. 
These reducing equivalents are provided by NADPH, the usual source of high-energy 
hydrogens for reductive biosynthesis. NADPH is generated when NADP* is reduced with 
electrons in the form of hydride ions. In heterotrophic organisms, these electrons are 
removed from fuel molecules by NADP*-specific dehydrogenases. In these organisms, 
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NADPH can be viewed as the carrier of electrons from catabolic reactions to anabolic 
reactions (Figure 17.12). In photosynthetic organisms, the energy of light is used to pull 
electrons from water and transfer them to NADP+; O; is a by-product of this process. 


17.3k Coenzymes and Vitamins Provide Unique Chemistry 
and Essential Nutrients to Pathways 


In addition to NAD* and NADPH, a variety of other small molecules are essential to 
metabolism. Some of these essential nutrients are vitamins. 

Vitamins are required in the diet, usually in trace amounts, because they cannot be 
synthesized by the organism itself. The requirement for any given vitamin depends on 
the organism. Not all vitamins are required by all organisms. Vitamins required in the 
human diet are listed in Table 17.3. These important substances are traditionally distin- 
guished as being either water soluble or fat soluble. 

Except for vitamin C (ascorbic acid), the water-soluble vitamins are all compo- 
nents or precursors of important biological substances known as coenzymes. These 
are low-molecular-weight molecules that bring unique chemical functionality to 
certain enzyme reactions. The side chains of the common amino acids provide only 
a limited range of chemical reactivities and carrier properties. Coenzymes, acting 
in concert with appropriate enzymes, provide a broader range of catalytic proper- 
ties for the reactions of metabolism. Coenzymes are typically modified by these 


Figure 17.12 Transfer of reducing equivalents from 
catabolism to anabolism via the NADPH cycle. 


Vitamins and Coenzymes 


Vitamin Coenzyme Form Function 


Water-Soluble 


Thiamine (vitamin B,) Thiamine pyrophosphate Decarboxylation of a-keto acids and formation and 


cleavage of a-hydroxyketones 


Niacin (vitamin B}, nicotinic acid) Nicotinamide adenine Hydride transfer 
dinucleotide (NAD*) 
Nicotinamide adenine dinucleotide Hydride transfer 


phosphate (NADP*) 
Flavin adenine dinucleotide (FAD) 


Flavin mononucleotide (FMN) 


Riboflavin (vitamin B») One- and two-electron transfer 


One- and two-electron transfer 


Pantothenic acid (vitamin Bs) Coenzyme A Activation of acyl groups for transfer by nucleophilic 
attack, and activation of the a-hydrogen of the acyl 


group for abstraction as a proton 


Pyridoxal, pyridoxine, 
pyridoxamine (vitamin Be) 


Pyridoxal phosphate Formation of stable Schiff base (aldimine) adducts 
with a-amino groups of amino acids; serving as an 


electron sink to stabilize reaction intermediates 


Cobalamin (vitamin B,) 


Biotin (vitamin B,) 


Folic acid (vitamin By) 


Fat-Soluble 
Retinol (vitamin A) 


Retinal (vitamin A) 


Retinoic acid (vitamin A) 


Ergocalciferol (vitamin D>) 


Cholecalciferol (vitamin D3) 


a-Tocopherol (vitamin E) 


Menaquinone (vitamin K) 


5'-Deoxyadenosylcobalamin 


Methylcobalamin 


Biotin—lysine complexes (biocytin) 


Tetrahydrofolate 


Intramolecular rearrangement, reduction of 
ribonucleotides to deoxyribonucleotides, and methyl 
group transfer 


Carrier of carboxyl groups in carboxylation reactions 


Acceptor and donor of 1-C units for all oxidation 
levels of carbon except that of CO 
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reactions and are then converted back to their original forms by other enzymes, so 
small amounts of these substances can be used repeatedly. Coenzymes may also act 
as carriers of specific functional groups, such as methyl groups and acyl groups. 
The coenzymes derived from the water-soluble vitamins are listed in Table 17.3. 
The fat-soluble vitamins are not directly related to coenzymes, but they play essen- 
tial roles in a variety of critical biological processes, including vision, maintenance 
of bone structure, and blood coagulation. The mechanisms of action of fat-soluble 
vitamins are not as well understood as their water-soluble counterparts, but mod- 
ern research efforts are gradually closing this gap. 


17.4 | What Experiments Can Be Used 
to Elucidate Metabolic Pathways? 


Research into the chemistry of metabolism and its organization into pathways can be 
traced back to the end of the nineteenth century. A major intellectual advance took 
place then when Eduard Buchner showed that the fermentation of glucose to yield 
ethanol and carbon dioxide can occur in extracts of broken yeast cells. Until this dis- 
covery, many thought that metabolism was a vital property, unique to intact cells; 
even the eminent microbiologist Louis Pasteur, who contributed so much to our 
understanding of fermentation, was a vitalist, one of those who believed that the pro- 
cesses of living substance transcend the laws of chemistry and physics. After Buchner’s 
revelation, biochemists searched for intermediates in the transformation of glucose 
and soon learned that inorganic phosphate was essential to glucose breakdown. This 
observation gradually led to the discovery of a variety of phosphorylated organic 
compounds that serve as intermediates along the fermentative pathway. 


17.4a Enzyme Inhibitors Can Be Used to Reveal Individual Metabolic 
Reactions 


An important tool for elucidating the steps in the pathway was the use of metabolic 
inhibitors. Adding an enzyme inhibitor to a cell-free extract caused an accumulation of 
intermediates in the pathway prior to the point of inhibition (Figure 17.13). Each inhib- 
itor was specific for a particular site in the sequence of metabolic events. As the arsenal 
of inhibitors was expanded, the individual steps in metabolism were revealed. 
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Figure 17.13 The use of inhibitors to reveal the sequence of reactions in a metabolic pathway. 

(a) Control: Under normal conditions, the steady-state concentrations of a series of intermediates 
will be determined by the relative activities of the enzymes in the pathway. (b) Plus inhibitor: In the 
presence of an inhibitor (in this case, an inhibitor of enzyme 4), intermediates upstream of the meta- 
bolic block (B, C, and D) accumulate, revealing themselves as intermediates in the pathway. 

The concentration of intermediates lying downstream (E and F) will fall. 
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Figure 17.14 One of the earliest experiments 

using a radioactive isotope as a metabolic tracer. 
Cells of Chlorella (a green alga) synthesizing carbo- 
hydrate from carbon dioxide were exposed briefly 

(5 s) to 4C-labeled CO;. The products of CO, incor- 
poration were then quickly isolated from the cells, 
separated by two-dimensional paper chromatogra- 
phy, and observed via autoradiographic exposure 

of the chromatogram. Such experiments identified 
radioactive 3-phosphoglycerate (PGA) as the pri- 
mary product of CO, fixation. The 3-phosphoglyc- 
erate was labeled in the 1-position (in its carboxyl 
group). Radioactive compounds arising from the 
conversion of 3-phosphoglycerate to other metabolic 
intermediates included phosphoenolpyruvate (PEP), 
malic acid, triose phosphate, alanine, as well as sugar 
phosphates and diphosphates. 


17.4b Mutations Create Specific Metabolic Blocks 


Genetics provides an approach to the identification of intermediate steps in metabo- 
lism that is somewhat analogous to inhibition. Mutation in a gene encoding an 
enzyme often results in an inability to synthesize the enzyme in an active form. Such 
a defect leads to a block in the metabolic pathway at the point where the enzyme acts, 
and the enzyme’s substrate accumulates. Such genetic disorders are lethal if the end 
product of the pathway is essential or if the accumulated intermediates have toxic 
effects. In microorganisms, however, it is often possible to manipulate the growth 
medium so that essential end products are provided. Then the biochemical 
consequences of the mutation can be investigated. Studies on mutations in genes of 
the filamentous fungus Neurospora crassa led G. W. Beadle and E. L. Tatum to 
hypothesize in 1941 that genes are units of heredity that encode enzymes (a principle 
referred to as the one gene—one enzyme hypothesis). 


17.4c Isotopic Tracers Can Be Used as Metabolic Probes 


Another widely used approach to the elucidation of metabolic sequences is to feed 
cells a substrate or metabolic intermediate labeled with a particular isotopic form of 
an element that can be traced. Two sorts of isotopes are useful in this regard: radioac- 
tive isotopes, such as 4C, and stable heavy isotopes, such as '8O or 5N. Because the 
chemical behavior of isotopically labeled compounds is rarely distinguishable from 
that of their unlabeled counterparts, isotopes provide reliable tags for observing met- 
abolic changes. The metabolic fate of a radioactively labeled substance can be traced 
by determining which atoms in intermediates derived from the labeled compound are 
radioactive (Figure 17.14). 

Heavy isotopes endow the compounds in which they appear with slightly greater 
masses than their unlabeled counterparts. These compounds can be separated and 
quantitated by mass spectrometry (or density gradient centrifugation, if they are mac- 
romolecules). For example, !*O was used in separate experiments as a tracer of the fate 
of the oxygen atoms in water and carbon dioxide to determine whether the atmospheric 
oxygen produced in photosynthesis arose from H20, CO, or both: 


CO, + H:O —> (CHO) + O: 


If '8O-labeled CO, was presented to a green plant carrying out photosynthesis, none 
of the '8O was found in O». Curiously, it was recovered as H,'8O. In contrast, when 
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plants fixing CO, were equilibrated with H,!%O, !8O, was evolved. These latter label- 
ing experiments established that photosynthesis is best described by the equation 


C!6O, +2 H,!80 —_ (CH,!*O) + 180, + H,!°O 


That is, in the process of photosynthesis, the two oxygen atoms in O, come from two 
H,O molecules. One O is lost from CO, and appears in H,O, and the other O of CO, 
is retained in the carbohydrate product. Two of the four H atoms are accounted for in 
(CH,0), and two reduce the O lost from CO, to H,O. 


17.4d NMR Spectroscopy Is a Noninvasive Metabolic Probe 


A technology analogous to isotopic tracers is provided by nuclear magnetic reso- 
nance (NMR) spectroscopy. The atomic nuclei of certain isotopes, such as 'H and #!P, 
have magnetic moments. The resonance frequency of a magnetic moment depends on 
the local chemical environment. That is, the NMR signal of the nucleus is influenced 
in an identifiable way by the chemical nature of its neighboring atoms in the com- 
pound. Such variation in an atom’s nuclear magnetic resonance frequency as a func- 
tion of the electronic environment in its neighborhood is called chemical shift. 
Chemical shifts reveal a great deal of structural information about the environment 
around the atom and thus the nature of the compound containing the atom. Trans- 
formations of substrates and metabolic intermediates labeled with magnetic nuclei 
can be traced by following changes in NMR spectra. Furthermore, NMR spectros- 
copy is a noninvasive procedure. Whole-body NMR spectrometers are being used 
today in hospitals to directly observe the metabolism (and clinical condition) of liv- 
ing subjects (Figure 17.15). NMR is a useful tool for clinical diagnosis and for the 
investigation of metabolism in situ (literally “in site,” meaning in this case “where 
and as it happens”). 


17.4e Cell Fractionation Reveals Which Compartment Contains a Metabolic 
Pathway 


Although the interior of a prokaryotic cell is not subdivided into compartments by 
internal membranes, the cell still shows some spatial segregation of metabolism. For 
example, certain metabolic pathways, such as phospholipid synthesis and oxidative 
phosphorylation, are localized in the plasma membrane. Protein biosynthesis is car- 
ried out on ribosomes. 

In contrast, eukaryotic cells are extensively compartmentalized by an endomem- 
brane system. Each of these cells has a true nucleus bounded by a double membrane 
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Figure 17.15 With NMR spectroscopy, one can 
observe the metabolism of a living subject in real 
time. These NMR spectra show the changes in 
ATP, creatine-P (phosphocreatine), and P; levels in 
the forearm muscle of a human subject before (a) 
and after (b) 19 minutes of exercise. Note that the 
three P atoms of ATP (a, B, and y) have different 
chemical shifts, reflecting their different chemical 
environments. 
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Figure 17.16 Fractionation of a cell extract by dif- 


ferential centrifugation. Cells in a tissue are ruptured 


during homogenization, releasing their contents. 

It is possible to separate organelles and subcellular 
particles in a centrifuge because their inherent size 
and density differences give them different rates of 
sedimentation in an applied centrifugal field. Nuclei 
are pelleted in relatively weak centrifugal fields and 
mitochondria in somewhat stronger fields, whereas 
very strong centrifugal fields are necessary to pel- 
let ribosomes and fragments of the endomembrane 
system. 


called the nuclear envelope. The nuclear envelope is continuous with the endomembrane 
system, which is composed of differentiated regions: the endoplasmic reticulum; the 
Golgi complex; various membrane-bounded vesicles such as lysosomes, vacuoles, and 
microbodies; and, ultimately, the plasma membrane itself. Eukaryotic cells also possess 
mitochondria and, if they are photosynthetic, chloroplasts. Disruption of the cell mem- 
brane and fractionation of the cell contents into the component organelles have allowed 
an analysis of their respective functions (Figure 17.16). Each compartment is dedicated 
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to specialized metabolic functions, and the enzymes appropriate to these specialized 
functions are confined together within the organelle. In many instances, the enzymes of 
a metabolic sequence occur together within the organellar membrane. Thus, the flow of 
metabolic intermediates in the cell is spatially as well as chemically segregated. For 
example, the 10 enzymes of glycolysis are found in the cytosol, but pyruvate, the prod- 
uct of glycolysis, is fed into the mitochondria. These organelles contain the citric acid 
cycle enzymes, which oxidize pyruvate to CO>. The great amount of energy released in 
the process is captured by the oxidative phosphorylation system of mitochondrial mem- 
branes and used to drive the formation of ATP (Figure 17.17). 


17.4f Metabolic Pathways Are Also Time-Dependent 


The spatial compartmentalization of metabolic pathways within cells provides impor- 
tant advantages, one of which is isolating competing pathways from one another. Cells 
and organisms also exhibit temporal compartmentalization of their metabolic path- 
ways. That is, metabolic pathways may be turned on and off in a time-dependent and/ 
or cyclic fashion. For example, the metabolism of many organisms—amicrobes, animals, 
and plants—is regulated in synchrony with the 24-hour cycle of day and night, a pattern 
called circadian rhythmicity and often referred to as the biological clock. Why? Because 
light and/or varying nutrient availability represent key signals regarding the transitory 
nature of the environment, and organisms have evolved and adapted to exploit the 
information in such signals. Photosynthesis and carbon dioxide fixation in plants typi- 
cally is active during daylight hours and down-regulated during the dark. Significantly, 
recent studies have demonstrated that mammalian genes involved in lipid metabolism, 
glycolysis, gluconeogenesis, electron transport, mitochondrial ATP synthesis, and other 
metabolic pathways show a circadian pattern of regulated expression. In particular, the 
genes encoding the rate-limiting enzymes in many of these pathways are under circa- 
dian regulation. Feeding time (nutrient availability) is an important external cue that 
sets the phase of these metabolic gene expression cycles. In the liver, gluconeogenesis 
and glycogen breakdown are active during the sleep/fasting phase of the circadian 
rhythm, apparently to maintain blood glucose levels. On the other hand, glycolysis and 
cholesterol synthesis are active during the wake/feeding phase when substrates become 
available. Notably, disruptions of the circadian clock result in elevated fat accumulation 
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Figure 17.17 Compartmentalization of glycolysis, 
the citric acid cycle, and oxidative phosphorylation. 
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and other metabolic abnormalities. Yeast cells (Saccharomyces cerevisiae) maintained 
under continuous nutrient-limited conditions adopt a 4- to 5-hour cycle of respiratory 
activity, as demonstrated by oscillations in O consumption. Over half of the genes in 
the yeast genome are expressed rhythmically during this yeast metabolic cycle, with 
genes encoding proteins of similar metabolic function showing synchronous oscillations 
in expression. Those genes involved with energy metabolism show particularly robust 
metabolic cycles. 


17.5 | What Can the Metabolome Tell Us 
about a Biological System? 


Metabolomics is a newly emerged field in biochemistry and an obvious next step in the 
—omics series of genomics, transcriptomics, and proteomics. The metabolome is the 
complete set of low-molecular-weight substances involved in metabolism or produced 
by a cell or organism under specified physiological conditions. Metabolomics is the 
systematic identification and quantitation of all of these substances. Moving from the 
genome, which specifies all the genetic information available to an organism as 
encoded in its DNA, to the metabolome takes one from the genotype of the organism 
(its complete genetic potential) to its phenotype (the observed traits of an organism 
achieved through expression of its genes and the interaction of those gene products 
with the environment). As the ultimate expression of an organism’s genetic possibili- 
ties and its responses to its environment, the metabolome is the best predictor of an 
organism’s phenotype. As such, metabolomics is occupying a prominent position in 
the rise of systems biology, where comprehensive information about the genome, the 
transcriptome, the proteome, and the metabolome combine to provide insightful 
descriptions of the behavior of cells and organisms, as well as detailed understanding 
of many diseases (Figure 17.18). 

Metabolomic analyses present daunting challenges, even for simple organisms. More 
than 500 metabolites are represented in the metabolic map shown in Figure 17.2, but far 
more exist in a typical cell. For example, the 40 or so fatty acids occurring in a cell can 
alone account for more than a thousand phosphatidic acids. (Phosphatidic acids, with 
two fatty acids esterified to sn-glycerol-3-phosphate create 40 x 40 = 1600 species by 
themselves!) The Human Metabolomics Database (www.hmdb.ca) provides data on 
more than 115,000 metabolites in cells and fluids (blood, urine, and so on) of the human 
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Figure 17.18 The —-omics of contemporary biochemistry and molecular biology, as it relates to metab- 
olism. The different kinds and relative abundances of the various cellular metabolites are the best 
indicator of the phenotype of the cell, where phenotype connotes the sum total of all of the cell’s (or 
organism’s) characteristics at a given moment in time. Any change in these metabolites indicates that 
a biochemical perturbation (a phenotypic change) has occurred. In human health, such perturbations 
may be useful clinical clues. (Adapted from Figure 2a in Patti, G. J., Yanes, O., and Siuzdak, G., 2012. 
Metabolomics: the apogee of the omics trilogy. Nature Reviews Molecular Cell Biology 13:263-269.) 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


17.5 What Can the Metabolome Tell Us about a Biological System? 


body. The metabolomes of plants are even more complex, with estimates suggesting 
hundreds of thousands of different metabolites across the plant kingdom. Metabolomic 
assays must be able to resolve and discriminate this extraordinary array of small mole- 
cules. Moreover, concentrations of metabolites in a cell vary widely, from less than 1071? 
M (for many hormones) to 0.1 M (for Na* ions). 

The principal tools of metabolomics are NMR spectroscopy and mass spectrome- 
try. Mass spectrometry (Chapter 5) offers unmatched sensitivity for detection of 
metabolites at low concentrations and NMR spectroscopy can provide exceptional 
resolution and discrimination of metabolites in complex mixtures (Figure 17.20). 
Coupling these techniques with a variety of chromatographic separation protocols 
makes it possible to rapidly analyze thousands of metabolites in biological samples at 
low cost. 

Metabolomic studies take two forms: Targeted metabolomics is the scrutiny of a par- 
ticular metabolic pathway or class of metabolites (for example, lipids or amino acids). 
Because the metabolites of interest are known, standard compounds can be used to 
predict the chromatographic, magnetic resonance, or mass spectrometric behavior of 
the molecules of interest. Figure 17.20 shows the results of a targeted metabolomic 
analysis. Untargeted or discovery metabolomics has a more global aim—to identify and 
quantify all of the metabolites present in a biological sample. 


Discovery Metabolomics Mass spectrometry (MS) is the key tool in discovery metabo- 
lomics, though in some cases, liquid chromatographic (or for hydrophobic substances, 
gas chromatographic) separation of metabolites into classes (e.g., hydrophobic, hydro- 
philic) precedes MS (LC/MS analyses). Tandem MS (MS/MS) for fragmentation of 
molecules and subsequent analysis (as in protein sequencing, see Section 5.4) is also 
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Figure 17.19 Mass spectrometry offers remarkable sensitivity for metabolomic analyses. Shown here 
are desorption electrospray ionization mass spectra for urine samples from individuals with inborn 
errors of metabolism. PKU = phenylketonuria; HCY = homocystinuria; MSUD = maple syrup 
urine disease. (Adapted from Pan, Z., et al., 2007. Principal component analysis of urine metabolites 
by NMR and DESI-MS in patients with inborn errors of metabolism. Analytical and Bioanalytical 
Chemistry 387:539-549.) 
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Figure 17.20 Targeted analysis of 
the water-soluble metabolites in (a) 'H NMR 
a plant. (a) One-dimensional 'H 
NMR spectrum of an equimolar 
mixture of 26 small-molecule 
standards. (b) Two-dimensional 
NMR spectrum of the same mixture 
(red) overlaid onto a spectrum of 
aqueous whole-plant extract from 
Arabidopsis thaliana, a model 
organism for the study of plant 
molecular biology and biochemistry. 
(Adapted from Lewis, I., et al., 
2007. Method for determining (b) 1H-8C NMR 
molar concentrations of metabolites 
in complex solutions from two- 
dimensional !H-!3C NMR spectra. 
Analytical Chemistry 79:9385—9390.) Ornithine, Putrescine 
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used. Technological innovations allow direct MS analysis of single bacterial colonies 
(Figure 17.21) or very small tissues through microscope interfaces. For example, me- 
tabolomic analysis has been carried out on the developing mouse embryonic spinal 
cord. A significant challenge is that data sets from discovery metabolomics analyses are 
extraordinarily complex. Software programs are available to collate the MS informa- 
tion and create networks of structural similarities that lead to unambiguous identifica- 
tion of individual metabolites through comparisons with established MS databases. 
These databases include the Human Metabolome Database and METLIN. METLIN 
is a metabolite and tandem MS database and data management system maintained by 
the Scripps Center for Metabolomics. METLIN is the world’s largest tandem MS data- 
base, containing hundreds of thousands of molecular standards. 

Small molecules modulate all steps in the flow of information from genotype to phe- 
notype. Metabolites are the substrates and products of enzymes whose actions deter- 
mine the phenotype. Metabolites influence gene expression through interactions with 
DNA-binding proteins that activate or repress transcription. Metabolites serve as 
allosteric effectors, not just for enzymes, but also for hormone receptors that have 
broad influences on a cell’s expression of its metabolic possibilities. A select set of small 
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molecules, such as phosphate and acetyl-CoA, are used in the regulation of protein 
function through post-translational modification. Metabolites determine the possibili- 
ties of epigenetic modifications. In humans, metabolomic changes due to external fac- 
tors—diet, lifestyle, the gut microbiome, age, drugs, and sex—generate consequences 
across all the —omes. In short, metabolomics is key to understanding an organism’s 
biology. Placing metabolomics at the center of information flow to the —-omics not only 
highlights its direct influence on the other —omics, but also emphasizes its importance as 
the master regulator. Small molecules are at the center of life (Figure 17.22). 


Fluxomics Determining the true phenotype of a cell depends not only on knowledge 
of all the metabolites in a cell at any given moment, but also on information about the 
flow of metabolites through its metabolic networks. The quantitative study of metabo- 
lite flow, or flux, is termed fluxomics. The fluxome is defined by all of the metabolic 
fluxes in a metabolic network, and thus represents systems-level information on those 
cellular processes defined by the metabolic network. As such, the fluxome depends on 
the metabolome and the proteome, with the proteome representing those enzymes that 
mediate the reactions comprising the metabolic network. Thus, the fluxome is a dy- 
namic view of the metabolic processes in a cell or organism. 

The direct measurement of intracellular metabolic flux is a daunting challenge. 
Instead, attempts are made to model the fluxome. Such models require detailed knowl- 
edge about the metabolome, the prevailing concentrations of enzymes in the metabolic 
networks, and the kinetic constants for these enzymes under intracellular conditions. 
When such models become predictive, the effects of disease states or pharmacological 
interventions can be more productively addressed. 
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Figure 17.21 A direct and highly sensitive analysis 
of metabolites by nanospray desorption electrospray 
ionization mass spectroscopy (NanoDESI MS). 
(a) Almost any surface, such as a bacterial colony, 
plant leaf, or animal skin, can be analyzed by this 
method. (b) Solvent is delivered by a capillary and 
then pumped from the sample surface and directly 
sprayed into an ESI-tandem mass spectrometer. Ions 
generated from the individual metabolites drawn 
from the surface are fragmented and analyzed by 
tandem MS. (c) Spectral networking: The masses of 
metabolites are collated, based on MS/MS spectral 
similarity, and displayed as nodes within a network. 
Individual nodes represent metabolites that can be 
identified through comparison with the MS profiles 
of entries in metabolomic databases. (Adapted from 
Figure 1 in Traxler, M. F., and Kolter, R., 2012. A 
massively spectacular view of the chemical lives of 
microbes. Proceedings of the National Academy of 
Sciences, USA 109:10128-10129.) 
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Figure 17.22 Small molecules are at the center of 
life. The flux of biological information is centered 
on metabolomics. Metabolic intermediates drive all 
the other —omics. In humans, external factors such 
as diet, lifestyle, the gut microbiome, age, drug, and 
sex directly affect the metabolome, and through 

it, all the other —-omes. (Adapted from Figure | in 
Costa dos Santos, Jr., G. C., Renovato-Martins, M., 
and Mesquita de Brito, N., 2021. The remodel of 
the “central dogma”: a metabolomics interaction 
perspective. Metabolomics 17:48, DOI: 10.1007/ 
s11306-021-01800-8 


Gut-microbiome 


METABOLOMICS 


Environment 


What Food Substances Form the Basis 
of Human Nutrition? 


17.6 


The use of foods by organisms is termed nutrition. The ability of an organism to use a 
particular food material depends upon its chemical composition and digestibility. In 
addition to indigestible fiber, food includes the macronutrients—protein, carbohy- 
drate, and lipid—and the micronutrients—including vitamins and minerals. 


17.6a Humans Require Protein 


Humans must consume protein to make new proteins. Dietary protein is a rich source 
of nitrogen, and certain amino acids—the so-called essential amino acids—cannot be 
synthesized by humans (and various animals) and can be obtained only in the diet. 
The average adult in the United States consumes far more protein than required for 
synthesis of essential proteins. Excess dietary protein is then merely a source of meta- 
bolic energy. Some of the amino acids (termed glucogenic) can be converted into glu- 
cose, whereas others, the ketogenic amino acids, can be converted to fatty acids and/ 
or keto acids. If fat and carbohydrate are already adequate for the energy needs of the 
individual, then both kinds of amino acids will be converted to triacylglycerol and 
stored in adipose tissue. 

An individual’s protein undergoes a constant process of degradation and resynthesis, 
a cycle known as protein turnover. Together with dietary protein, this recycled protein 
material participates in a nitrogen equilibrium. For most adults, the amount of protein 
synthesis is equaled by the amount of protein degradation, and the dietary amount of 
protein is equaled by the amount of nitrogen excreted in the urine (as urea), in the feces, 
and in sweat. Persons whose nitrogen intake is balanced by their nitrogen excretion are 
in zero nitrogen balance. A positive nitrogen balance occurs whenever there is a net 
increase in the organism’s protein content, such as during periods of growth. Nitrogen 
balance is positive in healthy individuals from birth through adolescence, in pregnant 
women, and in people recovering from illness or protein insufficiency. A negative 
nitrogen balance exists when dietary intake of nitrogen is insufficient to meet the 
demands for new protein synthesis, as in people who are starving or who need to replace 
tissue due to illness or injury. 
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17.6b Carbohydrates Provide Metabolic Energy 


The principal purpose of carbohydrate in the diet is production of metabolic energy. 
Simple sugars are metabolized in the glycolytic pathway. Complex carbohydrates are 
degraded into simple sugars, which then can enter the glycolytic pathway. Carbohy- 
drates are also essential components of nucleotides, nucleic acids, glycoproteins, and 
glycolipids. Human metabolism can adapt to a wide range of dietary carbohydrate 
levels, but the brain depends on glucose for fuel. When dietary carbohydrate con- 
sumption exceeds the energy needs of the individual, excess carbohydrate is converted 
to triacylglycerols and glycogen for long-term energy storage. On the other hand, 
when dietary carbohydrate intake is low, ketone bodies are formed from acetate units 
to provide metabolic fuel for the brain and other organs. 


17.6c Lipids Are Essential, but in Moderation 
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Fatty acids and triacylglycerols can be used as fuel by many tissues in the human body, 
and phospholipids are essential components of all biological membranes. Even though 
the human body can tolerate a wide range of fat intake levels, there are disadvantages 
in either extreme. Excess dietary fat is stored as triacylglycerols in adipose tissue, but 
high levels of dietary fat can also increase the risk of atherosclerosis and heart disease. 
Moreover, high dietary fat levels are also correlated with increased risk for colon, 
breast, and prostate cancers. When dietary fat consumption is low, there is a risk of 
essential fatty acid deficiencies. The human body cannot synthesize linoleic and linole- 
nic acids, so these must be acquired in the diet. In addition, arachidonic acid can be 
synthesized in humans only from linoleic acid, so it too is classified as essential. The 
essential fatty acids are key components of biological membranes, and arachidonic acid 
is the precursor to prostaglandins, which mediate a variety of processes in the body. 


Foundational Biochemistry Things You Should Know after Reading Chapter 17. 


Organisms show marked similarity in their metabolic pathways. 
Living things exhibit metabolic diversity. 


Autotrophs are organisms that can subsist on carbon dioxide as the 
only source of carbon; heterotrophs require an organic form of carbon. 


Phototrophs are photosynthetic organisms; chemotrophs use organic 
or inorganic compounds as a source of energy. 


Prokaryotes show greater metabolic diversity than found in all the 
eukaryotic organisms. 


Oxygen is essential for aerobic organisms. 


The flow of energy in the biosphere is intimately related to the carbon 
and oxygen cycles. 


Metabolic maps portray the principal reactions comprising the inter- 
mediary metabolism of carbohydrates, lipids, amino acids, nucleo- 
tides, and their derivatives. 


Traditional metabolic maps emphasize metabolites, but alternative 
models focused on the enzymes or enzyme complexes can provide 
new insights. 


Multienzyme systems may take several different forms. 


Multifunctional polypeptides are proteins that can catalyze several of 
the reactions in a metabolic pathway. 


Catabolic pathways are characteristically energy yielding, whereas 
anabolic pathways are energy requiring. 


Catabolism involves the oxidative degradation of complex nutrient 
molecules, such as carbohydrates, lipids, and proteins. 


The energy released in catabolic reactions is often captured in ATP. 


During catabolism, energy-rich electrons are transferred to the coen- 
zymes NAD* and NADP*. 


Energy released upon oxidation of NADH is coupled to ATP synthesis. 


The ADP/ATP and NAD*/NADH ratios are key indicators of the 
energy status of cells. 


NADPH is the source of reducing potential to drive reductive biosyn- 
thetic reactions. 


Anabolism is biosynthesis; as such, it is endergonic. 
Anabolism and catabolism are not mutually exclusive. 


The pathways of catabolism converge to a few end products, such as 
lactate, ethanol, CO», H20, and NH,*. 


Anabolic pathways diverge, synthesizing an astounding variety of 
biomolecules from a limited set of metabolic intermediates that serve 
as building blocks. 


Amphibolic intermediates play dual roles, serving roles in both cata- 
bolic and anabolic pathways. 


Corresponding pathways of catabolism and anabolism, such as gly- 
colysis and gluconeogenesis or fatty acid oxidation and fatty acid bio- 
synthesis, differ in important ways. 


Metabolic regulation requires different pathways for oppositely 
directed metabolic sequences. 


Opposing metabolic pathways are reciprocally regulated. 


ATP serves in a cellular energy cycle, being produced in catabolism 
(or photosynthesis) and consumed in biosynthesis, osmotic work, or 
motility. 


NAD* collects electrons released in catabolism. 


The energy released in NADH oxidation is coupled with the forma- 
tion of ATP; the coupled process is called oxidative phosphorylation. 
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e Transformation of the energy released in an oxidation—reduction 
reaction into the chemical energy of a phosphoric anhydride bond is 
a form of energy transduction. 

© Coenzymes provide unique chemistry to metabolic pathways. 

e Water-soluble vitamins are precursors for coenzyme synthesis. 

e Metabolic pathways have been elucidated by a variety of experimen- 
tal approaches. 

e Metabolic inhibitors block pathways at their site of inhibition. 

e Mutations create specific metabolic blocks. 

e Either radioactive isotopes or stable heavy isotopes can be used to 
trace the course of metabolic reactions. 

e NMR spectroscopy is a noninvasive metabolic probe. 

e Metabolic pathways are compartmentalized with cells. 

e Cells and organisms also exhibit temporal compartmentalization of 
their metabolic pathways. 

e The metabolism of many organisms shows circadian rhythmicity. 

e The metabolome is the complete set of metabolites involved in metab- 
olism or produced by a cell or organism under specified physiological 
conditions. 

Problems 


e The metabolome is an excellent predictor of a cell or organism’s 


phenotype. 


e The metabolome is an important aspect of systems biology. 


e NMR spectroscopy and mass spectrometry are the principal tools of 


metabolomic analyses. 


e Fluxomics is the quantitative study of metabolic flux. 


e Small molecules are at the center of life: The flow of biological infor- 


mation is centered on metabolomics. 


e Human nutrition depends on carbohydrates, lipids, and proteins for 


energy and replenishment of metabolic building blocks. 


e Vitamins and minerals are also essential to human nutrition. 


e Proteins undergo a continuous cycle of synthesis and degradation 


termed protein turnover. 


e Most adults excrete as much nitrogen as they consume and thus are in 


zero nitrogen balance. 


e Carbohydrates and triglycerides are important sources of nutritional 


energy. 


e Essential fatty acids cannot be synthesized by humans and must be 
obtained in the diet. 


Answers to all problems are at the end of this book. Detailed solutions are 
available in the Student Solutions Manual. 


1. 


Global Carbon Dioxide Cycling Expressed as Human Equivalents 
If3 X 10’ kg of CO, are cycled through the biosphere annually, how 
many human equivalents (70 kg people composed of 18% carbon by 
weight) could be produced each year from this amount of CO,? 


. The Different Modes of Carbon and Energy Metabolism Define the 


differences in carbon and energy metabolism between photoautotrophs 
and photoheterotrophs as well as between chemoautotrophs and 
chemoheterotrophs. 


. Metabolic Strategies in Plants Why can plants be considered both 


photoautotrophic and chemoheterotrophic? 


. Where Do the O Atoms in Organisms Come From? Name three 


principal inorganic sources of oxygen atoms that are commonly 
available in the inanimate environment and readily accessible to the 
biosphere. 


. How Do Catabolism and Anabolism Differ? What are the features that 


generally distinguish pathways of catabolism from pathways of 
anabolism? 


. Anabolic vs Catabolic Pathways How do anabolic and catabolic 


pathways differ in terms of the thermodynamic parameters enthalpy 
(H) and entropy (S)? 


. Oxidation States of Carbon A main feature of catabolism is the 


oxidation of carbon atoms in organic molecules from a reduced state 
(CH; or -CH,-) to the fully oxidized state of CO. An atom is 
oxidized when it loses electrons (as in Fe** > Fe**). In its oxidized 
form, carbon is still surrounded by four valence electrons making four 
chemical bonds. How does carbon lose electrons when oxidized from 
—CH; or -CH to CO? (Hint: The electronegativity value of H is 2.1, 
of Cis 2.5, and of O is 3.5.) 


8. How Are the Enzymes of Metabolic Pathways Organized? Name 
the four principal modes of enzyme organization in metabolic 
pathways. 


9. Why Do Anabolic and Catabolic Pathways Differ? Why is the pathway 
for the biosynthesis of a biomolecule at least partially different 
from the pathway for its catabolism? Why is the pathway for the 
biosynthesis of a biomolecule inherently more complex than the 
pathway for its degradation? 


10. What Are Primary Metabolic Roles of the Three Principal 
Nucleotides? (Integrates with Chapters 1 and 3.) What are the 


metabolic roles of ATP, NAD*, and NADPH? 


11. NAD* vs. NADP* The structure shown is NADP*. Compare this to 
the structure of NAD* shown in Figure 17.11. What is the difference 
between these two structures and why is this difference so important? 


(0) 
“a NH, 
SY 
Nt 
o (0) 
| 
O=P 
| No- 
O OH OH NH, 
| NAA 
N 
O=P\ ¢ | 
| OL N 2 
(0) N 
OH OPO,?- 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


12. How Is Metabolism Regulated? (Integrates with Chapter 15.) 
Metabolic regulation is achieved via regulating enzyme activity in 
three prominent ways: allosteric regulation, covalent modification, 
and enzyme synthesis and degradation. Which of these three modes 
of regulation is likely to be the quickest; which the slowest? For each 
of these general enzyme regulatory mechanisms, cite conditions in 
which cells might employ that mode in preference to either of the 
other two. 


13. What Are the Advantages of Metabolic Compartmentalization? What 
are the advantages of compartmentalizing particular metabolic 
pathways within specific organelles? 


14. Why Are Metabolomic Analyses Challenging? Name and discuss four 
challenges associated with metabolomic measurements in biological 
systems. 


15. Which Is Better: NMR or MS? Compare and contrast mass spectrom- 
etry and NMR in terms of their potential advantages and disadvan- 
tages for metabolomic analysis. 


16. How Do Vitamin-Derived Coenzymes Aid Metabolism? What 
chemical functionality is provided to enzyme reactions by pyridoxal 
phosphate (see Chapter 13)? By coenzyme A (see Chapter 19)? By 
vitamin Bj, (see Chapter 23)? By thiamine pyrophosphate (see 
Chapter 19)? 


17. What Are the Features of the Series of -omes? Define the following 


terms: 

a. genome 

b. transcriptome 
c. proteome 

d. metabolome 


e. fluxome 


18. What Is the Alcohol Dehydrogenase Enzyme Mechanism? The alcohol 
dehydrogenase reaction, described in Figure 17.11, interconverts ethyl 
alcohol and acetaldehyde and involves hydride transfer to and from 
NAD* and NADH, respectively. Write a reasonable mechanism for 
the conversion of ethanol to acetaldehyde by alcohol dehydrogenase. 


19. Where Are Various Metabolic Pathways Localized and What Are Their 
Inputs? For each of the following metabolic pathways, describe where 
in the cell it occurs and identify the starting material and end product(s): 
a. citric acid cycle 
b. glycolysis 
c. oxidative phosphorylation 
d. fatty acid synthesis 


20. A Possible Solution to Global Warming Many solutions to the 
problem of global warming have been proposed. One of these involves 
strategies for carbon sequestration—the removal of CO, from the 
earth’s atmosphere by various means. From your reading of this 
chapter, suggest and evaluate a strategy for carbon sequestration in the 
ocean. 


21. Radioactive Isotopes as Metabolic Tracers Both °P and *°S are 
radioactive isotopes that decay to stable isotopes of other elements 
by emission of an electron (B~ decay). Consult the Periodic Table to 
determine the elemental products (daughter isotopes) of °P and of 
35S decay. Given the half-life of 3P as 14.3 days and that of 35S as 
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Problems 


87.4 days, calculate what percentage of each would remain from a 
sample that contained both and decayed for 100 days. 


22. Nitrogen Balance Section 17.6 discusses the importance of the 
nitrogen balance. List classes of biomolecules other than proteins 
that depend on a well-regulated nitrogen balance. 


23. Organisms and Their Source of Energy What three categories of 
organisms are described below? 


a. Lives in an environment where there is no sunlight. Derives 
energy from redox reactions initiated by inorganic compounds. 
Capable of fixing CO). 

b. Lives in an environment where there is plenty of 
sunlight. Derives energy from redox reactions involving 
inorganic compounds but driven by visible light. Capable of 
fixing CO). 

c. Lives in an environment where there may or may not be sunlight. 
Derives energy from redox reactions initiated by organic 
compounds. Capable of fixing CO). 


The Metabolism of Obligate Anaerobes 


24. Obligate Anaerobes Which statement is most likely to be true 
concerning obligate anaerobes? 
a. These organisms can use oxygen if it is present in their environ- 
ment. 


b. These organisms cannot use oxygen as their final electron 
acceptor. 

c. These organisms carry out fermentation for at least 50% of their 
ATP production. 


d. Most of these organisms are vegetative fungi. 


Methods to Study a Cell’s Metabolism 


25. Methods to Assess Metabolism Which of the following experimental 
approaches is likely to give the most complete picture of a particular 
cell’s metabolism: NMR spectroscopy, metabolic inhibitors, mass 
spectroscopy, or isotopic tracers? Explain. 


PSAFE Problems 


(PSAFE models, listed by chapter or protein name, can be found at http:// 
www.psafeonline.net.) 


26. Examine the ActiveModel for alcohol dehydrogenase and identify the 
amino acid side chains involved in binding the NAD coenzyme. 


Think-Pair-Share Question 


Section 17.2 discusses different ways metabolic pathway diagrams can be 
organized (metabolite-or protein-centric). Consider alternative ways to 
organize metabolic pathways similar as shown in Figure 17.4 but 
focusing on the chemical changes that occur within the biomolecules. 
Try to apply any of these alternatives to the pathway shown at the top of 
the next page. Provide pathways for a carbon-centric scheme focused on 
the increase or decrease in the number of carbon-carbon bonds in the 
molecules and an electron-centric scheme focused on changes in the 
oxidation number of carbon within the molecules (revisit Figure 17.10 for 
the oxidation states of carbon). Provide at least one other possible 
metabolic pathway organization. 
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Glycolysis 


Essential Question 


Nearly every living cell carries out a catabolic process known as glycolysis—the stepwise 
degradation of glucose (and other simple sugars). Glycolysis is a paradigm of metabolic 
pathways. Carried out in the cytosol of cells, it is basically an anaerobic process; its principal 
steps occur with no requirement for oxygen. What are the chemical basis and logic for this 
central pathway of metabolism; that is, how does glycolysis work? 


Lo things first appeared in an environment lacking O2, and glycolysis was an 
early and important pathway for extracting energy from nutrient molecules. 
It played a central role in anaerobic metabolic processes during the first 2 billion years 
of biological evolution on earth. Contemporary organisms still employ glycolysis to 
provide precursor molecules for aerobic catabolic pathways (such as the tricarboxylic 
acid cycle) and as a short-term energy source when oxygen is limiting. A complex array 
of molecular signals fine-tune and adapt glycolysis to a variety of metabolic needs. 


18.1 


In the glycolysis pathway (Figure 18.1), which will be explained in detail below, a 
molecule of glucose is converted by 10 enzyme-catalyzed steps into two molecules of 
3-carbon pyruvate. Most of the details of this pathway (the first metabolic pathway 
to be elucidated) were worked out in the first half of the twentieth century by the 
German biochemists Otto Warburg, Gustav Embden, and Otto Fritz Meyerhof. 
In fact, the sequence of reactions in Figure 18.1 is often referred to as the Embden- 
Meyerhof pathway. 

Why is glycolysis so important to organisms? There are several reasons. For some 
tissues (such as brain, kidney medulla, and rapidly contracting skeletal muscles) and 
for some cells (such as erythrocytes and sperm cells), glucose is the only source of 
metabolic energy. In addition, the product of glycolysis—pyruvate—is a versatile 
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Living organisms, like machines, conform to the law 
of conservation of energy, and must pay for all their 
activities in the currency of catabolism. 


Ernest Baldwin 
Dynamic Aspects of Biochemistry (1952) 


<4 Louis Pasteur’s scientific investigations into fermentation 
of grape sugar were pioneering studies of glycolysis. 


Key Questions 


18.1 


18.2 
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18.4 
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18.6 
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What Are the Essential Features 
of Glycolysis? 

Why Are Coupled Reactions 
Important in Glycolysis? 

What Are the Chemical Principles 
and Features of the First Phase 
of Glycolysis? 

What Are the Chemical Principles 
and Features of the Second 
Phase of Glycolysis? 

What Are the Metabolic Fates of 
NADH and Pyruvate Produced 

in Glycolysis? 

How Do Cells Regulate Glycolysis? 
Are Substrates Other Than 
Glucose Used in Glycolysis? 


How Do Cells Respond 
to Hypoxic Stress? 
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Phase 1 


Phosphorylation of glucose and conversion to 2 molecules of 


glyceraldehyde-3-phosphate; 2 ATPs are used to prime these reactions. 


Phase 2 


Conversion of glyceraldehyde-3-phosphate to pyruvate and 
coupled formation of 4 ATP and 2 NADH. 
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Figure 18.1 The glycolytic pathway. The numbered steps are indicated in blue circles. 


metabolite that can be used in several ways. In most tissues, when oxygen is plentiful 
(aerobic conditions), pyruvate is oxidized (with loss of the carboxyl group as CO2), and 
the remaining two-carbon unit becomes the acetyl group of acetyl-coenzyme A (acetyl- 
CoA) (Figure 18.2). This acetyl group is metabolized in the tricarboxylic acid (TCA) 


cycle (and fully oxidized) to yield CO). 


Alternatively, in the absence of oxygen (anaerobic conditions), pyruvate can be 
reduced to lactate through oxidation of NADH to NAD*—a process termed lactic acid 
fermentation. In microorganisms such as brewer’s yeast and in certain plant tissues, 
pyruvate can be reduced to ethanol, again with oxidation of NADH to NAD*. Most 


students will recognize this process as alcoholic fermentation. 
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2 NAD* 2 CoASH 
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2 AC 0 conditions C NADH conditions 
2 NAD* 2 NAD* 
y 
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+ 
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Lactic acid fermentation Alcoholic fermentation 
in contracting muscle in yeast 


Figure 18.2 Pyruvate produced in glycolysis can be used by cells in several ways. In animals, pyruvate 
is normally converted to acetyl-coenzyme A, which is then oxidized in the TCA cycle to produce CO). 
When oxygen is limiting, pyruvate can be converted to lactate. Alcoholic fermentation in yeast 
converts pyruvate to ethanol and CO). 


Glycolysis consists of two phases. In the first phase, a series of five reactions, glu- 
cose is broken down into two molecules of glyceraldehyde-3-phosphate. In the second 
phase, five subsequent reactions convert these two molecules of glyceraldehyde- 
3-phosphate into two molecules of pyruvate. Phase 1 consumes two molecules of ATP 
(Figure 18.1). The later stages of glycolysis result in the production of four molecules 
of ATP. The net is 4 — 2 = 2 molecules of ATP produced per molecule of glucose. 
Microorganisms, plants, and animals (including humans) carry out the 10 reactions 
of glycolysis in more or less similar fashion, although the rates of the individual reac- 
tions and the means by which they are regulated differ from species to species. 


18.2 | Why Are Coupled Reactions 
Important in Glycolysis? 


The process of glycolysis converts some, but not all, of the metabolic energy of the 
glucose molecule into ATP. The free energy change for the conversion of glucose into 
two molecules of lactate (the anaerobic route in contracting muscle) is —183.6 kJ/mol: 


CoH ).05 — 2 H3C—CHOH—COO- + 2 Ht (18.1) 
AG” = —183.6 kJ/mol 


This process occurs with no net oxidation or reduction. Although several individual 
steps in the pathway involve oxidation or reduction, these steps compensate each 
other exactly. Thus, the conversion of a molecule of glucose into two molecules of 
lactate involves a rearrangement of bonds, with no net loss or gain of electrons. The 
energy made available through this rearrangement is a relatively small part of the 
total energy obtainable from glucose (see Figure 1.17). 

The production of two molecules of ATP in glycolysis is an energy-requiring 
process: 


2 ADP + 2 P; — 2 ATP + 2 H,O (18.2) 
AG” = 2 X 30.5 kJ/mol = 61.0 kJ/mol 


Glycolysis couples these two reactions: 


Glucose + 2 ADP + 2 P; — 2 lactate + 2 ATP +2 Ht +2H,O (18.3) 
AG” = —183.6 + 61 = —122.6 kJ/mol 
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Careers in Chemistry 


Figure 18.3 Just as a water pump must be primed 
with water to get more water out, the glycolytic 
pathway is primed with ATP in steps | and 3 in 
order to achieve net production of ATP in the 
second phase of the pathway. 


Neem J. Patel 


Neem J. Patel is a Ph.D. candidate at the University 
of California studying soil recovery after forest 
fires. She completed her undergraduate studies at 
Georgia State University with a B.S. in Biology 
and a minor in Chemistry. She was so intrigued 
with her biochemistry courses that she took more 
than her degree required. During these years, Patel 
directly applied that knowledge to study petroleum- 
contaminated sediments from the DeepWater 
Horizon Spill. 


She later became an assistant research scientist 
at New York University while she earned her M.S. 
with a focus on the meiotic regulation of brewer’s 
yeast. Patel credits the molecular and biochemical 
methods honed through her education for preparing 
her for such rigorous laboratory research. From 
amplicon sequencing to western blotting, Patel’s 
love for biochemistry has given her the tools to 
pursue her passion. 


Thus, under standard-state conditions, (61/183.6) X 100%, or 33%, of the free energy 
released is preserved in the form of ATP in these reactions. However, as discussed in 
Section 3.2, solution conditions, such as pH, concentration, ionic strength, and pres- 
ence of metal ions, can substantially alter the free energy change for such reactions. 
Under cellular conditions, the free energy change for the synthesis of ATP (Equation 
18.2) is much larger, and approximately 50% of the available free energy is converted 
into ATP. Clearly, then, more than enough free energy is available in the conversion 
of glucose into lactate to drive the synthesis of two molecules of ATP. 


18.3 


What Are the Chemical Principles and Features 


of the First Phase of Glycolysis? 


In the first phase of glycolysis, glucose is phosphorylated at C-1 and C-6, and the six- 
carbon skeleton of glucose is cleaved to yield two three-carbon molecules of glyceral- 


©Charles D. Winters 


a-D-Glucose + ATP*~ —-> a-p-glucose-6-phosphate?” + ADP?~ + H+ 


dehyde-3-phosphate. Phosphorylation and cleavage reorganize the glucose molecule 
so that molecules of ATP can be produced in the second phase of glycolysis. 


18.3a Reaction 1: Glucose Is Phosphorylated by Hexokinase 
or Glucokinase—The First Priming Reaction 


The initial reaction of the glycolysis pathway involves phosphorylation of glucose at 
carbon atom 6 by either hexokinase or glucokinase. (Recall that kinases are enzymes 
that transfer the y-phosphate of ATP to nucleophilic acceptors.) Phosphorylation acti- 
vates glucose for the subsequent reactions in the pathway. However, the formation of 
such a phosphoester is thermodynamically unfavorable and requires energy input to 
operate in the forward direction (see Section 3.4). The energy comes from ATP, a 
requirement that at first seems counterproductive. Glycolysis is designed to make ATP, 
not consume it. However, the hexokinase or glucokinase reaction (Figure 18.1, Step 1) is 
one of two priming reactions in the pathway. Just as old-fashioned, hand-operated water 
pumps (Figure 18.3) have to be primed with a small amount of water to deliver more 
water to the thirsty pumper, the glycolysis pathway requires two priming ATP mole- 
cules to start the sequence of reactions and delivers four molecules of ATP in the end. 
The complete reaction for the first step in glycolysis is 


(18.4) 
AG” = -16.7 kJ/mol 


The hydrolysis of ATP makes 30.5 kJ/mol available in this reaction, and the phos- 


phorylation of glucose costs 13.8 kJ/mol (Table 18.1). Thus, the reaction liberates 


16.7 kJ/mol under standard-state conditions (1 M concentrations), and the equilibrium 
of the reaction lies far to the right (K., = 850 at 25°C; see Table 18.1). Under cellular 
conditions (Table 18.2), this first reaction of glycolysis is more favorable than at stan- 


dard state, with a AG of —33.9 kJ/mol (see Table 18.1). 
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Table 18.1 

AG” Keq AG 
Reaction Enzyme (kJ/mol) at 25°C (kJ/mol) 
a-D-Glucose + ATP* == glucose-6-phosphate?~ + ADP?- + H+ Hexokinase —16.7 850 =33.9* 

Glucokinase 
Glucose-6-phosphate?” =— fructose-6-phosphate*~ Phosphoglucoisomerase +1.67 0.51 —2.92 
Fructose-6-phosphate?~ + ATP == Phosphofructokinase —14.2 310 —18.8 
fructose-1,6-bisphosphate*~ + ADP- + H+ 
Fructose-1,6-bisphosphate*~ ——= Fructose bisphosphate +23.9 6.43 X 1075 —0.23 
dihydroxyacetone-P?~ + glyceraldehyde-3-P?— aldolase 
Dihydroxyacetone-P?- —— glyceraldehyde-3-P?— Triose phosphate isomerase +7.56 0.0472 +2.41 
Glyceraldehyde-3-P?~ + P?~ + NAD+ == Glyceraldehyde-3-P +6.30 0.0786 —1.29 
1,3-bisphosphoglycerate + NADH + H* dehydrogenase 

1,3-Bisphosphoglycerate*~ + ADP?~ == 3-P-glycerate?~ + ATP*— Phosphoglycerate kinase —18.9 2060 +0.1 
3-Phosphoglycerate>~ == 2-phosphoglycerate?~ Phosphoglycerate mutase +4.4 0.169 +0.83 
2-Phosphoglycerate*>~ —— phosphoenolpyruvate*>~ + H20 Enolase +1.8 0.483 +1.1 
Phosphoenolpyruvate*~ + ADP?~ + H* == pyruvate” + ATP*- Pyruvate kinase —31.7 3.63 X 10° —23.0 
Pyruvate” + NADH + H* == lactate” + NAD+ Lactate dehydrogenase —25.2 2.63 X 104 —14.8 


*AG values calculated for 310K (37°C) using the data in Table 18.2 for metabolite concentrations in erythrocytes. AG® values are assumed to be the same at 25° and 37°C. 


The Cellular Advantages of Phosphorylating Glucose The incorporation of a phosphate into 
glucose in this energetically favorable reaction is important for several reasons. First, 
phosphorylation keeps the substrate in the cell. Glucose is a neutral molecule and could 
diffuse across the cell membrane, but phosphorylation confers a negative charge on glu- 
cose and the plasma membrane is essentially impermeable to glucose-6-phosphate (Figure 
18.4). Moreover, rapid conversion of glucose to glucose-6-phosphate keeps the intracellular 
concentration of glucose low, favoring facilitated diffusion of glucose into the cell. In addi- Table 18.2 
tion, because regulatory control can be imposed only on reactions not at equilibrium, the 
favorable thermodynamics of this first reaction makes it an important site for regulation. 


Metabolite mM 


The Isozymes of Hexokinase In most animal, plant, and microbial cells, the enzyme 
that phosphorylates glucose is hexokinase. Magnesium ion (Mg”") is required for this ee 
reaction, as for the other kinase enzymes in the glycolytic pathway. The true sub- eee peepee -W083 
strate for the hexokinase reaction is MgATP?-. There are four isozymes of _ Fructose-6-phosphate 0.014 
Fructose-1,6-bisphosphate 0.031 
Dihydroxyacetone phosphate 0.14 
Glyceraldehyde-3-phosphate 0.019 
€ ; 1,3-Bisphosphoglycerate 0.001 
& xlucose £ = -2,3-Bisphosphoglycerate 40 
E — È 3-Phosphoglycerate 0.12 
2-Phosphoglycerate 0.030 
Phosphoenolpyruvate 0.023 
Pyruvate 0.051 
Lactate 2.9 
ATP 1.85 
ADP 0.14 
P; 1.0 
Figure 18.4 Phosphorylation of glucose to glucose-6-phosphate by ATP creates a charged molecule ee A ees 


that cannot easily cross the plasma membrane. Biochemical and Biophysical Research Communications 18:345. 
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Glycolysis continues 


Figure 18.5 Glucose-6-phosphate is the branch point for several metabolic pathways. 


hexokinase in most animal tissues. Hexokinase I is the principal form in the brain. 
Hexokinase in skeletal muscle is a mixture of types I (70% to 75%) and II (25% to 
30%). The Km for glucose is 0.03 mM for type I and 0.3 mM for type II; thus, hexoki- 
nase operates efficiently at normal blood glucose levels of 4 mM or so. The animal 
isozymes are allosterically inhibited by the product, glucose-6-phosphate. High levels 
of glucose-6-phosphate inhibit hexokinase activity until consumption by glycolysis 
lowers its concentration. The hexokinase reaction is one of three points in the gly- 
colysis pathway that are regulated. As the generic name implies, hexokinase can 
phosphorylate a variety of hexose sugars, including glucose, mannose, and fructose. 

The type IV isozyme of hexokinase, called glucokinase, is found predominantly in 
the liver and pancreas. Compared to the other isozymes, type IV requires 5 mM glucose 
to reach 0.5 Vmax and is relatively unaffected by glucose-6-P. Thus, glucokinase becomes 
important metabolically only when liver glucose levels are high (for example, when the 
individual has consumed large amounts of carbohydrates). When glucose levels are low, 
hexokinase is primarily responsible for phosphorylating glucose. However, when glu- 
cose levels are high, glucose is converted by glucokinase to glucose-6-phosphate and is 
eventually stored in the liver as glycogen. Glucokinase is an inducible enzyme—the 
amount present in the liver is controlled by insulin (secreted by the pancreas). (In patients 
with type 2 diabetes, glucokinase expression is dramatically reduced. These individuals 
cannot tolerate high levels of blood glucose and produce little liver glycogen.) Because 
glucose-6-phosphate is common to several metabolic pathways (Figure 18.5), it occu- 
pies a branch point in glucose metabolism. 


Hexokinase Binds Glucose and ATP with an Induced Fit In most organisms, hexokinase 
occurs in a single form: a two-lobed 50 kDa monomer that resembles a clamp, with a 
large groove in one side (Figure 18.6; see also Figure 13.24). As predicted by Daniel 
Koshland (Section 13.6), hexokinase undergoes an induced fit, closing around the 
substrates ATP and glucose when they are bound (Figure 18.6). 

The human hexokinase isozymes I, II, and HI are twice as big as those of lower 
organisms. They are composed of two separate domains, each similar to the yeast 
enzyme, and connected head to tail by a long a-helix (Figure 18.7). The sequence and 
structure similarity apparently arose from the duplication and fusion of a primordial 
hexokinase gene. Interestingly, both halves of hexokinase II support catalysis, but only 
the C-terminal half of isozymes I and III performs phosphorylation of glucose. The 
N-terminal half, on the other hand, has apparently evolved into a form that allosteri- 
cally regulates the activity of the C-terminal half! Type IV hexokinase (glucokinase) is 
similar in structure to the yeast enzyme, with a single clamp domain, a single active site, 
and a mass of 50 kDa (Figure 18.7). 


Figure 18.6 The (a) open and (b) closed states of yeast hexokinase. Binding of glucose (green) induces 
a conformation change that closes the active site, (a: PDB id = 1IG8; b: PDB id = 1BDG). 
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Figure 18.7 Structures of mammalian hexokinase and glucokinase. (a) Mammalian hexokinase I 
(PDB id = 1CZA) contains an N-terminal domain (top) and a C-terminal domain (bottom) 
joined by a long a-helix. Each of these domains is similar in sequence and structure to yeast 
hexokinase. (b) Human glucokinase undergoes an induced fit upon binding glucose (green). 


(left: PDB id = 1V4T; right: PDB id = 1V4S). 


A Deeper Look 


Glucokinase—An Enzyme with Different Roles in Different Cells 


Glucokinase has historically received less attention than its hexo- 
kinase type I-III cousins, even though they share the distinction of 
catalyzing the first step of glycolysis—the phosphorylation of glu- 
cose. Interestingly, however, glucokinase plays quite different roles 
in different cell types. In liver, glucokinase is important for synthesis 
and storage of glycogen in the post-prandial state (following a meal, 
when glucose levels are high). On the other hand, in the B-cells of 
the pancreas, glucokinase is the glucose sensor that recognizes glu- 
cose and initiates a signaling pathway that results in glucose-induced 
insulin secretion. 

When blood glucose increases following a meal, glucose uptake 
by glucose transporters in the B-cell plasma membrane elevates 
the intracellular glucose concentration to millimolar levels. Under these 
conditions, glucokinase is the dominant initiator of glycolysis, since 
hexokinase is inhibited by glucose-6-P (figure). Glucokinase-driven 
glycolysis leads to an increased ATP/ADP ratio, which results in K* 
channel closure and subsequent opening of Ca?™ channels. Increased 
intracellular Ca** triggers secretion of insulin by the B-cells. In this 
way, the action of glucokinase in the B-cells results in glucose uptake 
(and energy gain) by insulin-sensitive organs throughout the body. 


Reference 
Lenzen, S., 2014. A fresh view of glycolysis and glucokinase regulation: 
history and current status. Journal of Biological Chemistry 289:12189-12194. 


Glucose 


Hexokinase | Glucokinase 


Y 
el Glucose-6-P 


B-Cell 


Glucokinase facilitates glucose-induced insulin secretion in B-cells of the 
pancreas. 


18.3b Reaction 2: Phosphoglucoisomerase Catalyzes 
the Isomerization of Glucose-6-Phosphate 


The second step in glycolysis is a common type of metabolic reaction: the isomeriza- 
tion of a sugar. In this particular case, the carbonyl oxygen of glucose-6-phosphate is 
shifted from C-1 to C-2. This amounts to isomerization of an aldose (glucose- 
6-phosphate) to a ketose (fructose-6-phosphate) as shown in Figure 18.8. The reaction 
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Figure 18.8 The phosphoglucoisomerase 
mechanism involves (1) opening of the 
pyranose ring, (2) proton abstraction leading 
to enediol formation, and (3) proton addition 
to the double bond followed by ring closure. 


Phosphoglucoisomerase, with fructose-6-P (blue) 
bound (PDB id = 1HOX). 
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is necessary for two reasons. First, the next step in glycolysis is phosphorylation at 
C-1, and the hemiacetal —OH of glucose would be more difficult to phosphorylate 
than a simple primary hydroxyl. Second, the isomerization to fructose (with a car- 
bonyl group at position 2 in the linear form) activates C-3, facilitating C-C bond 
cleavage in the fourth step of glycolysis. The enzyme responsible for this isomeriza- 
tion is phosphoglucoisomerase, also known as phosphoglucose isomerase and glucose 
phosphate isomerase. In humans, the enzyme requires Mg’* for activity and is highly 
specific for glucose-6-phosphate. The AG” is 1.67 kJ/mol, and the value of AG under 
cellular conditions (Table 18.1) is —2.92 kJ/mol. This small value means that the reac- 
tion operates near equilibrium in the cell and is readily reversible. Phosphoglucoi- 
somerase proceeds through an enediol intermediate, as shown in Figure 18.8. Although 
the predominant forms of glucose-6-phosphate and fructose-6-phosphate in solution 
are the ring forms, the isomerase interconverts the open-chain form of G-6-P with the 
open-chain form of F-6-P. 


18.3c Reaction 3: ATP Drives a Second Phosphorylation 
by Phosphofructokinase—The Second Priming Reaction 


The action of phosphoglucoisomerase, moving the carbonyl group from C-1 to C-2, 
creates a new primary alcohol function at C-1 (see Figure 18.8). The next step in 
the glycolytic pathway is the phosphorylation of this group by phosphofructokinase 
(PKF). Once again, the substrate that provides the phosphoryl group is ATP. Like the 
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hexokinase/glucokinase reaction, the phosphorylation of fructose-6-phosphate is 
a priming reaction and is endergonic: 


Fructose-6-P + P; —> fructose-1,6-bisphosphate (18.5) 
AG” = 16.3 kJ/mol 


When coupled (by phosphofructokinase) with phosphoryl transfer from ATP, the 
overall reaction becomes exergonic: 


Fructose-6-P + ATP —> fructose-1,6-bisphosphate + ADP (18.6) 
AG” = —14.2 kJ/mol 
AG (in erythrocytes) = —18.8 kJ/mol 


At pH 7 and 37°C, the phosphofructokinase reaction equilibrium lies far to the 
right. Just as the hexokinase reaction commits the cell to taking up glucose, the 
phosphofructokinase reaction commits the cell to metabolizing glucose rather than 
converting it to another sugar or storing it. Similarly, just as the large free energy 
change of the hexokinase reaction makes it a likely candidate for regulation, so the 
phosphofructokinase reaction is an important site of regulation—indeed, the most 
important site in the glycolytic pathway. 


Regulation of Phosphofructokinase Phosphofructokinase acts as a valve controlling 
the rate of glycolysis. In addition to its role as a substrate, ATP is an allosteric 
inhibitor of this enzyme. Thus, phosphofructokinase has two distinct binding sites 
for ATP; a high-affinity substrate site and a low-affinity regulatory site. In the pres- 
ence of high ATP concentrations, phosphofructokinase behaves cooperatively; plots 
of enzyme activity versus fructose-6-phosphate are sigmoid, and the K,, for 
fructose-6-phosphate is increased (Figure 18.9). Thus, when ATP levels are suffi- 
ciently high in the cytosol, glycolysis is turned off. Under most cellular conditions, 
however, the ATP concentration does not vary over a large range. The ATP concen- 
tration in muscle during vigorous exercise, for example, is only about 10% lower than 
that during the resting state. The rate of glycolysis, however, varies much more. 
A large range of glycolytic rates cannot be directly accounted for by only a 10% 
change in ATP levels. 

AMP reverses the inhibition due to ATP, and AMP levels in cells can rise dramati- 
cally when ATP levels decrease, due to the action of the enzyme adenylate kinase, which 
catalyzes the reaction 


ADP + ADP == ATP + AMP 


with the equilibrium constant: 


[ATP][AMP] _ 
eq [ADPF 


0.44 (18.7) 


Adenylate kinase rapidly interconverts ADP, ATP, and AMP to maintain this equilib- 
rium. ADP levels in cells are typically 10% of ATP levels, and AMP levels are often less 
than 1% of the ATP concentration. Due to the nature of the adenylate kinase equilib- 
rium, a small decrease in ATP concentration due to ATP consumption results in a 
much larger relative increase in the AMP levels because of adenylate kinase activity. 

Clearly, the activity of phosphofructokinase depends on both ATP and AMP levels 
and is a function of the cellular energy status. Phosphofructokinase activity is increased 
when the energy status falls and is decreased when the energy status is high. The rate of 
glycolytic activity thus decreases when ATP is plentiful and increases when more ATP is 
needed. 

Glycolysis and the citric acid cycle (to be discussed in Chapter 19) are coupled via phos- 
phofructokinase, because citrate, an intermediate in the citric acid cycle, is an allosteric 
inhibitor of phosphofructokinase. When the citric acid cycle reaches saturation, glycolysis 
(which feeds the citric acid cycle under aerobic conditions) slows down. The citric acid 
cycle directs electrons into the electron-transport chain (for the purpose of ATP synthesis 


Phosphofructokinase with ADP (orange) and fructose- 
6-phosphate (red) (PDB id = 4PFK). 
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High [ATP] 


Reaction velocity 


[Fructose-6-phosphate ] 


Figure 18.9 At high [ATP], phosphofructokinase 
(PFK) behaves cooperatively and the plot of enzyme 
activity versus [fructose-6-phosphate] is sigmoid. High 
[ATP] thus inhibits PFK, decreasing the enzyme’s 
affinity for fructose-6-phosphate. 
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Figure 18.10 Fructose-2,6-bisphosphate activates 
phosphofructokinase, increasing the affinity of the 
enzyme for fructose-6-phosphate and restoring the 
hyperbolic dependence of enzyme activity on 
substrate concentration. 
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Figure 18.11 Fructose-2,6-bisphosphate decreases the 
inhibition of phosphofructokinase due to ATP. 
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in oxidative phosphorylation) and also provides precursor molecules for biosynthetic 
pathways. Inhibition of glycolysis by citrate ensures that glucose will not be committed to 
these activities if the citric acid cycle is already saturated. 

Phosphofructokinase is also regulated by B-D-fructose-2,6-bisphosphate, a potent 
allosteric activator that increases the affinity of phosphofructokinase for the substrate 
fructose-6-phosphate (Figure 18.10). Stimulation of phosphofructokinase is also 
achieved by decreasing the inhibitory effects of ATP (Figure 18.11). Fructose- 
2, 6-bisphosphate increases the net flow of glucose through glycolysis by stimulating 
phosphofructokinase and by inhibiting fructose-1,6-bisphosphatase, the enzyme that 
catalyzes this reaction in the opposite direction. 
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18.3d Reaction 4: Cleavage by Fructose Bisphosphate 
Aldolase Creates Two 3-Carbon Intermediates 


Fructose bisphosphate aldolase is a glycolytic enzyme that cleaves fructose-1,6- 
bisphosphate between the C-3 and C-4 carbons to yield two triose phosphates. The 
products are dihydroxyacetone phosphate (DHAP) and glyceraldehyde-3-phosphate. 
The reaction has an equilibrium constant of approximately 1074 M, and a correspond- 
ing AG” of +23.9 kJ/mol. These values might imply that the reaction does not proceed 
effectively from left to right as written. However, the reaction makes two molecules 
(glyceraldehyde-3-P and dihydroxyacetone-P) from one molecule (fructose-1,6- 
bisphosphate), and the equilibrium is thus greatly influenced by concentration. The 
value of AG in erythrocytes is actually —0.23 kJ/mol (see Table 18.1). At physiological 
concentrations, the reaction is essentially at equilibrium. 
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(FBP) phosphate (DHAP) 


AG® = 23.9 kJ/mol 


3-phosphate (G-3-P) 


Two classes of aldolase enzymes are found in nature. Animal tissues produce a Class 
I aldolase, characterized by the formation of a covalent Schiff base intermediate 
between an active-site lysine and the carbonyl group of the substrate. Class I aldolases 
do not require a divalent metal ion. Class II aldolases are produced mainly in bacteria 
and fungi and do not form a covalent E-S intermediate, but they contain an active-site 
metal (normally zinc, Zn?™). Cyanobacteria and some other simple organisms possess 
both classes of aldolase. 

The aldolase reaction is merely the reverse of the aldol condensation well known to 
organic chemists. The latter reaction involves an attack by a nucleophilic enolate anion 
of an aldehyde or ketone on the carbonyl carbon of an aldehyde. The opposite reaction, 
aldol cleavage, begins with removal of a proton from the B-hydroxyl group, which is 
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followed by the elimination of the enolate anion. A mechanism for the aldol cleavage 
reaction of fructose-1,6-bisphosphate in the Class I-type aldolases is shown in 
Figure 18.12a. In Class II aldolases, an active-site metal such as Zn** behaves as an 
electrophile, polarizing the carbonyl group of the substrate and stabilizing the enolate 
intermediate (Figure 18.12b). 


18.3e Reaction 5: Triose Phosphate Isomerase 
Completes the First Phase of Glycolysis 


Of the two products of the aldolase reaction, only glyceraldehyde-3-phosphate goes 
directly into the second phase of glycolysis. The other triose phosphate, dihydroxy- 
acetone phosphate, must be converted to glyceraldehyde-3-phosphate by the enzyme 
triose phosphate isomerase. This reaction thus permits both products of the aldolase reac- 
tion to continue in the glycolytic pathway and in essence makes the C-1, C-2, and C-3 
carbons of the starting glucose molecule equivalent to the C-6, C-5, and C-4 carbons, 
respectively. The reaction mechanism involves an enediol intermediate that can donate 
either of its hydroxyl protons to a basic residue on the enzyme and thereby become either 
dihydroxyacetone phosphate or glyceraldehyde-3-phosphate (Figure 18.13). Triose phos- 
phate isomerase is one of the enzymes that have evolved to a state of catalytic perfection, 
with a turnover number near the diffusion limit (see Table 13.5). 
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Figure 18.12 A mechanism for the fructose-1, 
6-bisphosphate aldolase reaction. (a) The Schiff base 
formed between the substrate carbonyl and an 
active-site lysine acts as an electron sink, increasing 
the acidity of the B-hydroxyl group and facilitating 
cleavage as shown. The catalytic residues in the 
rabbit muscle enzyme are Lys”? and Asp”. (b) In 
Class II aldolases, an active-site Zn** stabilizes the 
enolate intermediate, leading to polarization of the 
substrate carbonyl group. 


Triose phosphate isomerase with substrate analog 
2-phosphoglycerate (cyan; PDB id = 1YPI). 
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Figure 18.13 A reaction mechanism for triose 
phosphate isomerase. In the enzyme from yeast, 
the catalytic residue is Glu'!®. 
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The triose phosphate isomerase reaction completes the first phase of glycolysis, each 
glucose that passes through being converted to two molecules of glyceraldehyde-3- 
phosphate. Although the last two steps of the pathway are energetically unfavorable, the 
overall five-step reaction sequence has a net AG®’ of +2.2 kJ/mol (Keq ~ 0.43). It is the 
free energy of hydrolysis from the two priming molecules of ATP that brings the overall 
equilibrium constant close to 1 under standard-state conditions. The net AG under cel- 
lular conditions is quite negative (— 53.4 kJ/mol in erythrocytes). 


Methylglyoxal and Advanced Glycation End-Products—The Dark Side of Glycolysis 


Sugars can react nonenzymatically and covalently with many pro- 
teins to form advanced glycation end-products (AGEs), altering the 
functions of the proteins. AGEs are associated with multiple pa- 
thologies, including diabetes, aging, and neurodegenerative diseases. 
Recent studies have shown that the dicarbonyl compound methylg- 
/yoxal is an even greater contributor to AGE formation than sugars 
themselves. Methylglyoxal covalently modifies cysteine, arginine, 
and lysine residues at multiple sites on many cellular proteins. 

Methylglyoxal is produced from dihydroxyacetone phosphate 
(DHAP) in glycolysis by both enzymatic and nonenzymatic routes 
and accounts for as much as 0.4% of the total glycolytic flux. Intra- 
cellular concentrations of methylglyoxal are kept in the low-micro- 
molar range via the glyoxalase cycle (figure), which is composed of 
two enzymes, glyoxalase 1 (GLO1) and glyoxalase 2 (GLO2). GLO1 
couples methylglyoxal with the ubiquitous antioxidant peptide 
glutathione to form lactoylglutathione. Hydrolysis of this interme- 
diate by GLO2 yields D-lactate, which is excreted from the body. 
Methylglyoxal is elevated in diabetes, cancer, cardiovascular disease, 
aging, and renal failure. Future insights into the particular sites of 
modification on specific proteins will be central to understanding 
the involvement of methylglyoxal and AGEs in diabetes and other 
diseases. 
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Methylglyoxal is produced from DHAP and covalently 
modifies Cys, Arg, and Lys residues at multiple sites on many 


proteins. 
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18.4 | What Are the Chemical Principles and Features 
of the Second Phase of Glycolysis? 


The second half of the glycolytic pathway involves the reactions that convert the 
metabolic energy in the glucose molecule into ATP. Altogether, four new ATP mole- 
cules are produced. If two are considered to offset the two ATPs consumed in phase 1, 
a net yield of two ATPs per glucose is realized. Phase 2 starts with the oxidation of 
glyceraldehyde-3-phosphate, a reaction with a large enough energy kick to produce the 
high-energy phosphate, 1,3-bisphosphoglycerate (see Figure 18.1, Step 6). Phosphoryl 
transfer from 1,3-BPG to ADP to make ATP is highly favorable. The product, 
3-phosphoglycerate, is converted via several steps to phosphoenolpyruvate (PEP), 
another high-energy phosphate. PEP readily transfers its phosphoryl group to ADP in 
the pyruvate kinase reaction to make another ATP. 


18.4a Reaction 6: Glyceraldehyde-3-Phosphate Dehydrogenase 
Creates a High-Energy Intermediate 


In the first glycolytic reaction to involve oxidation—reduction, glyceraldehyde-3- 
phosphate is oxidized to 1,3-bisphosphoglycerate by glyceraldehyde-3-phosphate 
dehydrogenase. Although the oxidation of an aldehyde to a carboxylic acid is a highly 
exergonic reaction, the overall reaction involves both formation of a carboxylic— 
phosphoric anhydride and the reduction of NAD* to NADH and is therefore slightly 
endergonic at standard state, with a AG®’ of +6.30 kJ/mol. The free energy that might 
otherwise be released as heat in this reaction is directed into the formation of a high- 
energy phosphate compound, 1,3-bisphosphoglycerate, and the reduction of NAD*. 
The reaction mechanism involves nucleophilic attack by a cysteine —SH group on the 
carbonyl carbon of glyceraldehyde-3-phosphate to form a hemithioacetal (Figure 18.14). 
The hemithioacetal intermediate decomposes by hydride (H:>) transfer to NAD* to 
form a high-energy thioester. Nucleophilic attack by phosphate displaces the product, 
1,3-bisphosphoglycerate, from the enzyme. The enzyme can be inactivated by reaction 
with iodoacetate, which reacts with and blocks the essential cysteine sulfhydryl. 
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The glyceraldehyde-3-phosphate dehydrogenase reaction is the site of action of 
arsenate (AsO,3~), a toxic anion analogous to phosphate. Arsenate is an effective sub- 
strate in this reaction, forming /-arseno-3-phosphoglycerate, but acyl arsenates are 
quite unstable and are rapidly hydrolyzed. 1-Arseno-3-phosphoglycerate breaks down 
to yield 3-phosphoglycerate, the product of the seventh reaction of glycolysis. The result 
is that glycolysis continues in the presence of arsenate, but the molecule of ATP formed 
in reaction 7 (phosphoglycerate kinase) is not made because this step has been bypassed. 
The lability of 1-arseno-3-phosphoglycerate effectively uncouples the oxidation and 
phosphorylation events, which are normally tightly coupled in the glyceraldehyde- 
3-phosphate dehydrogenase reaction. 


18.4b Reaction 7: Phosphoglycerate Kinase Is the Break-Even Reaction 


The glycolytic pathway breaks even in terms of ATPs consumed and produced with 
reaction 7. The enzyme phosphoglycerate kinase transfers a phosphoryl group from 
1,3-bisphosphoglycerate to ADP to form an ATP. Because each glucose molecule 
sends two molecules of glyceraldehyde-3-phosphate into the second phase of 
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Figure 18.14 A mechanism for the glyceraldehyde-3- R 
phosphate dehydrogenase reaction. Reaction of an | + 
enzyme sulfhydryl with the carbonyl carbon of ZN 
glyceraldehyde-3-P forms a thiohemiacetal, which H.N | 
loses a hydride to NAD* to become a thioester. 2 AN 
Phosphorolysis of this thioester releases a Oo yH 
1,3-bisphosphoglycerate. In the enzyme from rabbit pam 
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glycolysis and because two ATPs were consumed per glucose in the first-phase reac- 
tions, the phosphoglycerate kinase reaction pays off the ATP debt created by the 
priming reactions. As might be expected for a phosphoryl transfer enzyme, Mg** ion 
is required for activity and the true nucleotide substrate for the reaction is MgADP-. 
It is appropriate to view the sixth and seventh reactions of glycolysis as a coupled 
pair, with 1,3-bisphosphoglycerate as an intermediate. The phosphoglycerate kinase 
reaction is sufficiently exergonic at standard state to pull the G-3-P dehydrogenase 
reaction along. (In fact, the aldolase and triose phosphate isomerase are also pulled 
forward by phosphoglycerate kinase.) The net result of these coupled reactions is 


Glyceraldehyde-3-phosphate + ADP + P; + NAD+ —> 
3-phosphoglycerate + ATP + NADH + H* 
AG” = —12.6 kJ/mol (18.8) 


(b) 


(0) 

X A 2 
C— OPO; COO 
| ae | 

HCOH + ADP = > HCOH + ATP 
| Phosphoglycerate | 
CHORD kinase CH,OPO37— 

1,3-Bisphosphoglycerate 3-Phosphoglycerate 

(1,3-BPG) (3-PG) 


AG” = -18.9 kJ/mol 


Another consequence of the coupling between these reactions is revealed in their val- 
ues of AG under cellular conditions (Table 18.1). Despite its strongly negative A G°’, the 
The operand M) closed forms of phosphoglycerate phosphoglycerate kinase reaction operates at equilibrium in the erythrocyte (AG J 0.1 
kinase. ATP (cyan), 3-phosphoglycerate (purple), and KJ/mol). In essence, the free energy available in the phosphoglycerate kinase reaction is 
Mg?* (gold) (a: PDB id = 3PGK; b: PDB id = 1VPE). used to bring the three previous reactions closer to equilibrium. Viewed in this context, it 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


18.4 What Are the Chemical Principles and Features of the Second Phase of Glycolysis? 617 


is clear that ADP has been phosphorylated to form ATP at the expense of a substrate, 
namely, glyceraldehyde-3-phosphate. This is an example of substrate-level phosphoryla- 
tion, a concept that will be encountered again. (The other kind of phosphorylation, oxida- 
tive phosphorylation, is driven energetically by the transport of electrons from appropriate 
coenzymes and substrates to oxygen.) Even though the coupled reactions exhibit a very 
favorable AG°’, there are conditions (for example, high ATP and 3-phosphoglycerate 
levels) under which the phosphoglycerate kinase reaction can be reversed so that 3-phos- 
phoglycerate is phosphorylated from ATP. 

An important regulatory molecule, 2,3-bisphosphoglycerate, is synthesized and metab- 
olized by a pair of reactions that make a detour around the phosphoglycerate kinase reac- 
tion. 2,3-BPG, which stabilizes the deoxy form of hemoglobin and is primarily responsible 
for the cooperative nature of oxygen binding by hemoglobin (see Section 15.6), is formed 
from 1,3-bisphosphoglycerate by bisphosphoglycerate mutase (Figure 18.15). Interestingly, 
3-phosphoglycerate is required for this reaction, which involves phosphoryl transfer from 
the C-1 position of 1,3-bisphosphoglycerate to the C-2 position of 3-phosphoglycerate 
(Figure 18.16). Hydrolysis of 2,3-BPG is carried out by 2,3-bisphosphoglycerate phospha- 
tase. Although other cells contain only a trace of 2,3-BPG, erythrocytes typically contain 
4 to 5 mM 2,3-BPG. 


18.4c Reaction 8: Phosphoglycerate Mutase 
Catalyzes a Phosphoryl Transfer 


The remaining steps in the glycolytic pathway prepare for synthesis of the second ATP 
equivalent. This begins with the phosphoglycerate mutase reaction, in which the phos- 
phoryl group of 3-phosphoglycerate is moved from C-3 to C-2. (The term mutase is 
applied to enzymes that catalyze migration of a functional group within a substrate 
molecule.) The free energy change for this reaction is very small under cellular condi- 
tions (AG = 0.83 kJ/mol in erythrocytes). Phosphoglycerate mutase enzymes isolated 
from different sources exhibit different reaction mechanisms. As shown in Figure 18.17, 
the enzymes isolated from yeast and from rabbit muscle form phosphoenzyme inter- 
mediates, use 2,3-bisphosphoglycerate as a cofactor, and undergo intermolecular 
phosphoryl group transfers (in which the phosphate of the product 2-phosphoglycerate 
is not that from the 3-phosphoglycerate substrate). The prevalent form of phospho- 
glycerate mutase is a phosphoenzyme, with a phosphoryl group covalently bound to a 
histidine residue at the active site. This phosphoryl group is transferred to the C-2 
position of the substrate to form a transient, enzyme-bound 2,3-bisphosphoglycerate, 
which then decomposes by a second phosphoryl transfer from the C-3 position of the 
intermediate to the histidine residue on the enzyme. About once in every 100 enzyme 
turnovers, the intermediate, 2,3-bisphosphoglycerate, dissociates from the active site, 


COO COO 
| | 
A N s = 
HCOH = nem 
CH,OPO3- CH OH 
3-Phosphoglycerate 2-Phosphoglycerate 
(3-PG) (2-PG) 


Phosphoglycerate mutase 
AG” = +4.4 kJ/mol 


Glu®s 


SO," 


Asn!6 


Gly! 


The catalytic histidine (His'**) at the active site 
of Escherichia coli phosphoglycerate mutase 
(PDB id = 1E58). Note that His!’ is phosphorylated. 


O 
N N N 
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| i | j | 
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Figure 18.15 Formation and decomposition of 2,3-bisphosphoglycerate. 


1 ® 1 1 1 
2 + 2 = 2 + 2 
3 ® 3 3 3 
1,3-BPG 3-PG 3-PG 2,3-BPG 


Figure 18.16 The mutase that forms 2,3-BPG from 
1,3-BPG requires 3-phosphoglycerate. The reaction 
is an intermolecular phosphoryl transfer from C-1 
of 1,3-BPG to C-2 of 3-PG. 
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Figure 18.17 A mechanism for the phosphoglycerate mutase 
reaction in rabbit muscle and in yeast. Zelda Rose of the (0J 
Institute for Cancer Research in Philadelphia showed that the coo O 

enzyme requires a small amount of 2,3-BPG to phosphorylate | | 

the histidine residue before the mechanism can proceed. Prior to © HC—O—P—O™ 

her work, the role of the phosphohistidine in this mechanism 
was not understood. 


leaving an inactive, unphosphorylated enzyme. The unphosphorylated enzyme can 
be reactivated by binding 2,3-BPG. For this reason, maximal activity of phospho- 
glycerate mutase requires the presence of small amounts of 2,3-BPG. 


18.4d Reaction 9: Dehydration by Enolase Creates PEP 


Recall that prior to synthesizing ATP in the phosphoglycerate kinase reaction, it was 
necessary to first make a substrate having a high-energy phosphate. Reaction 9 of 
glycolysis, the enolase reaction, similarly makes a high-energy phosphate in prepara- 
tion for ATP synthesis. Enolase catalyzes the formation of phosphoenolpyruvate from 
2-phosphoglycerate. The reaction involves the removal of a water molecule to form the 
enol structure of PEP. The AG” for this reaction is relatively small at 1.8 kJ/mol 
(Keq = 0.5); and, under cellular conditions, AG is very close to zero. How can the 


COO COO 
| | 
2+ 
HC — O— PO, Mg C—O— PO, + HO 
| or ee 
CH,OH casa CH, 
2-Phosphoglycerate Phosphoenolpyruvate 
(2-PG) (PEP) 


AG” = +1.8 kJ/mol 


enolase reaction transform a substrate with a relatively low free energy of hydrolysis 
into a product (PEP) with a very high free energy of hydrolysis? This is possible because 
2-phosphoglycerate and PEP contain about the same amount of potential metabolic 
energy, with respect to decomposition to P;, CO», and H2O. What the enolase reaction 
does is rearrange the substrate into a form from which more of this potential energy 
can be released upon hydrolysis. The enzyme is strongly inhibited by fluoride ions in 
the presence of phosphate. Thomas Nowak has shown that fluoride, phosphate, and a 
divalent cation form a transition-state—like complex in the enzyme active site, with 
fluoride apparently mimicking the hydroxide ion nucleophile in the enolase reaction. 

Yeast enolase is a dimer of identical subunits and X-ray crystal structures of enzymes 
like this typically show identical active site structures for both subunits. However, if yeast 
enolase is crystallized in the presence of a mixture of the substrate (2-phosphoglycerate) 
and the product (phosphoenolpyruvate), the crystallized dimer is asymmetric! One sub- 
unit active site contains 2-phosphoglycerate, and the other contains PEP (Figure 18.18), 
thus providing a before-and-after picture of this glycolytic enzyme. 


18.4e Reaction 10: Pyruvate Kinase Yields More ATP 


The second ATP-synthesizing reaction of glycolysis is catalyzed by pyruvate kinase, 
which brings the pathway at last to its pyruvate branch point. Pyruvate kinase medi- 
ates the transfer of a phosphoryl group from phosphoenolpyruvate to ADP to make 
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(a) (b) 


Mg (=) Phosphoglycerate 


Li® 


His!?° 


Figure 18.18 The yeast enolase dimer is asymmetric. The active site of one subunit (a) contains 
2-phosphoglycerate, the enolase substrate. Also shown are a Mg?* ion (blue), a Li* ion (purple), and 
His!>’, which participates in catalysis. The other subunit (b) binds phosphoenolpyruvate, the product 
of the enolase reaction. An active-site water molecule (yellow), Mg?* (blue), and His'® are also shown 
(PDB id = 2ONE). 


ATP and pyruvate. The reaction requires Mg?* ion and is stimulated by K* and 
certain other monovalent cations. 


COO- CcOO- 
l Mg? | is 
Umo- ro + mw + ap — eo + = 
Il Kt | 
CH, Pyruvate CH, 

kinase 
PEP Pyruvate 


AG” = -31.7 kJ/mol 


The corresponding Keq at 25°C is 3.63 X 105, and it is clear that the pyruvate kinase 
reaction equilibrium lies very far to the right. Concentration effects reduce the magni- 
tude of the free energy change somewhat in the cellular environment, but the AG in eryth- 
rocytes is still quite favorable at —23.0 kJ/mol. The high free energy change for the 
conversion of PEP to pyruvate is due largely to the highly favorable and spontaneous 
conversion of the enol tautomer of pyruvate to the more stable keto form (Figure 18.19) 
following the phosphoryl group transfer step. 


The structure of the pyruvate kinase tetramer is sensitive to bound ligands. The inactive E. coli enzyme 
in the absence of ligands (left, PDB id = 1E0U). The active form of the yeast dimer, with fructose-1, 
6-bisphosphate (an allosteric regulator, blue), substrate analog (red), and K* (gold) (PDB id = 1A3W). 
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Figure 18.19 The conversion of phosphoenol- 
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pyruvate (PEP) to pyruvate may be viewed as ARA g fet be 
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The large negative A G of this reaction makes pyruvate kinase a suitable target site for 
regulation of glycolysis. For each glucose molecule in the glycolysis pathway, two ATPs 
are made at the pyruvate kinase stage (because two triose molecules were produced per 
glucose in the aldolase reaction). Because the pathway broke even in terms of ATP at 
the phosphoglycerate kinase reaction (two ATPs consumed and two ATPs produced), 
the two ATPs produced by pyruvate kinase represent the payoff of glycolysis—a net 
yield of two ATP molecules. 

Pyruvate kinase possesses allosteric sites for numerous effectors. It is activated by 
AMP and fructose-1,6-bisphosphate and inhibited by ATP, acetyl-CoA, and alanine. 
(Note that alanine is the a-amino acid counterpart of the a-keto acid, pyruvate.) 
Furthermore, liver pyruvate kinase is regulated by covalent modification. Hormones 
such as glucagon activate cAMP-dependent protein kinase, which transfers a phosphoryl 
group from ATP to the enzyme. The phosphorylated form of pyruvate kinase is more 
strongly inhibited by ATP and alanine and has a higher Km for PEP, so in the presence 
of physiological levels of PEP, the enzyme is inactive. Then PEP is used as a substrate 
for glucose synthesis in the gluconeogenesis pathway, instead of going on through gly- 
colysis and the citric acid cycle (or fermentation routes). A suggested active-site 
geometry and mechanism for pyruvate kinase, based on NMR and EPR studies by 


Human Biochemistry 


Pyruvate Kinase M2 Moonlights as a Protein Kinase in Cancer 


purine biosynthesis when nucleotides for DNA and RNA synthesis are 
needed—a hallmark of proliferating tissues such as tumors. 


The mammalian pyruvate kinase gene family is represented by two 
genes that encode four isoforms—L, R, M1, and M2. L and R 
predominate in liver and red blood cells, respectively. M1 is found 
mainly in brain and skeletal muscle, but M2 is the dominant form 
in proliferating cells. New tissues always express M2 first and switch 
to other forms during tissue differentiation. However, numerous 
studies have shown that a shift back to M2 expression occurs during 
tumor formation. 

PK M2 normally exists as a tetramer that exhibits high pyruvate 
kinase activity. However, Zhi-Ren Liu and colleagues have shown 
that PK M2 can dissociate to form dimers that localize to the cell nu- 
cleus. PK M2 dimers have low PK activity, but they can act as protein 


kinases, using PEP as the phosphorylation substrate. This novel 
PK M2 protein kinase activity is essential to cellular prolifera- 
tion—at least a hundred proteins can be phosphorylated by this 
PEP-dependent protein kinase. 

More recently, Young-Sam Lee and colleagues have added a sur- 
prising twist to this story. The protein kinase activity of PK M2 is 
stimulated by N-succinylo-5-aminoimidazole-4-carboxamide-1-ribose- 
5’-phosphate (SAICAR), an intermediate from the purine biosynthetic 
pathway. PK M2, which acts to direct glucose to the pentose phosphate 
pathway for ribose-5-phosphate synthesis (see Critical Developments 
in Biochemistry—The Warburg Effect and Cancer), also acts through 
its protein kinase function to activate the expression of the genes for 


N-succinylo-5-aminoimidazole-4-carboxamide 
ribonucleotide (SAICAR) 
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Figure 18.20 A mechanism for the pyruvate kinase reaction, based on NMR and EPR studies by Albert 
Mildvan and colleagues. Phosphoryl transfer from phosphoenolpyruvate (PEP) to ADP occurs in four 
steps: (1) A water on the Mg?* ion coordinated to ADP is replaced by the phosphoryl group of PEP, (2) 
Mg?* dissociates from the a-P of ADP, (3) the phosphoryl group is transferred, and (4) the enolate of 
pyruvate is protonated. (Adapted from Mildvan, A., 1979. The role of metals in enzyme-catalyzed 
substitutions at each of the phosphorus atoms of ATP. Advances in Enzymology 49:103-126.) 


Albert Mildvan and colleagues, is presented in Figure 18.20. The carbonyl oxygen of 
pyruvate and the y-phosphorus of ATP lie within 0.3 nm of each other at the active site, 
consistent with direct transfer of the phosphoryl group without formation of a phos- 
phoenzyme intermediate. 


18.5 | What Are the Metabolic Fates of NADH 
and Pyruvate Produced in Glycolysis? 


In addition to ATP, the products of glycolysis are NADH and pyruvate. Their processing 
depends upon other cellular pathways. NADH must be recycled to NAD*, lest NAD+ 
become limiting in glycolysis. NADH can be recycled by both aerobic and anaerobic 
paths, either of which results in further metabolism of pyruvate. What a given cell does 
with the pyruvate produced in glycolysis depends in part on the availability of oxygen. 
Under aerobic conditions, pyruvate can be sent into the citric acid cycle (also known as 
the TCA cycle). There, it is oxidized to CO) with the production of additional NADH 
(and FADH)). Under aerobic conditions, the NADH produced in glycolysis and the cit- 
ric acid cycle is reoxidized to NAD* in the mitochondrial electron-transport chain. 


18.5a Anaerobic Metabolism of Pyruvate Leads to Lactate or Ethanol 


Under anaerobic conditions, the pyruvate produced in glycolysis is reduced to etha- 
nol in yeast, or reduced to lactate in microorganisms and animals. These processes 
are examples of fermentation—the production of ATP energy by reaction pathways in 
which organic molecules function as donors and acceptors of electrons. In either case, 
the reduction of pyruvate provides a means of reoxidizing the NADH produced in the 
glyceraldehyde-3-phosphate dehydrogenase reaction of glycolysis (Figure 18.21). 
In yeast, alcoholic fermentation is a two-step process. Pyruvate is decarboxylated 
to acetaldehyde by pyruvate decarboxylase in an essentially irreversible reaction. 
Thiamine pyrophosphate is a required cofactor for this enzyme. The second step, the 
reduction of acetaldehyde to ethanol by NADH, is catalyzed by alcohol dehydrogenase 
(Figure 18.2la). At pH 7, the reaction equilibrium strongly favors ethanol. The end 
products of alcoholic fermentation are thus ethanol and carbon dioxide. Alcoholic 
fermentations are the basis for the brewing of beers and the fermentation of grape 
sugar in wine making. Lactate produced by anaerobic microorganisms during lactic 
acid fermentation is responsible for the taste of sour milk and for the characteristic 
taste and fragrance of sauerkraut, which in reality is fermented cabbage. 
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(a) Alcoholic fermentation 


O 
x 
CHO HPOJ COPO% 
H—C— OH > H—CG—OH 
G3PDH 
CH,OPO2— CH,OPO2- 
p-Glyceraldehyde- 1,3-BPG 
3-phosphate 
v 
NAD* NADH + H* | 
uN 
O 
| 5 
CH,C — COO- = 
C E Pyruvate 2 
CHCH OH < CH,CHO CO, 
Alcohol 
Ethanol dehydrogenase Acetaldehyde 
(b) Lactic acid fermentation o 
N 
CHO HPO} No OPO;- 
| k 
H —C— OH > H—C—OH 
G3PDH | 
CH,OPO;— CHyOPO;- 
p-Glyceraldehyde- 1,3-BPG 
3-phosphate 
v E 
8 5 
NAD* NADH + H* = 8 
aN 2 2 
OH o F : 
| | 7 j 
CH; — C — COO“ < CHC — COOT 
| Lactate t 
H dehydrogenase a 


Lactate 


Figure 18.21 Anaerobic metabolism of pyruvate leads to lactate or ethanol. (a) Pyruvate reduction to 
ethanol in yeast provides a means for regenerating NAD* consumed in the glyceraldehyde-3-P 
dehydrogenase reaction. Photo: Fermentation at a bourbon distillery. A mash of corn and other grains 
is fermented by yeast, producing ethanol and carbon dioxide, which can be seen bubbling to the 
surface. Woodford Reserve Distillery, Versailles, KY. (b) In oxygen-depleted muscle, for example, during 
strenuous exercise (photo), NAD* is regenerated in the lactate dehydrogenase reaction. Hibernating 
fresh-water turtles, trapped beneath ice and lying in mud, become anoxic, and convert glucose reserves 
mainly to lactate. Their shells release minerals to buffer the lactate throughout hibernation (photo). 


18.5b Lactate Accumulates Under Anaerobic 
Conditions in Animal Tissues 


In animal tissues experiencing anaerobic conditions, pyruvate is reduced to lactate. 
Pyruvate reduction occurs in tissues that normally experience minimal access to 
blood flow (for example, the cornea of the eye) and also in rapidly contracting skeletal 
muscle. When skeletal muscles are exercised strenuously, the available tissue oxygen 
is consumed and the pyruvate generated by glycolysis can no longer be oxidized in the 
TCA cycle. Instead, excess pyruvate is reduced to lactate by lactate dehydrogenase 
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The Warburg Effect and Cancer 


Otto Warburg observed in 1924 that rapidly proliferating cancer 
cells metabolize glucose in a manner distinct from that of quiescent 
(non-dividing) tissues. Warburg found that cancer cells tend to con- 
vert glucose into lactate, even in the presence of oxygen sufficient to 
support mitochondrial oxidative phosphorylation. (Because this oc- 
curs even in an abundance of oxygen, it is referred to as aerobic gly- 
colysis.) This is at first surprising, because conversion of glucose to 
lactate produces only 2 molecules of ATP per molecule of glucose, 
whereas complete metabolism of glucose in oxidative phosphoryla- 
tion produces more than 30 molecules of ATP. Why do proliferating 
cells switch to an apparently less efficient metabolism? 

Lewis Cantley and co-workers have suggested that proliferating 
cells need more than just ATP—they must synthesize large amounts 
of nucleotides, amino acids, and lipids. For these purposes, cells re- 
quire carbon for biomass and NADPH (reducing equivalents for 
anabolic metabolism). These needs cannot be met merely by con- 
verting glucose to CO, and ATP via oxidative phosphorylation. 
Some glucose must be diverted to macromolecular precursors such 
as glycolytic intermediates for nonessential amino acids, ribose for 
nucleotides, and acetate units for fatty acids. C-13 NMR measure- 
ments have shown in fact that glioblastoma (a type of cancer) cells 
convert as much as 90% of acquired glucose and 60% of acquired 
glutamine into lactate or alanine, producing large amounts of 
NADPH. The lactate and alanine produced in this way can be re- 
cycled by nearby specialized nonproliferating cells, and also by the 
liver. 

Numerous signaling proteins and pathways regulate the glycolytic 
pathway. Phosphoinositide 3-kinase (PI3K) activates glucose trans- 
porter expression, enhancing glucose capture by hexokinase. PI3K 
also stimulates PFK. In contrast to quiescent cells, proliferating cells 
selectively express the M2 (embryonic) isoform of pyruvate kinase 
(PK M2), which is regulated by tyrosine-phosphorylated proteins. The 
activity of PK M2 is reduced by binding phosphotyrosine-containing 


AP Images 


Otto Warburg received the Nobel Prize in Physiology or Medicine in 1931 
with the citation “for his discovery of the nature and mode of action of the 
respiratory enzyme.” Warburg’s “respiratory enzyme” is cytochrome oxidase, 
the last enzyme in the electron transport pathway. Warburg, who studied 
under the great Emil Fischer, mentored three future recipients of the Nobel 


Prize, including Hans Krebs, discoverer of the tricarboxylic acid cycle. 
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proteins, allowing PK M2 to act as a gatekeeper to control the flow of 
carbon into biosynthetic pathways versus catabolism for ATP produc- 
tion. p53, the protein encoded by a prominent tumor suppressor gene, 
is implicated in the regulation of cellular responses to stress. The p53 
gene is also mutated in most cancer cells. TIGAR, a p53-activated 
protein, is a fructose bisphosphatase. TIGAR inhibits PFK by lower- 
ing fructose-2,6,bisphosphate levels, thereby redirecting glucose to the 
pentose phosphate pathway. The pentose phosphate pathway pro- 
duces NADPH and a variety of sugars essential to anabolic pathways. 
p53 has additional effects on glycolysis and related pathways. For 
example, p53 inhibits the expression of glucose transporter proteins 
(which bring glucose into the cell) and also regulates expression of 
hexokinase and phosphoglycerate mutase. These effects of p53 occur 
in the nucleus, but cytosolic p53 directly inhibits glucose-6-P dehydro- 
genase (which inhibits the shunting of glucose-6-P into the pentose 
phosphate pathway, forestalling NADPH production for fatty acid 
synthesis and ribose-5-P formation for nucleotide and nucleic acid 
biosynthesis). 

The emerging evidence highlights the role of glycolysis as a 
dynamically regulated system that is programmed to meet the spe- 
cific needs of quiescent differentiated tissues or fit the requirements 
of rapidly proliferating cells. 
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Figure 18.22 A comparison of free energy changes 
for the reactions of glycolysis (step 1 = hexokinase) 
under (a) standard-state conditions and (b) actual 
intracellular conditions in erythrocytes. The values 
of AG” provide little insight into the actual free 
energy changes that occur in glycolysis. On the other 
hand, under intracellular conditions, seven of the 
glycolytic reactions operate near equilibrium (with 
AG near zero). The driving force for glycolysis lies in 
the hexokinase (1), phosphofructokinase (3), and 
pyruvate kinase (10) reactions. The lactate 
dehydrogenase (step 11) reaction also exhibits a 
large negative AG under cellular conditions. 


(Figure 18.21b). The rate of anaerobic glycolysis in skeletal muscle can increase up to 
2000-fold almost instantaneously, for example, to support the intense demands of a 
sprinting animal. Large amounts of ATP are generated rapidly, at the expense of lac- 
tate accumulation. In anaerobic muscle tissue, lactate represents the end of glycolysis. 
Most of this lactate must be carried out of the muscle by the blood and transported to 
the liver, where it can be resynthesized into glucose in gluconeogenesis. Moreover, 
because glycolysis generates only a fraction of the total energy available from the 
breakdown of glucose (the rest is generated by the TCA cycle and oxidative phos- 
phorylation), the onset of anaerobic conditions in skeletal muscle also means a reduc- 
tion in the energy available from the breakdown of glucose. 

Freshwater turtles (Figure 18.21b) use a remarkable adaptation of their form to sur- 
vive the harsh conditions of winter, under frozen lakes for months, without breathing. 
Turtles in such conditions become anoxic (subsisting without oxygen). Their heart beats 
only once every ten minutes, and brain activity is reduced to almost nothing. They 
metabolize only a limited store of glycogen, rather than fat (which requires oxygen to be 
metabolized). The glycogen is converted to glucose, and then to lactate via glycolysis. 
The turtle’s shell releases mineral reserves to buffer the otherwise fatal buildup of lactate 
in the turtle’s blood, and the shell also traps some of the lactate, keeping it out of the 
bloodstream. Without the biochemical adaptations provided by its shell, the turtle 
could not survive the winter. 


18.6 | How Do Cells Regulate Glycolysis? 


The elegance of nature’s design for the glycolytic pathway may be appreciated through an 
examination of Figure 18.22. The standard-state free energy changes for the 10 reactions 
of glycolysis and the lactate dehydrogenase reaction (Figure 18.22a) are variously positive 
and negative and, taken together, offer little insight into the coupling that occurs in the 
cellular milieu. On the other hand, the values of AG under cellular conditions 
(Figure 18.22b) fall into two distinct classes. For reactions 2 and 4 through 9, AG is very 
close to zero, meaning these reactions operate essentially at equilibrium. Small changes in 
the concentrations of reactants and products could push any of these reactions either for- 
ward or backward. By contrast, the hexokinase, phosphofructokinase, and pyruvate 
kinase reactions all exhibit large negative AG values under cellular conditions. These reac- 
tions are thus the sites of glycolytic regulation. When these three enzymes are active, gly- 
colysis proceeds and glucose is readily metabolized to pyruvate or lactate. Inhibition 
of the three key enzymes by allosteric effectors brings glycolysis to a halt. In 
gluconeogenesis—the biosynthesis of glucose—different enzymes are used to carry out 
reactions 1, 3, and 10 in reverse, effecting the net synthesis of glucose. The maintenance of 
reactions 2 and 4 through 9 at or near equilibrium permits these reactions (and their 
respective enzymes!) to operate effectively in either the forward or reverse direction. 


18.7 | Are Substrates Other Than Glucose 
Used in Glycolysis? 


The glycolytic pathway described in this chapter begins with the breakdown of glucose, 
but other sugars, both simple and complex, can enter the cycle if they can be converted 
by appropriate enzymes to one of the intermediates of glycolysis. Figure 18.23 shows 
the routes by which several simple metabolites can enter the glycolytic pathway. 


18.7a Fructose Catabolism in Liver Is Unregulated— 
and Potentially Harmful 
Dietary fructose is degraded almost exclusively in the liver in an unregulated process 


that circumvents the PFK reaction. In the liver, fructose is phosphorylated at C-1 by 
the enzyme fructokinase: 


D-Fructose + ATP*~ —> p-fructose-1-phosphate?~ + ADP?~ + H+ (18.9) 
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(a) (b) 
Glucose 
UDP-Glucose 
Mannose Glucose-1-P 
Fructose Fracrokinase Fructose-1-P | 
Mannose-6-P — 
ATP ADP Aldolase B 
Dihydroxy- 
acetone Glyceraldehyde 
phosphate ATP J 
Triose kinase 
Triose ADP 
phosphate Glyceraldehyde-3-P 
isomerase f 
pes Figure 18.23 The routes by which fructose, 
Pyruvate mannose, and galactose enter glycolysis. 
(a) Fructose metabolism in the liver is 
unregulated. By contrast, glucose 
breakdown in glycolysis is highly regulated, 
Acetyl-CoA particularly by the phosphofructokinase 
reaction. Aldolase B, a class I aldolase 
unique to the liver, produces glyceraldehyde 
Fatty acids and (not glyceraldehyde-3-P), which must be 
triglycerides phosphorylated by triose kinase to continue 
through the balance of the glycolytic 
pathway. (b) Mannose and galactose enter 
the glycolytic pathway via hexokinase 


2 Pyruvate 


Subsequent action by fructose-1-phosphate aldolase cleaves fructose-1-P to produce 
dihydroxyacetone phosphate and pD-glyceraldehyde: 


D-Fructose-1-P?~ —> p-glyceraldehyde + dihydroxyacetone phosphate?” (18.10) 


Dihydroxyacetone phosphate is of course an intermediate in glycolysis. D-Glyceraldehyde 
can be phosphorylated by triose kinase in the presence of ATP to form D-glyceraldehyde- 
3-phosphate, another glycolytic intermediate. 

Most of the dihydroxyacetone phosphate and glyceraldehyde-3-P produced from 
fructose is converted directly to pyruvate and acetyl-CoA, and then to fatty acids and 
triglycerides in an unregulated process that can result in accumulation of fat in the liver 
(Figure 18.23a). Thus, diets high in fructose can lead to fatty liver disease and eventu- 
ally cirrhosis. Liver fats can also be released into the circulation, with resulting accumu- 
lation by fat cells in other tissues. American dietary consumption of fructose has 
increased from 15 g/day in 1900 to approximately 90 g/day in modern times, a conse- 
quence of the prevalence of sucrose and high fructose corn syrup in beverages and 
processed foods. Diets high in fructose promote insulin resistance, metabolic syndrome, 
and obesity. 

The small amounts of dietary fructose that reach the kidneys and muscle can be phos- 
phorylated by hexokinase, which, as pointed out previously, can utilize several different 
hexose substrates. The free energy of hydrolysis of ATP drives the reaction forward: 


D-Fructose + ATP*~ —> p-fructose-6-phosphate?~ + ADP?~ + H+ (18.11) 


Fructose-6-phosphate generated in this way enters the glycolytic pathway directly in 
step 3, the second priming reaction. 


ahead of the phosphofructokinase reaction. 
Thus their catabolism is subject to the same 
regulation as glucose. 
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18.7b Mannose Enters Glycolysis in Two Steps 


Another simple sugar that enters glycolysis at the same point as fructose is mannose, 
which occurs in many glycoproteins, glycolipids, and polysaccharides (see Section 7.5). 
Mannose is also phosphorylated from ATP by hexokinase, and the mannose-6-phosphate 
thus produced is converted to fructose-6-phosphate by phosphomannoisomerase. 


pD-Mannose + ATP*~ —> D-mannose-6-phosphate?~ + ADP*~ + H+ (18.12) 


D-Mannose-6-phosphate*~ ——> D-fructose-6-phosphate?— (18.13) 


18.7c Galactose Enters Glycolysis via the Leloir Pathway 


A somewhat more complicated route into glycolysis is followed by galactose, 
another simple hexose sugar. The process, called the Leloir pathway after Luis 
Leloir, its discoverer, begins with phosphorylation from ATP at the C-1 position by 
galactokinase: 


Human Biochemistry 


Tumor Diagnosis Using Positron Emission Tomography (PET) 


More than 90 years ago, Otto Warburg at the Kaiser Wilhelm 
Institute of Biology in Germany demonstrated that most animal 
and human tumors displayed a very high rate of glycolysis com- 
pared to that of normal tissue (see also the Critical Developments 
in Biochemistry box in Section 18.6). This observation from long 
ago is the basis of a modern diagnostic method for tumor detection 
called positron emission tomography, or PET. PET uses molecular 


(a) 
CH,OH 
(0) 
OH HOH 
HO 
18Ẹ 
2-[!8F]Fluoro-2-deoxyglucose 
(b) 


511 kev Photon 


Emitted 
positron 


Electron in tissue 


511 kev Photon 


Positrons emitted by !ŝF are annihilated by collision with electrons in tissue. 


probes that contain a neutron-deficient, radioactive element such 
as carbon-11 or fluorine-18. An example is 2-['*F]fluoro-2-deoxy- 
glucose (FDG), a molecular mimic of glucose. The '8F nucleus is 
unstable and spontaneously decays by emission of a positron (an 
antimatter* particle) from a proton, thus converting a proton to 
a neutron and transforming the '8F to '8O. The emitted positron 
typically travels a short distance (less than a millimeter) and collides 
with an electron, annihilating both particles and creating a pair of 
high-energy photons—gamma rays. Detection of the gamma rays 
with special cameras can be used to construct three-dimensional 
models of the location of the radiolabeled molecular probe in the 
tissue of interest. 

FDG is taken up by human cells and converted by hexoki- 
nase to 2-['8F]fluoro-2-deoxy-glucose-6-phosphate in the first step of 
glycolysis. Cells of a human brain, for example, accumulate FDG in 
direct proportion to the amount of glycolysis occurring in those cells. 
Tumors can be identified in PET scans as sites of unusually high FDG 
accumulation. 


(c) PET image of human brain following administration of 
'SFDG. Red area indicates a large malignant tumor. 
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Science Source 


*The existence of antimatter in the form of positrons was first postulated by Robert Oppenheimer, the father of the atomic bomb. 
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D-Galactose + ATP*~ ——> p-galactose-1-phosphate*~ + ADP? + H* (18.14) 


Galactose-l-phosphate is then converted into UDP-galactose (a sugar nucleotide) 
by galactose-1-phosphate uridylyltransferase (Figure 18.24), with concurrent produc- 
tion of glucose-l-phosphate and consumption of a molecule of UDP-glucose. The 
uridylyltransferase reaction (Figure 18.25) proceeds via a ping-pong mechanism 
(see Section 13.5) with a covalent enzyme-UMP intermediate. The glucose-l- 
phosphate produced by the transferase reaction is a substrate for the phosphogluco- 
mutase reaction (Figure 18.24), which produces glucose-6-phosphate, a glycolytic 
substrate. The other transferase product, UDP-galactose, is converted to UDP- 
glucose by UDP-glucose-4-epimerase. The combined action of the uridylyltrans- 
ferase and epimerase thus produces glucose-1-P from galactose-1-P, with regeneration 
of UDP-glucose. 

A rare hereditary condition known as galactosemia involves defects in galactose-1-P 
uridylyltransferase that render the enzyme inactive. Toxic levels of galactose accumulate 
in afflicted individuals, causing cataracts and permanent neurological disorders. These 
problems can be prevented by removing galactose and lactose from the diet. In adults, 
the toxicity of galactose appears to be less severe, due in part to the metabolism of 
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Figure 18.24 Galactose metabolism via the Leloir pathway. 
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Figure 18.26 The UDP-glucose pyrophosphorylase (0) 

reaction utilizes galactose-1-P as a substrate, as well 

as glucose-1-P. CH,OH HN | 
Il 
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galactose-1-P by UDP-glucose pyrophosphorylase, which apparently can accept galactose- 
1-P in place of glucose-1-P (Figure 18.26). The levels of this enzyme may increase in 
galactosemic individuals to accommodate the metabolism of galactose. 


18.7d Glycerol Can Also Enter Glycolysis 


Glycerol is the last important simple substance whose ability to enter the glycolytic 
pathway must be considered. This metabolite, which is produced in substantial amounts 
by the decomposition of triacylglycerols, can be converted to glycerol-3-phosphate by 
the action of glycerol kinase and then oxidized to dihydroxyacetone phosphate by the 
action of glycerol phosphate dehydrogenase, with NAD* as the required coenzyme. The 
dihydroxyacetone phosphate thereby produced enters the glycolytic pathway as a 
substrate for triose phosphate isomerase. 


CH,OH CHOH 
Glycerol kinase | Mg?* | 
yoe HOCH + ADP = ——> HocH + ADP 
reaction | | 
CHOH CH,OPO3- 
Glycerol sn-Glycerol-3-phosphate 
CHOH CHOH 
| 
HOCH + Nab’ ——> c=0 + NAD + [BR ClycerolP dehydrogenase 
reaction 
CH,OPO3- CH,OPO3- 
sn-Glycerol-3-phosphate Dihydroxyacetone 
phosphate 


18.8 | How Do Cells Respond to Hypoxic Stress? 


Glycolysis is an anaerobic pathway—it does not require oxygen. On the other hand, 
operation of the TCA cycle depends on oxygen, so it is aerobic. When oxygen is 
abundant, cells prefer aerobic metabolism, which yields more energy per glucose 
consumed. However, as Louis Pasteur first showed, when oxygen is limited, cells 
adapt to make the most of glycolysis, the less energetic, anaerobic alternative. In 
mammalian tissues, hypoxia (oxygen limitation) can cause changes in gene expression 
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Lactose and Lactose Intolerance: From Mother’s Milk to Yogurt 


Lactose is an interesting sugar in many ways. In placental mammals, 
it is synthesized only in the mammary gland, and then only during 
late pregnancy and lactation. The synthesis is carried out by lactose 
synthase, a dimeric complex of two proteins: galactosyl transferase 
and a-lactalbumin. Galactosyl transferase is present in all human 
cells, and it is normally involved in incorporation of galactose into 
glycoproteins. In late pregnancy, the pituitary gland in the brain 
releases a protein hormone, prolactin, which triggers production 
of a-lactalbumin by certain cells in the breast. a-Lactalbumin, a 
123-residue protein, associates with galactosyl transferase to form 
lactose synthase, which catalyzes the reaction: 


UDP-galactose + glucose —> lactose + UDP 


Lactose breakdown by lactase in the small intestine provides 
newborn mammals with essential galactose for many purposes, 
including the synthesis of gangliosides in the developing brain. 
Lactase is a B-galactosidase that cleaves lactose to yield galactose 


Breakdown of lactose to galactose and glucose by lactase. 


Portions adapted from Hill, R., and Brew, K., 1975. Lactose synthetase. 
Advances in Enzymology 43:411—-485; and Bloch, K., Blondes in Venetian 
Paintings, the Nine-Banded Armadillo, and Other Essays in Biochemistry. 
New Haven, CT: Yale University Press, 1994. 


and glucose—in fact, the only human enzyme that can cleave a 
B-glycosidic linkage: 

Lactase is an inducible enzyme in mammals, and it appears in 
the fetus during the late stages of gestation. Lactase activity peaks 
shortly after birth, but by the age of 3 to 5 years, it declines to a 
low level in nearly all children. Low levels of lactase make many 
adults lactose intolerant. Lactose intolerance occurs commonly in 
most parts of the world (with the notable exception of some parts 
of Africa and northern Europe; see table). The symptoms of lac- 
tose intolerance, including diarrhea and general discomfort, can be 
relieved by eliminating milk from the diet. Alternatively, products 
containing B-galactosidase are available commercially. 

Certain bacteria, including several species of Lactobacillus, thrive 
on the lactose in milk and carry out lactic acid fermentation, con- 
verting lactose to lactate via glycolysis. This is the basis of produc- 
tion of yogurt, which is now popular in the Western world but 
of Turkish origin. Other cultures also produce yogurtlike foods. 
Nomadic Tatars in Siberia and Mongolia used camel milk to make 
koumiss, which was used for medicinal purposes. In the Caucasus, 
kefir is made much like yogurt, except that the starter culture con- 
tains (in addition to Lactobacillus) Streptococcus lactis and yeast, 
which convert some of the glucose to ethanol and CO;, producing 
an effervescent and slightly intoxicating brew. 


Percentage of Population with Lactase Persistence 


Country Lactase Persistence (%) 
Sweden 99 
Denmark 97 
United Kingdom (Scotland) 95 
Germany 88 
Australia 82 
United States (Iowa) 81 
Spain 72 
Jordan 21 
Greece 12 
Japan 10 
Native Americans 5 


Adapted from Bloch, K., Blondes in Venetian Paintings, the Nine-Banded Armadillo, and 
Other Essays in Biochemistry. New Haven, CT: Yale University Press, 1994. 


that result in increased angiogenesis (the growth of new blood vessels), increased syn- 
thesis of red blood cells, and increased levels of some glycolytic enzymes (and thus a 


higher rate of glycolysis). 


What is the molecular basis for the increased expression of glycolytic enzymes? 
One of the triggers for this expression is a DNA-binding protein called hypoxia induc- 
ible factor (HIF). HIF is induced by inflammatory and oxidative stresses, and is also 
induced in tumors. HIF is a heterodimer of a constitutive, stable nuclear B subunit 
(HIF-1) and an inducible, unstable a-subunit. Both subunits are basic helix-loop- 
helix transcription factors that bind to hypoxia-inducible genes, and both subunits 
exist as a series of isoforms (for example, HIF-la, HIF-2a, and HIF-3a). HIF-a 
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subunit regulation is a multistep process that includes gene splicing, phosphorylation, 
acetylation, and hydroxylation. HIF-la is the best-studied HIF-a@ isoform. Under 
normal oxygen levels, the a-subunits are continually synthesized but quickly degraded. 
When oxygen is plentiful, HIF-la is hydroxylated by oxygen-dependent prolyl hydrox- 
ylases (PHDs) at Pro*” and Pro. These hydroxylations ensure its binding to ubiqui- 
tin E3 ligase, which leads to rapid proteolysis by the 26S proteasome (see Chapter 31). 
HIF-la binding to the ligase is also promoted by acetylation of Lys5? by the ARD1 
acetyltransferase. In addition, the presence of oxygen induces the hydroxylation of 
HIF-1la Asn’ by the hydroxylase factor—inhibiting HIF (FIH-1). Figure 18.27 shows 
the structure of FIH bound to a fragment of HIF-la. 

Because PHDs and FIH-1 both are oxygen-dependent, lowering oxygen concentra- 
tion means that HIF-la@ avoids degradation and is available to promote gene transcrip- 
tion (Figure 18.28). Phosphorylation of HIF-la by a protein kinase promotes binding 
of HIF-la to HIF-18, which enhances transcription. HIF-la—HIF-18 dimers bind to 
hypoxia responsive elements (HREs), activating transcription of HRE-regulated genes, 
including hexokinase and lactate dehydrogenase, and the genes for glucose and lactate 
transporters (Figure 18.28). Pasteur observed more than 100 years ago that fermenta- 
tion amounted to “life without air.” The Pasteur effect depends on HIF-mediated acti- 
vation of the genes encoding glycolytic enzymes in the absence of oxygen. The linking 
of glycolytic activity to oxygen level is the result of an exquisite dance of oxygen- 
sensitive enzymes with proteins, which undergo covalent modifications that control 
protein-protein and protein-DNA interactions, a dance that Pasteur could hardly have 
anticipated. 


Figure 18.27 FIH (green) bound to HIF. 
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Figure 18.28 The HIF transcription factor is composed of two subunits: a ubiquitous HIF-18 
subunit and a hypoxia-responsive HIF-la@ subunit. In response to hypoxia, inactivation of the PHDs 
allows HIF-1la stabilization, dimerization with HIF-18, binding of the dimer to the hypoxia 
responsive element (HRE) of HIF target genes, and activation of the transcription of these genes. 
VHL is the von Hippel Lindau subunit of the ubiquitin E3 ligase that targets proteins for proteasome 
degradation. 
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Problems 


Foundational Biochemistry Things You Should Know after Reading Chapter 18. 


e Glycolysis is an anaerobic pathway of 10 reactions that converts ® 
glucose to pyruvate. 


e Glucose processed through glycolysis is normally the sole source of © 
metabolic energy for brain, kidney medulla, and rapidly contracting 
skeletal muscles. 


e Four of the reactions of glycolysis involve coupling to ATP. e 


e Inthe first phase of glycolysis, glucose is phosphorylated at C-1 and 
C-6 and the six-carbon skeleton is cleaved to yield two, three-carbon ®© 
molecules of glyceraldehyde-3-phosphate. 


e  Hexokinase and phosphofructokinase are priming reactions that con- ® 
sume ATP to make more later in the pathway. 


e Hexokinase is not the most important site of regulation in glycolysis 
because its product, glucose-6-phosphate, is a metabolic branch point. 


e The phosphoglucoisomerase reaction plays two roles in glycolysis: it 


e 
converts C-1 to a primary hydroxyl, which is easier to phosphorylate 
in the next reaction, and it activates C-3, facilitating C-C bond cleav- 
age in the fourth reaction of glycolysis. hd 
e The mechanisms of phosphoglucoisomerase and triose phosphate 
isomerase both involve similar enediol intermediates. 
e Phosphofructokinase is the committed step in glycolysis; it is alloster- — e 
ically activated by fructose-2,6-bisphosphate and AMP and inhibited 
by ATP and citrate. 
e 
e Fructose bisphosphate aldolase cleaves fructose-1,6-bisphosphate 
to yield two triose phosphates, dihydroxyacetone phosphate and 
glyceraldehyde-3-phosphate. bd 


Glyceraldehyde-3-phosphate dehydrogenase converts glyceraldehyde- 
3-phosphate into a high-energy intermediate, 1,3-bisphosphoglycerate. 


Phosphoglycerate kinase, the break-even reaction of glycolysis, is the 
first ATP-producing reaction of glycolysis, termed a substrate-level 
phosphorylation. 


Pyruvate kinase catalyzes the transfer of a phosphoryl group from 
PEP to ADP, producing ATP and pyruvate. 


Pyruvate kinase is allosterically activated by fructose-1,6-bisphosphate 
and inhibited by ATP and acetyl-CoA. 


Under anaerobic conditions, pyruvate produced in glycolysis is con- 
verted either to lactate (in animals) or ethanol (in yeast). These 
conversions are examples of fermentation—the production of ATP 
energy by pathways in which organic molecules function as donors 
and acceptors of electrons. 


Under aerobic conditions, pyruvate is directed into the TCA cycle, 
where it is oxidized to CO, with production of NADH and FADH). 


Under cellular conditions, the hexokinase, phosphofructokinase, and 
pyruvate kinase reactions operate with large, negative AG and repre- 
sent the sites of regulation of glycolysis. All the other reactions of 
glycolysis operate with AG near zero under cellular conditions. 


Fructose can be a substrate for glycolysis, but it is subject to unregu- 
lated catabolism in the liver and may be harmful. 


Galactose enters glycolysis via the Leloir pathway involving the inter- 
conversion of sugar nucleotides. 


In mammalian tissues, hypoxia leads to increased synthesis of glyco- 
lytic enzymes, triggered by hypoxia inducible factor (HIF). 


Answers to all problems are at the end of this book. Detailed solutions are 2. 


available in the Student Solutions Manual. 
1. Characterizing Glycolysis List the reactions of glycolysis that 
a. are energy consuming (under standard-state conditions). 


b. are energy yielding (under standard-state conditions). 3. 
c. consume ATP. 
d. yield ATP. 
e. are strongly influenced by changes in concentration of substrate 
and product because of their molecularity. 
f. are at or near equilibrium in the erythrocyte (see Table 18.2). 
4. 
i i 
om ameus ec 
HC— OH C=O C— OPO,” 
| | I 
CH,OPO;2— CHOH CHo 
Structure A Structure B Structure C 


Radiotracer Labeling of Pyruvate from Glucose Determine the 
anticipated location in pyruvate of labeled carbons if glucose 
molecules labeled (in separate experiments) with '4C at each position 
of the carbon skeleton proceed through the glycolytic pathway. 


Effects of Changing Metabolite Concentrations on Glycolysis In an 
erythrocyte undergoing glycolysis, what would be the effect of a 
sudden increase in the concentration of: 

a. ATP b. AMP 

c. fructose-1,6-bisphosphate d. fructose-2,6-bisphosphate 


e. citrate f.  glucose-6-phosphate 


Structures in the Glycolysis Pathway Consider the structures shown and 
organize them in order as they appear in the glycolysis pathway. 


7 1 
2-O,PO—C “Oo—C 
ae — OH ae — OPO,?— 
orot L 
Structure D Structure E 
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10. 


Tl. 


12. 


13. 


14. 


Chapter 18 Glycolysis 


Understanding NADH Cycling in Glycolysis Discuss the cycling of 
NADH and NAD* in glycolysis and the related fermentation reactions. 


Enzymes in the Glycolysis Pathway Consider the enzymes listed and 
organize them in the order that they appear in the glycolysis pathway: 
a. phosphoglycerate kinase 

b. enolase 

c. pyruvate kinase 

d. phosphoglycerate mutase 

e. triose phosphate isomerase 

Understanding the Reaction Mechanisms of Glycolysis For each of the 


following reactions, name the enzyme that carries out this reaction in 
glycolysis, and write a suitable mechanism for the reaction. 


CH,OPO,2- 
C=O 
CH,OPO,2- CHO 
HOCH | | 
== C=O + HCOH 
HCOH | | 
CHOH CH,OPO,2- 
HCOH 
CH ,OPO,2- 7 
O — OPO;% 
Y 
CHO C 
a — ngon 
CH ,OPO,2- CH OPO,- 


The Reactions and Mechanisms of the Leloir Pathway Write the 
reactions that permit galactose to be utilized in glycolysis. Write a 
suitable mechanism for one of these reactions. 


The Effect of lodoacetic Acid on the Glyceraldehyde-3-P Dehydrogenase 
Reaction (Integrates with Chapters 4 and 14.) How might iodoacetic 
acid affect the glyceraldehyde-3-phosphate dehydrogenase reaction in 
glycolysis? Justify your answer. 


Radiotracer Studies with *2P in Glycolysis If **P-labeled inorganic 
phosphate were introduced to erythrocytes undergoing glycolysis, 
would you expect to detect ?P in glycolytic intermediates? If so, 
describe the relevant reactions and the *P incorporation you would 
observe. 


Oxidation in the Glycolysis Pathway Although an oxidation of a 
carbon atom occurs in the glycolysis pathway, there is no oxygen 
involved. In which step does the oxidation occur and what is the 
oxidizing agent? 


Comparing Glycolysis Entry Points for Sucrose Sucrose can enter 
glycolysis by either of two routes: 


Sucrose phosphorylase: 
Sucrose + P; == fructose + glucose-1-phosphate 


Invertase: 
Sucrose + H,O = = fructose + glucose 


Would either of these reactions offer an advantage over the other in 
the preparation of hexose for entry into glycolysis? 


Assessing the Role of Mg** in Glycolysis What would be the conse- 
quences of a Mg’* ion deficiency for the reactions of glycolysis? 


D-glucose, D-fructose, and D-mannose (Integrates with Chapter 7) 
Consider the Fisher projections of D-glucose, D-mannose, D-galactose 
and D-fructose shown in Figures 7.2 and 7.3. Apply enediol isomerism 
(see Figure 18.19) to show the relationship between D-glucose, 
D-mannose, and D-fructose. Discuss how the application of enediol 


15. 


16. 


17. 


18. 


19. 


isomerism to D-galactose leads to an isomer different from D-fructose, 
D-glucose, or D-mannose. 


Energetics of the Triose Phosphate Isomerase Reaction (Integrates 
with Chapter 3.) Triose phosphate isomerase catalyzes the conversion 
of dihydroxyacetone-P to glyceraldehyde-3-P. The standard free 
energy change, A G°’, for this reaction is +7.6 kJ/mol. However, the 
observed free energy change (AG) for this reaction in erythrocytes is 
+2.4 kJ/mol. 
a. Calculate the ratio of [dihydroxyacetone-P]/[glyceraldehyde-3-P] 
in erythrocytes from AG. 


b. If [dihydroxyacetone-P] = 0.2 mM, what is [glyceraldehyde-3-P]? 


Energetics of the Enolase Reaction (Integrates with Chapter 3.) 
Enolase catalyzes the conversion of 2-phosphoglycerate to phospho- 
enolpyruvate + H20. The standard free energy change, A G°’, for this 
reaction is +1.8 kJ/mol. If the concentration of 2-phosphoglycerate 
is 0.045 mM and the concentration of phosphoenolpyruvate is 

0.034 mM, what is AG, the free energy change for the enolase 
reaction, under these conditions? 


Energetics of PEP Hydrolysis (Integrates with Chapter 3.) The 
standard free energy change (A G°’) for hydrolysis of phosphoenol- 
pyruvate (PEP) is —62.2 kJ/mol. The standard free energy change 
(A G°’) for ATP hydrolysis is —30.5 kJ/mol. 
a. What is the standard free energy change for the pyruvate kinase 
reaction: 
ADP + phosphoenolpyruvate —> ATP + pyruvate 


b. What is the equilibrium constant for this reaction? 


c. Assuming the intracellular concentrations of [ATP] and [ADP] 
remain fixed at 8 mM and 1 mM, respectively, what will be the 
ratio of [pyruvate]/[phosphoenolpyruvate] when the pyruvate 
kinase reaction reaches equilibrium? 


Energetics of Fructose-1,6-bisP Hydrolysis (Integrates with Chapter 3.) 
The standard free energy change (A G°’) for hydrolysis of fructose-1, 
6-bisphosphate (FBP) to fructose-6-phosphate (F-6-P) and P; is 
—16.7 kJ/mol: 


FBP + HO —> fructose-6-P + P; 
The standard free energy change (AG°’) for ATP hydrolysis is 
—30.5 kJ/mol: 
ATP + HO —> ADP + P; 
a. What is the standard free energy change for the phosphofructoki- 
nase reaction: 

ATP + fructose-6-P —> ADP + FBP 

b. What is the equilibrium constant for this reaction? 


c. Assuming the intracellular concentrations of [ATP] and [ADP] 
are maintained constant at 4 mM and 1.6 mM, respectively, in a 
rat liver cell, what will be the ratio of [FBP]/[fructose-6-P] when 
the phosphofructokinase reaction reaches equilibrium? 


Energetics of 1,3-Bisphosphoglycerate Hydrolysis (Integrates with 
Chapter 3.) The standard free energy change (A G°’) for hydrolysis of 
1,3-bisphosphoglycerate (1,3-BPG) to 3-phosphoglycerate (3-PG) and 
P; is —49.6 kJ/mol: 


1,3-BPG + H,O —> 3-PG + P; 


The standard free energy change (AG°’) for ATP hydrolysis is 
—30.5 kJ/mol: 


ATP + HO —> ADP + P; 


a. What is the standard free energy change for the phosphoglycerate 
kinase reaction: 


ADP + 1,3-BPG —> ATP + 3-PG 


b. What is the equilibrium constant for this reaction? 
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c. If the steady-state concentrations of [1,3-BPG] and [3-PG] in an 
erythrocyte are 1 uM and 120 uM, respectively, what will be the 
ratio of [ATP]/[ADP], assuming the phosphoglycerate kinase 
reaction is at equilibrium? 


20. Energetics of the Hexokinase Reaction The standard-state free energy 
change, A G°’, for the hexokinase reaction is — 16.7 kJ/mol. Use the 
values in Table 18.2 to calculate the value of AG for this reaction in 
the erythrocyte at 37°C. 


21. Analyzing the Concentration Dependence of the Adenylate Kinase 
Reaction Taking into consideration the equilibrium constant for the 
adenylate kinase reaction (Equation 18.7), calculate the change in 
concentration in AMP that would occur if 8% of the ATP in an 
erythrocyte (red blood cell) were suddenly hydrolyzed to ADP. In 
addition to the concentration values in Table 18.2, it may be useful 
to assume that the initial concentration of AMP in erythrocytes 
is 5 uM. 


22. Distinguishing the Mechanisms of Class | and Class | Aldolases 
Fructose bisphosphate aldolase in animal muscle is a class I 
aldolase, which forms a Schiff base intermediate between substrate 
(for example, fructose-1,6-bisphosphate or dihydroxyacetone 
phosphate) and a lysine at the active site (see Figure 18.12). The 
chemical evidence for this intermediate comes from studies with 
aldolase and the reducing agent sodium borohydride, NaBHg. 
Incubation of the enzyme with dihydroxyacetone phosphate and 
NaBH, inactivates the enzyme. Interestingly, no inactivation is 
observed if NaBH, is added to the enzyme in the absence of 
substrate. Write a mechanism that explains these observations and 
provides evidence for the formation of a Schiff base intermediate in 
the aldolase reaction. 


23. Understanding the Ping-Pong Mechanism of Gal-1-P Uridylyltransfer- 
ase As noted in Figure 18.25, the galactose-1-phosphate uridylyl- 
transferase reaction proceeds via a ping-pong mechanism. Consult 
Section 13.5c, to refresh your knowledge of ping-pong mechanisms, 
and draw a diagram to show how a ping-pong mechanism would 
proceed for the uridylyltransferase. 


24. Understanding the Mechanism of Hemolytic Anemia Genetic defects 
in glycolytic enzymes can have serious consequences for humans. 
For example, defects in the gene for pyruvate kinase can result in a 
condition known as hemolytic anemia. Consult a reference to learn 
about hemolytic anemia, and discuss why such genetic defects lead to 
this condition. 


25. The Consumer Can Be Fooled, but Not the Consumer’s Biochemis- 
try A fat-free food item contains the following nutritional facts: 
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Total fat: = 0 g (0%); Cholesterol = 0 mg (0%); Sodium = 80 mg 
(3%); Total carbohydrate = 40 g (30%), which includes 28 g added 
sugars; Protein = 3 g. The first item on the list of ingredients is high 
fructose corn syrup. Explain why technically the food item is fat free, 
but not to our biochemistry. 


26. Regarding phosphofructokinase, which of the following statements 


is true: 

a. Low [ATP] stimulates the enzyme, but fructose-2,6-bisphosphate 
inhibits. 

b. High [ATP] stimulates the enzyme, but fructose-2,6-bisphosphate 
inhibits. 

c. High [ATP] stimulates the enzyme, and fructose-2,6-bisphosphate 
activates. 


d. The enzyme is more active at low [ATP] than at high, and 
fructose-2,6-bisphosphate activates the enzyme. 


e. [ATP] and fructose-2,6-bisphosphate both inhibit the enzyme. 


27. Based on your reading of this chapter, what would you expect to be 
the most immediate effect on glycolysis if the steady-state concentra- 
tion of glucose-6-P were 8.3 mM instead of 0.083 mM? 


PSAFE Problems 


(PSAFE models, listed by chapter or protein name, can be found at 
http://www.psafeonline.net.) 


28. Examine the PSAFE model for alcohol dehydrogenase and describe 
the structure and function of the catalytic zinc center. 


29. Based on your knowledge of the structure of NAD* and an 
assumption that coenzyme dissociation is the rate limiting step of 
the alcohol dehydrogenase mechanism, hypothesize why a N249W 
mutation at the coenzyme binding site would increase the rate of 
catalysis. 


30. Using the PSAFE model for phosphofructokinase (Trypanosoma), 
describe the difference between the APO1, APO2, and holoenzyme 
conformations. 


Think-Pair-Share Question 


C-labeled D-glucose was used to study the alcoholic fermentation in 
Saccharomyces cerevisiae. Depending on the carbon that was labeled, 
either '4*CH3-CH,OH, CH;-'4CH,OH, or '*CO, were formed. Which 
carbon(s) in D-glucose were labeled to obtain these products? (Use the 
numbering system of the carbons as discussed in Chapter 7.) Would 
similar results be obtained if D-fructose or D-mannose were used as 
starting molecules? 
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The Tricarboxylic Acid Cycle 


< A time-lapse photograph of a Ferris 
wheel at night. Aerobic cells use a 
metabolic wheel—the tricarboxylic acid 
cycle—to generate energy by acetyl-CoA 
oxidation. 


Essential Question 


The glycolytic pathway converts glucose to pyruvate and produces two molecules of ATP 
per glucose—only a small fraction of the potential energy available from glucose. Under 
anaerobic conditions, pyruvate is reduced to lactate in animals and ethanol in yeast, and 
much of the potential energy of the glucose molecule remains untapped. In the presence of 
oxygen, however, a much more interesting and thermodynamically complete story unfolds. 
How is pyruvate oxidized to CO, under aerobic conditions, and what is the chemical logic 
that dictates how this process occurs? 


nder aerobic conditions, pyruvate from glycolysis is converted to acetyl-coenzyme 

A (acetyl-CoA) and oxidized to CO, in the tricarboxylic acid (TCA) cycle (also 
called the citric acid cycle). The electrons liberated by this oxidative process are passed 
via NADH and FADH, through an elaborate, membrane-associated electron-transport 
pathway to O,, the final electron acceptor. Electron transfer is coupled to creation of a 
proton gradient across the membrane. Such a gradient represents an energized state, 
and the energy stored in this gradient is used to drive the synthesis of many equivalents 
of ATP. 

ATP synthesis as a consequence of electron transport is termed oxidative phosphory- 
lation; the complete process is diagrammed in Figure 19.1. Aerobic pathways permit the 
production of 30 to 38 molecules of ATP per glucose oxidized. Although two molecules 
of ATP come from glycolysis and two more directly out of the TCA cycle, most of the 
ATP arises from oxidative phosphorylation. Specifically, reducing equivalents released 
in the oxidative reactions of glycolysis, pyruvate decarboxylation, and the TCA cycle 
are captured in the form of NADH and enzyme-bound FADH,, and these reduced 
coenzymes fuel the electron-transport pathway and oxidative phosphorylation. 

Complete oxidation of glucose to CO, involves the removal of 24 electrons—that is, 
it is a 24-electron oxidation. In glycolysis, four electrons are removed as NADH, and 
four more electrons exit as two more NADH in the decarboxylation of two molecules 


19 


Thus times do shift, each thing his turn does hold; 
New things succeed, as former things grow old. 


Robert Herrick 
Hesperides (1648), “Ceremonies for Christmas Eve” 


Key Questions 


19.1 


19.2 


19.3 


19.4 


19.5 


19.6 


19.7 


19.8 
19.9 


What Is the Chemical Logic 
of the TCA Cycle? 


How Is Pyruvate Oxidatively 
Decarboxylated to Acetyl-CoA? 
How Are Two CO, Molecules 
Produced from Acetyl-CoA? 

How Is Oxaloacetate Regenerated 
to Complete the TCA Cycle? 

What Are the Energetic Consequences 
of the TCA Cycle? 

Can the TCA Cycle Provide 
Intermediates for Biosynthesis? 
What Are the Anaplerotic, 

or “Filling Up,” Reactions? 

How Is the TCA Cycle Regulated? 


Can Any Organisms Use Acetate 
as Their Sole Carbon Source? 
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Chapter 19 The Tricarboxylic Acid Cycle 


Glycolysis N Figure 19.1 The pathways of catabolic metabolism, from glucose to ATP. 
(a) Pyruvate produced in glycolysis is oxidized in (b) the citric acid cycle. 
> (c) Electrons liberated in this oxidation flow through the electron-transport 
WARE chain and drive the synthesis of ATP in oxidative phosphorylation. 
X In eukaryotic cells, this overall process occurs in mitochondria. 
y æ> NADH 


Mitochondrial 
matrix 


(b) 
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GTP GDP + P, 


Oxidative 


(c) phosphorylation 


Electron transport 


Intermembrane space 
Proton 


gradient ic 


NADH 


NADH 


Mitochondrial matrix 


> [FADH] 


> NADH J 


of pyruvate to two acetyl-CoA (Figure 19.1). For each acetyl-CoA oxidized in the TCA 
cycle, eight more electrons are removed (as three NADH and one FADH;)): 


H;CCOO™ + 2H,O + H+ —> 2C0O, + 8H 


In the electron-transport pathway, these eight electrons combine with oxygen to form 
water: 


8H + 20, —> 4H,0 
So, the net reaction for the TCA cycle and electron-transport pathway is 
H3CCOO™ + 20; + Ht —> 2CO, + 2H,O0 


As German biochemist Hans Krebs showed in the 1930s, the eight-electron oxidation 
of acetate by the TCA cycle is accomplished with the help of oxaloacetate. (In his honor, 
the TCA cycle is often referred to as the Krebs cycle.) Beginning with acetate, a series of 
five reactions produces two molecules of CO, with four electrons extracted in the form of 
NADH and four electrons passed to oxaloacetate to produce a molecule of succinate. The 
pathway becomes a cycle with three additional reactions that accomplish the four-electron 
oxidation of succinate back to oxaloacetate; first, oxidation of a single bond to a double 
bond, then addition of the elements of water across the double bond, and finally oxidation of 
the resulting alcohol to a carbonyl. This special trio of reactions is used repeatedly in 
metabolism: in fatty acid oxidation, in reverse in fatty acid synthesis, and also in amino 
acid synthesis and breakdown. 


19.1 | What Is the Chemical Logic of the TCA Cycle? 


The entry of new carbon units into the cycle is through acetyl-CoA. This entry meta- 
bolite can be formed either from pyruvate (from glycolysis) or from oxidation of fatty 
acids. Transfer of the two-carbon acetyl group from acetyl-CoA to the four-carbon 
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19.1 What Is the Chemical Logic of the TCA Cycle? 


oxaloacetate to yield six-carbon citrate 1s catalyzed by citrate synthase. A dehydration— 
rehydration rearrangement of citrate yields isocitrate. Two successive decarboxylations 
produce a-ketoglutarate and then succinyl-CoA, a CoA conjugate of a four-carbon unit. 
Several steps later, oxaloacetate is regenerated and can combine with another two-carbon 
unit of acetyl-CoA. Thus, carbon enters the cycle as acetyl-CoA and exits as COs. In the 
process, metabolic energy is captured in the form of ATP, NADH, and enzyme-bound 
FADH; (symbolized as [FADH)]). 

The TCA cycle provides a chemically feasible way of cleaving a two-carbon compound. 
The cycle shown in Figure 19.2 at first appears to be a complicated way to oxidize acetate 
units to CO», but there is a chemical basis for the apparent complexity. Oxidation of an 
acetyl group to a pair of CO, molecules requires C—C cleavage: 


CH3COO” — > CO, + CO, 


In many instances, C—C cleavage reactions in biological systems occur between 
carbon atoms a and £ to a carbonyl group: 


i 


C—C, — Cg 
i 


Cleavage 


A good example of such a cleavage is the fructose bisphosphate aldolase reaction (see 
Figure 18.12). 
Another common type of C—C cleavage is a-cleavage of an a-hydroxyketone: 


Cleavage 


This type of cleavage occurs in the transketolase reaction in the pentose phosphate 
pathway. 

Neither of these cleavage strategies is suitable for acetate. It has no B-carbon, and the 
second method would require hydroxylation—not a favorable reaction for acetate. In- 
stead, living things have evolved the clever chemistry of condensing acetate with oxalo- 
acetate and then carrying out a B-cleavage. The TCA cycle combines this B-cleavage 
reaction with oxidation to form CO,, regenerate oxaloacetate, and capture the liberated 
metabolic energy in NADH and ATP. 
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Figure 19.2 The TCA cycle. 
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19.2 | How Is Pyruvate Oxidatively Decarboxylated 
to Acetyl-CoA? 


Pyruvate produced by glycolysis is a significant source of acetyl-CoA for the TCA 
cycle. Because, in eukaryotic cells, glycolysis occurs in the cytoplasm, whereas the 
TCA cycle reactions and all subsequent steps of aerobic metabolism take place in 
the mitochondria, pyruvate must first enter the mitochondria to enter the TCA cycle. 
The oxidative decarboxylation of pyruvate to acetyl-CoA 


Pyruvate + CoA + NAD* —> acetyl-CoA + CO, + NADH 


is the connecting link between glycolysis and the TCA cycle. The reaction is catalyzed 
by pyruvate dehydrogenase, a multienzyme complex. 

The pyruvate dehydrogenase complex (PDC) is formed from multiple copies of three 
enzymes: pyruvate dehydrogenase (E1), dihydrolipoamide acetyltransferase (E2), and 
dihydrolipoamide dehydrogenase (E3). All are involved in the conversion of pyruvate to 
acetyl-CoA. In addition, eukaryotic PDC contains a unique subunit, termed E3BP, that 
has no known enzymatic function. The active sites of all three enzymes are not far 
removed from one another, and the product of the first enzyme is passed directly to the 
second enzyme, and so on, without diffusion of substrates and products through the 
solution. Eukaryotic PDC, one of the largest-known multienzyme complexes (with a 
diameter of approximately 500 A), is a 9.5 MDa assembly. 

The central core of PDC consists of E2 subunits (and E3BP in eukaryotes), 
forming either a 24-meric octahedron (in gram-negative bacteria) or a 60-meric 
pentagonal dodecahedron (in the mitochondria of eukaryotes and gram-positive 
bacteria). In mitochondria, six E3BP dimers and 48 E2 subunits comprise the core 
structure, and binding of E3 is mediated by these substituted E3BPs. Thirty El a2B> 
heterotetramers and 12 E3 homodimers are bound to the E2/E3BP core. Eukaryotic 
E2 subunits consist of an inner catalytic (IC) domain and an outer globular do- 
main, itself composed of two lipoic-acid binding domains (L1 and L2) and a small 
El-binding domain, connected by Ala- and Pro-rich hinge regions 20 to 30 residues 
in length (Figure 19.3a). Human E3BP is composed of three linker-connected do- 
mains in a configuration similar to that of human E2, and its IC domain has 50% 
sequence identity with E2. The flexible linker segments in E2 and E3BP impart the 
flexibility that allows the lipoic acid groups to visit all three active sites during 
catalysis. Two regulatory enzymes, a pyruvate dehydrogenase kinase (PDK, a 
homodimer), and a pyruvate dehydrogenase phosphatase (PDP, a heterodimer) are 
bound to L1, L2, and/or L3. 

The pyruvate dehydrogenase reaction (Figure 19.4) is a tour de force of mechanistic 
chemistry, involving a total of three enzymes and five different coenzymes. The first step 
of this reaction, the decarboxylation of pyruvate and transfer of the acetyl group to 
lipoic acid, depends on accumulation of negative charge on the transferred two-carbon 
fragment, as facilitated by the quaternary nitrogen on the thiazolium group of thiamine 
pyrophosphate (TPP). As shown in Figure 19.5, this cationic imine nitrogen plays two 
distinct roles in TPP-catalyzed reactions: 


1. It provides electrostatic stabilization of the thiazole carbanion formed upon re- 
moval of the C-2 proton. (The sp” hybridization and the availability of vacant 
d orbitals on the adjacent sulfur probably also facilitate proton removal at C-2.) 

2. TPP attack on pyruvate leads to decarboxylation. The TPP cationic imine nitrogen 
can act as an effective electron sink to stabilize the negative charge that must 
develop on the carbon that has been attacked. This stabilization takes place by 
resonance interaction through the double bond to the nitrogen atom. 


This resonance-stabilized intermediate can be protonated to give hydroxyethyl-TPP. 
The reaction of hydroxyethyl-TPP with the oxidized form of lipoic acid yields the 
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Figure 19.3 Models of human pyruvate 
dehydrogenase. (a) Domain structures of human E2 
and E3BP. (b) CryoEM 3D reconstruction of the 
dodecahedral core of human E2 inner core domain, 
showing the view along the 5-fold axis of symmetry. 
(c) Model of the human E2/E3BP:E3 core complex. 
(d) Model of human pyruvate dehydrogenase 
complex. Six E3BP dimers (brown) substitute for 12 
E2 subunits (purple) to form the E2/E2BP core. 
Around each pentagonal opening, one E3 and two 
E1 subunits interact to form a shell connected to the 
E2/E3BP core through linker segments. (e) Crystal 
structures of El from E. coli (left—PDB id = 1L8A) 
and humans (right—PDB id = 1NI4). Both 
structures show bound thiamine coenzymes. (f) 
Crystal structures of E3 from E. coli (left—PDB id = 
4JDR) and humans (right—PDB id = 1ZY8). Both 
structures show bound FAD. (Adapted from 
Vijayakrishnan, S., et al., 2010. Solution structure 
and characterization of the human pyruvate 
dehydrogenase complex core assembly. Journal of 
Molecular Biology 399:71—93; Yu, X., et al., 2007. 
Structures of the human pyruvate dehydrogenase 
complex cores: A highly conserved catalytic center 
with flexible N-terminal domains. Structure 16: 
104-114; and Jiang, J., Baiesc, F.L., Hiromasa, Y., 
et al., 2018. Atomic structure of the E2 inner core of 
human pyruvate dehydrogenase complex. 
Biochemistry 57:2325—2334.) 
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Figure 19.4 The reaction mechanism of the pyruvate dehydrogenase complex. In step 1, 
decarboxylation of pyruvate occurs with formation of hydroxyethyl-TPP. Transfer of the two-carbon 
unit to lipoic acid in step 2 is followed by formation of acetyl-CoA in step 3. Lipoic acid is reoxidized 
in step 4 of the reaction. 
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Figure 19.5 The mechanistic details of the first three steps of the pyruvate dehydrogenase complex 
reaction. 
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energy-rich acetyl-thiol ester of reduced lipoic acid through oxidation of the hydroxyl- 
carbon of the two-carbon substrate unit. Nucleophilic attack by CoA on the carbonyl 
carbon (a characteristic feature of CoA chemistry) results in transfer of the acetyl group 
from lipoic acid to CoA. The subsequent oxidation of lipoic acid is catalyzed by the 
FAD-dependent dihydrolipoyl dehydrogenase, and NAD* is reduced. 


A Deeper Look 


The Coenzymes of the Pyruvate Dehydrogenase Complex 


Coenzymes are small molecules that bring unique chemistry to 
enzyme reactions. Five coenzymes are used in the pyruvate dehydro- 


genase reaction. 


Thiamine (vitamin B,) 


The Nicotinamide Coenzymes 

NAD*/NADH and NADP*/NADPH carry out 
hydride (H:~) transfer reactions. All reactions in- 
volving these coenzymes are two-electron transfers. 
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TPP assists in the decarboxylation of a-keto acids (here) and in the 


formation and cleavage of a-hydroxy ketones (as in the transketo- 
lase reaction). 
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(Continued) 
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The Flavin Coenzymes—FAD/FADH, 

Flavin coenzymes can exist in any of three redox states, and this 
allows flavin coenzymes to participate in one-electron and two- 
electron transfer reactions. Partly because of this, flavoproteins 
catalyze many reactions in biological systems and work with many 
electron acceptors and donors. 
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Flavin mononucleotide, FAN 


A 


o=P— O7 


Flavin adenine dinucleotide, FAD 
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Because the ribityl group is not a true pentose sugar (it is a sugar 
alcohol) and is not joined to the isoalloxazine group in a glycosidic 
bond, the molecule is not truly a nucleotide and the terms flavin 
mononucleotide and dinucleotide are incorrect. Nonetheless, these 
designations are so deeply ingrained in common biochemical usage 
that the erroneous nomenclature persists. 
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(Continued) 
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Coenzyme A The reactive sulfhydryl group on CoA mediates both of these 
The two main functions of CoA are: functions. The sulfhydryl group forms thioester linkages with acyl 
y groups. The two main functions of CoA are illustrated in the citrate 
1. Activation of acyl groups for transfer by nucleophilic attack synthase reaction (see Figure 19.6). 
2. Activation of the a-hydrogen of the acyl group for abstraction 
as a proton 
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transfer during oxidation and decarboxylation of a-keto acids. It is amide bond formation with the e-NH group of a lysine side chain. 
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19.3 | How Are Two CO, Molecules Produced 
from Acetyl-CoA? 
The TCA cycle involves entry and capture of an acetyl group, followed by a series of 


reactions that accomplish an eight-electron oxidation and, ultimately, departure of 
two CO, molecules. 


19.3a The Citrate Synthase Reaction Initiates the TCA Cycle 


The first reaction within the TCA cycle, the one by which carbon atoms are intro- 
duced, is the citrate synthase reaction (Figure 19.6). Here acetyl-CoA reacts with 


pro-S arm 


SCoA | HO CoA 


awe 


pro-Rarm 


Citryl-CoA Citrate 


Oxaloacetate 


Figure 19.6 Citrate is formed in the citrate synthase reaction from oxaloacetate and acetyl-CoA. 
The mechanism involves nucleophilic attack by the carbanion of acetyl-CoA on the carbonyl carbon 
of oxaloacetate, followed by thioester hydrolysis. The pro-S and pro-R groups around the central 
pro-chiral carbon are indicated. 
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| Table 19.1 e| icy) The Enzymes and Reactions of the TCA Cycle 


Reaction Enzyme AG” (kj/mol) AG (kJ/mol) 
1. Acetyl-CoA + oxaloacetate + H:O == CoASH + citrate Citrate synthase —31.4 —53.9 
2. Citrate —— isocitrate Aconitase +6.7 +0.8 
3. Isocitrate + NAD* =—— a-ketoglutarate + NADH + CO, Isocitrate dehydrogenase —8.4 -17.5 
4. a-Ketoglutarate + CoASH + NAD* == succinyl-CoA + NADH + CO,  a-Ketoglutarate —30 —43.9 
dehydrogenase complex 

5. Succinyl-CoA + GDP + P; == succinate + GTP + CoASH Succinyl-CoA synthetase —3.3 =0 

6. Succinate + [FAD] == fumarate + [FADH])] Succinate dehydrogenase +0.4 =0 

7. Fumarate + HO == L-malate Fumarase —3.8 =0 

8. L-Malate + NAD* == oxaloacetate + NADH + H* Malate dehydrogenase +29.7 =0 


AG values from Newsholme, E. A., and Leech, A. R., Biochemistry for the Medical Sciences, New York: Wiley, 1983. 


oxaloacetate in a Perkin condensation (a carbon—carbon condensation between a 
ketone or aldehyde and an ester). The acyl group is activated in two ways in an acyl- 
CoA molecule: The carbonyl carbon is activated for attack by nucleophiles, and the 
C, carbon is more acidic and can be deprotonated to form a carbanion. The citrate 
synthase reaction depends upon the latter mode of activation. As shown in Figure 19.6, 
a general base on the enzyme accepts a proton from the methyl group of acetyl-CoA, 
producing a stabilized a-carbanion of acetyl-CoA. This strong nucleophile attacks 
the a-carbonyl of oxaloacetate, yielding citryl-CoA. This part of the reaction has an 
equilibrium constant near 1, but the overall reaction is driven to completion by the 
subsequent hydrolysis of the high-energy thioester to citrate and free CoA. The over- 
all AG” is —31.4 kJ/mol, and under standard conditions the reaction is essentially ir- 
reversible. Although the mitochondrial concentration of oxaloacetate is very low 
(much less than 1 wM—see example in Section 19.4), the strong, negative AG?’ drives 
the reaction forward. 


Citrate Synthase Is a Dimer Citrate synthase in mammals is a dimer of 49 kDa subunits 
(Table 19.1). On each subunit, oxaloacetate and acetyl-CoA bind to the active site, 
which lies in a cleft between two domains and is surrounded mainly by a-helical seg- 
ments (Figure 19.7). Binding of oxaloacetate induces a conformational change that 
facilitates the binding of acetyl-CoA and closes the active site so that the reactive 
carbanion of acetyl-CoA is protected from protonation by water. 


NADH Is an Allosteric Inhibitor of Citrate Synthase Citrate synthase is the first step in 
the TCA cycle and as stated the reaction has a large negative AG°’. As might be 
expected, it is a highly regulated enzyme. NADH, a product of the TCA cycle, is an 
allosteric inhibitor of citrate synthase, as is succinyl-CoA, the product of the fifth step 
in the cycle (and an acetyl-CoA analog). 


19.3b Citrate Is lsomerized by Aconitase to Form Isocitrate 


In its role as an intermediate in this pathway, citrate itself poses a problem: It is a 
poor candidate for further oxidation because it contains a tertiary alcohol, which 
could be oxidized only by breaking a carbon-carbon bond. A simple solution to this 
problem is to isomerize the tertiary alcohol to a secondary alcohol, which the cycle 
proceeds to do in the next step. 

Citrate is isomerized to isocitrate by aconitase in a two-step process involving aconi- 
tate as an intermediate (Figure 19.8). In this reaction, the elements of water are first 
abstracted from citrate to yield aconitate, which is then rehydrated with H— and HO— 
adding back in opposite positions to produce isocitrate. The net effect is the conversion 
of a tertiary alcohol (citrate) to a secondary alcohol (isocitrate). Oxidation of the sec- 
ondary alcohol of isocitrate involves breakage of a C—H bond, a simpler matter than 
the C—C cleavage required for the direct oxidation of citrate. 


CoASH 


Figure 19.7 Citrate synthase. In the monomer 
shown here, citrate is blue, and CoA is red. (Top: 
PDB id = ICTS; bottom: PDB id = 2CTS.) 
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Figure 19.8 The aconitase reaction and the structure of the active site of aconitase from pig. 

(a) The aconitase reaction converts citrate to cis-aconitate and then to isocitrate. Aconitase is stereo- 
specific and removes the pro-R hydrogen from the pro-R arm of citrate. (b) The active site of 
aconitase. The iron-sulfur cluster (pink) is coordinated by cysteines (orange) and isocitrate (purple) 
(PDB id = 1BO0J). 


Inspection of the citrate structure shows a total of four chemically equivalent hydro- 
gens, but only one of these—the pro-R H atom of the pro-R arm of citrate—is ab- 
stracted by aconitase, which is quite stereospecific. Formation of the double bond of 
aconitate following proton abstraction requires departure of hydroxide ion from the 
C-3 position. Hydroxide is a relatively poor leaving group, and its departure is facili- 
tated in the aconitase reaction by coordination with an iron atom in an iron-sulfur 
cluster. 


Aconitase Utilizes an lron—-Sulfur Cluster Aconitase contains an iron-sulfur cluster 
consisting of three iron atoms and four sulfur atoms in a near-cubic arrangement 
(Figure 19.9). Cysteine residues from the enzyme coordinate the three iron atoms. In 
the inactive state of the enzyme, one corner of the cube is vacant. Binding of iron (as 
Fe**) to this position activates aconitase. The iron atom in this position can coordi- 
nate the C-3 carboxyl and hydroxyl groups of citrate. This iron atom thus acts as a 
Lewis acid, accepting an unshared pair of electrons from the hydroxyl, making it a 
better leaving group. The equilibrium for the aconitase reaction favors citrate, and an 
equilibrium mixture typically contains about 90% citrate, 4% cis-aconitate, and 6% 
isocitrate. The AG?’ is +6.7 kJ/mol. 


Fluoroacetate Blocks the TCA Cycle Fluoroacetate (which occurs naturally in plants in 
Australia, Brazil, and Africa) is an extremely poisonous agent that blocks the TCA 
cycle in vivo, although it has no apparent effect on any of the isolated enzymes. Its 
LDso, the lethal dose for 50% of animals consuming it, is 0.2 mg per kilogram of body 


-ooc ^ ~H 


Aconitate 


Citrate 


Figure 19.9 The iron-sulfur cluster of aconitase. Binding of Fe?* to the vacant position of the cluster 
activates aconitase. The added iron atom coordinates the C-3 carboxyl and hydroxyl groups of citrate 
and acts as a Lewis acid, accepting an electron pair from the hydroxyl group and making it a better 
leaving group. 
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weight; it has been used as a rodent poison. The action of fluoroacetate has been 
traced to aconitase. Fluoroacetate is converted in two steps in vivo to fluorocitrate as 
shown, and aconitase is inhibited by fluorocitrate. 


F 
H mC — COO 
Acetyl-CoA Citrate HO—C— COO- 
FCH,COO- =e SCoA sea ae —Coo- 
Fluoroacetate Fluoroacetyl-CoA (2R, 3S)-Fluorocitrate 


Fluoroacetate readily crosses both the cellular and the mitochondrial membranes, 
and in mitochondria it is converted to fluoroacetyl-CoA by acetyl-CoA synthetase. 
Fluoroacetyl-CoA is a substrate for citrate synthase, which condenses it with oxalo- 
acetate to form fluorocitrate. Fluoroacetate may thus be viewed as a Trojan horse 
inhibitor. Analogous to the giant Trojan horse of legend—which the soldiers of Troy 
took into their city, not knowing that Greek soldiers were hidden inside it and waiting 
to attack—fluoroacetate enters the TCA cycle innocently enough, in the citrate syn- 
thase reaction. Citrate synthase converts fluoroacetate to fluorocitrate, which inhib- 
its aconitase, thus blocking the TCA cycle. 


19.3c Isocitrate Dehydrogenase Catalyzes the First 
Oxidative Decarboxylation in the Cycle 


In the next step of the TCA cycle, isocitrate is oxidatively decarboxylated to yield 
a-ketoglutarate, with concomitant reduction of NAD* to NADH in the isocitrate 
dehydrogenase reaction (Figure 19.10). The reaction has a net AG?’ of —8.4 kJ/mol, 
and it is sufficiently exergonic to pull the aconitase reaction forward. This two-step 
reaction involves (1) oxidation of the C-2 alcohol of isocitrate to form oxalosuccinate, 
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Figure 19.10 The isocitrate dehydrogenase reaction. (a) The mechanism of the reaction. (b) The active 
site of isocitrate dehydrogenase. Isocitrate is blue, NADP* (as an NAD* analog) is gold, with Ca?* in 
red (PDB id = 1AI2). 
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followed by (2) a B-decarboxylation reaction that expels the central carboxyl group 
as COs, leaving the product a-ketoglutarate. Oxalosuccinate, the B-keto acid pro- 
duced by the initial dehydrogenation reaction, is unstable and thus is readily 
decarboxylated. 


Isocitrate Dehydrogenase Links the TCA Cycle and Electron Transport Isocitrate dehy- 
drogenase provides the first connection between the TCA cycle and the electron- 
transport pathway and oxidative phosphorylation, via its production of NADH. Asa 
connecting point between two metabolic pathways, isocitrate dehydrogenase is a regu- 
lated reaction. NADH and ATP are allosteric inhibitors, whereas ADP acts as an 
allosteric activator, lowering the K,, for isocitrate by a factor of 10. The enzyme is 
virtually inactive in the absence of ADP. Also, the product, a-ketoglutarate, is a cru- 
cial a-keto acid for aminotransferase reactions (see Section 13.5c), connecting the TCA 
cycle (that is, carbon metabolism) with nitrogen metabolism. 


19.3d a-Ketoglutarate Dehydrogenase Catalyzes the Second 
Oxidative Decarboxylation of the TCA Cycle 


A second oxidative decarboxylation occurs in the a-ketoglutarate dehydrogenase 
reaction. 


NAD+? CoASH 


NADH (GOH 


Be —coo- n — coo” 
HC HC 
-m Kenasi a 
c— a-Ketoglutarate C — SCoA 
ra dehydrogenase 
a-Ketoglutarate Succinyl-CoA 


Like the pyruvate dehydrogenase complex, a-ketoglutarate dehydrogenase is a multi- 
enzyme complex—consisting of a-ketoglutarate dehydrogenase, dihydrolipoyl trans- 
succinylase, and dihydrolipoyl dehydrogenase—that employs five different coenzymes 
(Table 19.2). The dihydrolipoyl dehydrogenase in this reaction is identical to that in 
the pyruvate dehydrogenase reaction. The mechanism is analogous to that of pyru- 
vate dehydrogenase. As with the pyruvate dehydrogenase reaction, this reaction pro- 
duces NADH and a thioester product—in this case, succinyl-CoA. Succinyl-CoA and 
NADH products are energy-rich species that are important sources of metabolic 
energy in subsequent cellular processes. 


19.4 | How Is Oxaloacetate Regenerated 
to Complete the TCA Cycle? 
The completion of the TCA cycle involves one reaction that generates a nucleoside 


triphosphate, followed by three reactions that convert a carbon in a C—C single 
bond to a carbonyl. These latter three reactions turn out to be nature’s preferred 


| Table 19.2 _| IEPA Composition of the a-Ketoglutarate Dehydrogenase Complex from Escherichia coli 


Enzyme Coenzyme 
a-Ketoglutarate dehydrogenase 
Dihydrolipoyl transsuccinylase 


Dihydrolipoyl dehydrogenase 


Thiamine pyrophosphate 
Lipoic acid, COASH 
FAD, NAD* 


Enzyme M, Number of Subunits Subunit M, Number of Subunits per Complex 
192,000 2 96,000 24 

1,700,000 24 70,000 24 
112,000 2 56,000 12 
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method for accomplishing this conversion, and the same chemistry is used in multiple 
pathways-fatty acid oxidation, fatty acid synthesis, amino acid breakdown, and 
amino acid synthesis, to name just a few. 


19.4a Succinyl-CoA Synthetase Catalyzes Substrate-Level 
Phosphorylation 


The NADH produced in the foregoing steps can be routed through the electron- 
transport pathway to make high-energy phosphates via oxidative phosphorylation. 
However, succinyl-CoA is itself a high-energy intermediate and is utilized in the next 
step of the TCA cycle to drive the phosphorylation of GDP to GTP (in mammals) or 
ADP to ATP (in plants and bacteria). 


xDP + P; STP + CoASH 
H.c—coo- © 2 ° 
H.C HC — COO~ 
| ` | 
Z C— SCoA Succinyl-CoA HC — COO 
o synthetase 
Succinyl-CoA Succinate 


The reaction is catalyzed by succinyl-CoA synthetase, sometimes called succinate thio- 
kinase. The free energies of hydrolysis of succinyl-CoA and GTP or ATP are similar, 
and the net reaction has a AG” of —3.3 kJ/mol. Succinyl-CoA synthetase provides 
another example of a substrate-level phosphorylation (see Section 18.4b), in which a 
substrate (rather than an electron-transport chain or proton gradient) provides the 
energy for phosphorylation. It is the only such reaction in the TCA cycle. The GTP 
produced by mammals in this reaction can exchange its terminal phosphoryl group 
with ADP via the nucleoside diphosphate kinase reaction: 


Nucleoside diphosphate 
kinase 


GTP + ADP ATP + GDP 


The Mechanism of Succinyl-CoA Synthetase Involves a Phosphohistidine The mecha- 
nism of succinyl-CoA synthetase is postulated to involve displacement of CoA by 
phosphate, forming succinyl phosphate at the active site, followed by transfer of the 
phosphoryl group to an active-site histidine (making a phosphohistidine intermediate) 
and release of succinate. The phosphoryl moiety is then transferred to GDP to form 
GTP (Figure 19.11). This sequence of steps preserves the energy of the thioester bond 
of succinyl-CoA in a series of high-energy intermediates that lead to a molecule of 
ATP: 


Thioester —> [succinyl-P] —> [phosphohistidine] —> GTP —> ATP 


The First Five Steps of the TCA Cycle Produce NADH, CO,, GTP (ATP), and Succinate This 
is a good point to pause in our trip through the TCA cycle and see what has happened. 
A two-carbon acetyl group has been introduced as acetyl-CoA and linked to oxaloac- 
etate, and two CO, molecules have been liberated. The cycle has produced two mole- 
cules of NADH and one of GTP or ATP and has left a molecule of succinate. 

The TCA cycle can now be completed by converting succinate to oxaloacetate. 
This latter process represents a net oxidation. The TCA cycle breaks it down into 
(consecutively) an oxidation step, a hydration reaction, and a second oxidation step. 
The oxidation steps are accompanied by the reduction of an [FAD] and an NAD*. 
The reduced coenzymes, [FADH;] and NADH, subsequently provide reducing 
power in the electron-transport chain. Virtually the same chemical strategy is used in 
B-oxidation of fatty acids. 


649 


Condensation A reaction between two or more 
molecules that results in formation of a larger 
molecule, with elimination of some simpler molecule, 
such as water (as in dehydration synthesis). 


Synthase An enzyme-catalyzed condensation reaction 
that does not require a nucleoside triphosphate as an 
energy source. 


Synthetase An enzyme-catalyzed condensation 
reaction that requires a nucleoside triphosphate (often 
ATP) as an energy source. 
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I 
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4 


O 
INS + 
ZOSP N “SNH 
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GTP 
Ow 
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Figure 19.11 The mechanism of the succinyl-CoA 
synthetase reaction. 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


650 Chapter 19 The Tricarboxylic Acid Cycle 


Histidine FAD 


Figure 19.12 The covalent bond between FAD and 
succinate dehydrogenase involves the C-8a methylene 
group of FAD and the N-3 of a histidine residue on 
the enzyme. 


19.4b Succinate Dehydrogenase Is FAD-Dependent 


The oxidation of succinate to fumarate is carried out by succinate dehydrogenase, a 
membrane-bound enzyme that is actually part of the electron-transport chain. 


coo- FAD FADH, 

| H S poe 
CHo C 

| Succinate | | 
CHo dehydrogenase C 

| E a. 
(0010 OOC H 

Succinate FAD FE ADH, Fumarate 


Succinate dehydrogenase is also known as the succinate—coenzyme Q reductase of 
the electron-transport chain. In contrast with all of the other enzymes of the TCA 
cycle, which are soluble proteins found in the mitochondrial matrix, succinate 
dehydrogenase is an integral membrane protein tightly associated with the inner 
mitochondrial membrane. Succinate oxidation involves removal of H atoms across a 
C—C bond, rather than a C—O or C—N bond, and produces the trans-unsaturated 
fumarate. This reaction (the oxidation of an alkane to an alkene) is not sufficiently 
exergonic to reduce NAD*, but it does yield enough energy to reduce [FAD]. (By 
contrast, oxidations of alcohols to ketones or aldehydes are more energetically 
favorable and provide sufficient energy to reduce NAD*.) 

Succinate dehydrogenase is a dimeric protein, with subunits of molecular masses 
70 and 27 kDa. FAD is covalently bound to the larger subunit; the bond involves a 
methylene group of C-8a of FAD and N-3 of a histidine on the protein (Figure 19.12). 
Succinate dehydrogenase also contains three different iron-sulfur clusters: a 4Fe-4S 
cluster, a 3Fe-4S cluster, and a 2Fe-2S cluster, shown below. 


Ng Ne 
Ss oe ~e 
Cys Cys 


Viewed from either end of the succinate molecule, the reaction involves dehydrogena- 
tion a,8 to a carbonyl (actually, a carboxyl) group. The dehydrogenation is stereo- 
specific, with the pro-S hydrogen removed from one carbon atom and the pro-R 
hydrogen removed from the other. The electrons captured by [FAD] in this reaction 
are passed directly into the iron-sulfur clusters of the enzyme and on to coenzyme Q 
(UQ). The covalently bound FAD is first reduced to [FADH,] and then reoxidized to 
form [FAD] and the reduced form of coenzyme Q, UQH. Electrons captured by 
UQH,; then flow through the rest of the electron-transport chain. 

Note that flavin coenzymes can carry out either one-electron or two-electron transfers. 
The succinate dehydrogenase reaction represents a net two-electron reduction of FAD. 


19.4c Fumarase Catalyzes the Trans-Hydration 
of Fumarate to Form L-Malate 


Fumarate is hydrated in a stereospecific reaction by fumarase to give L-malate. 


COO 
H a, HO | 


C \ H— C — COO% 

| | 

C Fumarase HC — COOT 
-00c” “H 


Fumarate L-Malate 


The reaction involves trans-addition of the elements of water across the double 
bond. Recall that aconitase carries out a similar reaction and that trans-addition of 
—H and —OH occurs across the double bond of cis-aconitate. Although the exact 
mechanism is uncertain, it may involve protonation of the double bond to form an 
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19.5 What Are the Energetic Consequences of the TCA Cycle? 


(a) Carbonium ion mechanism 
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intermediate carbonium ion (Figure 19.13a) or possibly attack by water or OH- 
anion to produce a carbanion, followed by protonation (Figure 19.13b). 


19.4d Malate Dehydrogenase Completes the Cycle 
by Oxidizing Malate to Oxaloacetate 


In the last step of the TCA cycle, L-malate is oxidized to oxaloacetate by malate dehy- 
drogenase. This reaction is very endergonic, with a AG?’ of +30 kJ/mol. 


NADH 
NAD+ + Ht 
OH \ J Oo 
| N 
H—C—Coo- Malate C—COO- 
dehydrogenase | 
HC — COO7 N 7S H,C—COO- 
NAD+ NADH 
L-Malate + Ht Oxaloacetate 


Consequently, the concentration of oxaloacetate in the mitochondrial matrix is usually 
quite low. The reaction, however, is pulled forward by the favorable citrate synthase 
reaction. Oxidation of malate is coupled to reduction of yet another molecule of NAD*, 
the third one of the cycle. Counting the [FAD] reduced by succinate dehydrogenase, 
this makes the fourth coenzyme reduced through oxidation of a single acetate unit. 
Malate dehydrogenase is structurally and functionally similar to other dehydrogenases, 
notably lactate dehydrogenase (Figure 19.14). Both consist of alternating B-sheet and 
a-helical segments. Binding of NAD* causes a conformational change in the 20-residue 
segment that connects the D and E strands of the B-sheet. The change is triggered by an 
interaction between the adenosine phosphate moiety of NAD* and an arginine residue in 
this loop region. Such a conformational change is consistent with an ordered single- 
displacement mechanism for NAD*-dependent dehydrogenases (see Section 13.5b). 


19.5 | What Are the Energetic Consequences 
of the TCA Cycle? 


The net reaction accomplished by the TCA cycle, as follows, shows two molecules of 
CO,, one ATP, and four reduced coenzymes produced per acetate group oxidized. 
The cycle is exergonic, with a net AG°’ for one pass around the cycle of approximately 
—40 kJ/mol. Table 19.1 compares the AG?’ values for the individual reactions with the 
overall AG” for the net reaction. 


Figure 19.13 Two possible mechanisms for the 
fumarase reaction. (a) Carbonium ion mechanism. 
(b) Carbanion mechanism. 


Figure 19.14 The active site of malate 
dehydrogenase. Malate is yellow; NAD* is pink 
(PDB id = 1EMD). 
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Acetyl-CoA + 3 NAD* + [FAD] + ADP + P; + 2 H,O —> 
2 CO, + 3 NADH + 3 H+ + [FADH,] + ATP + CoASH 
AG” = —40 kJ/mol 


Glucose metabolized via glycolysis produces two molecules of pyruvate and thus two 
molecules of acetyl-CoA, which can enter the TCA cycle. Combining glycolysis and 
the TCA cycle gives the net reaction: 


Glucose + 2 H,O + 10 NAD+ + 2 [FAD] + 4 ADP + 4 P; —> 
6 CO, + 10 NADH + 10 H* + 2 [FADH,] + 4 ATP 


The stoichiometries of glycolysis and the TCA cycle can be summarized as: 
Glycolysis: 
CsH\206 —> 2 CH;COCOO™ + 2 H* + 4H = 2C;H,0O; + 4H 
Net: 2 ATP + 2 NADH + 2 H* from 4 H and the cleavage of 1 C—C bond. 
TCA Cycle (including pyruvate dehydrogenase): 
2 C3;H,O; + 6 H:O —> 6 CO, + 20 H 


Net: 2 ATP and 8 NADH + 8 H+ + 2 FADH, from 20 H and the cleavage of 4 
C—C bonds. 


All six carbons of glucose are liberated as CO;, and a total of four molecules of ATP 
are formed thus far in substrate-level phosphorylations. The 12 reduced coenzymes 
produced up to this point can eventually produce as many as 34 molecules of ATP 
in the electron-transport and oxidative phosphorylation pathways. A stoichiometric 
relationship for these subsequent processes would be 


NADH + H* +10, + 3 ADP + 3 P; — NAD+ + 3 ATP + 4 HO 
[FADH] + 40, + 2 ADP + 2 P; — [FAD] + 2 ATP + 3 H,O 


Thus, a total of three ATP per NADH and two ATP per FADH; may be produced 
through the processes of electron transport and oxidative phosphorylation. 


19.5a The Carbon Atoms of Acetyl-CoA Have Different Fates in the TCA Cycle 


It is instructive to consider how the carbon atoms of a given acetate group are routed 
through several turns of the TCA cycle. As shown in Figure 19.15, neither of the two 
carbon atoms of a labeled acetate unit is lost as CO in the first turn of the cycle. The 
CO; released in any turn of the cycle derives from the carboxyl groups of the oxaloac- 
etate acceptor (produced in the previous turn), not from incoming acetyl-CoA. On 
the other hand, succinate labeled on one end from the original labeled acetate forms 
two different labeled oxaloacetates. The carbonyl carbon of acetyl-CoA is evenly 
distributed between the two carboxyl carbons of oxaloacetate, and the labeled methyl 
carbon of incoming acetyl-CoA ends up evenly distributed between the methylene 
and carbonyl carbons of oxaloacetate. 

When these labeled oxaloacetates enter a second turn of the cycle, both of the carboxyl 
carbons are lost as CO , but the methylene and carbonyl carbons survive through the second 
turn. Thus, the methyl carbon of a labeled acetyl-CoA survives two full turns of the cycle. 
In the third turn of the cycle, one-half of the carbon from the original methyl group of 
acetyl-CoA has become one of the carboxyl carbons of oxaloacetate and is thus lost as COs. 
In the fourth turn of the cycle, further scrambling results in loss of half of the remaining 
labeled carbon (one-fourth of the original methyl carbon label of acetyl-CoA), and so on. 

It can be seen that the carbonyl and methyl carbons of labeled acetyl-CoA have very 
different fates in the TCA cycle. The carbonyl carbon survives the first turn intact but is 
completely lost in the second turn. The methyl carbon survives two full turns, then 
undergoes a 50% loss through each succeeding turn of the cycle. 

It is worth noting that the carbon-carbon bond cleaved in the TCA pathway entered 
as an acetate unit in the previous turn of the cycle. Thus, the oxidative decarboxylations 
that cleave this bond are just a cleverly disguised acetate C—C cleavage and oxidation. 
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(a) Fate of the carboxyl carbon of acetate unit 
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Figure 19.15 The fate of the carbon atoms of acetate in successive TCA cycles. Assume at the start, 
labeled acetate is added to cells containing unlabeled metabolites. The pink arrows lead from one turn 
of the TCA cycle to a subsequent one. (a) The carbonyl carbon of acetyl-CoA is fully retained 
through one turn of the cycle but is lost completely in a second turn of the cycle. (b) The methyl 
carbon of a labeled acetyl-CoA survives two full turns of the cycle but becomes equally distributed 
among the four carbons of oxaloacetate by the end of the second turn. In each subsequent turn of 
the cycle, one-half of this carbon (the original labeled methyl group) is lost. 
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A Deeper Look 


Steric Preferences in NAD*-Dependent Dehydrogenases 


The enzymes that require nicotinamide coenzymes are stereospe- 
cific and transfer hydride to either the pro-R or the pro-S positions 
selectively. 

What accounts for this stereospecificity? It arises from the fact 
that the enzymes (and especially the active sites of enzymes) are 
inherently asymmetric structures. The nicotinamide coenzyme (and 
the substrate) fit the active site in only one way. Malate dehydroge- 


nase, the citric acid cycle enzyme, transfers hydride to the Hp posi- 
tion of NADH, but glyceraldehyde-3-P dehydrogenase in the glyco- 
lytic pathway transfers hydride to the Hs position, as shown in the 
accompanying figure. Dehydrogenases are stereospecific with re- 
spect to the substrates as well. Note that alcohol dehydrogenase 
removes hydrogen from the pro-R position of ethanol and transfers 
it to the pro-R position of NADH. 


Glyceraldehyde- 
3-phosphate 


NAD* 


———————— 


Malate 
dehydrogenase 


Oxaloacetate R 


P, 
—=$$——_ 


Glyceraldehyde- 
3-phosphate 
dehydrogenase 


1,3-Bisphospho- 
glycerate 


—— 
<— 


Alcohol 
dehydrogenase 


Acetaldehyde 


Stereochemistry of dehydrogenase reactions: (a) the malate dehydrogenase reaction; (b) the glyceraldehyde-3-phosphate 
dehydrogenase reaction; (c) the alcohol dehydrogenase reaction. 


19.6 | Can the TCA Cycle Provide Intermediates 


for Biosynthesis? 


Until now we have viewed the TCA cycle as a catabolic process because it oxidizes 
acetate units to CO, and converts the liberated energy to ATP and reduced coen- 
zymes. The TCA cycle is, after all, the end point for breakdown of food materials, at 
least in terms of carbon turnover. However, as shown in Figure 19.16, four-, five-, and 
six-carbon species produced in the TCA cycle also fuel a variety of biosynthetic pro- 
cesses. a-Ketoglutarate, succinyl-CoA, fumarate, and oxaloacetate are all precursors 
of important cellular species. (To participate in eukaryotic biosynthetic processes, 
however, they must first be transported out of the mitochondria.) A transamination 
reaction converts a-ketoglutarate directly to glutamate, which can then serve as a 
versatile precursor for proline, arginine, and glutamine. Succinyl-CoA provides most 
of the carbon atoms of the porphyrins. Oxaloacetate can be transaminated to 
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Figure 19.16 The TCA cycle provides intermediates for numerous biosynthetic processes in the cell. 
Amino acids are highlighted in orange. 
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produce aspartate. Aspartic acid itself is a precursor of the pyrimidine nucleotides 
and, in addition, is a key precursor for the synthesis of asparagine, methionine, lysine, 
threonine, and isoleucine. Oxaloacetate can also be decarboxylated to yield PEP, 
which is a key element of several pathways, namely (1) synthesis (in plants and micro- 
organisms) of the aromatic amino acids phenylalanine, tyrosine, and tryptophan; 


Human Biochemistry 


Leber’s Hereditary Optic Neuropathy Arises from Mitochondrial DNA Mutations 


Diseases arising from defects in mitochondrial enzymes are quite rare, 
because major defects in the TCA cycle (and the respiratory chain) 
are incompatible with life and affected embryos rarely survive to 
birth. Even so, about 150 different hereditary mitochondrial diseases 
have been reported. Although mitochondria carry their own DNA, 
many of the reported diseases map to the nuclear genome, because 
most of the mitochondrial proteins are imported from the cytosol. 


An interesting disease linked to mitochondrial DNA mutations is 
Leber’s hereditary optic neuropathy (LHON), in which the genetic 
defects are located primarily in the mitochondrial DNA coding for 
the subunits of NADH-CoQ reductase, also known as Complex I 
of the electron-transport chain. Leber’s disease is the most common 
form of blindness in otherwise healthy young men and occurs less 
often in women. 
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Figure 19.17 Pyruvate carboxylase, phosphoenolpyr- 
uvate (PEP) carboxylase, and malic enzyme catalyze 
anaplerotic reactions, replenishing TCA cycle inter- 
mediates. 


Chapter 19 The Tricarboxylic Acid Cycle 


(2) formation of 3-phosphoglycerate and conversion to the amino acids serine, gly- 
cine, and cysteine; and (3) gluconeogenesis, the pathway that synthesizes new glucose 
and many other carbohydrates. 

Finally, citrate can be exported from the mitochondria then broken down by ATP- 
citrate lyase to yield oxaloacetate and acetyl-CoA, a precursor of fatty acids. 
Oxaloacetate produced in this reaction is rapidly reduced to malate, which can then be 
processed in either of two ways: It may be transported into mitochondria, where it is 
reoxidized to oxaloacetate, or it may be oxidatively decarboxylated to pyruvate by malic 
enzyme, with subsequent mitochondrial uptake of pyruvate. This cycle permits citrate 
to provide acetyl-CoA for biosynthetic processes, with return of the malate and pyruvate 
by-products to the mitochondria. 


19.7 | What Are the Anaplerotic, or Filling Up, Reactions? 


In a sort of reciprocal arrangement, the cell also feeds many intermediates back into 
the TCA cycle from other reactions. Because such reactions replenish the TCA cycle 
intermediates, Hans Kornberg proposed that they be called anaplerotic reactions (lit- 
erally, the filling-up reactions). For example, PEP carboxylase and pyruvate carboxylase 
synthesize oxaloacetate from pyruvate (Figure 19.17). 

Pyruvate carboxylase is the most important of the anaplerotic enzymes. It exists in the 
mitochondria of animal cells but not in plants, and it provides a direct link between 
glycolysis and the TCA cycle. Pyruvate carboxylase is tetrameric and contains covalently 
bound biotin and an Mg?* site on each subunit. Pyruvate carboxylase has an absolute 
allosteric requirement for acetyl-CoA. Thus, when acetyl-CoA levels exceed the oxalo- 
acetate supply, allosteric activation of pyruvate carboxylase by acetyl-CoA raises 
oxaloacetate levels, so the excess acetyl-CoA can enter the TCA cycle. 

PEP carboxylase occurs in yeast, bacteria, and higher plants, but not in animals. The 
enzyme is specifically inhibited by aspartate, which is produced by transamination of 
oxaloacetate. Thus, organisms using this enzyme control aspartate production by regu- 
lation of PEP carboxylase. Malic enzyme is found in the cytosol or mitochondria of 
many animal and plant cells and is an NADPH-dependent enzyme. 
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A Deeper Look 


Anaplerosis Plays a Critical Role in Insulin Secretion 


The B-cells of the islets of Langerhans in the pancreas release 
insulin in response to an increase of blood glucose. A rise in 
blood glucose induces an increase in B-cell glucose metabolism, 
leading to insulin release. But what cellular processes mediate this 
response? For many years, it was widely accepted that ATP (pro- 
duced from the processes of glycolysis, the TCA cycle, electron 
transport, and oxidative phosphorylation) activated K* channels 
in the plasma membrane, triggering insulin release. However, re- 
cent research has shown that anaplerotic enzymes feed alternative 
pathways that produce cytosolic signal molecules that also support 
insulin secretion. 

Pancreatic B-cells maintain high levels of the anaplerotic enzyme 
pyruvate carboxylase. In these cells, only about half the pyruvate 
from glycolysis enters the TCA cycle via pyruvate dehydrogenase, 
leading to oxidative processes that generate ATP. The rest of the 
pyruvate is converted by pyruvate carboxylase to oxaloacetate, 
much of which is converted to malate and exported to the cytosol, 
as part of a pyruvate/malate cycle (see figure). NADPH and other 
metabolites generated from this cycle act as intracellular messen- 
gers that appear to be as significant as ATP in provoking insulin 
secretion. 

In support of this new model for insulin secretion, Jamie Joseph 
and his colleagues have shown that B-cells that lack hypoxia- 
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inducible factor-1 beta (HIF-1f) display reduced anaplerotic activ- 
ity, reduced activities of pyruvate carboxylase and cytosolic malic 
enzyme, and reduced insulin secretion. 

Anaplerosis also appears to play a role in peripheral tissues. In 
insulin-resistant individuals, exercise increases anaplerotic activity, 
increases fatty acid oxidation, and restores insulin sensitivity. 
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The pyruvate/malate cycle in pancreatic B-cells. Oxaloacetate produced 
by high levels of pyruvate carboxylase is converted to malate, then 
exported to the cytosol. Cytosolic malic enzyme (MEc) converts malate 
to pyruvate, which is returned to the mitochondrial matrix. NADPH and 
other cytosolic signals produced by this and related cycles provoke 
insulin secretion. 
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It is worth noting that the reaction catalyzed by PEP carboxykinase could also 


function as an anaplerotic reaction, were it not for the particular properties of the 


enzyme. 
”=0;PO GDP GIP o 
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a a | 
CHo PEP carboxykinase HC — DDOR 
PEP Oxaloacetate 


CO, binds weakly to PEP carboxykinase, whereas oxaloacetate binds very tightly 
(Kp = 2 X 10~® M), and, as a result, the enzyme favors formation of PEP from 
oxaloacetate. 

The catabolism of amino acids provides pyruvate, acetyl-CoA, oxaloacetate, fuma- 
rate, a-ketoglutarate, and succinate, all of which may be oxidized by the TCA cycle. In 
this way, proteins as nutrients are excellent sources of energy. 
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A Deeper Look 


Fool’s Gold and the Reductive Citric Acid Cycle—The First Metabolic Pathway? 


How did life arise on the planet Earth? It was once supposed that a 
reducing atmosphere, together with random synthesis of organic 
compounds, gave rise to a prebiotic soup in which the first living 
things appeared. However, certain key compounds, such as arginine, 
lysine, and histidine; the straight-chain fatty acids; porphyrins; and 
essential coenzymes, have not been convincingly synthesized under 
simulated prebiotic conditions. This and other problems have led 
researchers to consider other models for the evolution of life. 

One of these alternative models, postulated by Giinter 
Wachtershauser, involves an archaic version of the TCA cycle run- 
ning in the reverse (reductive) direction. Reversal of the TCA cycle 
results in assimilation of CO, and fixation of carbon as shown. For 
each turn of the reversed cycle, two carbons are fixed in the formation 
of isocitrate and two more are fixed in the reductive transformation 
of acetyl-CoA to oxaloacetate. Thus, for every succinate that enters 
the reversed cycle, two succinates are returned, making the cycle 
highly autocatalytic. Because TCA cycle intermediates are involved 
in many biosynthetic pathways (see Section 19.6), a reversed TCA 
cycle would be a bountiful and broad source of metabolic substrates. 

A reversed, reductive TCA cycle would require energy input to 
drive it. What might have been the thermodynamic driving force 
for such a cycle? Wachtershauser hypothesizes that the anaerobic 
reaction of FeS and H3S to form insoluble FeS, (pyrite, also known 
as fool’s gold) in the prebiotic milieu could have been the driving 
reaction: 


FeS + HS —> Fes, (pyrite)| + Hz 


This reaction is highly exergonic, with a standard-state free energy 
change (AG°’) of —38 kJ/mol. Under the conditions that might 
have existed in a prebiotic world, this reaction would have been 
sufficiently exergonic to drive the reductive steps of a reversed TCA 
cycle. 


Wachtershauser has also suggested that early metabolic processes 
first occurred on the surface of pyrite and other related mineral 
materials. The iron-sulfur chemistry that prevailed on these mineral 
surfaces may have influenced the evolution of the iron-sulfur proteins 
that control and catalyze many reactions in modern pathways 
(including the succinate dehydrogenase and aconitase reactions of 
the TCA cycle). This reductive citric acid cycle has been shown to 
occur in certain extant archaea and bacteria, where it serves all their 
carbon needs. 
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19.8 | How Is the TCA Cycle Regulated? 


Situated as it is between glycolysis and the electron-transport chain, the TCA cycle 
must be carefully controlled. If the cycle were permitted to run unchecked, large 
amounts of metabolic energy could be wasted in overproduction of reduced coenzymes 
and ATP; conversely, if it ran too slowly, ATP would not be produced rapidly enough 
to satisfy the needs of the cell. Also, as just seen, the TCA cycle is an important source 
of precursors for biosynthetic processes and must be able to provide them as needed. 

What are the sites of regulation in the TCA cycle? Based on our experience with 
glycolysis (see Figure 18.22), we might anticipate that some of the reactions of the TCA 
cycle would operate near equilibrium under cellular conditions (with A G = 0), whereas 
others—the sites of regulation—would be characterized by large negative AG values. 
Estimates for the values of A G in mitochondria, based on mitochondrial concentrations 
of metabolites, are summarized in Table 19.1. Three enzyme reactions of the cycle— 
citrate synthase, isocitrate dehydrogenase, and a-ketoglutarate dehydrogenase—operate 
with large negative AG values under mitochondrial conditions and are thus the primary 
sites of regulation in the cycle. 

The regulatory actions that control the TCA cycle are shown in Figure 19.18. As one 
might expect, the principal regulatory signals are the concentrations of acetyl-CoA, 
ATP, NAD*, and NADH, with additional effects provided by several other metabolites. 
The main sites of regulation are pyruvate dehydrogenase, citrate synthase, isocitrate 
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Figure 19.18 Regulation of the TCA cycle. Green plus symbols represent activation; red circles with 
slashes indicate inhibition. 


dehydrogenase, and a-ketoglutarate dehydrogenase. All of these enzymes are inhibited 
by NADH, so when the cell has produced all the NADH that can conveniently be 
turned into ATP, the cycle shuts down. For similar reasons, ATP is an inhibitor of 
pyruvate dehydrogenase and isocitrate dehydrogenase. The TCA cycle is turned on, 
however, when either the ADP/ATP or NAD*/NADH ratio is high, an indication that 
the cell has run low on ATP or NADH. Regulation of the TCA cycle by NADH, NAD‘, 
ATP, and ADP thus reflects the energy status of the cell. On the other hand, succinyl- 
CoA is an intracycle regulator, inhibiting citrate synthase and a-ketoglutarate dehydro- 
genase. Acetyl-CoA acts as a signal to the TCA cycle that glycolysis or fatty acid 
breakdown is producing two-carbon units. Acetyl-CoA activates pyruvate carboxylase, 
the anaplerotic reaction that provides oxaloacetate, the acceptor for increased flux of 
acetyl-CoA into the TCA cycle. 


19.8a Pyruvate Dehydrogenase Is Regulated 
by Phosphorylation/Dephosphorylation 
Most organisms can synthesize sugars such as glucose from pyruvate. However, 


animals cannot synthesize glucose from acetyl-CoA. For this reason, the pyruvate 
dehydrogenase complex, which converts pyruvate to acetyl-CoA, plays a pivotal role 
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Figure 19.19 Regulation of the pyruvate 
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Figure 19.20 Structure of human SIRT3 with a 
bound peptide substrate analog (shown in ball-and- 
stick). The small domain of SIRT3 is at the top; the 
large domain is at the bottom (PDB id = 3GLT). 


in metabolism. Conversion to acetyl-CoA commits nutrient carbon atoms either to 
oxidation in the TCA cycle or to fatty acid synthesis. Because this choice is so crucial 
to the organism, pyruvate dehydrogenase is a carefully regulated enzyme. It is sub- 
ject to product inhibition and is further regulated by nucleotides. Finally, activity of 
pyruvate dehydrogenase is regulated by phosphorylation and dephosphorylation of 
the enzyme complex itself. 

High levels of either product, acetyl-CoA or NADH, allosterically inhibit the 
pyruvate dehydrogenase complex. Acetyl-CoA specifically blocks dihydrolipoyl trans- 
acetylase, and NADH acts on dihydrolipoyl dehydrogenase. The mammalian pyruvate 
dehydrogenase is also regulated by covalent modifications. As shown in Figure 19.19, a 
Mg?*-dependent pyruvate dehydrogenase kinase is associated with the enzyme in 
mammals. This kinase is allosterically activated by NADH and acetyl-CoA, and when 
levels of these metabolites rise in the mitochondrion, they stimulate phosphorylation of 
three residues (Ser?", Ser, and Ser?7!) on the pyruvate dehydrogenase a-subunit, 
blocking the first step of the pyruvate dehydrogenase reaction, the decarboxylation of 
pyruvate. Inhibition of the dehydrogenase in this manner eventually lowers the levels of 
NADH and acetyl-CoA in the matrix of the mitochondrion. Reactivation of the 
enzyme is carried out by pyruvate dehydrogenase phosphatase, a Ca**-activated enzyme 
that binds to the dehydrogenase complex and hydrolyzes the phosphoserine moieties on 
the dehydrogenase subunit. At low ratios of NADH to NAD+ and low acetyl-CoA 
levels, the phosphatase maintains the dehydrogenase in an activated state, but a high level 
of acetyl-CoA or NADH once again activates the kinase and leads to the inhibition of 
the dehydrogenase. Insulin and Ca* ions activate dephosphorylation, and pyruvate 
inhibits the phosphorylation reaction. 

Pyruvate dehydrogenase is also sensitive to the energy status of the cell. AMP acti- 
vates pyruvate dehydrogenase, whereas GTP inhibits it. High levels of AMP are a sign 
that the cell may be energy-poor. Activation of pyruvate dehydrogenase under such 
conditions commits pyruvate to energy production. 


19.8b Isocitrate Dehydrogenase Is Strongly Regulated 


The mechanism of regulation of isocitrate dehydrogenase (Figure 19.18) is in some 
respects the reverse of pyruvate dehydrogenase. The mammalian isocitrate dehydro- 
genase is subject only to allosteric activation by ADP and NAD* and to inhibition by 
ATP and NADH. Thus, high NAD*/NADH and ADP/ATP ratios stimulate isocitrate 
dehydrogenase and TCA cycle activity. 

It may seem surprising that isocitrate dehydrogenase is strongly regulated, because it is 
not an apparent branch point within the TCA cycle. However, the citrate/isocitrate ratio 
controls the rate of production of cytosolic acetyl-CoA, because acetyl-CoA in the cyto- 
sol is derived from citrate exported from the mitochondrion. (Breakdown of cytosolic 
citrate produces oxaloacetate and acetyl-CoA, which can be used in a variety of biosyn- 
thetic processes.) Thus, isocitrate dehydrogenase activity in the mitochondrion favors 
catabolic TCA cycle activity over anabolic utilization of acetyl-CoA in the cytosol. 


19.8c Regulation of TCA Cycle Enzymes by Acetylation 


Lysine acetylation is a dynamic, reversible, and regulatory post-translational modifica- 
tion that enables cells to sense energy status and respond to environmental changes and 
stresses (Chapter 15). All of the TCA cycle enzymes are acetylated in organisms from 
bacteria to plants to mammals, and acetylation inhibits TCA enzyme activity in the 
cases that have been studied, particularly isocitrate dehydrogenase, succinate 
dehydrogenase, and malate dehydrogenase (see Figure 15.11). SIRT3 is a mitochondrial 
NAD*-dependent deacetylase of the sirtuin family (Chapter 15) that can deacetylate 
and activate all these TCA enzymes (Figure 19.20). SIRT3 expression is a function of 
exercise status in humans. Adults who exercise vigorously for an hour six days a week 
express SIRT3 at levels nearly three times those of sedentary adults. Elevated levels of 
SIRT3 presumably activate TCA cycle enzymes, and studies have shown that increased 
levels of SIRT3 are correlated with extended lifespans in humans. 
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TCA Metabolites Play Roles in Many Pathways via Post-Translational Modifications 


Post-translational modifications (PTMs) of enzymes and other 
proteins play important roles in most if not all biological pathways. 
Several TCA metabolites in particular modify and regulate the 
activities of metabolic enzymes. For example, Matthias Mann and 
colleagues have shown that acetyl-CoA is the substrate for acetyla- 
tion of 3600 lysine residues on 1750 proteins in human cell lines (see 
figure a)—lysine acetylation was first observed with histone proteins 
in the cell nucleus; thus lysine acetylation enzymes are referred to as 
histone acetyltransferases or HATs. Nearly every enzyme in glycoly- 
sis, the TCA cycle, gluconeogenesis, and glycogen metabolism, fatty 
acid oxidation, and the urea cycle is acetylated in this way. 


o- 
me) 
(c) SH OO S O 
+ m a o —> 
Some post-translational modifications that involve TCA metabolites as 


substrates. (a) Acetylation of lysine by acetyl-CoA. (b) Succinylation of 
lysine by succinyl-CoA. (c) Succination of cysteine by fumarate. 


Similarly, succinyl-CoA is the substrate for succinylation of lysine 
residues on many proteins (figure b). For example, David Lombard 
and Yingming Zhao have identified 2565 lysine succinylation sites 
on 779 proteins in mammalian cells. This reaction is reversible, with 
the desuccinylation reaction catalyzed by a sirtuin referred to as 
SIRTS. 


19.8d Two Covalent Modifications Regulate 
E. coli |socitrate Dehydrogenase 


Interestingly, fumarate can also serve as a PTM substrate in a 
reaction with Cys residues on proteins that is irreversible and is 
termed succination because the final product is a covalently bound 
succinate group (figure c). This is an example of a Michael addition. 

How could these modifications affect protein or enzyme func- 
tion? Steric factors and alteration of surface charge may well play 
a role. Acetylation neutralizes the positive charge of the involved 
lysine residue, whereas succinylation changes a lysine e-amino posi- 
tive charge to a negative charge on the associated carboxyl group. 
On the other hand, succination changes the cysteine-SH group into 
a pair of adjacent negatively charged carboxyl groups. 

All these PTMs with TCA metabolites are involved with multiple 
alterations of major cellular functions. Increases in cellular fumarate 
concentration are observed in diabetes and certain cancers. Obese 
mice (an animal model of type 2 diabetes), with high serum glucose 
levels, exhibit elevated levels of fumarate and greatly increased suc- 
cination of proteins in several types of fat cells (adipocytes). In dia- 
betic rats, glyceraldehyde-3-phosphate dehydrogenase (in glycolysis) 
is succinated and irreversibly inhibited in muscle tissue. Similarly, 
kidney tumors are characterized by mutations in fumarase that 
lead to elevated mitochondrial fumarate concentrations and mas- 
sive increases in succination of cellular proteins, decreased activity 
of aconitase in the TCA cycle, and metabolic reprogramming that 
contributes to cancer cell proliferation. 
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Interestingly, the E.coli isocitrate dehydrogenase is regulated by two different 
covalent modifications. Serine residues on each subunit of the dimeric enzyme are 
phosphorylated by the protein kinase activity of AceK, a bifunctional kinase/phos- 
phatase, causing inhibition of the isocitrate dehydrogenase activity. Activity is 
restored by the action of the phosphatase activity of AceK. When TCA cycle and 
glycolytic intermediates—such as isocitrate, 3-phosphoglycerate, pyruvate, PEP, 
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and oxaloacetate—are high, the kinase is inhibited, the phosphatase is activated, 
and the TCA cycle operates normally. When levels of these intermediates fall, the 
kinase is activated, isocitrate dehydrogenase is inhibited, and isocitrate is diverted to 
the glyoxylate pathway, as explained in the next section. Interestingly, the kinase and 
phosphatase activities of AceK depend on acetylation of AceK by lysine acetyltrans- 
ferases and removal of these acetyl groups by deacetylases. Acetylated AceK dis- 
plays a reduced ability to inactivate isocitrate dehydrogenase (by phosphorylation) 
but at the same time an increased ability to activate isocitrate dehydrogenase (via 
dephosphorylation by the phosphatase). 


19.8e TCA Metabolites Can Act as Cellular Signals 


In addition to their role in catabolic metabolism, some TCA cycle metabolites also 
participate in cell signaling. Succinate accumulates in macrophages (white blood cells 
that engulf and digest dead cells and cellular debris) during inflammation, acting as 
a signal to activate pro-inflammatory gene expression. Succinate also increases in 
tissues during ischemia (inadequate blood supply to an organ or part of the body). 
In its signaling role, succinate is exported from mitochondria to the cytosol, inhibit- 
ing prolyl hydroxylases and activating HIF-la, thus enhancing expression of pro- 
inflammatory genes (Section 18.8). It also can be exported from cells where it binds to 
SUCNRI, a G-protein-coupled receptor, either on adjacent cells or on the cells from 
which it was released, provoking pro-inflammatory responses. 


: -O O 
(0) oO 
O O : O 
— CO, + 
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cis-Aconitate Itaconate 


By contrast, itaconate, derived from cis-aconitate in the aconitate decarboxylase reac- 
tion, acts as an anti-inflammatory signal, counteracting the pro-inflammatory pathways 
activated by succinate. Itaconate acts by inhibiting succinate dehydrogenase and also can 
be exported from cells, where it alkylates cysteine residues on the protein KEAP1, en- 
hancing the expression of antioxidant and anti-inflammatory genes. Itaconate also in- 
hibits the induction of interferons, immune system proteins that stimulate or inhibit 
several hundred genes involved in antiviral defense mechanisms, inflammation, adaptive 
immunity, angiogenesis, and other processes. 


19.8f The TCA Cycle Operates as a Metabolon 


The enzymes of the TCA cycle associate as a metabolon (Section 17.2). Selfassembly of 
the TCA enzymes into a supercomplex allows the channeling of substrate from one 
enzyme to another, enabling a much higher metabolic flux than would be the case for 
separate enzymes distributed throughout the mitochondrial matrix. Cross-linking ex- 
periments in vivo and mass spectrometry studies confirm that the seven TCA enzymes 
in the matrix form a complex that associates with SDH of the inner mitochondrial 
membrane, and this entire eight-enzyme complex apparently functions as one unit 
(Figure 19.21). Fei Wu and Shelley Minteer have shown that computer simulations and 
analysis of surface electrostatic potential in models are consistent with a continuous 
positively charged surface across the assembled supercomplex, which could facilitate 
transport of the negatively charged TCA substrates from one active site to the next. 


19.9 | Can Any Organisms Use Acetate 
as Their Sole Carbon Source? 
Plants (particularly seedlings, which cannot yet accomplish efficient photosynthesis) 


as well as some bacteria and algae can use acetate as the only source of carbon for all 
the carbon compounds they produce. Although we saw that the TCA cycle can supply 
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intermediates for some biosynthetic processes, the cycle gives off two CO, for every 
two-carbon acetate group that enters and cannot effect the net synthesis of TCA cycle 
intermediates. Thus, it would not be possible for the cycle to produce the massive 
amounts of biosynthetic intermediates needed for acetate-based growth unless alter- 
native reactions were available. In essence, the TCA cycle is geared primarily to 
energy production, and it wastes carbon units by giving off CO. Modification of the 
cycle to support acetate-based growth would require eliminating the CO)-producing 
reactions and enhancing the net production of four-carbon units (that is, oxaloace- 
tate). Plants and bacteria employ a modification of the TCA cycle called the glyoxylate 
cycle to produce four-carbon dicarboxylic acids (and eventually even sugars) from 
two-carbon acetate units. The glyoxylate cycle bypasses the two oxidative decarbox- 
ylations of the TCA cycle and instead routes isocitrate through the isocitrate lyase 
and malate synthase reactions (Figure 19.22). Glyoxylate produced by isocitrate lyase 
reacts with a second molecule of acetyl-CoA to form L-malate. The net effect is to 
conserve carbon units, using two acetyl-CoA molecules per cycle to generate 
oxaloacetate. Some of this is converted to PEP and then to glucose by pathways 
of carbohydrate metabolism. 


19.9a The Glyoxylate Cycle Operates in Specialized Organelles 


The enzymes of the glyoxylate cycle in plants are contained in glyoxysomes, organ- 
elles devoted to this cycle. Yeast and algae carry out the glyoxylate cycle in the 
cytoplasm. The enzymes common to both the TCA and glyoxylate pathways exist as 
isozymes, with spatially and functionally distinct enzymes operating independently 
in the two cycles. 
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Figure 19.21 A schematic model for the TCA cycle 
metabolon, emphasizing the relative sizes of the 
cycle components and the circular path of the cycle 
itself, but not the precise arrangements of the 
enzymes with respect to each other. Abbreviations: 
PDH = pyruvate dehydrogenase complex; CS = 
citrate synthase; Aco = aconitase; IDH = isocitrate 
dehydrogenase; aK DH = alpha-ketoglutarate 
dehydrogenase; Succ-CoA synthetase = succinyl- 
CoA synthetase; SDH = succinate dehydrogenase; 
MDH = malate dehydrogenase. Adapted from 
Benit, P., et al., 2014. Unsuspected task for an old 
team: succinate, fumarate and other Krebs cycle 
acids in metabolic remodeling. Biochimica et 
Biophysica Acta 1837:1330-1337; and Wu, F., and 
Minteer, S., 2015. Krebs cycle metabolon: structural 
evidence of substrate channeling revealed by 
crosslinking and mass spectrometry. Angewandte 
Chemie, International Edition 54: 1851—1854. 
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Figure 19.22 The glyoxylate cycle. The first two steps 
are identical to TCA cycle reactions. The third step 
bypasses the CO>-evolving steps of the TCA cycle to 
produce succinate and glyoxylate. The malate 
synthase reaction forms malate from glyoxylate and 
another acetyl-CoA. The result is that one turn of 
the cycle consumes one oxaloacetate and two acetyl- 
CoA molecules but produces two molecules of 
oxaloacetate. (Succinate produced in the isocitrate 
lyase reaction is converted to oxaloacetate by TCA 
cycle reactions.) The net for this cycle is one 
oxaloacetate from two acetyl-CoA molecules. 
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19.9b Isocitrate Lyase Short-Circuits the TCA Cycle 


by Producing Glyoxylate and Succinate 


The isocitrate lyase reaction (Figure 19.23) produces succinate, a four-carbon 
product of the cycle, as well as glyoxylate, which can then combine with a second 
molecule of acetyl-CoA. Isocitrate lyase catalyzes an aldol cleavage and is similar 
to the reaction mediated by aldolase in glycolysis. The malate synthase reaction 
(see Figure 19.22), a Claisen condensation of acetyl-CoA with the aldehyde of gly- 
oxylate to yield malate, is quite similar to the citrate synthase reaction. Compared 
with the TCA cycle, the glyoxylate cycle (1) contains only five steps (as opposed to 
eight), (2) lacks the CO,-liberating reactions, (3) consumes two molecules of acetyl- 
CoA per cycle, and (4) produces four-carbon units (oxaloacetate) as opposed to 


one-carbon units. 


2R, 3S-Isocitrate 


Figure 19.23 The isocitrate lyase reaction. 
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19.9 Can Any Organisms Use Acetate as Their Sole Carbon Source? 


19.9¢ The Glyoxylate Cycle Helps Plants Grow in the Dark 


The existence of the glyoxylate cycle explains how certain seeds grow underground 
(or in the dark), where photosynthesis is impossible. Many seeds (peanuts, soybeans, 
and castor beans, for example) are rich in lipids, and most organisms degrade the 
fatty acids of lipids to acetyl-CoA. Glyoxysomes form in seeds as germination begins, 
and the glyoxylate cycle uses the acetyl-CoA produced in fatty acid oxidation to 
provide large amounts of oxaloacetate and other intermediates for carbohydrate syn- 
thesis. Once the growing plant begins photosynthesis and can fix CO) to produce 
carbohydrates, the glyoxysomes disappear. 


19.9d Glyoxysomes Must Borrow Three Reactions from Mitochondria 


Glyoxysomes do not contain all the enzymes needed to run the glyoxylate cycle: 
Succinate dehydrogenase, fumarase, and malate dehydrogenase are absent. Conse- 
quently, glyoxysomes must cooperate with mitochondria to run their cycle 
(Figure 19.24). Succinate travels from the glyoxysomes to the mitochondria, where it 
is converted to oxaloacetate. Transamination to aspartate follows because oxaloace- 
tate cannot be transported out of the mitochondria. Aspartate formed in this way 
then moves from the mitochondria back to the glyoxysomes, where a reverse trans- 
amination with a-ketoglutarate forms oxaloacetate, completing the shuttle. Finally, to 
balance the transaminations, glutamate shuttles from glyoxysomes to mitochondria. 


lutamate Oxaloacetate 


A 


Malate 


Fumarate 


=oc=oc; S, > Succinate 


Malate ~Oxaloacetate 


Phosphoenolpyruvate 


Carbohydrate 


Figure 19.24 Glyoxysomes lack three of the enzymes needed to run the glyoxylate cycle. Succinate 
dehydrogenase, fumarase, and malate dehydrogenase are all borrowed from the mitochondria in a 
shuttle in which succinate and glutamate are passed to the mitochondria and a-ketoglutarate and 
aspartate are passed to the glyoxysome. 
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Chapter 19 The Tricarboxylic Acid Cycle 


Foundational Biochemistry Things You Should Know after Reading Chapter 19. 


Pyruvate is converted to acetyl-CoA and oxidized to CO, in the 
tricarboxylic acid cycle (also known as the citric acid cycle or Krebs 


cycle). 


Reducing equivalents released in the oxidative reactions of glycolysis, 
pyruvate decarboxylation, and the TCA cycle are captured in the 


form of NADH and enzyme-bound FADH). 


Glycolysis in one equation: 
C6H 1206 ——> 2 CH;COCOO- + 2H* + 4 H = 2 C3H,4O; + 4 H 


Net: 2 ATP + 2 NADH + 2 H+ from 4 H and the cleavage of 1 C—C 


bond 


TCA Cycle (including pyruvate dehydrogenase) in one equation: 
2 C;3H,O; + 6 HO —_ 6 CO; F 20 H 


Net: 2 ATP and 8 NADH + 8 H* + 2 FADH, from 20 H and the 


cleavage of 4 C—C bonds 
Carbon enters the TCA cycle as acetyl-CoA and exits as CO). 


Pyruvate is oxidatively decarboxylated to acetyl-CoA by the pyruvate 


dehydrogenase complex. 


Acetyl-CoA reacts with oxaloacetate in the citrate synthase reaction 


of the TCA cycle to produce citrate. 


Citrate is isomerized to form isocitrate by aconitase in the TCA cycle. 


Isocitrate dehydrogenase catalyzes the first oxidative decarboxylation 


of the TCA cycle, the conversion of isocitrate to a-ketoglutarate. 


a-Ketoglutarate is oxidatively decarboxylated to succinyl-CoA by 


a-ketoglutarate dehydrogenase. 


Succinyl-CoA synthetase catalyzes a substrate-level phosphorylation, 
the conversion of succinyl-CoA to succinate, and the attendant con- 


version of GDP and P; to GTP. 


Succinate dehydrogenase oxidizes succinate to fumarate, a reaction 


that is part of the mitochondrial electron-transport chain. 


Fumarase catalyzes the trans-hydration of fumarate to form L-malate 
in the TCA cycle. 


Malate is oxidized to oxaloacetate by malate dehydrogenase, the final 
reaction of the TCA cycle. 


The AG” for one pass around the TCA cycle is —40 kJ/mol. 


The carbonyl carbon of acetyl-CoA is fully retained through one turn 
of the TCA cycle but is lost completely in a second turn of the cycle. 


The methyl carbon of acetyl-CoA survives two full turns of the TCA 
cycle. However, in each subsequent turn of the cycle, one-half of the 
remaining carbon is lost. 


The four-, five-, and six-carbon species produced in the TCA cycle fuel a 
variety of biosynthetic processes. a-Ketoglutarate, succinyl-CoA, fuma- 
rate, and oxaloacetate are all precursors of important cellular species. 


Several cellular reactions (termed anaplerotic reactions), including 
pyruvate carboxylase, PEP carboxylase, and malic enzyme, replenish 
TCA cycle intermediates. 


The main sites of regulation in the TCA cycle are pyruvate dehydro- 
genase, citrate synthase, isocitrate dehydrogenase, and a-ketoglutarate 
dehydrogenase. All of these are inhibited by NADH. ATP is an inhibi- 
tor of pyruvate dehydrogenase and isocitrate dehydrogenase. 


Pyruvate dehydrogenase is inhibited by phosphorylation and acti- 
vated by dephosphorylation. 


The activities of TCA cycle enzymes are modulated by covalent 
modifications including acetylation, phosphorylation, succinylation, 
and succination. 


The enzymes of the TCA cycle associate and operate as a metabolon. 


Plants as well as some bacteria and algae employ a modification of 
the TCA cycle called the glyoxylate cycle, and thus can use acetate as 
their sole carbon source. Humans and other animals cannot. 


Answers to all problems are at the end of this book. Detailed solutions are 
available in the Student Solutions Manual. 


COO- aia COO- 

a COO- CHo p~ (0010 G 

ta le a —COO- HC—OH ie A 

l =0 i a —OH la 4 =O 

Lai das A lar dor l OO- 
Structure A Structure B Structure C Structure D Structure E Structure F 


1. Radiolabeling with “C-Glutamate Describe the labeling pattern that 
would result from the introduction into the TCA cycle of glutamate 
labeled at C, with 4C. 


2. Structures in the TCA Cycle Consider the structures shown and 
organize them in the order in which they appear in the TCA cycle. 


3. Enzymes in the TCA Cycle Consider the enzymes listed and organize 
them in the order in which they appear in the TCA cycle: (a) 


fumarase, (b) a-ketoglutarate dehydrogenase, (c) succinate dehydroge- 


nase, (d) aconitase, (e) malate dehydrogenase, and (f) isocitrate 
dehydrogenase. 


4. Assessing the Effects of Substrate Concentrations on the TCA 
Cycle Describe the effect on the TCA cycle of (a) increasing the 


concentration of NAD*, (b) reducing the concentration of ATP, or 
(c) increasing the concentration of isocitrate. 


. Assessing the Effect of Active-Site Phosphorylation on Enzyme 


Activity (Integrates with Chapter 15.) The serine residue of isocitrate 
dehydrogenase that is phosphorylated by protein kinase lies within 
the active site of the enzyme. This situation contrasts with most other 
examples of covalent modification by protein phosphorylation, where 
the phosphorylation occurs at a site remote from the active site. What 
direct effect do you think such active-site phosphorylation might have 
on the catalytic activity of isocitrate dehydrogenase? (See Barford, D., 
1991. Molecular mechanisms for the control of enzymic activity by 
protein phosphorylation. Biochimica et Biophysica Acta 1133:55-62.) 


. Understanding the Mechanism of the a-Ketoglutarate Dehydrogenase 


Reaction The first step of the a-ketoglutarate dehydrogenase reaction 
involves decarboxylation of the substrate and leaves a covalent TPP 
intermediate. Write a reasonable mechanism for this reaction. 


. Understanding the Action of Fluoroacetate on the TCA Cycle 


In a tissue where the TCA cycle has been inhibited by fluoroacetate, 
what difference in the concentration of each TCA cycle metabolite 
would you expect, compared with a normal, uninhibited tissue? 


. Designing an Assay for Succinate Dehydrogenase On the basis of the 


descriptions of the physical properties of FAD and FADH), suggest a 
method for the measurement of the enzyme activity of succinate 
dehydrogenase. 
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10. 


T. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


19. 


Understanding the Oxidative Processes in the TCA Cycle Starting with 
citrate, isocitrate, a-ketoglutarate, and succinate, state which of the 
individual carbons of the molecule undergo oxidation in the next step 
of the TCA cycle. Which molecules undergo a net oxidation? 


The Mechanism of Citrate Synthase Deuterated acetyl-coenzyme A 
(-CD; instead of -CH3) was used in an experiment to study the TCA 
cycle. Upon analysis of the metabolites, it was found that deuterium 
(D) was present in a wide variety of compounds; even compounds not 
associated with the citric acid cycle. Explain how this could be. 


Designing Substrate Analog Inhibitors for the TCA Cycle In addition 
to fluoroacetate, consider whether other analogs of TCA cycle 
metabolites or intermediates might be introduced to inhibit other, 
specific reactions of the cycle. Explain your reasoning. 


Understanding the Mechanism of the Pyruvate Dehydrogenase 
Reaction Based on the action of thiamine pyrophosphate in catalysis 
of the pyruvate dehydrogenase reaction, suggest a suitable chemical 
mechanism for the pyruvate decarboxylase reaction in yeast: 


Pyruvate —> acetaldehyde + CO, 


Regulation by Ca?* (Integrates with Chapter 15) Revisit Sections 16.1 
and 16.2 on the molecular mechanism of muscle contraction. In 
contracting muscles, Ca* is released from the sarcoplasmic reticulum. 
Would you predict that Ca?* inhibits or activates the pyruvate 
dehydrogenase complex and the TCA cycle in muscle cells? Explain 


your answer. 


Understanding the Energetics of the Aconitase Reaction (Integrates 
with Chapter 3.) Aconitase catalyzes the citric acid cycle reaction: 


Citrate == isocitrate 
The standard free energy change, A G°’, for this reaction is +6.7 kJ/ 
mol. However, the observed free energy change (AG) for this reac- 
tion in pig heart mitochondria is +0.8 kJ/mol. What is the ratio of 
[isocitrate]/[citrate] in these mitochondria? If [isocitrate] = 0.03 mM, 
what is [citrate]? 


Radiolabeling with “CO, in the Pyruvate Carboxylase Describe the 
labeling pattern that would result if '*CO, were incorporated into the 
TCA cycle via the pyruvate carboxylase reaction. 


Consequences of Radiolabeling With “CO, in the Reductive TCA 
Cycle Describe the labeling pattern that would result if the 
reductive, reversed TCA cycle (see A Deeper Look in Section 19.7) 
operated with CO). 


Consequences of Radiolabeling with “CH;-Acetyl-CoA in the 
Glyoxylate Cycle Describe the labeling pattern that would result in 
the glyoxylate cycle if a plant were fed acetyl-CoA labeled at 

the —CH; carbon. 


Understanding the Mechanism of the Malate Synthase Reaction 

The malate synthase reaction, which produces malate from 
acetyl-CoA and glyoxylate in the glyoxylate pathway, involves 
chemistry similar to the citrate synthase reaction. Write a mechanism 
for the malate synthase reaction, and explain the role of CoA in 

this reaction. 


Understanding How Cells Provide Acetate Units for Fatty Acid 
Synthesis in the Cytosol In most cells, fatty acids are synthesized from 
acetate units in the cytosol. However, the primary source of acetate 
units is the TCA cycle in mitochondria, and acetate cannot be 
transported directly from the mitochondria to the cytosol. Cells solve 
this problem by exporting citrate from the mitochondria and then 
converting citrate to acetate and oxaloacetate. Then, because cells 
cannot transport oxaloacetate into mitochondria directly, they must 
convert it to malate or pyruvate, both of which can be taken up by 
mitochondria. Draw a complete pathway for citrate export, conver- 
sion of citrate to malate and pyruvate, and import of malate and 
pyruvate by mitochondria. 


a. Which of the reactions in this cycle might require energy input? 
b. What would be the most likely source of this energy? 


Problems 667 


c. Do you recognize any of the enzyme reactions in this cycle? 
d. What coenzymes might be required to run this cycle? 


. Assessing the Equilibrium Concentrations in the Malate Dehydrogenase 


Reaction A typical intramitochondrial concentration of malate 

is 0.22 mM. If the ratio of NAD* to NADH in mitochondria is 20, 
and if the malate dehydrogenase reaction is at equilibrium, calculate 
the concentration of oxaloacetate in the mitochondrion at 20°C. A 
typical mitochondrion can be thought of as a cylinder 1 um in 
diameter and 2 um in length. Calculate the number of molecules of 
oxaloacetate in a mitochondrion. In analogy with pH (the negative 
logarithm of [H*]), what is pOoAA? 


. Understanding the Oxidation of Glucose and Its Products in the TCA 


Cycle Glycolysis, the pyruvate dehydrogenase reaction, and the TCA 
cycle result in complete oxidation of a molecule of glucose to CO). 
Review the calculation of oxidation numbers for individual atoms in 
any molecule, and then calculate the oxidation numbers of the 
carbons of glucose, pyruvate, the acetyl carbons of acetyl-CoA, and 
the metabolites of the TCA cycle to convince yourself that complete 
oxidation of glucose involves removal of 24 electrons and that each 
acetyl-CoA through the TCA cycle gives up 8 electrons. 


. A Simple Way to Understand and Remember the Reactions of the TCA 


Cycle Recalling all reactions of the TCA cycle can be a challenging 
proposition. One way to remember these is to begin with the simplest 
molecule—succinate, which is a symmetric four-carbon molecule. 
Begin with succinate, and draw the eight reactions of the TCA cycle. 
Remember that succinate —> oxaloacetate is accomplished by a 
special trio of reactions: oxidation of a single bond to a double bond, 
hydration across the double bond, and oxidation of an alcohol to a 
ketone. From there, a molecule of acetyl-CoA is added. If you 
remember the special function of acetyl-CoA (see A Deeper Look, in 
Section 19.2), this is an easy reaction to draw. From there, you need 
only isomerize, carry out the two oxidative decarboxylations, and 
remove the CoA molecule to return to succinate. 


. Understanding the Mechanism-Based Inactivation of Aconitase by 


Fluoroacetate Aconitase is rapidly inactivated by 2R, 3R- 
fluorocitrate, which is produced from fluoroacetate in the citrate 
synthase reaction. Interestingly, inactivation by fluorocitrate is 
accompanied by stoichiometric release of fluoride ion (i.e., one 

F- ion is lost per aconitase active site). This observation is consistent 
with mechanism-based inactivation of aconitase by fluorocitrate. 
Suggest a mechanism for this inactivation, based on formation of 
4-hydroxy-trans-aconitate, which remains tightly bound at the active 
site. To assess your answer, consult this reference: Lauble, H., et al., 
1996. The reaction of fluorocitrate with aconitase and the crystal 
structure of the enzyme-inhibitor complex. Proceedings of the 
National Academy of Sciences, USA 93:13699-13703. 


. The post-translational modification of cysteine residues on proteins 


by fumarate (Human Biochemistry—TCA Metabolites Play 

Roles in Many Pathways via Post-Translational Modifications, in 
Section 19.8) is termed a succination and proceeds via a mechanism 
known as Michael addition. Write a detailed mechanism for this 
Michael addition reaction. 


. Fatty Acid Metabolism in the TCA Cycle Complete oxidation of a 


16-carbon fatty acid can yield about 114 molecules of ATP. Study 
Figure 19.2 and determine how many ATP molecules would be 
generated if a 16-carbon fatty acid were metabolized solely by the 
TCA cycle, in the form of 8 acetyl-CoA molecules. 


. TCA Cycle Study Figure 19.18 and decide which of the following 


statements is false. 

a. Pyruvate dehydrogenase is inhibited by NADH. 
Pyruvate dehydrogenase is inhibited by ATP. 
Citrate synthase is inhibited by NADH. 
Succinyl-CoA activates citrate synthase. 


spo sf 


Acetyl-CoA activates pyruvate carboxylase. 
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668 Chapter 19 The Tricarboxylic Acid Cycle 
PSAFE Problems 


(PSAFE models, listed by chapter or protein name, can be found at 
http://www. psafeonline.net.) 


27. Examine the PSAFE structure for isocitrate dehydrogenase, and 
identify the a-helices, 3;9 helices, and B-sheets in this structure. Locate 
the enzyme residues involved in catalysis. 


Think-Pair-Share Question 


Consider the isotopically labeled oxaloacetate structure shown. Use 
it as a starting point for one round of the TCA cycle and predict 


the intermediates that could contain the isotopes. Draw the labeled 
intermediates. 
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Electron Transport and 20 
Oxidative Phosphorylation 


In all things of nature there is something of the 
marvelous. 


Aristotle (384-322 B.C.) 


< Wall Piece #IV (1985), a kinetic sculpture 
by George Rhoads. This complex mechan- 
ical art form can be viewed as a metaphor 
for the molecular apparatus underlying 
electron transport and ATP synthesis by 
oxidative phosphorylation. 


Wall Piece #IV (1985), a kinetic sculpture by George Rhoads 


Living cells save up metabolic energy predominantly in the form of fats and carbohydrates, 20.1 Where in the Cell Do Electron 
and they spend this energy for biosynthesis, membrane transport, and movement. In both Transport and Oxidative 
directions, energy is exchanged and transferred in the form of ATP. Most of the metabolic Phosphorylation Occur? 
energy that is obtainable from substrates entering glycolysis and the TCA cycle is funneled 20.2 How Is the Electron-Transport 
via oxidation—reduction reactions into NADH and reduced flavoproteins, the latter Chain Organized? 

symbolized by [FADH]. How do cells oxidize NADH and [FADH,] and convert their 20.3 What Are the Thermodynamic 
reducing potential into the chemical energy of ATP? Implications of Chemiosmotic 

Coupling? 


20.4 How Does a Proton Gradient Drive 
the Synthesis of ATP? 


20.5 What Is the P/O Ratio for 


hereas ATP made in glycolysis and the TCA cycle is the result of substrate-level Mitochondrial Oxidative 

phosphorylation, NADH-dependent ATP synthesis is the result of oxidative Phosphorylation? 
phosphorylation. Electrons stored in the form of the reduced coenzymes, NADH or 20.6 How Are the Electrons of Cytosolic 
[FADH,], are passed through an elaborate and highly organized chain of proteins and NADH Fed into Electron Transport? 
coenzymes, the electron-transport chain, finally reaching O, (molecular oxygen), the 20.7 How Do Mitochondria Mediate 


: i ose . is? 
terminal electron acceptor. Each component of the chain can exist in (at least) two oxi- Apoptosis: 


dation states, and each component is successively reduced and reoxidized as electrons 
move through the chain from NADH (or [FADH])]) to Oy. In the course of electron 
transport, a proton gradient is established across the inner mitochondrial membrane. 
It is the energy of this proton gradient that drives ATP synthesis. 


20.1 | Where in the Cell Do Electron Transport 
and Oxidative Phosphorylation Occur? 
The processes of electron transport and oxidative phosphorylation are membrane 


associated. Prokaryotes are the simplest life form, and prokaryotic cells typically con- 
sist of a single cellular compartment surrounded by a plasma membrane and a more 
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(a) 


rigid cell wall. In such a system, the conversion of energy from NADH and [FADH)] 
to the energy of ATP via electron transport and oxidative phosphorylation is carried 
out at (and across) the plasma membrane. 

In eukaryotic cells, electron transport and oxidative phosphorylation are localized in 
mitochondria, which are also the sites of TCA cycle activity and fatty acid oxidation. 
Mammalian cells contain 800 to 2500 mitochondria; other types of cells may have as 
few as one or two or as many as half a million mitochondria. Human erythrocytes, 
whose purpose is simply to transport oxygen to tissues, contain no mitochondria at all. 
The typical mitochondrion is about 0.5 + 0.3 um in diameter and from 0.5 to several 
microns long; its overall shape is sensitive to metabolic conditions in the cell, as will be 
described in the next section. 


20.1a Mitochondrial Functions Are Localized in Specific Compartments 


Mitochondria are surrounded by a simple outer membrane and a more complex inner 
membrane (Figure 20.1). The space between the inner and outer membranes is referred 
to as the intermembrane space. Several enzymes that utilize ATP (such as creatine ki- 
nase and adenylate kinase) are found in the intermembrane space. The smooth outer 
membrane is about 30% to 40% lipid and 60% to 70% protein and has a relatively high 
concentration of phosphatidylinositol. The outer membrane contains significant 
amounts of porins—transmembrane proteins, rich in B-sheets, that form large chan- 
nels across the membrane, permitting free diffusion of molecules with molecular 
weights of about 10,000 or less. The outer membrane plays a prominent role in main- 
taining the shape of the mitochondrion. The inner membrane is richly packed with 
proteins, which account for nearly 80% of its weight; thus, its density is higher than 
that of the outer membrane. The fatty acids of inner membrane lipids are highly un- 
saturated. Cardiolipin and diphosphatidylglycerol (see Chapter 8) are abundant. The 
inner membrane lacks cholesterol and is quite impermeable to molecules and ions. 
Species that must cross the mitochondrial inner membrane—ions, substrates, fatty 
acids for oxidation, and so on—are carried by specific transport proteins in the mem- 
brane. Notably, the inner membrane is extensively folded (Figure 20.1). The folds, 
known as cristae, provide the inner membrane with a large surface area in a small 
volume. During periods of active respiration, the inner membrane appears to shrink 
significantly, leaving a comparatively large intermembrane space. 


(b) 


Outer membrane 


Inner membrane 


Intermembrane 
space 


Cristae 


Figure 20.1 Mitochondria are the energy factories of animal cells. (a) A drawing of a mitochondrion 
with components labeled. (b) Tomography of a rat liver mitochondrion. The tubular structures in 
red, yellow, green, purple, and aqua represent individual cristae formed from the inner mitochondrial 
membrane. (b, Frey, T. G., and Mannella, C. A., 2000. The internal structure of mitochondria. 
Trends in Biochemical Sciences 25:319-324.) 
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Human Biochemistry 


Mitochondrial Dynamics in Human Diseases 


Mitochondria are mobile and versatile organelles that exist in 
dynamic networks. They continuously divide by fission and join 
by the process of fusion. The dynamic cycle of fission and fusion 
modulates numerous cellular processes, including the interactions 
between mitochondria and the endoplasmic reticulum, biogenesis 
of mitochondria, the programmed death of mitochondria (termed 
mitophagy), the dynein- and kinesin-mediated transport of mito- 
chondria along microtubule tracks in the cell, and sensing of and 
response to oxygen in the cell. 

Both fission and fusion are driven by proteins that hydrolyze GTP 
(GTPases). Fission of mitochondria is mediated by dynamin-related 
protein 1 (DRP1), a cytosolic protein that, when activated, migrates 
to the outer mitochondrial membrane and multimerizes to form a 
ringlike structure that constricts and divides the organelle (figure a). 
Fusion is accomplished by two GTPases in the outer mitochondrial 
membrane known as mitofusin-1 and mitofusin-2 and by an inner 


(a) (b) 


membrane protein, OPA1 (figure b). Mitofusins initiate fusion by 
forming homodimeric or heterodimeric, antiparallel, coiled-coil 
linkages between adjacent mitochondria, thus joining the two outer 
membranes. On the other hand, proteolytic cleavage of OPA1 
stimulates inner membrane fusion. Fusion and fission are both 
modulated by lipids generated by phospholipase D, particularly 
phosphatidic acid. The small, negatively charged head group of 
phosphatidic acid causes curvature of lipid bilayers and recruits 
adaptor proteins, promoting fusion. In contrast, hydrolysis of 
phosphatidic acid by lipin-1 produces diacylglycerol, which promotes 
fission. 

Abnormalities of mitochondrial dynamics are involved in car- 
diovascular disease, as well as neurodegenerative, endocrine, and 
neoplastic diseases, including cancer. For example, impaired fusion 
and enhanced fission result in fragmentation of the mitochondrial 
network in pulmonary arterial hypertension (bottom left). 


Fission 


Movement of DRP1 from 
the cytosol to the outer 
mitochondrial membrane 


Mitosis, or 
membrane 
depolarization, 
or oxidative 
stress 


Fragmented mitochondrial 
network in pulmonary artery 
smooth muscle cells of a 
patient with pulmonary 
arterial hypertension 


Mitochondria 


Fusion 


Inner 
mitochondrial 


membrane , - 
mitochondrial 


membrane 


Mitochondrial network in 
normal human pulmonary 
artery smooth muscle cells 


Courtesy of Dr. Zhigang Hong and Dr. Stephen Archer, Professor, Head, Department of Medicine, Queen's University, Tier 1 CRC in Mitochondrial Dynamics and 


Translational Medicine. 


Some of the events of mitochondrial (a) fission and (b) fusion. DRP1 is the dynamin-related protein 1. MMF s are mitochondrial fission 


factors. OPAI is a dynamin-related GTPase involved in mitochondrial fusion. 


References 
Archer, S. L., 2013. Mitochondrial Dynamics—Mitochondrial fission and 
fusion in human diseases. New England Journal of Medicine 369:2236-2251. 


Mishra, P., Carelli, V., Manfredi, G., and Chan, D. C., 2014. Proteolytic 
cleavage of Opal stimulates mitochondrial inner membrane fusion and 
couples fusion to oxidative phosphorylation. Cell Metabolism 19:630-641. 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


674 Chapter 20 Electron Transport and Oxidative Phosphorylation 


20.1b The Mitochondrial Matrix Contains 
the Enzymes of the TCA Cycle 


The space inside the inner mitochondrial membrane is called the matrix, and it con- 
tains most of the enzymes of the TCA cycle and fatty acid oxidation. (An important 
exception, succinate dehydrogenase of the TCA cycle, is located in the inner mem- 
brane itself.) In addition, mitochondria contain circular DNA molecules, along with 
ribosomes and the enzymes required to synthesize proteins coded within the mito- 
chondrial genome. Although some of the mitochondrial proteins are made this way, 
most are encoded by nuclear DNA and synthesized by cytosolic ribosomes. 


20.2 | How Is the Electron-Transport Chain Organized? 


As we have seen, the metabolic energy from oxidation of food materials—sugars, fats, 
and amino acids—is funneled into formation of reduced coenzymes (NADH) and 
reduced flavoproteins ([FADH)]). The electron-transport chain reoxidizes the coen- 
zymes and channels the free energy obtained from these reactions into the creation of 
a proton gradient. This reoxidation process involves the removal of both protons and 
electrons from the coenzymes. Electrons move from NADH and [FADH,] to molecu- 
lar oxygen, O2, which is the terminal acceptor of electrons in the chain. The reoxida- 
tion of NADH, 


NADH (reductant) + H+ + O; (oxidant) —> NAD* + H,O (20.1) 

involves the following half-reactions: 
NAD+ + 2 Ht + 2 e —> NADH + H? @,' = —0.32 V (20.2) 
10, +2 H* + 2e —- H,O ə = +0.816 V (20.3) 


Here, half-reaction 20.3 is the electron acceptor and half-reaction 20.2 is the electron 
donor. Then 


AZ,’ = 0.816 


(—0.32) = 1.136 V (20.4) 


and, according to Equation 3.22c, the standard-state free energy change, AG*’, is 
—219 kJ/mol. Molecules along the electron-transport chain have reduction potentials 
between the values for the NAD*/NADH couple and the oxygen/H,O couple, so 
electrons move down the energy scale toward progressively more positive reduction 
potentials (Figure 20.2). 

Although electrons move from more negative to more positive reduction potentials 
in the electron-transport chain, it should be emphasized that the electron carriers do not 
operate in a simple linear sequence. This will become evident when the individual com- 
ponents of the electron-transport chain are discussed in the following paragraphs. 


Careers in Chemistry 
Dr. Vincent Wu 


Dr. Vincent Wu is a research scientist at Triplebar, 
a company dedicated to creating products that 
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Dr.Wu’s career began with a bachelor’s degree 
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genetics and involved studies of transcriptional 
regulation of plant cell wall degrading enzymes 
in Neurospora crassa. Dr. Wu believes that a 
strong grasp of the principles of biochemistry is 
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fermentation with genetically engineered fungi will 
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and development of vegan and lactose-free milk 
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20.2a The Electron-Transport Chain Can Be Isolated in Four Complexes 


The electron-transport chain involves several different molecular species, including: 


1. Flavoproteins, which contain tightly bound FMN or FAD as prosthetic groups and 
which may participate in one- or two-electron transfer events. 

2. Coenzyme Q, also called ubiquinone (and abbreviated CoQ or UQ) (see Figure 20.4), 
which can function in either one- or two-electron transfer reactions. 

3. Several cytochromes (proteins containing heme prosthetic groups [see Chapter 5], 
which function by carrying or transferring electrons), including cytochromes b, c, 
cı, a, and a3. Cytochromes are one-electron transfer agents in which the heme iron 
is converted from Fe** to Fe** and back. 

4. A number of iron-sulfur proteins, which participate in one-electron transfers in- 
volving the Fe* and Fe** states. 

5. Protein-bound copper, a one-electron transfer site that converts between Cu* and Cu?*. 


All these intermediates except for cytochrome c are membrane associated (either in the 
mitochondrial inner membrane of eukaryotes or in the plasma membrane of prokaryotes). 
Three types of proteins involved in this chain—flavoproteins, cytochromes, and iron-sulfur 
proteins—possess electron-transferring prosthetic groups. 

The components of the electron-transport chain can be purified from the mitochon- 
drial inner membrane. Solubilization of the membranes containing the electron-transport 
chain results in the isolation of four distinct protein complexes, and the complete chain 
can thus be considered to be composed of four parts: (I) NADH-coenzyme Q reductase, 
(II) succinate-coenzyme Q reductase, (III) coenzyme Q-cytochrome c reductase, and 
(IV) cytochrome c oxidase (Figure 20.3). Complex I accepts electrons from NADH, 
serving as a link between glycolysis, the TCA cycle, fatty acid oxidation, and the 
electron-transport chain. Complex II includes succinate dehydrogenase and thus forms 
a direct link between the TCA cycle and electron transport. Complexes I and II produce 


Figure 20.2 @,' and € values for the components 
of the mitochondrial electron-transport chain. 
Values indicated are consensus values for animal 
mitochondria. Black bars represent @,'; red bars, 6. 
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Fatty acyl-CoA dehydrogenase 
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Electron-transferring 
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Fe-S centers 


1 
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NADH-coenzyme Q | Complex III Complex IV 


oxidoreductase 


Cyto hrome galcompla Cytochrome aag complex, 


2 btype hemes, 
Rieske Fe-S center, 2 a-type hemes, 


C-type heme (cyt c) Cujon: 


Coenzyme Q-cytochrome c Cytochrome c oxidase 


Complex II oxidoreductase 


Flavoprotein 2 


Succinate dehydrogenase, 
FAD (covalent), 
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b-type heme 


Figure 20.3 An overview of the complexes and pathways in the mitochondrial 
Succinate-coenzyme Q electron-transport chain. (Adapted from Nicholls, D. G., and Ferguson, S. J., 
oxidoreductase Bioenergetics 3, London: Academic Press, 2022.) 


a common product, reduced coenzyme Q (UQH;), which is the substrate for coenzyme 
Q-cytochrome c reductase (Complex II). As shown in Figure 20.3, there are two other 
ways to feed electrons to UQ: the electron-transferring flavoprotein, which transfers elec- 
trons from the flavoprotein-linked step of fatty acyl-CoA dehydrogenase, and 
sn-glycerophosphate dehydrogenase, which converts dihydroxyacetone to sn-glycerol 
3-phosphate. Complex II oxidizes UQH) while reducing cytochrome c, which in turn is 
the substrate for Complex IV, cytochrome c oxidase. Complex IV is responsible for 
reducing molecular oxygen. Each of the complexes shown in Figure 20.3 is a large 
multisubunit complex embedded within the inner mitochondrial membrane. 


20.2b Complex | Oxidizes NADH and Reduces Coenzyme Q 


Complex I, also known as NADH dehydrogenase, transfers a pair of electrons from 
NADH to coenzyme Q, a small, hydrophobic, yellow compound. Another common 
name for this enzyme complex is NADH dehydrogenase. The complex from mammalian 
heart mitochondria (with an estimated mass of 980 kDa) involves 45 polypeptide chains, 
one molecule of flavin mononucleotide (FMN), and nine Fe-S clusters, together con- 
taining a total of 20 to 26 iron atoms (Table 20.1). Bacterial complex I is much simpler, 
with just 14 subunits and a mass of 500 kDa, but the cofactors in mitochondrial and 
bacterial complex I are the same. By virtue of its dependence on FMN, NADH-UQ 
reductase is a flavoprotein. 

Although the precise mechanism of the NADH-UQ reductase is unknown, the first 
step involves binding of NADH to the enzyme on the matrix side of the inner mito- 
chondrial membrane and transfer of electrons from NADH to tightly bound FMN: 


NADH + [FMN] + H+ —> [FMNH)] + NAD* (20.5) 


The second step involves the transfer of electrons from the reduced [FMNH)] to a 
series of Fe-S proteins, including both 2Fe-2S and 4Fe-4S clusters (see Figure 20.6). The 
versatile redox properties of the flavin group of FMN are important here. NADH is a 
two-electron donor, whereas the Fe-S proteins are one-electron transfer agents. The fla- 
vin of FMN has three redox states—the oxidized, semiquinone, and reduced states. It 
can act as either a one-electron or a two-electron transfer agent and may serve as a criti- 
cal link between NADH and the Fe-S proteins. 
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| Table 20.1 | ZA) Protein Complexes of the Mitochondrial Electron-Transport Chain 


Mass Prosthetic 
Complex (kDa) Subunits Group Binding Site for: 
NADH-UQ reductase 980 45 FMN NADH (matrix side) 
Fe-S UQ (lipid core) 
Succinate-UQ reductase 140 4 FAD Succinate (matrix side) 
Fe-S UQ (lipid core) 
UQ-Cyt c reductase 250 9-10 Heme b; Cyt c (intermembrane 
Heme by space side) 
Heme cı 
Fe-S 
Cytochrome c 13 1 Heme c Cyt c 
Cyta 
Cytochrome c oxidase 162 13 Heme a Cyt c (intermembrane 
Heme a; space side) 
Cu, 
Cug 


The final step of the reaction involves the transfer of two electrons from iron-sulfur 
clusters to coenzyme Q. Coenzyme Q is a mobile electron carrier. Its isoprenoid tail 
makes it highly hydrophobic, and it diffuses freely in the hydrophobic core of the inner 
mitochondrial membrane. As a result, it shuttles electrons from Complexes I and II to 
Complex II. The redox cycle of UQ is shown in Figure 20.4. The structural and func- 
tional organization of Complex I is shown in Figure 20.5. 


Complex | Transports Protons from the Matrix to the Cytosol The oxidation of one 
NADH and the reduction of one UQ by NADH-UQ reductase results in the net 
transport of protons from the matrix side to the cytosolic side of the inner membrane. 
The cytosolic side, where H* accumulates, is referred to as the P (for positive) face; 
similarly, the matrix side is the N (for negative) face. Some of the energy liberated by 
the flow of electrons through this complex is used in a coupled process to drive the 
transport of protons across the membrane. (This is an example of active transport, a 


(a) © O- OH 
—— 0O + H @ + Ht 
HCO i CH; ie | CHO CH; HCO T S-— CHs 
& F : Teg s+. ee = = 
HCO — (CHy— CH= C — CHo) — H CHO R H3CO—-\_ ZR 
| 
O OH OH 
Coenzyme Q, oxidized form Semiquinone Coenzyme Q, 
(Q, ubiquinone) intermediate reduced form 
(QH>) (QH,, ubiquinol) 
(b) 


Figure 20.4 Coenzyme Q, also known as ubiquinone, is a lipid-soluble carrier 
of electrons in the electron transport pathway. (a) The three oxidation states of 
coenzyme Q. (b) A space-filling model of coenzyme Q. 
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Figure 20.5 Structural organization of mammalian 
Complex I, based on cryo-electron microscopy, 
showing functional relationships within the L-shaped 
complex. (a) All redox centers of Complex I are 
located in the peripheral arm (PA), except ubiquinone, 
which is located in the hydrophobic membrane 
domain (MD). Electron flow from NADH to UQH, 
in the membrane pool is indicated. (b) Structure of 
the entire Complex I from mouse (Mus musculus) 
(PDB id = 6G2J). The 14 core subunits are in color 
and labeled accordingly, and the 31 supernumerary 
subunits are in white. The positions of the flavin 
mononucleotide (FMN) at the NADH binding site 
and of FeS cluster N>, the immediate electron donor 
to ubiquinone, are indicated with circles. The 
structure of the cofactor chain, together with the 
modeled structure of ubiquinone-10, is also shown 
separately in the inset. The distance from one center 
to the next is about 1 nm. Leonard Sazanov has 
proposed a coupling mechanism for complex I in all 
organisms, in which protons are taken up from the 
matrix exclusively by subunits ND4 and NDS. 
Protons then move back and forth along the hydrated, 
hydrophilic axis (part a), involving all the protein 
subunits of the inner membrane (part b) in a series of 
electrostatically driven ‘domino effect’ events that 
result in release of protons to the intermembrane 
space only through NDS. (Part b adapted from Agip, 
A., Blaza, J., Bridges, H., et al., 2018. Cryo-EM 
structures of complex I from mouse heart 
mitochondria in two biochemically defined states. 
Nature Struct. Mol. Biol. 25:548-556.) 
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phenomenon examined in detail in Chapter 9.) Available experimental evidence sug- 
gests a stoichiometry of four H* transported per two electrons passed from NADH 
to UQ. 


Electrostatic Interactions Along a Hydrophilic Axis Drive Proton Transport The struc- 
ture of Complex I is elegantly suited to its biological function. This multi-subunit 
complex consists of a large hydrophilic domain extending into the mitochondrial 
matrix (termed the peripheral arm-PA, Figure 20.5a) and a large hydrophobic do- 
main in the inner mitochondrial membrane (the membrane domain-MD). The core 
of this structure is composed of 14 subunits, plus a varying number of so-called 
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Human Biochemistry 


Solving a Medical Mystery Revolutionized Our Treatment of Parkinson’s Disease 


A tragedy among illegal drug users was the impetus for a revolution- 
ary treatment of Parkinson’s disease. In 1982, several mysterious 
cases of paralysis came to light in southern California. The victims, 
some of them teenagers, were frozen like living statues, unable to 
talk or move. The case was baffling at first, but it was soon traced 
to a batch of synthetic heroin that contained MPTP (1-methyl- 
4-phenyl-1,2,3,6-tetrahydropyridine) as a contaminant. MPTP is rap- 
idly converted in the brain to MPP* (1-methyl-4-phenylpyridine) by 


the enzyme monoamine oxidase B. MPP* is a potent inhibitor of 
mitochondrial Complex I (NADH-—UQ reductase), and it acts pref- 
erentially in the substantia nigra, an area of the brain that is essential 
to movement and also the region of the brain that deteriorates slowly 
in Parkinson’s disease. 

Parkinson’s disease results from the inability of the brain to pro- 
duce sufficient quantities of dopamine, a neurotransmitter. Neurolo- 
gist J. William Langston, asked to consult on the treatment of some 


of these patients, recognized that the symptoms of this drug-induced 
disorder were in fact similar to those of parkinsonism. He began 
treatment of the patients with L-dopa, which is decarboxylated in 
the brain to produce dopamine. The treated patients immediately 
ZN regained movement. Langston then took a bold step. He implanted 
fetal brain tissue into the brains of several of the affected patients, 
prompting substantial recovery from the Parkinson’s-like symptoms. 
Langston’s innovation sparked a revolution in the use of tissue im- 
plantation for the treatment of neurodegenerative diseases. 

Other toxins may cause similar effects in neural tissue. Timothy 
Greenamyre at Emory University has shown that rats exposed to the 
MPP* pesticide rotenone (see Figure 20.26) over a period of weeks experi- 

ence a gradual loss of function in dopaminergic neurons and then 

| develop symptoms of parkinsonism, including limb tremors and 

rigidity. This finding supports earlier research that links long-term 
pesticide exposure to Parkinson’s disease. 


— > 


Monoamine | 
oxidase B SS 
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MPTP 


Cell death 
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“supernumerary” subunits, for example 31 in mammals, that contribute to assem- 
bly, stability or regulation, or fulfill independent metabolic roles. These subunits 
are essentially identical in all species that have them (Figure 20.5b). The hydrophilic 
domain is essentially a superhighway for electrons, from the FMN cofactor site 
through a chain of seven Fe-S clusters that extends approximately 95 Å through the 
protein to a UQ site deep in the membrane (Figure 20.5a). Electronic structure cal- 
culations by Tomoyuki Hayashi and Alexei Stuchebrukhov have shown that elec- 
tron transfer between the redox centers in Complex I occurs by quantum mechani- 
cal tunneling. (See Section 14.5 for a discussion of quantum mechanical tunneling.) 
The distance from one redox center to the next in Complex I is roughly 1 nm. All 
redox centers of the enzyme except the UQ site are located outside the membrane 
domain. Therefore, in contrast to other proton-pumping respiratory complexes, the 
redox centers of Complex I cannot participate directly in proton translocation. The 
highly hydrophobic ubiquinone fits into a narrow channel at the junction between 
the peripheral arm (PA) and the membrane domain (MD). 

The membrane domain contains a novel feature, rarely found in other membrane 
proteins—a highly hydrated central axis containing 23 hydrophilic residues (8 Lys, 
7 Glu, 2 Asp, and 6 His) (Figure 20.5a). High resolution cryo-EM studies have 
identified about 90 water molecules in this central axis within 6 Å of the hydrophilic 
residues. Leonid Sazanov and others have postulated that these waters and hydro- 
philic residues are involved in the pumping of protons across Complex I from the 
mitochondrial matrix to the intermembrane space. 

A crevice in subunit ND1, termed the E-channel, connects the site of ubiquinone 
reduction to the first putative proton pump, consisting of subunits ND1, ND3, ND6, 
and ND4L. The other three likely proton channels are located in subunits ND2, ND4, 
and NDS. These latter three are homologous to a well-characterized class of antiporter 
channel proteins and are thus termed the antiporter subunits of Complex I. To drive 
proton pumping, the energy released during ubiquinone reduction is converted into 
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conformational changes in the proton channel subunits. Different conceptual schemes 
have been proposed, but all these models involve a wave of electrostatic interactions 
propagating from the ubiquinone site to the proton pumping channels. These interac- 
tions are presumed to involve reversible protonation of the hydrophilic residues and 
water molecules in the central axis. 


20.2c Complex II Oxidizes Succinate and Reduces Coenzyme Q 


Complex II is perhaps better known by its other name—succinate dehydrogenase, 
the only TCA cycle enzyme that is an integral membrane protein in the inner mito- 
chondrial membrane. This complex (Figure 20.6) has a mass of 124 kDa and is 


(a) 


Intermembrane 
space 


Complex II 


Succinate Fumarate 


(b) (c) 


Heme b 


Fe-S 
centers 


Figure 20.6 Complex II of the electron transport pathway. (a) A scheme for electron flow in Complex 
II. Oxidation of succinate occurs with reduction of [FAD]. Electrons are then passed to Fe-S centers 
and then to coenzyme Q (UQ). Proton transport does not occur in this complex. (b) The structure of 
Complex II from pig heart (PDB id = 1ZOY). (c) The arrangement of redox centers. Electron flow is 
from bottom to top. 
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composed of two hydrophilic subunits, (a flavoprotein (Fp, 68 kDa) and an iron- 
sulfur protein (Ip, 29 kDa)) and two hydrophobic, membrane-anchored subunits 
(15 kDa and 11 kDa), which contain one heme b and provide the binding site for UQ. 
Fp contains an FAD covalently bound to a His residue (see Figure 19.12), and Ip 
contains three Fe-S centers: a 4Fe-4S cluster, a 3Fe-4S cluster, and a 2Fe-2S cluster. 
When succinate is converted to fumarate in the TCA cycle, concomitant reduction 
of bound [FAD] to [FADH)] occurs in succinate dehydrogenase. This [FADH)] 
transfers its electrons immediately to Fe-S centers, which pass them on to UQ. Elec- 
tron flow from succinate to UQ: 


Succinate —> fumarate + 2 H* + 2 e7 (20.6) 
UQ +2 H* +2 e —~+UQH, (20.7) 
Net rxn: Succinate + UQ —> fumarate + UQH, Ag., = 0.029 V (20.8) 


This yields a net reduction potential of 0.029 V. (Note that the first half-reaction is writ- 
ten in the direction of the e~ flow. As always, A@,’ is calculated according to 
Equation 3.23.) The small free energy change of this reaction does not contribute to the 
transport of protons across the inner mitochondrial membrane. 

This is a crucial point because proton transport is coupled with ATP synthesis. 
Oxidation of one [FADH,] in the electron-transport chain results in synthesis of 
approximately two molecules of ATP, compared with the approximately three ATPs 
produced by the oxidation of one NADH. Other flavoproteins can also supply elec- 
trons to UQ, including mitochondrial sn-glycerophosphate dehydrogenase, an inner 
membrane-bound shuttle enzyme, and the fatty acyl-CoA dehydrogenases, three 
soluble matrix enzymes involved in fatty acid oxidation (Figure 20.7). The path of 
electrons from succinate to UQ is shown in Figure 20.6. 


20.2d Complex III Mediates Electron Transport from Coenzyme Q 
to Cytochrome c 


In Complex III of the electron-transport chain, reduced coenzyme Q (UQH)) passes its 
electrons to cytochrome c via a unique redox pathway known as the Q cycle. Referring 
to the reaction it catalyzes, Complex III is called UQ—cytochrome c reductase (UQ-— 
cyt c reductase), and it involves three different cytochromes and an Fe-S protein. In the 
cytochromes of these and similar complexes, the iron atom at the center of the porphy- 
rin ring cycles between the reduced Fe?* (ferrous) and oxidized Fe** (ferric) states. 
Cytochromes were first named and classified on the basis of their absorption spec- 
tra (Figure 20.8), which depend upon the structure and environment of their heme 
groups. The b cytochromes contain iron protoporphyrin LX (Figure 20.9), the same heme 
found in hemoglobin and myoglobin. The c cytochromes contain heme c, derived from 
iron protoporphyrin IX by the covalent attachment to cysteine residues from the as- 
sociated protein. (One other heme variation, heme a, is found in cytochrome a, a com- 
ponent of Complex IV of the electron transport chain. Heme a contains a 15-carbon 
isoprenoid chain on a modified vinyl group and a formyl group in place of one of the 


(0) 
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Figure 20.7 The fatty acyl-CoA dehydrogenase reaction, emphasizing that the reaction involves 
reduction of enzyme-bound FAD (indicated by brackets). 
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Figure 20.9 The structures of iron protoporphyrin 
TX, heme c, and heme a. Porphyrin rings shown in 
gray. Covalent linkages of heme c are shown in pink. 
Hydroxyethyl farnesyl group of heme a is shown in 
pink. 
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Figure 20.8 Visible absorption spectra of cytochrome c. 


methyls [see Figure 20.9].) UQ—cyt c reductase (Figure 20.10) contains a b-type cyto- 
chrome, of 30 to 40 kDa, with two different heme sites and one c-type cytochrome. The 
two hemes on the b cytochrome polypeptide in UQ-cyt c reductase are distinguished 
by their reduction potentials and the wavelength (A max) of the so-called a-band. One of 
these hemes, known as bz or bs, has a standard reduction potential, €,', of —0.100 V 
and a wavelength of maximal absorbance (A max) of 566 nm. The other, known as by or 
bse, has a standard reduction potential of +0.050 V and a Àmax of 562 nm. (H and L 
here refer to high and low reduction potential.) 

The structure of the UQ-cyt c reductase, also known as the cytochrome bc, 
complex, was determined by Johann Deisenhofer and his colleagues. (Deisenhofer 
was a co-recipient of the Nobel Prize in Chemistry for his work on the structure of a 


Figure 20.10 The structure of UQ-cyt c reductase, also known as the cytochrome bc, complex. The a-helical 
bundle near the top of the structure defines the transmembrane domain of the protein (PDB id = 1BCC). 
Heme groups are shown in red, Fe/S centers in green, and associated phospholipids in blue. 
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photosynthetic reaction center). The complex is a dimer, with each monomer 
consisting of 11 protein subunits and 2165 amino acid residues (monomer mass, 248 
kDa). Each monomer consists of three domains: a large domain that extends 75 A 
into the mitochondrial matrix, a transmembrane domain consisting of 13 transmem- 
brane a-helices in each monomer, and a small domain that extends 38 A into the in- 
termembrane space (Figure 20.10). Most of the Rieske protein (an Fe-S protein 
named for its discoverer) is mobile in the crystal (only 62 of its 196 residues are shown 
in the structure in Figure 20.10), and Deisenhofer has postulated that mobility of this 
subunit could be required for electron transfer in the function of this complex. 


Complex III Drives Proton Transport As with Complex I, passage of electrons through 
the Q cycle of Complex III is accompanied by proton transport across the inner mi- 
tochondrial membrane. The postulated pathway for electrons in this system is shown 
in Figure 20.11. A large pool of UQ and UQH) exists in the inner mitochondrial 
membrane. The Q cycle is initiated when a molecule of UQH) from this pool diffuses 
to a site (called Q,) on Complex HI near the cytosolic face of the membrane. 

Oxidation of this UQH; occurs in two steps (Figure 20.1 1a). First, an electron from 
UQH; is transferred to the Rieske protein and then to cytochrome cı. This releases 
two H* to the cytosol and leaves UQ : ~, a semiquinone anion form of UQ, at the Q, 
site. The second electron is then transferred to the b; heme, converting UQ : ~ to UQ. 
The Rieske protein and cytochrome cı are similar in structure; each has a globular 
domain and is anchored to the inner mitochondrial membrane by a hydrophobic seg- 
ment. However, the hydrophobic segment is N-terminal in the Rieske protein and 
C-terminal in cytochrome cı. 
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Figure 20.11 The Q cycle in mitochondria. (a) The 
electron-transfer pathway following oxidation of the 
first UQH, at the Q, site near the cytosolic face of 
the membrane. (b) The pathway following oxidation 
of a second UQH3. 
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Figure 20.12 The structure of mitochondrial 
cytochrome c. The heme is shown at the center of 
the structure. It is covalently linked to the protein via 
two sulfur atoms. A third sulfur from a methionine 
residue coordinates the iron (PDB id = 2B4Z). 


Figure 20.13 Bovine cytochrome c oxidase consists 
of 13 subunits. The 3 largest subunits (shown 
here)—I (purple), II (yellow), and III (b/ue)—contain 
the proton channels and the redox centers 

(PDB id = 2EIJ). 


The electron on the bz heme facing the cytosolic side of the membrane is now passed 
to the by heme on the matrix side of the membrane. This electron transfer occurs against 
a membrane potential of 0.15 V and is driven by the loss of redox potential as the elec- 
tron moves from bz (€,' = —0.100 V) to by (o = +0.050 V). The electron is then 
passed from by to a molecule of UQ at a second quinone-binding site, Q,,, converting this 
UQ to UQ: ~. The resulting UQ - ~ remains firmly bound to the Q, site. This completes 
the first half of the Q cycle (Figure 20.1 1a). 

The second half of the cycle (Figure 20.11b) is similar to the first half, with a second 
molecule of UQH; oxidized at the Q, site, one electron being passed to cytochrome cı 
and the other transferred to heme b; and then to heme by. In this latter half of the Q 
cycle, however, the by electron is transferred to the semiquinone anion, UQ - ~, at the Q, 
site. With the addition of two H* from the mitochondrial matrix, this produces a mol- 
ecule of UQHb, which is released from the Q, site and returns to the coenzyme Q pool, 
completing the Q cycle. Thus, two UQH); are taken from the UQ/UQH); pool and one is 
returned; the Q cycle accomplishes the net oxidation of one UQH>. 


The Q Cycle Is an Unbalanced Proton Pump Why has nature chosen this rather convo- 
luted path for electrons in Complex II? First of all, Complex II takes up two protons 
on the matrix side of the inner membrane and releases four protons on the cytoplasmic 
side for each pair of electrons that passes through the Q cycle. Thus, it is referred to 
as an unbalanced proton pump. The other significant feature of this mechanism is that 
it offers a convenient way for a two-electron carrier, UQH:, to interact with the bz and 
by hemes, the Rieske protein Fe-S cluster, and cytochrome cı, all of which are 
one-electron carriers. 


Cytochrome c Is a Mobile Electron Carrier Electrons traversing Complex III are passed 
through cytochrome c; to cytochrome c. Cytochrome c is the only one of the mitochon- 
drial cytochromes that is water soluble. Its structure (Figure 20.12) is globular; the 
planar heme group lies near the center of the protein, surrounded predominantly by 
hydrophobic amino acid residues. The iron in the porphyrin ring is coordinated both to 
a histidine nitrogen and to the sulfur atom of a methionine residue. Coordination with 
ligands in this manner on both sides of the porphyrin plane precludes the binding of 
oxygen and other ligands, a feature that distinguishes cytochrome c from hemoglobin 
(see Chapter 15). 

Cytochrome c, like UQ, is a mobile electron carrier. It associates loosely with the in- 
ner mitochondrial membrane (in the intermembrane space on the cytosolic side of the 
inner membrane) to acquire electrons from the Fe-S-—cyt cı aggregate of Complex III, 
and then it migrates along the membrane surface in the reduced state, carrying electrons 
to cytochrome c oxidase, the fourth complex of the electron-transport chain. 


20.2e Complex IV Transfers Electrons from Cytochrome c 
to Reduce Oxygen on the Matrix Side 


Complex IV is called cytochrome c oxidase because it accepts electrons from cyto- 
chrome c and directs them to the four-electron reduction of O, to form H,0: 


4 cyt c (Fe**) + 4 H* + O, — 4 cyt c (Fe**) + 2 H2O (20.9) 


Thus, cytochrome c oxidase and O, are the final destination for the electrons derived 
from the oxidation of food materials. In concert with this process, cytochrome c oxi- 
dase also drives transport of protons across the inner mitochondrial membrane. The 
combined processes of oxygen reduction and proton transport involve a total of 8H* 
in each catalytic cycle—four H* for O, reduction and four H* transported from the 
matrix to the intermembrane space. 

The total number of subunits in cytochrome c oxidase varies from 2 to 4 (in bacteria) 
to 13 (in mammals). Three subunits (I, I, and III) are common to most organisms 
(Figure 20.13). This minimal complex, which contains two hemes (termed a and a3) and 
three copper ions (two in the Cu, center and one in the Cug site), is sufficient to carry 
out both oxygen reduction and proton transport. 
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The total mass of the protein in mammalian Complex IV is 204 kDa. All 13 subunits are 
shown in Figure 20.14. In mammals, subunits I through III, the largest ones, are encoded 
by mitochondrial DNA, synthesized in the mitochondrion, and inserted into the inner 
membrane from the matrix side. The 10 smaller subunits are coded by nuclear DNA, are 
synthesized in the cytosol, and are presumed to play regulatory roles in the complex. 

In the bovine structure, subunit I is cylindrical in shape and consists of 12 trans- 
membrane helices, without any significant extramembrane parts. Hemes a and a3, 
which lie perpendicular to the membrane plane, and Cug are cradled by the helices of 
subunit I (Figure 20.15). Subunits II and III lie on opposite sides of subunit I and do 
not contact each other (see Figure 20.13). Subunit II has an extramembrane domain on 
the outer face of the inner mitochondrial membrane. This domain consists of a 
10-strand B-barrel that holds the two copper ions of the Cua site 7 A from the nearest 
surface atom of the subunit. Subunit III consists of seven transmembrane helices with 
no significant extramembrane domains. 


Electron Transfer in Complex IV Involves Two Hemes and Two Copper Sites Electron 
transfer through Complex IV begins with binding of cytochrome c to the B-barrel of 
subunit II. Four electrons are transferred sequentially (one each from four molecules of 
cytochrome c) first to the Cu, center, next to heme a, and finally to the Cup/heme a3 
active site, where O, is reduced to HO (Figure 20.15): 


Cyt c —> Cu, —> heme a —> Cup/heme a; — O, (20.10) 


The Cu, site (Figure 20.16a) is the entry point for electrons from cytochrome c. It lies 
in a hydrophobic patch on subunit II, surrounded by a ring of negatively charged 
Asp and Glu residues. Electrons flow rapidly from Cu, to heme a, which is coordi- 
nated by a pair of His residues (Figure 20.16b), and then to the Cug/heme a; complex. 
The Fe atom in heme a; is five coordinate (Figure 20.16c), with four ligands from the 


2 H* 


2 H* 


Figure 20.15 The electron-transfer pathway for cytochrome oxidase. Cytochrome c binds on the 
cytosolic side, transferring electrons through the copper and heme centers to reduce O on the matrix 
side of the membrane (PDB id = 2EIJ). 


Figure 20.14 The complete structure of bovine 
cytochrome c oxidase The 3 largest subunits—I 
(purple), TI (yellow), and II (blue)—contain the 
proton channels and the redox centers. 

(PDB id = 2EIJ). 
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Figure 20.16 Structures of the redox centers of 
bovine cytochrome c oxidase. (a) The Cua site, 


(b) the heme a site, and (e) the binuclear Cug/heme heme plane and one from His*”. This leaves a sixth position free, and this is the cata- 

arste (PDB 1d TZEI): lytic site where O, binds and is reduced. Cug is about 5A from the Fe atom of heme 
a3 and is coordinated by three histidine ligands, including His**°, His??°, and His?! 
(Figure 20.16c). An unusual crosslink between His?* and Tyr*“4 lowers the pK, of the 
Tyr hydroxyl so that it can participate in proton transport across the membrane. 


20.2f Proton Transport Across Cytochrome c Oxidase 
Is Coupled to Oxygen Reduction 


Proton transport in Rhodobacter sphaeroides cytochrome c oxidase takes place via 
two channels denoted the D- and K-pathways (Figure 20.17a). Both these channels 
contain water molecules, and they are lined with polar residues that can either proton- 
ate and deprotonate or form hydrogen bonds. The D-pathway is named for Asp!” at 
the channel opening, and the K-pathway is named for Lys*”, a critical residue located 
midway in the channel. These two channels converge at the binuclear Cup/heme a; site 


(b) 


D-pathway K-pathway 


Figure 20.17 The proton channels of cytochrome c oxidase from R. sphaeroides. (a) Functional 
residues in the D- and K-pathways are indicated. The shaded D- and K-pathways converge at the 
Cup/heme a; center. The proton exit channel is lined by residues 320 to 340 of subunit I (PDB id = 
1M56). (b) Protons are presumed to hop along arrays of water molecules in the proton transport 
channels of cytochrome c oxidase. Two steps in the hopping model are shown. Such a chain of 
protonation and deprotonation events means that the proton eventually released from the exit channel 
is far removed from the proton that entered the D-pathway and initiated the cascade. 
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midway across the complex and the membrane. Here, Glu** serves as a branch point, 
shuttling protons either to the catalytic site for O, reduction (to form H20) or to the 
exit channel (residues 320 to 340) that leads protons to the intermembrane space (Fig- 
ure 20.17a). In each catalytic cycle, two H* pass through the K-pathway and six H* 
traverse the D-pathway. The K-pathway protons and two of the D-pathway protons 
participate in the reduction of one O, to two H20, and the remaining four D-pathway 
protons are passed across the membrane and released to the intermembrane space. 

How are protons driven across cytochrome c oxidase? The mechanism involves three 
key features: 


e The pK, values of protein side chains in the proton channels are shifted (by the local 
environment) to make them effective proton donors or acceptors during transport. 
For example, the pK, of Glu?® is unusually high at 9.4. (This is similar to the behav- 
ior of Asp* and Asp” in bacteriorhodopsin; see Figure 9.58.) 

e Electron-transfer events induce conformation changes that control proton trans- 
port. For example, redox events at the Cug/heme a; site are sensed by Glu? and an 
adjacent proton-gating loop (residues 169 to 175), controlling H* binding and re- 
lease by Glu?®ć and proton movement through the exit channel. 

e Protons are transported via chains of hydrogen-bonded water molecules in the pro- 
ton channels (Figure 20.17b). Sequential hopping of protons along these proton 
wires essentially transfers a positive charge between distant residues in the channel. 
(Note that the H* that arrives at an accepting residue is not the same proton that left 
the donating residue.) 


20.2g Reversal of Complex II May Short-Circuit 
the Electron Transfer Pathway 


Oxygen produced by Complex IV is usually the terminal acceptor of electrons in the 
electron transport pathway. However, in certain tissues and especially when oxygen is 
limiting, fumarate may act as an alternate terminal electron acceptor via reversal of 
Complex II. Fumarate reduction to succinate by such a reversal has been observed 
under low-oxygen conditions in multiple tissues, but kidney, liver, and brain show 
constitutive production of succinate from fumarate regardless of oxygen availability. 
Reversal of Complex II in this way may allow tissues to cope with reductive stress—a 
condition in which flow of electrons through the electron transport pathway may 
exceed the capacity of the pathway to reduce oxygen. 


20.2h The Complexes of Electron Transport 
May Function as Supercomplexes 


For many years, the complexes of the electron-transport chain were thought to exist 
and function independently in the mitochondrial inner membrane because each 
complex could be isolated in a pure yet functional state. Growing experimental 
evidence, however, supports the existence of multimeric supercomplexes of the four 
electron-transport complexes. These supercomplexes, also known as respirasomes, may 
represent functional states, and association of two or more of the individual complexes 
may be advantageous for the organism. Supercomplexes can be identified by single 
particle electron microscopy, and kinetic measurements support the operation of the 
respiratory chain as one functional unit. Mutation studies show that Complex III 
is required to maintain Complex I in the mitochondria of human and mouse cells in 
culture, and Complex IV is necessary for proper assembly of Complex I in mouse 
fibroblasts. 

As shown in Figure 20.18, the CI,HI,IV> supercomplex (containing two units each 
of Complexes I, II, and IV) forms a circular structure with the dimeric CII located in 
the center, where it is surrounded by two copies each of CI and CIV. Two cytochrome c 
(Cyt.c) molecules are positioned to accept electrons on the surface of the CIII dimer. 

Substantial evidence points to a requirement for cardiolipin (see Figure 8.6) in the 
assembly and stability of mitochondrial supercomplexes. For example, association 
of Complexes III and IV requires the presence of cardiolipin. Studies by Carola 
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Figure 20.18 The complexes of the electron 
transport chain operate as supercomplexes 
in the inner mitochondrial membrane. 

(a) Cryo-EM images of a supercomplex 
formed from two units each of Complex I, 
Complex II, and Complex IV. The structure 
is shown in three differently rotated views 
(left, side view along the inner membrane; 
right, top view from the matrix side; bottom, 
bottom view from the intermembrane side). 
The transmembrane region is indicated by 
two dashed lines. M, matrix; IM, inner 
membrane; IMS, intermembrane space. The 
cofactors are shown in spheres. (b) A model 
for the electron-transport pathway in the 
mitochondrial inner membrane. UQ/UQH2 
and cytochrome c are mobile electron 
carriers and function by transferring 
electrons between the complexes. The 
proton transport driven by Complexes I, 
III, and IV is indicated. (Images in part a 
adapted from Gou, R., Zong, S., Wu, M. 
et al., 2017. Architecture of human 
mitochondrial respiratory megacomplex 
LUILIV>. Cell 170:1247-1257. Images 
kindly provided by Maojun Yang.) 
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Hunte and colleagues show that cardiolipin stabilizes supercomplexes by neutraliz- 
ing the charges of lysine residues at the binding interface between Complex III and 
Complex IV. 


The Model of Electron Transport Is a Dynamic One The model that emerges for electron 
transport is shown in Figure 20.18b. Coenzyme Q collects electrons from NADH-UQ 
reductase and succinate—UQ reductase and delivers them (by diffusion through the 
membrane core) to UQ-cyt c reductase. Cytochrome c is water soluble and moves 
freely in the intermembrane space, carrying electrons from UQ-cyt c reductase to 
cytochrome c oxidase. In the process of these electron transfers, protons are driven 
across the inner membrane (from the matrix side to the intermembrane space). The 
proton gradient generated by electron transport represents an enormous source of 
potential energy. As seen in the next section, this potential energy is used to synthesize 
ATP as protons flow back into the matrix. 


20.21 Electron Transfer Energy Stored in a Proton 
Gradient: The Mitchell Hypothesis 


In 1961, Peter Mitchell, a British biochemist, proposed that the energy stored in a 
proton gradient across the inner mitochondrial membrane by electron transport 
drives the synthesis of ATP in cells. The proposal became known as Mitchell’s 
chemiosmotic hypothesis. In this hypothesis, protons are driven across the membrane 
from the matrix to the intermembrane space and cytosol by the events of electron 
transport. This mechanism stores the energy of electron transport in an 
electrochemical potential. As protons are driven out of the matrix, the pH rises and 
the matrix becomes negatively charged with respect to the cytosol (Figure 20.19). 
Electron transport-driven proton pumping thus creates a pH gradient and an 
electrical gradient across the inner membrane, both of which tend to attract protons 
back into the matrix from the cytoplasm. Flow of protons down this electrochemical 
gradient, an energetically favorable process, drives the synthesis of ATP. 


Intermembrane space 
(high [H*], low pH) 


Figure 20.19 The proton and electrochemical gradients existing across the inner mitochondrial 
membrane. The electrochemical gradient is generated by the transport of protons across the 
membrane by Complexes I, III, and IV in the inner mitochondrial membrane. 
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The ratio of protons transported per pair of electrons passed through the chain—the 
so-called H+/2e~ ratio—has been an object of great interest for many years. Nevertheless, 
the ratio has remained extremely difficult to determine. The consensus estimate for the 
electron-transport pathway from succinate to O, is 6H*/2e~. The ratio for Complex I by 
itself remains uncertain, but recent best estimates place it as high as 4H*/2e~. On the 
basis of this value, the stoichiometry of transport for the pathway from NADH to O; is 
10H*/2e~. Although this is the value assumed in Figure 20.18, it is important to realize 
that this value is tentative and represents a consensus drawn from many experiments. 


20.3 | What Are the Thermodynamic Implications 
| of Chemiosmotic Coupling? 


Mitchell’s chemiosmotic hypothesis revolutionized our thinking about the energy 
coupling that drives ATP synthesis by means of an electrochemical gradient. How 
much energy is stored in this electrochemical gradient? For the transmembrane flow 
of protons across the inner membrane (from inside [matrix] to outside), we could write 


AF in > Hout (20.1 1) 
The free energy difference for protons across the inner mitochondrial membrane in- 


cludes a term for the concentration difference and a term for the electrical potential. 
This is expressed as 


AG = RT In É! + ZFAW (20.12) 
1 


[ci] 

where c; and c, are the proton concentrations on the two sides of the membrane, Z is 
the charge on a proton, F is Faraday’s constant, and Ay is the potential difference 
across the membrane. For the case at hand, this equation becomes 
[Hou] 
Hin | 
In terms of the matrix and cytoplasm pH values, the free energy difference is 

AG = —2.303 RT(pHou — pHin) + FAW (20.14) 


AG = RTIln 


+ ZFAY (20.13) 


Reported values for Ay and ApH vary, but the membrane potential is always found to be 
positive outside and negative inside, and the pH is always more acidic outside and more 
basic inside. Taking typical values of Ay = 0.18 V and ApH = 1 unit, the free energy 
change associated with the movement of one mole of protons from inside to outside is 


AG = 2.3 RT + (0.18 V) (20.15) 
With F = 96.485 kJ/V - mol, the value of AG at 37°C is 
AG = 5.9 kJ + 17.4 kJ = 23.3 kJ (20.16) 


which is the free energy change for movement of a mole of protons across an inner 
membrane. Note that the free energy terms for both the pH difference and the potential 
difference are unfavorable for the outward transport of protons, with the latter term 
making the greater contribution. On the other hand, the AG for inward flow of protons 
is —23.3 kJ/mol. It is this energy that drives the synthesis of ATP, in accord with 
Mitchell’s model. Peter Mitchell was awarded the Nobel Prize in Chemistry in 1978. 


20.4 | How Does a Proton Gradient Drive 
| the Synthesis of ATP? 


The great French chemist Antoine Lavoisier showed in 1777 that foods undergo 
combustion in the body. Since then, chemists and biochemists have wondered how 
energy from food oxidation is captured by living things. Mitchell paved the way by 
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20.4 How Does 


suggesting that a proton gradient across the inner mitochondrial membrane could 
drive the synthesis of ATP. But how could the proton gradient be coupled to ATP 
production? The answer lies in a mitochondrial complex called F,Fo-ATP synthase, 
or sometimes F;Fyo—ATPase (for the reverse reaction it catalyzes). The F; portion 
of the ATP synthase was first identified in early electron micrographs of 
mitochondrial preparations as spherical, 8.5 nm projections or particles on the 
inner membrane. The purified particles catalyze ATP hydrolysis, the reverse 
reaction of the ATP synthase. Stripped of these particles, the membranes can still 
carry out electron transfer but cannot synthesize ATP. In one of the first 
reconstitution experiments with membrane proteins, Efraim Racker showed that 
adding the particles back to stripped membranes restored electron transfer- 
dependent ATP synthesis. 


20.4a ATP Synthase Is Composed of F; and Fy Complexes 


ATP synthase is a remarkable molecular machine. It is an enzyme, a proton pump, 
and a rotating molecular motor. Nearly all the ATP that fuels our cellular processes 
is made by this multifaceted molecular superstar. The spheres observed in electron 
micrographs make up the F, unit, which catalyzes ATP synthesis (Figure 20.20a, b). 
These F; spheres are attached to an integral membrane protein aggregate called the 
Fo unit. F, consists of five polypeptide chains named a, B, y, 5, and e, with a subunit 


(a) 


ATP8 (AGL) 
DAPIT (k) 
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(b) 


Figure 20.20 The ATP synthase, a rotating molecular 
motor. (a) Cryo-EM structure of the mammalian (ovine) 
mitochondrial F;Fy ATP synthase. Side view, color- 
coded by subunits, as indicated. Alternate names for 
some of the subunits are shown in parentheses. (b) Top 
view of the ATP synthase, with a- and B-subunits 
indicated. Bound ADP molecules are shown as spheres. 
(c) Cryo-EM structure of bovine mitochondrial ATP 
synthase dimer with labeled subunits. (Parts (a) and (b) 
adapted from Pinke, G., Zhou, L., and Sazanov, L. A., 
2020. Cryo-EM structure of the entire mammalian 
F-type ATP synthase. Nat. Struct. Mol. Biol. 27: 
1077-1085. Part (c): PDB id = 7AJF.) 
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Figure 20.21 Two views of the F, unit of the 
F,Fo-ATP synthase. (a) An axial view of the F, 

unit of the F, Fp-ATP synthase, showing alternating 
a- and B-subunits in a hexameric array, with the 
y-subunit (purple) visible in the center of the 
structure. (b) A side view of the F1 unit, with one 
a-subunit and one B-subunit removed to show how 
the y-subunit (red) extends through the center of the 
a3; hexamer. Also shown are the 6-subunit (aqua) 
and the e-subunit (pink), which link the y-subunit to 
the Fo unit (PDB id = 1E79). 


LEGPA Human F,F,-ATP Synthase Subunit Organization 
Complex Protein Subunit Residues Stoichiometry Stator/Rotor 
Fi a 510 3 Stator 
B 510 3 Stator 
y 273 1 Rotor 
ô 146 1 Rotor 
€ 75 1 Rotor 
Fo alATP6 226 1 Stator 
b/B1 214 1 Stator 
c 76 8-15* Rotor 
d 161 1 Stator 
e 71 1 Stator 
f 88 1 Stator 
g 103 1 Stator 
Fo/h 76 1 Stator 
6.8pl/i 60 1 Stator 
OSCP 190 1 Stator 
DAPIT 58 1 Stator 
ATP8/A6L 66 1 Stator 


*The number of c-subunits varies among organisms: bovine mitochondria, 8; yeast mitochondria 
and E coli, 10; Ilyobacter tartaricus, 11; spinach chloroplasts, 14; Spirulina platensis, 15. 

iThe subunit nomenclature can be confusing. E. coli ATP synthase lacks a 6-subunit in its rotor; its 
6-subunit is analogous structurally and functionally to the mitochondrial OSCP. Similarly, subunit F6 
in the bovine ATP synthase is weakly homologous to subunit h in yeast. 


stoichiometry a363yde (Table 20.2 and Figure 20.21). Fo includes 12 different subunits 
denoted as a, b, c, d, e, f, g, F6, 6.8pl, OSCP, DAPIT, and ATP8. Fy forms the trans- 
membrane pore or channel through which protons move to drive ATP synthesis. The 
Fo structures in prokaryotic organisms and plants have 10 to 15 c-subunits, whereas 
c-rings of only 8 subunits are found in the Fy structures of vertebrates and all or most 
invertebrates (Table 20.2). 

All of the subunits of Fo except c form part of the stator—a stationary component 
anchored in the membrane—and a ring of 8 to 15 c-subunits (see Table 20.3) constitutes 
a major component of the rotor of the motor. Protons flowing through the a—c complex 
cause the c-ring to rotate in the membrane. Each c-subunit is a folded pair of a-helices 
joined by a short loop, whereas the a-subunit is composed of a cluster of a-helices. The 
b-subunit, together with the d- and F,-subunits and the oligomycin sensitivity-conferring 
protein (OSCP), form a long, slender peripheral stalk (PS) that connects Fo in the mem- 
brane with F;, which extends out into the matrix. The b-, d-, and F¢-subunits form long 
a-helical segments that comprise the stalk, and OSCP adds a helical bundle cap that sits 
at the top of an a-subunit of F, (Figure 20.20). The stalk is a stable link between Fy and 
F, essentially joining the two, both structurally and functionally. 

Structural accommodations in the mitochondrial membrane suit different numbers of 
subunits in the circular ring of c-subunits. Nobelist John Walker and colleagues have 
shown that Lys* of the c-subunits in bovine mitochondria are completely trimethylated, 
forming a bulky quaternary amino group on each subunit, adjacent to the phospholipid 
headgroups of the inner membrane. This bulky group would clash with phospho- 
lipid headgroups, but Walker has postulated that cardiolipin, which is essential to ATP 
synthase activity and which contains two phosphatidylglycerol groups but no headgroup, 
can accommodate the trimethylamino groups and link adjacent c-subunits in the relatively 
small 8-subunit c-ring. No organism with c-rings of more than 8 subunits are trimethylated 
at Lys*. 
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The ATP synthase assembles into V-shaped dimers in the mitochondrial inner mem- 
brane (Figure 20.20c). Long ribbons of these dimers bend the mitochondrial inner 
membrane and form folds or cristae, the highly folded membrane structures that maxi- 
mize the surface area available for respiration and ATP formation. 


20.4b The Catalytic Sites of ATP Synthase Adopt 
Three Different Conformations 


The a- and B-subunits of the F; structure are similar, each consisting of an N-terminal 
domain with six B-strands, a central nucleotide-binding domain made of both 
a-helices and B-strands, and an a-helical C-terminal domain containing six a-helices 
in the B-subunits and seven in the a-subunits. The F; structure appears at first to be 
asymmetric hexamer of a- and B-subunits. However, it is asymmetric in several ways. 
The a- and B-subunits, arranged in an alternating pattern in the hexamer, are similar 
but not identical. The hexamer contains six ATP-binding sites, each of them arranged 
at the interface of adjacent subunits. Three of these, each located mostly on a 
B-subunit but with some residues contributed by an a-subunit, are catalytic sites for 
ATP synthesis. The other three, each located mostly on an a-subunit but with resi- 
dues contributed by a B-subunit, are noncatalytic and inactive. The noncatalytic 
a-sites have similar structures. Significantly, the single y-subunit of F, introduces 
asymmetry to the a383; hexamer that forces the three catalytic B-sites to have three quite 
different conformations. In the crystal structure first characterized by John Walker, 
one of the B-subunit ATP sites contains AMP-PNP (a nonhydrolyzable analog of 
ATP), another contains ADP, and the third site is empty. 


NH, 
Z N a B y 
N | \ (0) (0) (0) 
K i ial 
N N CHO — P—O— P—N—P—O- 
| | | 
2 o7 OF o7 
H H 
H H Nonhydrolyzable 
OH OH B-y bond 


B, y-Imidoadenosine 5'-triphosphate (AMP-PNP) 


Walker’s work provided structural verification for a novel hypothesis first advanced 
by Paul Boyer, the binding change mechanism for ATP synthesis. Walker and Boyer, 
whose efforts provided complementary insights into the workings of this molecular 
motor, shared in the Nobel Prize for Chemistry in 1997. 


20.4c Boyer’s O Exchange Experiment Identified the Energy-Requiring Step 


The elegant studies by Boyer of '8O exchange in ATP synthase provided important insights 
into the mechanism of the enzyme. Boyer and his colleagues studied the ability of the 
synthase to incorporate labeled oxygen from H,'*O into P;. This reaction (Figure 20.22) 
occurs via synthesis of ATP from ADP and P;, followed by hydrolysis of ATP with 
incorporation of oxygen atoms from the solvent. Although net production of ATP requires 
coupling with a proton gradient, Boyer observed that this exchange reaction occurs 
readily, even in the absence of a proton gradient. The exchange reaction was so efficient 
that, eventually, all four oxygens of phosphate were labeled with 18O. This important 


In the absence of a proton gradient: 


ADP + P; 


Ht HO H Ht mf 
A> vt nok sor + -ron 


Enzyme 
bound 180 


Figure 20.22 ATP-ADP exchange in the absence 

of a proton gradient. Exchange leads to incorporation 
of '8O in phosphate as shown. Boyer’s experiments 
showed that '*O could be incorporated into all 

four positions of phosphate, demonstrating that the 
free energy change for ATP formation from 
enzyme-bound ADP + P; is close to zero. 

(From Parsons, D. F., 1963. Science 140:985.) 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


694 


Chapter 20 Electron Transport and Oxidative Phosphorylation 


Cycle repeats 


observation indicated that the formation of enzyme-bound ATP does not require energy. 
The experiments that followed, by Boyer, Harvey Penefsky, and others, showed clearly 
that the energy-requiring step in the ATP synthase was actually the release of newly 
synthesized ATP from the enzyme (Figure 20.23). Eventually, it would be shown that flow 
of protons through Fo drives the enzyme conformational changes that result in the binding 
of substrates on ATP synthase, ATP synthesis, and the release of products. 


20.4d The Binding Change Mechanism Describes 
the Events of Rotational Catalysis 


Boyer proposed that these conformation changes occurred in a rotating fashion. His ro- 
tational catalysis model, the binding change mechanism (Figure 20.23), suggested that at 
any instant the three 8 subunits of F; existed in three different conformations, that these 
different states represented the three steps of ATP synthesis, and that each site stepped 
through the three states to make ATP. A site beginning with ADP and phosphate bound 
(the first state) would synthesize ATP (producing the second state) and then release ATP, 
leaving an empty site (the third state). In the binding change mechanism, the three cata- 
lytic sites thus cycle concertedly through the three intermediate states of ATP synthesis. 


20.4e Proton Flow Through Fo Drives Rotation 
of the Motor and Synthesis of ATP 


How might the cycling proposed by Boyer’s binding change mechanism occur? Im- 
portant clues have emerged from several experiments that show that the y-subunit 
rotates with respect to the aß complex. How such rotation might be linked to trans- 
membrane proton flow and ATP synthesis is shown in Figure 20.24. The ring of 
c-subunits is a rotor that turns with respect to the a-subunit, a stator component 
consisting of five transmembrane a-helices with proton access channels on either 
side of the membrane. The y-subunit is the link between the functions of F; and Fo. 
In one complete rotation, the y-subunit drives conformational changes in each 
B-subunit that lead to ATP synthesis. Each 360° rotation of the y-subunit drives each 
B-subunit through all three conformational states of the binding change mechanism: 
ATP-bound (T), ADP- and P;-bound (L), and open (O). Thus, three ATPs are synthe- 
sized per turn. 

But how does the Fy complex couple the events of proton transport and ATP synthe- 
sis? The a-subunit contains two half-channels, a proton inlet channel that opens to the 
intermembrane space, and a proton outlet channel that opens to the matrix. The 
c-subunits are proton carriers that transfer protons from the inlet channel to the outlet 
channel only by rotation of the c-ring. Each c-subunit contains a protonatable residue, 
Asp*!. Protons flowing from the intermembrane space through the inlet half-channel 
protonate the Asp“! of a passing c-subunit and ride the rotor around the ring until they 
reach the outlet channel and flow out into the matrix. 

The molecular details of this process are shown in Figure 20.24. Each c-subunit in 
the c-ring has an inner helix and an outer helix. Asp“! is located midway along the outer 
a-helix. When protonated, the Asp carboxyl faces into the adjacent subunit. Rotation 
of the entire outer a-helix exposes Asp“! to the outside when it is deprotonated. Arg?!®, 
located midway on a transmembrane helix of the a-subunit, forms hydrogen bonds with 
Asp“! residues on two adjacent c-subunits. The half-channels of the a-subunit extend up 
and down from Arg”!®, The inlet channel terminates in Asn'4, whereas the outlet chan- 
nel terminates at Ser. 


Figure 20.23 The binding change mechanism for ATP synthesis by ATP synthase. This model 
assumes that F; has three interacting and conformationally distinct active sites: an open (O) 
conformation with almost no affinity for ligands, a loose (L) conformation with low affinity for 
ligands, and a tight (T) conformation with high affinity for ligands. Synthesis of ATP is initiated 
(step 1) by binding of ADP and P; to an L site. In the second step, an energy-driven conformational 
change converts the L site to a T conformation and converts T to O and O to L. In the third step, 
ATP is synthesized at the T site and released from the O site. Two additional passes through this 
cycle produce two more ATPs and return the enzyme to its original state. 
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b 
(a) ATP a __OSCP ` (b) 
ADP + P; 
Fı 
i , aArg?10 
Rotating stalk Peripheral 
stalk (PA) 
Matrix 
Fo 


Intermembrane 
space 


(c) 


c-subunits 


Figure 20.24 Details of proton transfer 
between the a-subunit and c-subunits. 

(a) Protons entering the inlet half-channel in 
the a-subunit are transferred to binding sites 
on c-subunits. Rotation of the c-ring delivers 
protons to the outlet half-channel in the 
a-subunit. Flow of protons through the 
structure turns the rotor and drives the cycle 
of conformational changes in B that 
synthesize ATP. (b) Arg?!® on the a-subunit 
lies between the end of the inlet half-channel 
(Asn?!*) and the end of the outlet 
half-channel (Ser?) (PDB id = 1C17). 

(c) A view looking down into the plane of 
the membrane. Transported protons flow 
from the inlet half-channel to Asp“! residues 
on the c-ring, around the ring, and then into 
the outlet half-channel. When Asp“! is 
protonated, the outer helix of the c-subunit 
rotates clockwise to bury the protonated 
carboxyl group for its trip around the c-ring. 
Counterclockwise ring rotation then brings 
another protonated Asp“! to the a-subunit, 
where an exiting proton is transferred to the 
outlet half-channel. 


a-subunit 
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Bacteriorhodopsin 


Lipid 


i Mitochondrial 
vesicle 


F|F)-ATP synthase 


= 


Figure 20.25 The reconstituted vesicles containing 
ATP synthase and bacteriorhodopsin used by 
Stoeckenius and Racker to confirm the Mitchell 
chemiosmotic hypothesis. (Note that both proteins 
are in reverse inside-out orientation relative to their 
normal orientation in biological membranes.) 


The structure of the c-subunit complex is exquisitely suited for proton transport. 
When a proton enters the a-subunit inlet channel and is transferred from a-subunit 
Asn*!4 to c-subunit Asp®°!, the a-helix of that c-subunit rotates clockwise to bury the 
Asp carboxyl group (Figure 20.24c). Each Asp“! remains protonated once it leaves the 
a-subunit interface because the hydrophobic environment of the membrane interior 
makes deprotonation (and charge formation) highly unfavorable. However, when a pro- 
tonated Asp residue approaches the a-subunit outlet channel, the proton is transferred 
to Ser? and exits through the outlet channel. The a-subunit Arg?!’ side chain orients 
adjacent Asp“! groups and promotes transfers of entering protons from a-subunit 
Asn?'4 to Asp“! and transfers of exiting protons from Asp“! to a-subunit Ser?°°. Arg?!°, 
because it is protonated, also prevents direct proton transfer from Asn?'4 to Ser, 
which would circumvent ring rotation and motor function. 

ATP synthesis occurs in concert with rotation of the c-ring, because the y-subunit is 
anchored to the rotating c-ring and rotates with it. Rotation causes the y-subunit to turn 
relative to the three B-subunit nucleotide sites of F,, changing the conformation of each 
in sequence, so ADP is first bound, then phosphorylated, then released, according to 
Boyer’s binding change mechanism. 


20.4f How Many Protons Are Required to Make an ATP? 
It Depends on the Organism 


The bioenergetic cost of making an ATP varies widely among organisms. In prokarya, 
chloroplasts, and fungi, ATP synthases have evolved a range of c-ring sizes from 10 to 
15 subunits. Since one turn of the c-ring produces 3 ATP, the cost of making an ATP 
varies from 3.3 (10/3) to 5 (15/3) protons. On the other hand, evidence indicates that all 
vertebrates and all or most invertebrates contain c-rings with only eight subunits. 
Thus, the approximately 50,000 vertebrates and two million invertebrates on earth 
require 8/3 or 2.7 protons to make one ATP. Vertebrate and invertebrate ATP syn- 
thases are the most efficient that have been found. 

Why might the number of c subunits vary? It is logical to assume an inverse relationship 
between the number of c subunits in an organism’s F,;Fo-ATP synthase and the strength of 
the proton-motive force. (Proton-motive force is a term used to describe the free energy 
difference, AG, for protons across a membrane [see equation 20.12].) If the proton-motive 
force is small, more Ht would have to traverse the membrane via the c-subunit ring to 
deliver enough energy to drive ATP synthesis, and of course, if the proton-motive force is 
large, fewer protons (and fewer c subunits) would be needed. The highly evolved human 
species needs only eight protons to accomplish one turn of the ATP synthase rotor, 
whereas most other organisms require more than eight. 


20.4g The Mitchell Model Has Been Confirmed 
in a Reconstitution Experiment 


When Mitchell first described his chemiosmotic hypothesis in 1961, little evidence 
existed to support it and it was met with considerable skepticism by the scientific 
community. Eventually, however, considerable evidence accumulated to support this 
model. It is now clear that the electron-transport chain generates a proton gradient, 
and careful measurements have shown that ATP is synthesized when a pH gradient is 
applied to mitochondria that cannot carry out electron transport. Even more relevant 
is a simple but crucial experiment reported in 1974 by Efraim Racker and Walther 
Stoeckenius, which provided specific confirmation of the Mitchell hypothesis. In this 
experiment, the bovine mitochondrial ATP synthase was reconstituted in simple lipid 
vesicles with bacteriorhodopsin, a light-driven proton pump from Halobacterium halo- 
bium. As shown in Figure 20.25, upon illumination, bacteriorhodopsin pumped pro- 
tons into these vesicles, and the resulting proton gradient was sufficient to drive ATP 
synthesis by the ATP synthase. Because the only two kinds of proteins present were 
one that produced a proton gradient and one that used such a gradient to make ATP, 
this experiment essentially verified Mitchell’s chemiosmotic hypothesis. 
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20.4h Inhibitors of Oxidative Phosphorylation 
Reveal Insights About the Mechanism 


Many details of electron transport and oxidative phosphorylation mechanisms have 
been gained from studying the effects of particular electron transport and oxidative 
phosphorylation inhibitors (Figure 20.26). The sites of inhibition by these agents are 
indicated in Figure 20.27. 


Inhibitors of Complexes I, II, and III Block Electron Transport Rotenone is a common 
insecticide that strongly inhibits the NADH—UQ reductase. Rotenone is obtained from 
the roots of several species of plants. Indigenous communities from certain parts of the 
world have made a practice of beating the roots of trees along riverbanks to release 
rotenone into the water, where it paralyzes fish and makes them easy prey. Amytal and 
other barbiturates also inhibit Complex I. All these substances appear to inhibit reduc- 
tion of coenzyme Q and the oxidation of the Fe-S clusters of NADH—UQ reductase. 


Cyanide, Azide, and Carbon Monoxide Inhibit Complex IV Complex IV, the cytochrome 
c oxidase, is specifically inhibited by cyanide, azide, and carbon monoxide (Figure 
20.27). Cyanide and azide bind tightly to the ferric form of cytochrome a3, whereas 


H zoh 
aC 
Jac, ea 
CH; H 
N. 
O a N 
CoH; 
25! NH 
(CHs) 9 CHCHgCHg3 
I CsHs ~COOCH; 
CHO 
OCH; Amytal Demerol 
` Rotenone (amobarbital) (meperidine) 


Figure 20.26 The structures of several inhibitors of electron transport and oxidative phosphorylation. 
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Uncouplers: 
2,4-Dinitrophenol 
Dicumarol Figure 20.27 The sites of action of several inhibitors of 
BOCH electron transport and/or oxidative phosphorylation. 
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Figure 20.28 Structures of several uncouplers, 
molecules that dissipate the proton gradient across 
the inner mitochondrial membrane and thereby 
destroy the tight coupling between electron 
transport and the ATP synthase reaction. 

(a) Dinitrophenol; (b) Dicumarol; (c) Carbonyl 
cyanide-p-trifluoromethoxyphenyl hydrazone—best 
known as FCCP; for Fluoro carbonyl cyanide 
phenylhydrazone. 


carbon monoxide binds only to the ferrous form. The inhibitory actions of cyanide 
and azide at this site are very potent, whereas the principal toxicity of carbon monox- 
ide arises from its affinity for the iron of hemoglobin. Herein lies an important distinc- 
tion between the poisonous effects of cyanide and carbon monoxide. Because animals 
(including humans) carry a large number of hemoglobin molecules, they must inhale 
a large quantity of carbon monoxide to die from it. These same organisms, however, 
possess comparatively few molecules of cytochrome a3. Consequently, a limited expo- 
sure to cyanide can be lethal. The sudden action of cyanide attests to the organism’s 
constant and immediate need for the energy supplied by electron transport. 


Oligomycin Is an ATP Synthase Inhibitor One of the best-studied inhibitors of ATP 
synthase is oligomycin. Oligomycin is a polyketide antibiotic that acts directly on ATP 
synthase by binding at the interface between the a-subunit and the c-subunit rotor. 
Oligomycin also blocks the movement of protons through Fo. 


20.41 Uncouplers Disrupt the Coupling of Electron 
Transport and ATP Synthase 


Another important class of reagents affects ATP synthesis, but in a manner that does 
not involve direct binding to any of the proteins of the electron-transport chain or the 
F|Fo-ATPase. These agents are known as uncouplers because they disrupt the tight 
coupling between electron transport and the ATP synthase. Uncouplers act by 
dissipating the proton gradient across the inner mitochondrial membrane created by 
the electron-transport system. Typical examples include 2,4-dinitrophenol, dicuma- 
rol, and carbonyl cyanide-p-trifluoro-methoxyphenyl hydrazone (perhaps better 
known as fluorocarbonyl cyanide phenylhydrazone, or FCCP) (Figure 20.28). These 
compounds share two common features: hydrophobic character and a dissociable 
proton. As uncouplers, they function by carrying protons across the inner mem- 
brane. Their tendency is to acquire protons on the cytosolic surface of the membrane 
(where the proton concentration is high) and carry them to the matrix side, thereby 
destroying the proton gradient that couples electron transport and the ATP synthase. 
In mitochondria treated with uncouplers, electron transport continues and protons 
are driven out through the inner membrane. However, they leak back in so rapidly 
via the uncouplers that ATP synthesis does not occur. Instead, the energy released in 
electron transport is dissipated as heat. 


20.45 ATP-ADP Translocase Mediates the Movement of ATP 
and ADP Across the Mitochondrial Membrane 


ATP, the cellular energy currency, must exit the mitochondria to carry energy thro- 
ughout the cell, and ADP must be brought into the mitochondria for reprocessing. 
Neither of these processes occurs spontaneously because the highly charged ATP 
and ADP molecules do not readily cross biological membranes. Instead, these pro- 
cesses are mediated by a single transport system, the ATP—ADP translocase. This 
protein tightly couples the exit of ATP with the entry of ADP so that the mitochon- 
drial nucleotide levels remain approximately constant. For each ATP transported 
out, one ADP is transported into the matrix. The translocase is the most abundant 
protein in the inner mitochondrial membrane and accounts for approximately 14% 
of the total mitochondrial membrane protein. It is a homodimer of 30 kDa subunits. 
The structure of the bovine translocase consists of six transmembrane a-helices. 
The helices are all tilted with respect to the membrane, and the first, third, and fifth 
helices are bent or kinked at proline residues in the middle of the membrane 
(Figure 20.29). 

Transport occurs via a single nucleotide-binding site, which alternately faces the 
matrix and the intermembrane space. It binds ATP on the matrix side, reorients to face 
the outside, and exchanges ATP for ADP, with subsequent rearrangement to face the 
matrix side of the inner membrane. Transport of ATP and ADP is regulated by phos- 
phorylation of Tyr!** on transmembrane helix number 5 (Figure 20.29). The relatively 
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Human Biochemistry 


Endogenous Uncouplers—Novel Proteins with Many Beneficial Effects 


Certain cold-adapted animals, hibernating animals, and newborn 
animals generate large amounts of heat by uncoupling oxidative 
phosphorylation. These organisms have a type of fat known as 
brown adipose tissue, so called for the color imparted by the many 
mitochondria this adipose tissue contains. The inner membrane of 
brown adipose tissue mitochondria contains large amounts of an 
endogenous protein called thermogenin (literally, “heat maker”) or 
uncoupling protein 1 (UCP1). UCP1 creates a passive proton chan- 
nel through which protons flow from the cytosol to the matrix and 


acid oxidation. It also appears to be essential for adaptation to 
fasting. 

Certain plants use the heat of uncoupled proton transport for 
a special purpose. Skunk cabbage and related plants contain floral 
spikes that are maintained as much as 20° above ambient tem- 
perature in this way. The warmth of the spikes serves to vaporize 
odiferous molecules, which attract insects that fertilize the flowers. 
Red tomatoes have very small mitochondrial membrane proton 
gradients compared with green tomatoes—evidence that uncouplers 


UCPI is a universal temperature regulator for animals. Mice that are more active in red tomatoes. 
lack UCP1 cannot maintain their body temperature in cold condi- 
tions, whereas normal animals produce larger amounts of UCP1 
when they are cold adapted. 

Two other uncoupling proteins, UCP2 and UCP3, show 60% se- 
quence identity with UCP1 and, like UCP1, they can also dissipate 
the mitochondrial proton gradient, but they do so in the service of 
very different physiological functions. UCP2 is found mainly in kid- 
ney, pancreas, spleen, immune cells, and the central nervous system, 
and it is involved in a variety of processes, including regulation of 
food intake, insulin secretion, and immunity. In particular, it plays 
a role in regulation of the generation of reactive oxygen species 
(ROS), which include hydrogen peroxide, hypochlorite ion, hydroxyl 
radical, and superoxide anion (see figure). 
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Mitochondrial ROS production correlates exponentially with 
mitochondrial proton-motive force (Equation 20.12 and Section 
20.3). (The larger the proton-motive force, the more likely that 
the respiratory electron carriers will donate electrons to oxygen to 
form ROS.) Dissipation of the proton-motive force by UCP2 thus 
attenuates ROS formation and reduces oxidative stress, which has 
been linked to diabetes and atherosclerosis. On the other hand, 
UCP3 may play a similar role in attenuation of ROS during fatty 
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large size and charge of the ATP and ADP substrates, compared to the relatively small 
size of the translocase itself, may require large conformation changes during trans- 
port. Export of ATP by the translocase is coupled to the ATP/ADP ratio and the 
proton-motive force across the inner mitochondrial membrane, ensuring that the 
energy needs of the cell will be met. 


Outward Movement of ATP Is Favored over Outward ADP Movement The charge on 
ATP at pH 7.2 or so is about —4, and the charge on ADP at the same pH is about —3. 
Thus, net exchange of an ATP (out) for an ADP (in) results in the net movement of one 
negative charge from the matrix to the cytosol. (This process is equivalent to the move- 
ment of a proton from the cytosol to the matrix.) Recall that the inner membrane is 
positive outside (see Figure 20.19), and it becomes clear that outward movement of 
ATP is favored over outward ADP transport, ensuring that ATP will be transported 
out (Figure 20.29). Inward movement of ADP is favored over inward movement of 
ATP for the same reason. Thus, the membrane electrochemical potential itself controls 
the specificity of the ATP-ADP translocase. However, the electrochemical potential is 
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Figure 20.29 The bovine ATP-ADP 
translocase (PDB id = 2C3E). (a) a 
ribbon structure and the path of the 
protein across the membrane. 

(b) Outward transport of ATP (via the 
ATP-ADP translocase) is favored by the 
membrane electrochemical potential. 
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diminished by the ATP-ADP translocase cycle and therefore operates with an energy 
cost to the cell. The cell must compensate by passing yet more electrons down the 
electron-transport chain. 

What is the cost of ATP-ADP exchange relative to the energy cost of ATP synthesis 
itself? We already noted that moving one ATP out and one ADP in is the equivalent of 
one proton moving from the cytosol to the matrix. Synthesis of an ATP results from the 
movement of approximately three protons from the cytosol into the matrix through Fo. 
Altogether, this means that approximately four protons are transported into the matrix 
per ATP synthesized. Thus, approximately one-fourth of the energy derived from the 
respiratory chain (electron transport and oxidative phosphorylation) is expended as the 
electrochemical energy devoted to mitochondrial ATP—ADP transport. 


20.5 | What Is the P/O Ratio for Mitochondrial 
Oxidative Phosphorylation? 


The P/O ratio is the number of molecules of ATP formed in oxidative phosphorylation 
per two electrons flowing through a defined segment of the electron-transport chain. 
In spite of intense study of this ratio, its actual value remains a matter of contention. 

The P/O ratio depends on the ratio of H* transported out of the matrix per 2 e~ 
passed from NADH to O; in the electron-transport chain and on the number of H* that 
pass through the ATP synthase to synthesize an ATP. The latter number depends on the 
number of c-subunits in the Fo ring of the synthase. As noted in Table 20.2, the number 
of c-subunits in the ATP synthase ranges from 8 to 15, depending on the organism. This 
would correspond to ratios of H* consumed per ATP from about 2.7 to 5, respectively, 
since each rotation of the ATP synthase rotor drives the formation of three ATP. Add- 
ing one H* for the action of the ATP-ADP translocase raises these values to about 3.7 
and 6, respectively. 
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20.6 How Are the Electrons of Cytosolic NADH Fed into Electron Transport? 


Assuming 10 H* are transported out of the matrix per 2 e~ passed from NADH to 
O, through the electron-transport chain, and 3.7 H* are transported into the matrix per 
ATP synthesized (and translocated), then the mitochondrial P/O ratio is 10/3.7, or 2.7, 
for the case of electrons entering the electron-transport chain as NADH. 


rR 10 H+ )= a2 
O \3.7H*/\2e [NADH >50,]/ 37 ` 


This is somewhat lower than earlier estimates, which placed the P/O ratio at three for 
mitochondrial oxidation of NADH. For the portion of the chain from succinate to O», 
the H*/2e™ ratio is 6 (as noted previously), and the P/O ratio in this case would be 6/3.7, 
or 1.6; earlier estimates placed this number at 2. The consensus of more recent experi- 
mental measurements of P/O ratios for these two cases has been closer to the values of 
2.7 and 1.6. Many chemists and biochemists, accustomed to the integral stoichiometries 
of chemical and metabolic reactions, were once reluctant to accept the notion of nonin- 
tegral P/O ratios. At some point, as we learn more about these complex coupled pro- 
cesses, it may be necessary to reassess the numbers. 


20.6 | How Are the Electrons of Cytosolic NADH 
Fed into Electron Transport? 


Most of the NADH used in electron transport is produced in the mitochondrial 
matrix, an appropriate site because NADH is oxidized by Complex I on the matrix 
side of the inner membrane. Furthermore, the inner mitochondrial membrane is 
impermeable to NADH. Recall, however, that NADH is produced in glycolysis by 
glyceraldehyde-3-P dehydrogenase in the cytosol. If this NADH were not oxidized to 
regenerate NAD+, the glycolytic pathway would cease to function due to NAD* limi- 
tation. Eukaryotic cells have a number of shuttle systems that harvest the electrons of 
cytosolic NADH for delivery to mitochondria without actually transporting NADH 
across the inner membrane. 


20.6a The Glycerophosphate Shuttle Ensures Efficient Use of Cytosolic NADH 


In the glycerophosphate shuttle pathway, two different glycerophosphate dehydrogenases, 
one in the cytosol and one on the outer face of the mitochondrial inner membrane, 
work together to carry electrons into the mitochondrial matrix (see Figure 20.30). 
NADH produced in the cytosol transfers its electrons to dihydroxyacetone phos- 
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Figure 20.30 The glycerophosphate shuttle (also known as the glycerol phosphate shuttle) couples 
the cytosolic oxidation of NADH with mitochondrial reduction of [FAD]. 
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phate, thus reducing it to glycerol-3-phosphate. This metabolite is reoxidized by the 
FAD*-dependent mitochondrial membrane enzyme to reform dihydroxyacetone 
phosphate and enzyme-bound FADH>. The two electrons of [FADH,] are passed 
directly to UQ, forming UQH). Thus, via this shuttle, cytosolic NADH can be used to 
produce mitochondrial [FADH)] and, subsequently, UQH). As a result, cytosolic 
NADH oxidized via this shuttle route yields only 1.6 molecules of ATP. The cell pays 
with a potential ATP molecule for the convenience of getting cytosolic NADH into the 
mitochondria. Although this may seem wasteful, there is an important payoff. The 
glycerophosphate shuttle is essentially irreversible, and even when NADH levels are 
very low relative to NAD‘, the cycle operates effectively. 


20.6b The Malate—Aspartate Shuttle Is Reversible 


The second electron shuttle system, called the malate—aspartate shuttle, is shown in 
Figure 20.31. Oxaloacetate is reduced in the cytosol, acquiring the electrons of 
NADH (which is oxidized to NAD*). Malate is transported across the inner mem- 
brane, where it is reoxidized by malate dehydrogenase, converting NAD* to NADH 
in the matrix. This mitochondrial NADH readily enters the electron-transport chain. 
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Figure 20.31 The malate (oxaloacetate)—aspartate shuttle, which operates across the inner 
mitochondrial membrane. In this shuttle process, electrons from cytosolic NADH are carried 
by malate across the inner mitochondrial membrane to produce NADH in the matrix. 
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20.6 How Are the Electrons of Cytosolic NADH Fed into Electron Transport? 


The oxaloacetate produced in this reaction cannot cross the inner membrane and 
must be transaminated to form aspartate, which can be transported across the mem- 
brane to the cytosolic side. Transamination in the cytosol recycles aspartate back to 
oxaloacetate. In contrast to the glycerol phosphate shuttle, the malate—aspartate 
cycle is reversible, and it operates as shown in Figure 20.31 only if the NADH/NAD* 
ratio in the cytosol is higher than the ratio in the matrix. Because this shuttle pro- 
duces NADH in the matrix, the full 2.7 ATPs per NADH are recovered. 


20.6c The Net Yield of ATP from Glucose Oxidation 
Depends on the Shuttle Used 


The complete route for the conversion of the metabolic energy of glucose to ATP has 
now been described in Chapters 18 through 20. Assuming appropriate P/O ratios, the 
number of ATP molecules produced by the complete oxidation of a molecule of glu- 
cose can be estimated. Keeping in mind that P/O ratios must be viewed as approxi- 
mate, for all the reasons previously cited, this text assumes the values of 2.7 and 1.6 for 
the mitochondrial oxidation of NADH and succinate, respectively. In eukaryotic 
cells, the combined pathways of glycolysis, the TCA cycle, electron transport, and 
oxidative phosphorylation then yield a net of approximately 32 to 34 molecules of ATP 
per molecule of glucose oxidized, depending on the shuttle route used (Table 20.3). 


| Table 20.3 | ye} Yield of ATP from Glucose Oxidation 


ATP Yield per Glucose 
Glycerol- Malate- 


Phosphate Aspartate 
Pathway Shuttle Shuttle 
Glycolysis: glucose to pyruvate (cytosol) 
Phosphorylation of glucose =l =] 
Phosphorylation of fructose-6-phosphate =f =1 
Dephosphorylation of 2 molecules of 1,3-BPG +2 +2 
Dephosphorylation of 2 molecules of PEP +2 +2 
Oxidation of 2 molecules of glyceraldehyde-3-phosphate 
yields 2 NADH 
Pyruvate conversion to acetyl-CoA (mitochondria) 
2 NADH 
Citric acid cycle (mitochondria) 
2 molecules of GTP from 2 molecules of succinyl-CoA +2 +2 


Oxidation of 2 molecules each of isocitrate, a-ketoglutarate, 
and malate yields 6 NADH 


Oxidation of 2 molecules of succinate yields 2 [FADH)] 


Oxidative phosphorylation (mitochondria) 


2 NADH from glycolysis yield 1.6 ATPs each if NADH +3.2 +5.4 
is oxidized by glycerol-phosphate shuttle; 2.7 ATP by 
malate—aspartate shuttle 


Oxidative decarboxylation of 2 pyruvate to 2 acetyl-CoA: 


2 NADH produce 2.7 ATPs each +5.4 +5.4 

2 [FADH,] from each citric acid cycle produce +3.2 +3.2 

1.6 ATPs each 

6 NADH from citric acid cycle produce 2.7 ATPs each +16.2 +16.2 
Net Yield 32.0 34.2 


Note: These P/O ratios of 2.7 and 1.6 for mitochondrial oxidation of NADH and [FADH,] are consensus values. 
Because they may not reflect actual values and because these ratios may change depending on metabolic conditions, 
these estimates of ATP yield from glucose oxidation are approximate. 
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The net stoichiometric equation for the oxidation of glucose, using the glycerol phos- 
phate shuttle, is 


Glucose + 6 O, + ~32 ADP +~32 P; —> 
6 CO, +~32 ATP + ~38 H,O (20.18) 


Because the 2 NADH formed in glycolysis are transported by the glycerol phosphate 
shuttle in this case, they each yield only 1.6 ATP, as already described. On the other 
hand, if these 2 NADH take part in the malate—aspartate shuttle, each yields 2.7 
ATP, giving a total (in this case) of 34.2 ATP formed per glucose oxidized. Most of 
the ATP—28 out of 32 or 30 out of 34—is produced by oxidative phosphorylation; 
only 4 ATP molecules result from direct synthesis during glycolysis and the TCA 
cycle. 

The situation in bacteria is somewhat different. Prokaryotic cells need not carry out 
ATP-ADP exchange. Thus, bacteria have the potential to produce as many as 38 ATP 
per glucose. 


20.6d 3.5 Billion Years of Evolution Have Resulted 
in a Very Efficient System 


How much energy does a eukaryotic cell extract from the glucose molecule? Taking a 
value of 50 kJ/mol for the hydrolysis of ATP under cellular conditions (see Chapter 3), 
the production of 34 ATPs per glucose oxidized yields 1700 kJ/mol of glucose. The 
cellular oxidation (combustion) of glucose yields AG = —2937 kJ/mol. The overall ef- 
ficiency for the pathways of glycolysis, the TCA cycle, electron transport, and oxidative 
phosphorylation is 1700/2937 Xx 100% = 58%. 


20.7 | How Do Mitochondria Mediate Apoptosis? 


Mitochondria are not only the home of the TCA cycle and oxidative phosphorylation 
but also are a crossroads for several cell-signaling pathways. Mitochondria take up 
Ca?’ ions released from the endoplasmic reticulum, thus helping control intracellular 
Ca? signals. They produce reactive oxygen species (ROS), including oxygen ions, free 
radicals and peroxides, that play signaling roles in cells, although ROS can also cause 
cellular damage. Mitochondria also participate in the programmed death of cells, a 
process known as apoptosis (the second “p” is silent in this word). 

Apoptosis is a mechanism through which certain cells are eliminated from higher 
organisms. It is central to the development and homeostasis of multicellular organisms, 
and it is the route by which unwanted or harmful cells are eliminated. Under normal 
circumstances, apoptosis is suppressed through compartmentation of the involved 
activators and enzymes. Mitochondria play a major role in this subcellular partitioning 
of the apoptotic activator molecules. One such activator is cytochrome c, which 
normally resides in the intermembrane space, bound tightly to a lipid chain of cardiolipin 
in the membrane (Figure 20.32). A variety of triggering agents, including Ca?*+, ROS, 
certain lipid molecules, a family of caspases (proteases with a catalytic Cys residue that 
cleave next to Asp residues in targets), and certain protein kinases, can induce the 
opening of pores in the mitochondrial membrane. For example, using hydrogen peroxide 
as a substrate, cytochrome c can oxidize its bound cardiolipin chain, releasing itself 
from the membrane. When the outer membrane is made permeable by other apoptotic 
signals, cytochrome c can enter the cytosol. This is termed mitochondrial outer membrane 
permeabilization (MOMP). 

Permeabilization events, which occur at points where outer and inner mitochondrial 
membranes are in contact, involve association of the ATP—ADP translocase in the 
inner membrane and the voltage-dependent anion channel (VDAC) in the outer mem- 
brane. This interaction leads to the opening of protein-permeable pores. Cytochrome 
c, as well as several other proteins, can pass through these pores. Pore formation is 
carefully regulated by the BCL-2 family of proteins, which includes both proapoptotic 
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members (proteins known as BAX, BID, and BAD) and antiapoptotic members 
(BCL-2 itself, as well as BCL-X; and BCL-W). The BCL-2 proteins also activate a se- 
ries of events that trigger autophagy, a catabolic process involving the degradation of 
the cell’s own components through the lysosomal machinery. The cell’s choice of 
apoptosis or autophagy is regulated in part by a membrane protein called RAGE, the 


receptor for advanced glycation endproducts. 
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Figure 20.32 The role of mitochondria in apoptosis. 
(a) Cytochrome c is anchored at the inner 
mitochondrial membrane by association with 
cardiolipin (diphosphatidylglycerol). The peroxidase 
activity of cytochrome c oxidizes a cardiolipin lipid 
chain, releasing cytochrome c from the membrane. 
(b) The opening of pores in the outer membrane, 
induced by a variety of triggering agents, releases 
cytochrome c to the cytosol, where it initiates the 
events of apoptosis. 
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20.7a Cytochrome c Triggers Apoptosome Assembly 


How is the release of cytochrome c translated into the activation of the death cascade, 
a point of no return for the cell? The answer lies in the assembly, in the cytosol, of a 
signaling platform called the apoptosome (Figures 20.32 and 20.33). The function of the 
apoptosome is to activate a cascade of proteases called caspases. (The “c” in “caspase” 
is for cysteine and “asp” is for aspartic acid. Caspases have Cys at the active site and 
cleave their peptide substrates after Asp residues.) The apoptosome is a wheel-shaped, 
heptameric platform that looks like (and in some ways behaves like) an earth-orbiting 
space station. It is assembled from seven subunits of the apoptotic protease-activating 
factor 1 (Apaf-1), a multidomain protein. Apaf-1 contains an ATPase domain (which 
prefers dATP over ATP in some organisms), a caspase-recruitment domain (CARD), 
and a WD40 repeat domain. Normally (before the death-signaling cytochrome c is 
released from mitochondria), these three domains are folded against each other 
(Figure 20.33b), with dATP tightly bound, and Apaf-1 is locked in an inactive mono- 
meric state. Binding of cytochrome c to the WD40 domain, followed by dATP hydro- 
lysis, converts Apaf-1 to an extended conformation. Then, exchange of dADP for a 
new molecule of dATP prompts assembly of the heptameric platform (Figure 20.33), 
which goes on to activate the death-dealing caspase cascade. 

Mitochondria-mediated apoptosis is the mode of cell death for many neurons in the 
brain during strokes and other brain-trauma injuries. When a stroke occurs, the neurons 
at the site of oxygen deprivation die within minutes by a nonspecific process of necrosis, 
but cells adjacent to the immediate site of injury die more slowly by apoptosis. These 
latter cells have been saved by a variety of therapeutic interventions that suppress apop- 
tosis in laboratory studies, raising the hope that strokes and other neurodegenerative 
conditions may someday be treated clinically in similar ways. 
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Figure 20.33 (a) Apaf-1 is a multidomain protein, consisting of an N-terminal CARD, a 
nucleotide-binding and oligomerization domain (NOD), and several WD40 domains. (b) Binding 

of cytochrome c to the WD40 domains and ATP hydrolysis unlocks Apaf-1 to form the semiopen 
conformation. Nucleotide exchange leads to oligomerization and apoptosome formation. (c) A model 
of the apoptosome, a wheellike structure with molecules of cytochrome c bound to the WD40 
domains, which extend outward like spokes. 
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Foundational Biochemistry Things You Should Know after Reading Chapter 20. 


e Electrons from glycolysis and the TCA cycle, stored in the form of 
reduced coenzymes (NADH and FADH, pass through the electron 
transport chain, generating a proton gradient that is used to make ATP. 


e Electron transport and oxidative phosphorylation take place in the 
inner membranes of mitochondria and in the plasma membranes of 
prokaryotes. 


e Mitochondria are surrounded by a simple and porous outer mem- 
brane as well as a more complex inner membrane. 


e Mitochondria are constantly remodeled by fission and fusion of the 
mitochondrial membranes. 


e Electrons move spontaneously from more negative to more positive 
reduction potentials, passing through the four complexes of the elec- 
tron transport chain. 


e Complex I of the electron transport chain collects a pair of electrons 
from NADH and passes them to ubiquinone, inducing conforma- 
tional changes that pump four protons from the matrix to the inter- 
membrane space. 


e Complex II of the electron transport chain oxidizes succinate, collect- 
ing a pair of electrons from FADH, and passing them to ubiquinone, 
with no attendant transport of protons across the inner membrane. 


e Complex III of the electron transport chain mediates electron trans- 
port from hydrophobic ubiquinone to water-soluble cytochrome c, in 
a multi-step pathway called the Q cycle. Two electrons through this 
cycle pump four protons across the mitochondrial inner membrane. 


e Complex IV of the electron transport chain transfers electrons from 
cytochrome c to oxygen, reducing it to water. One pair of electrons 
passing through this complex transport two protons across the mi- 
tochondrial inner membrane. Oxygen is the terminal acceptor of 
electrons in the electron transport chain. 


e The complexes of the electron transport chain function as super- 
complexes (also known as respirasomes) in the inner mitochondrial 
membrane. 


e Peter Mitchell’s chemiosmotic hypothesis proposed that protons are 
driven across the inner mitochondrial membrane by electron trans- 
port and that the resulting proton gradient would drive the synthesis 
of ATP. 


The free energy change for movement of protons across the inner 
mitochondrial membrane is sufficient to drive the synthesis of ATP 
by the ATP synthase. 


The F,Fo-ATP synthase is an enzyme, a proton pump, and a rotating 
molecular motor that drives the synthesis of ATP. 


The three catalytic B-sites of the F; complex of the ATP synthase nor- 
mally exist in three different conformations that represent the steps of 
ATP synthesis—one with ATP bound, one with ADP bound and one 
empty, as predicted by Paul Boyer’s binding change mechanism and 
as confirmed by John Walker’s structural studies. 


Proton flow through the Fo unit drives rotation of the c-ring of the 
ATP synthase, inducing conformational changes that drive the syn- 
thesis of ATP in the F; unit. 


Reconstituted vesicles containing ATP synthase and bacteriorhodop- 
sin were used by Walter Stoeckenius and Efraim Racker to synthesize 
ATP upon exposure to light, confirming the Mitchell hypothesis. 


Inhibitors of Complexes I-IV of the electron transport chain block 
electron transport. 


Uncouplers, whether proteins, or small hydrophobic molecules with 
dissociable protons, disrupt the coupling of electron transport and 
ATP synthesis. 


The ATP—ADP translocase mediates the movement of ATP and ADP 
across the inner mitochondrial membrane. 


The P/O ratio for electrons entering the electron transport chain from 
NADH is approximately 2.7. The P/O ratio for electrons entering as 
succinate/FADH) is approximately 1.6. 


Electrons from cytosolic NADH can be fed into the electron trans- 
port chain via the glycerophosphate shuttle or the malate-aspartate 
shuttle, the latter being more efficient. 


The combined pathways of glycolysis, the TCA cycle, electron trans- 
port and oxidative phosphorylation yield 32 to 34 ATP per glucose 
oxidized in eukaryotes. 


Apoptosis is triggered by oxidation of cardiolipin in the mitochon- 
drial membrane, which leads to release of cytochrome c to the cyto- 
sol. This triggers assembly of apoptosomes and activation of caspases 
that lead to apoptosis, the programmed death of cells. 


Answers to all problems are at the end of this book. Detailed solutions are 
available in the Student Solutions Manual. 


1. Understanding Redox Couples For the following reaction, 
[FAD] + 2 cyt c (Fe?*) + 2 H+ —> [FADH,] + 2 cyt c (Fe?*) 


determine which of the redox couples is the electron acceptor and 
which is the electron donor under standard-state conditions. 
Calculate the value of AG,', and determine the standard-state free 
energy change for the reaction in the thermodynamically favored 
direction. 


2. Reduction/Oxidation Potentials (Integrates with Chapter 3) Refer to 
Table 3.3 listing the standard reduction potentials of some biochemi- 
cal compounds. 

a. For each pair listed, indicate which compound is most likely to 
be reduced. 
al. UQ or fumarate 
a2. NAD* or FAD 
a3. pyruvate or cytochrome c (Fe**) 


b. For each pair listed, indicate which compound is most likely to 
be oxidized. 
bl. FMNH; or succinate 
b2. HO or UQH> 
b3. malate or a-ketoglutarate 


3. Determining the Redox Potential of the Glyceraldehyde-3-Phosphate 


Dehydrogenase Reaction Calculate the value of A€,’ for the 
glyceraldehyde-3-phosphate dehydrogenase reaction, and calculate the 
free energy change for the reaction under standard-state conditions. 


. Understanding the pH Dependence of NAD* Reduction For the 


following redox reaction, 
NAD* + 2 H* + 2e —> NADH + H+ 
suggest an equation (analogous to Equation 3.33) that predicts the 


pH dependence of this reaction, and calculate the reduction 
potential for this reaction at pH 8. 


. Understanding an Antidote for Cyanide Poisoning Sodium nitrite 


(NaNO, ) is used by emergency medical personnel as an antidote for 
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cyanide poisoning (for this purpose, it must be administered immedi- 
ately). Based on the discussion of cyanide poisoning in Section 20.4, 
suggest a mechanism for the lifesaving effect of sodium nitrite. 


. Assessing the Mechanism of Uncoupler Action A wealthy investor has 


come to you for advice. She has been approached by a biochemist 
who seeks financial backing for a company that would market 
dinitrophenol and dicumarol as weight-loss medications. The 
biochemist has explained to her that these agents are uncouplers and 
that they would dissipate metabolic energy as heat. The investor wants 
to know if you think she should invest in the biochemist’s company. 
How do you respond? 


. Assessing the [ATP]/[ADP][P;] Ratio and ATP Synthesis Assuming that 


3 H* are transported per ATP synthesized in the mitochondrial 
matrix, the membrane potential difference is 0.18 V (negative inside), 
and the pH difference is | unit (acid outside, basic inside), calculate 
the largest ratio of [ATP]/[ADP][P;] under which synthesis of ATP 
can occur. 


. Analyzing the Succinate Dehydrogenase Reaction Of the dehydroge- 


nase reactions in glycolysis and the TCA cycle, all but one use NAD* 
as the electron acceptor. The lone exception is the succinate dehydro- 
genase reaction, which uses covalently bound FAD of a flavoprotein 
as the electron acceptor. The standard reduction potential for this 
bound FAD is in the range of 0.003 to 0.091 V (see Table 3.3). 
Compared with the other dehydrogenase reactions of glycolysis and 
the TCA cycle, what is unique about succinate dehydrogenase? Why is 
bound FAD a more suitable electron acceptor in this case? 


. Analyzing the Succinate Dehydrogenase Reaction (Integrates with 


Chapter 3) The conversion of succinate into fumarate uses FAD as 
the oxidizing agent. Use the @,’ values given in Table 3.3 to calculate: 


a. AG” for this reaction. 
b. AG” for this reaction if NAD* was used as the oxidizing agent. 


Analysis of the NADH-CoQ Reductase Reaction 


a. What is the standard free energy change (A G°’) for the reduction 
of coenzyme Q by NADH as carried out by Complex I (NADH-— 
coenzyme Q reductase) of the electron-transport pathway if @,' 
(NAD*/NADH) = —0.320 V and €,’ (CoQ/CoQH;) = +0.060 V. 

b. What is the equilibrium constant (Keq) for this reaction? 

c. Assume that (1) the actual free energy release accompanying the 
NADH-coenzyme Q reductase reaction is equal to the amount 
released under standard conditions (as calculated in part a), 

(2) this energy can be converted into the synthesis of ATP with 
an efficiency = 0.75 (that is, 75% of the energy released upon 
NADH oxidation is captured in ATP synthesis), and (3) the 
oxidation of one equivalent of NADH by coenzyme Q leads 
to the phosphorylation of one equivalent of ATP. 


Under these conditions, what is the maximum ratio of [ATP]/ 
[ADP] attainable by oxidative phosphorylation when [P;] = 
1 mM? (Assume AG” for ATP synthesis = +30.5 kJ/mol.) 


Determining the Impact of the Succinate Dehydrogenase Reaction on 
Oxidative Phosphorylation Consider the oxidation of succinate by 
molecular oxygen as carried out via the electron-transport pathway 


Succinate + 50, — fumarate + H,O 


a. What is the standard free energy change (A G°’) for this reaction 
if £,’ (Fum/Succ) = +0.031 V and €,' (;0./H2O) = +0.816 V? 

b. What is the equilibrium constant (Keq) for this reaction? 

c. Assume that (1) the actual free energy release accompanying 
succinate oxidation by the electron-transport pathway is equal to 
the amount released under standard conditions (as calculated in 
part a), (2) this energy can be converted into the synthesis of 
ATP with an efficiency = 0.7 (that is, 70% of the energy released 
upon succinate oxidation is captured in ATP synthesis), and 


12. 


13. 


14. 


15. 


(3) the oxidation of one succinate leads to the phosphorylation 
of two equivalents of ATP. 


Under these conditions, what is the maximum ratio of [ATP]/ 
[ADP] attainable by oxidative phosphorylation when [P;] = 
1 mM? (Assume AG” for ATP synthesis = +30.5 kJ/mol.) 


Determining the Effects of NADH Oxidation on Oxidative 
Phosphorylation Consider the oxidation of NADH by molecular 
oxygen as carried out via the electron-transport pathway 


NADH + H+ + 0, — NAD* + H,O 


a. What is the standard free energy change (A G°’) for this reaction if 
&€o' (NAD*/NADH) = —0.320 V and €,' (O2/H2O) = +0.816 V. 


b. What is the equilibrium constant (Keq) for this reaction? 


c. Assume that (1) the actual free energy release accompanying 
NADH oxidation by the electron-transport pathway is equal to 
the amount released under standard conditions (as calculated in 
part a), (2) this energy can be converted into the synthesis of 
ATP with an efficiency = 0.75 (that is, 75% of the energy 
released upon NADH oxidation is captured in ATP synthesis), 
and (3) the oxidation of one NADH leads to the phosphoryla- 
tion of three equivalents of ATP. 

Under these conditions, what is the maximum ratio of [ATP]/ 
[ADP] attainable by oxidative phosphorylation when [P;] = 
2 mM? (Assume AG” for ATP synthesis = +30.5 kJ/mol.) 


Determining the Effect of the Cytochrome Oxidase Reaction on 
Oxidative Phosphorylation Write a balanced equation for the 
reduction of molecular oxygen by reduced cytochrome c as carried 
out by Complex IV (cytochrome oxidase) of the electron-transport 
pathway. 


a. What is the standard free energy change (AG°’) for this reaction if 


AZo cyt c(Fe**)/cyt c(Fe?*) = +0.254 volts and 
g,’ (40,/H,0) = 0.816 volts 


b. What is the equilibrium constant (Keq) for this reaction? 

c. Assume that (1) the actual free energy release accompanying 
cytochrome c oxidation by the electron-transport pathway is 
equal to the amount released under standard conditions (as 
calculated in part a), (2) this energy can be converted into the 
synthesis of ATP with an efficiency = 0.6 (that is, 60% of the 
energy released upon cytochrome c oxidation is captured in ATP 
synthesis), and (3) the reduction of one molecule of O, by 
reduced cytochrome c leads to the phosphorylation of two 
equivalents of ATP. 


Under these conditions, what is the maximum ratio of [ATP]/ 
[ADP] attainable by oxidative phosphorylation when [Pi] = 3 mM? 
(Assume A G” for ATP synthesis = +30.5 kJ/mol and T = 298 K.) 


Understanding the Energetics of the Lactate Dehydrogenase Reaction 
The standard reduction potential for (NAD*/NADH) is —0.320 V, 
and the standard reduction potential for (pyruvate/lactate) is —0.185 V. 


a. What is the standard free energy change (A G°’) for the lactate 
dehydrogenase reaction: 


NADH + H* + pyruvate —> lactate + NAD* 


b. What is the equilibrium constant (Keq) for this reaction? 


c. If [pyruvate] = 0.05 mM and [lactate] = 2.9 mM and AG for the 
lactate dehydrogenase reaction = —15 kJ/mol in erythrocytes, 
what is the [NAD*]/[NADH] ratio under these conditions? 


Analyzing the Lactate Dehydrogenase Action (Integrates with Chapter 3) 
The conversion of pyruvate into lactate uses NAD+ as the oxidizing 
agent. Use the @,’ values given in Table 3.3 to calculate: 

a. AG” for this reaction. 

b. AG” for this reaction if FAD was used as the oxidizing agent. 
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Assessing Proton Transport and Free Energy Change in Bacterial ATP 
Synthesis Assume that the free energy change (AG) associated 
with the movement of | mole of protons from the outside to the 
inside of a bacterial cell is —23 kJ/mol and 3 H+ must cross the 
bacterial plasma membrane per ATP formed by the bacterial 
F,Fo-ATP synthase. ATP synthesis thus takes place by the coupled 
process: 


3 Hou + ADP + P; ——3 H*i, + ATP + H2O 


a. If the overall free energy change (A Goveran) associated with ATP 
synthesis in these cells by the coupled process is —21 kJ/mol, 
what is the equilibrium constant (Keq) for the process? 

b. What is A Gyynthesis, the free energy change for ATP synthesis, in 
these bacteria under these conditions? 

c. The standard free energy change for ATP hydrolysis 
(A G?’ hydrolysis) iS —30.5 kJ/mol. If [Pj] = 2 mM in these bacterial 
cells, what is the [ATP]/[ADP] ratio in these cells? 


b Cytochromes in Complex III As discussed in Section 20.2d, the two b 
cytochromes of Complex II contain the same heme molecule but 
with very different reduction potentials (—0.1 V versus +0.05 V). 
Explain what causes this difference in reduction potential. 


Describing the Path of Electrons Through the Q Cycle of Complex 
Ill Describe in your own words the path of electrons through the Q 
cycle of Complex III. 


Describing the Path of Electrons Through Complex IV Describe in your 
own words the path of electrons through the copper and iron centers 
of Complex IV. 


Oxidation of Water Because of its high reduction potential, the 
reduction of O, to H,O serves as a potent terminal electron acceptor 
system for the electron transport chain. Is it possible for H,O to be 
reoxidized to O; or, in other words, is it possible for oxygen in water 
to lose its electrons? 


Understanding Oxidation of Unsaturated Lipids In the course of 
events triggering apoptosis, a fatty acid chain of cardiolipin under- 
goes peroxidation to release the associated cytochrome c. Lipid 
peroxidation occurs at a double bond. Suggest a mechanism for the 
reaction of hydrogen peroxide with an unsaturation in a lipid chain, 
and identify a likely product of the reaction. 


Determining the Number of Protons in a Typical Mitochondrion In 
problem 20 at the end of Chapter 19, you might have calculated the 
number of molecules of oxaloacetate in a typical mitochondrion. 
What about protons? A typical mitochondrion can be thought of as 
a cylinder 1 um in diameter and 2 wm in length. If the pH in the 
matrix is 7.8, how many protons are contained in the mitochondrial 
matrix? 


Understanding the Unique Requirement for FAD in the Succinate 
Dehydrogenase Reaction Considering that all other dehydrogenases 
of glycolysis and the TCA cycle use NADH as the electron donor, 
why does succinate dehydrogenase, a component of the TCA cycle 
and the electron-transfer chain, use FAD as the electron acceptor 
from succinate, rather than NAD*? Note that there are two 
justifications for the choice of FAD here—one based on energetics 
and one based on the mechanism of electron transfer for FAD 
versus NAD*. 


Anaerobic Respiration Anaerobic organisms do not use oxygen as the 
terminal acceptor for electron transport. Other reduction reactions, 
shown below, drive metabolism in anaerobic organisms. Write 
stoichiometric equations for these reduction reactions: 

a. Fet > Fe” b. SO; — HS- 

c. NO;—> N, d. S— HS- 


25; 


26. 
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Problems 


Understanding the Mechanism of the ATP Synthase Based on your 

reading in this chapter on the F; Fy-ATPase, what would you 

conclude about the mechanism of ATP synthesis: 

a. The reaction proceeds by nucleophilic substitution via the Sy2 
mechanism. 

b. The reaction proceeds by nucleophilic substitution via the Sy1 
mechanism. 

c. The reaction proceeds by electrophilic substitution via the El 
mechanism. 

d. The reaction proceeds by electrophilic substitution via the E2 
mechanism. 


Relating ProtonTransport and Free Energy Changes in Mitochondria 
Imagine that you are working with isolated mitochondria and you 
manage to double the ratio of protons outside to protons inside. In 
order to maintain the overall AG at its original value (whatever it is), 
how would you have to change the mitochondria membrane potential? 


PSAFE Problems 


(PSAFE models, listed by chapter or protein name, can be found at http:// 
www.psafeonline.net.) 


27. 


28. 


Using the PSAFE model for Complex I of the electron transport 
chain: Describe the interactions of different arms of the L-shape. 
Describe Nqol’s Rossman fold. Determine the specific interactions of 
each residue of the catalytic binding site with NADH. 


Using the PSAFE model for ADP-ATP translocase, explain the 
formation of the kinked a-helices. 


Think-Pair-Share Question 


The electron transport/oxidative phosphorylation pathway may seem like 
a drawn-out and overly complicated process with room for improvement 
or simplification. Consider this set of questions to argue that the system 

as it exists may be the best of all possible worlds. 


a. A single pair of electrons from NADH induces proton transport 
in complexes I, III and IV creating the proton motive force that 
drives the synthesis of ATP. Why not add more complexes in the 
electron transport system? Why could an expanded system 
consisting of more complexes that could induce much more 
proton transport leading to a stronger proton motor force and 
the synthesis of more ATP from a single NADH be catastrophic 
to the cell? (Hint: revisit “Human Biochemistry: Endogenous 
Uncouplers—Novel Proteins with Many Beneficial Effects”). 

b. Why do cells not use a direct reduction of oxygen by NADH at 
the active site of a complex IV-like protein according to the 
following equation: 2NADH + 2H* + O, > 2NAD* + 2H,0? 
Such a setup would eliminate the need for proton pumping or 
electron transport. 

c. Imagine an inverse electron/proton transport system in which the 
electron transport/oxidative phosphorylation process involves the 
movement of protons through multiple complexes inside the 
mitochondrial inner membrane and the transport of electrons 
across the mitochondrial inner membrane towards the intermem- 
brane space. What would be the driving force for this proton 
transport through the various complexes? What kind of features 
would such a system need to contain? Would it be as efficient and 
functional as the existing system? 
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Photosynthesis 


< Field of goldenrod. 


Essential Questions 


Photosynthesis is the primary source of energy for all life forms (except chemolithotrophic 
prokaryotes). Much of the energy of photosynthesis is used to drive the synthesis of organic 
molecules from atmospheric CO7. How is solar energy captured and transformed into meta- 
bolically useful chemical energy? How is the chemical energy produced by photosynthesis 
used to create organic molecules from carbon dioxide? 


he vast majority of energy consumed by living organisms stems from solar energy 

captured by the process of photosynthesis. Only chemolithotrophic prokaryotes 
are independent of this energy source. Of the 1.5 X 10 kJ of energy reaching the 
earth each day from the sun, 1% is absorbed by photosynthetic organisms and trans- 
duced into chemical energy.'! This energy, in the form of biomolecules, becomes 
available to other members of the biosphere through food chains. The transduction of 
solar, or light, energy into chemical energy is often expressed in terms of carbon diox- 
ide fixation, in which hexose is formed from carbon dioxide and oxygen is evolved: 


6 CO, + 6 H20 H CHRO. + 6 O (21.1) 


Estimates indicate that more than 2 x 10!! tons of carbon dioxide are fixed globally per 
year, of which one-third is fixed in the oceans, primarily by photosynthetic marine 
microorganisms. 

Although photosynthesis is traditionally equated with CO, fixation, light energy (or 
rather the chemical energy derived from it) drives all endergonic processes in phototrophic 
cells. The assimilation of inorganic forms of nitrogen and sulfur into organic molecules 
represents two other metabolic conversions closely coupled to light energy in green plants. 
Our previous considerations of aerobic metabolism (Chapters 18 through 20) presented 


'Of the remaining 99%, two-thirds is absorbed by the earth and oceans, thereby heating the planet; the 
remaining one-third is lost as light reflected back into space. 
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Ina sun-flecked lane, 

Beside a path where cattle trod, 

Blown by wind and rain, 

Drawing substance from air and sod; 
In ruggedness, it stands aloof, 

The ragged grass and puerile leaves, 
Lending a hand to fill the woof 

In the pattern that beauty weaves. 
What mystery this, hath been wrought; 
Beauty from sunshine, air, and sod! 
Could we thus gain the ends we sought- 
Tell us thy secret, Goldenrod. 


Rosa Staubus 
Oklahoma pioneer (1886-1966) 


Key Questions 


21.1 What Are the General Properties 
of Photosynthesis? 


21.2 How Is Solar Energy Captured 
by Chlorophyll? 
21.3 What Kinds of Photosystems Are 
Used to Capture Light Energy? 
21.4 What Is the Molecular Architecture 
of Photosynthetic Reaction Centers? 
21.5 What Is the Quantum Yield 
of Photosynthesis? 
21.6 How Does Light Drive 
the Synthesis of ATP? 
21.7 How Is Carbon Dioxide Used 
to Make Organic Molecules? 


21.8 How Does Photorespiration 
Limit CO, Fixation? 
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Figure 21.1 Electron micrograph of a representative 
chloroplast. 


cellular respiration (precisely the reverse of Equation 21.1) as the central energy-releasing 
process in life. It necessarily follows that the formation of hexose from carbon dioxide and 
water, the products of cellular respiration, must be endergonic. The necessary energy comes 
from light. Note that in the carbon dioxide fixation reaction described, light is used to drive 
a chemical reaction against its thermodynamic potential. 


21.1 | What Are the General Properties 
of Photosynthesis? 


Organisms capable of photosynthesis are very diverse, ranging from simple prokaryotic 
forms to the largest organisms of all, Sequoia gigantea, the giant redwood trees of 
California. Despite this diversity, we find certain generalities regarding photosynthesis. 


21.1la Photosynthesis Occurs in Membranes 


One universal attribute is that photosynthesis occurs in membranes. In photosynthetic 
prokaryotes, the photosynthetic membranes fill up the cell interior; in photosynthetic 
eukaryotes, the photosynthetic membranes are localized in large organelles known as 
chloroplasts (Figures 21.1 and 21.2). Chloroplasts are one member in a family of related 
plant-specific organelles known as plastids. Chloroplasts themselves show a range of 
diversity, from the single, spiral chloroplast that gives Spirogyra its name to the 
multitude of ellipsoidal plastids typical of higher plant cells (Figure 21.3). 

Characteristic of all chloroplasts, however, is the organization of the inner mem- 
brane system, the so-called thylakoid membrane. The thylakoid membrane is organized 
into paired folds that extend throughout the organelle, as shown in Figure 21.2. These 
paired folds, or lamellae, give rise to flattened sacs or discs, thylakoid vesicles (from the 
Greek thylakos, meaning “sack’’), which occur in stacks called grana. A single stack, or 
granum, may contain dozens of thylakoid vesicles, and different grana are joined by 
lamellae that run through the soluble portion, or stroma, of the organelle. Chloroplasts 
thus possess three membrane-bound aqueous compartments: the intermembrane 
space, the stroma, and the interior of the thylakoid vesicles, the so-called thylakoid 
space (also known as the thylakoid lumen). This third compartment serves an important 
function in the transduction of light energy into ATP formation. The thylakoid mem- 
brane has a highly characteristic lipid composition and, like the inner membrane of the 
mitochondrion, is impermeable to most ions and molecules. Chloroplasts, like their 
mitochondrial counterparts, possess DNA, RNA, and ribosomes and consequently 
display a considerable amount of autonomy. However, many critical chloroplast com- 
ponents are encoded by nuclear genes, so autonomy is far from absolute. 


James Dennis/Phototake 
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Figure 21.2 Schematic of an idealized chloroplast. 
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(b) 


Figure 21.3 Chloroplast structures. (a) The spiral chloroplasts (green) of Spirogyra, a fresh-water alga. 
(b) The chloroplasts of a higher plant cell are seen here as the dark, oval-shaped membrane-rich 
organelles distributed around the cell periphery. Note the multiple chloroplasts around the cell’s periphery. 


21.1b Photosynthesis Consists of Both Light Reactions 
and Dark Reactions 


If a chloroplast suspension is illuminated in the absence of carbon dioxide, oxygen is 
released. Furthermore, if the illuminated chloroplasts are now placed in the dark and 
supplied with CO;, net hexose synthesis can be observed (Figure 21.4). Thus, the 
release of oxygen can be temporally separated from CO, fixation and also has a light 
dependency that CO) fixation lacks. The light reactions of photosynthesis, of which 
O, evolution is only one part, are associated with the thylakoid membranes. In 
contrast, the light-independent reactions, or so-called dark reactions, notably CO, 
fixation, are located in the stroma. A concise summary of the photosynthetic process 
is that radiant electromagnetic energy (light) is transformed by a specific photochem- 
ical system located in the thylakoids to yield chemical energy in the form of reducing 
potential (NADPH) and high-energy phosphate (ATP). NADPH and ATP can then 
be used to drive the endergonic process of hexose formation from CO, by a series of 
enzymatic reactions found in the stroma (see Equation 21.3). 
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Chloroplast suspension 


Absence of CO, 


Og evolved CO, fixation 
into sugars 


Figure 21.4 The light-dependent and light-independent reactions of photosynthesis. 


21.1c Water Is the Ultimate Electron Donor 
for Photosynthetic NADP* Reduction 


In green plants, water serves as the ultimate electron donor for the photosynthetic 
generation of reducing equivalents. The reaction sequence 


2 H,O + 2 NADP* + x ADP + x P; ™ 
O, + 2 NADPH + 2H*+xATP+xH,O (21.2) 


describes the process, where nhv symbolizes light energy (n is some number of photons 
of energy hv, where A is Planck’s constant and v is the frequency of the light). Light 
energy is necessary to make the unfavorable reduction of NADP* by H,O (A@,’ = 
—1.136 V; AG” = +219 kJ/mol NADP") thermodynamically favorable. Thus, the light 
energy input, nhv, must exceed 219 kJ/mol NADP’. The stoichiometry of ATP forma- 
tion depends on the pattern of photophosphorylation operating in the cell at the time 
and on the ATP yield in terms of the chemiosmotic ratio, ATP/H‘, as we will see later. 
Nevertheless, the stoichiometry for the metabolic pathway of CO, fixation can be 


written with certainty: 
12 NADPH + 12 H+ + 18 ATP + 6 CO, + 11 H,O —> 
glucose-1-P + 12 NADP* + 18 ADP + 17 P; (21.3) 


A More Generalized Equation for Photosynthesis In 1931, a comparative study of pho- 
tosynthesis in bacteria led C. B. van Niel to a more general formulation of the overall 
reaction: 


co, + cHA 2 ý (CHO) + 2A + HO 214 
Hydrogen Hydrogen Reduced Oxidized 
acceptor donor acceptor donor 


In photosynthetic bacteria, HA is variously H2S (in photosynthetic green and purple 
bacteria (reaction (21.5)), isopropanol (in Rhodopseudomonas species, reaction (21.6)), 
or some similar oxidizable substrate. [(CH2O) symbolizes a carbohydrate unit.] 


CO, + 2 HS —> (CHO) + HO +28 (21.5) 


O 
| 
CO» + 2 CH;—CHOH—CH; —> (CH0) + HO + 2? CH,—C—CH; (21.6) 


In cyanobacteria and the eukaryotic photosynthetic cells of algae and higher plants, 
HbA is H,O, as implied earlier, and 2 A is O2. The accumulation of O, to constitute 21% 
of the earth’s atmosphere is the direct result of eons of global oxygenic photosynthesis. 
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21.2 | How Is Solar Energy Captured by Chlorophyll? 


Photosynthesis depends on the photoreactivity of chlorophyll. Chlorophylls are 
magnesium-containing substituted tetrapyrroles whose basic structure is reminiscent 
of heme, the iron-containing porphyrin (see Chapters 5 and 20). Chlorophylls differ 
from heme in a number of properties: Magnesium instead of iron is coordinated in the 
center of the planar conjugated ring structure; a long-chain alcohol, phytol, is esterified 
to a pyrrole ring substituent; and the methine bridge linking pyrroles HI and IV is sub- 
stituted and crosslinked to ring III, leading to the formation of a fifth five-membered 
ring, labeled V. The structures of chlorophyll a and b are shown in Figure 21.5a. 

Chlorophylls are excellent light absorbers because of their aromaticity. That is, 
they possess delocalized m electrons, whose orbitals lie above and below the planar 
ring structure. The energy differences between electronic states in these m orbitals cor- 
respond to the energies of visible light photons. When light energy is absorbed, an 
electron is promoted to a higher orbital, enhancing the potential for transfer of this 
electron to a suitable acceptor. Loss of such a photoexcited electron to an acceptor is 
an oxidation—-reduction reaction. The net result is the transduction of light energy 
into the chemical energy of a redox reaction. 


21.2a Chlorophylls and Accessory Light-Harvesting Pigments 
Absorb Light of Different Wavelengths 


The absorption spectra of chlorophylls a and b (Figure 21.5b) differ somewhat. Plants 
that possess both chlorophylls can harvest a wider spectrum of incident energy. Other 
pigments in photosynthetic organisms, so-called accessory light-harvesting pigments 
(Figure 21.6), increase the possibility for absorption of incident light (the light that 
strikes the chloroplast surface), of wavelengths not absorbed by the chlorophylls. 
Carotenoids and phycocyanobilins, like chlorophyll, possess many conjugated double 
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Figure 21.5 The (a) structures and (b) absorption 
spectra of chlorophyll a and b. The phytyl side chain 
of ring IV provides a hydrophobic tail to anchor the 
chlorophyll in membrane protein complexes. 
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Figure 21.6 Structures of representative accessory light-harvesting pigments in photosynthetic cells. 
(a) B-Carotene, an accessory light-harvesting pigment in leaves. (b) Phycocyanobilin, a blue pigment 
found in cyanobacteria. 


bonds and thus absorb visible light. Carotenoids have two primary roles in 
photosynthesis—light harvesting and photoprotection through destruction of reac- 
tive oxygen species that arise as byproducts of photoexcitation. 


21.2b The Light Energy Absorbed by Photosynthetic 
Pigments Has Several Possible Fates 


Each photon represents a quantum of light energy. A quantum of light energy 
absorbed by a photosynthetic pigment has four possible fates (Figure 21.7): 


1. Loss as heat. The energy can be dissipated as heat through redistribution into 
atomic vibrations within the pigment molecule. 

2. Loss as light. Energy of excitation reappears as fluorescence (light emission); a 
photon of fluorescence is emitted as the e~ drops down to a lower orbital. This 
fate is common only in saturating light intensities. For thermodynamic reasons, the 
photon of fluorescence has a longer wavelength and hence lower energy than the 
quantum of excitation. 

3. Resonance energy transfer. The excitation energy can be transferred by resonance 
energy transfer to a neighboring molecule if the energy level difference between the 
two corresponds to the quantum of excitation energy. In this process, the energy 
transferred raises an electron in the receptor molecule to a higher energy state as 
the photoexcited e~ in the original absorbing molecule returns to ground state. 
This so-called Förster resonance energy transfer is the mechanism whereby quanta 
of light falling anywhere within an array of pigment molecules can be transferred 
ultimately to specific photochemically reactive sites. 

4. Energy transduction. The energy of excitation, in raising an electron to a higher 
energy orbital, dramatically changes the standard reduction potential, @,’, of 
the pigment such that it becomes a much more effective electron donor. That 
is, the excited-state species, by virtue of having an electron at a higher energy level 
through light absorption, has become a more potent electron donor. Reaction of this 
excited-state electron donor with an electron acceptor situated in its vicinity leads 
to the transformation, or transduction, of light energy (photons) to chemical energy 
(reducing power, the potential for electron-transfer reactions). Transduction of light 
energy into chemical energy, the photochemical event, is the essence of photosynthesis. 
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Figure 21.7 Possible fates of the quantum of light energy absorbed by photosynthetic pigments. 


21.2c The Transduction of Light Energy into Chemical 
Energy Involves Oxidation—Reduction 


The diagram presented in Figure 21.8 illustrates the fundamental transduction of 
light energy into chemical energy (an oxidation—reduction reaction) that is the basis 
of photosynthesis. Chlorophyll (Chl) resides in a membrane in close association with 
molecules competent in e~ transfer, symbolized here as A and B. Chl absorbs a 


Figure 21.8 Model for light absorption by chlorophyll 
and transduction of light energy into an oxidation— 
reduction reaction. The scheme shows a photon of 
light initiating a series of one-electron transfer events. 
I: Photoexcitation of Chl creates Chl*. I: Electron 
transfer from Chl* to A yields oxidized Chl (Chl*) 
and reduced A (A`) MI: An electron-transfer pathway 
from A~ to NADP* leads to NADPH formation and 
restoration of oxidized A (A). IV: Chl* accepts an 
electron from B, restoring Chl and generating oxidized 
B (B+). V: B* is reduced back to B by an electron 
originating in H20. Water oxidation is the source of 
O; formation. 
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hemistry in History 


Photosynthesis 


Emmett Chapelle 


Dr. Emmett Chapelle (1925-2019) was an African 
American educated in the segregated public schools 
of Phoenix, AZ. After high school graduation 
in 1942, he enlisted in the U.S. Army and was 
wounded twice while serving with the 92nd Infantry 
Division in Italy. After the war, despite facing 
segregation and social adversity, he received an 
associate’s degree in electrical engineering from 
Phoenix College and a Bachelor of Science in 
Biology at the University of California, Berkeley 
in 1950. Following a teaching stint in Tennessee, he 
earned his Master of Biology from the University 
of Washington. 


During the early years of the space race, Dr. Chapelle 
was credited for the inclusion of single-celled algae on 
spacecrafts as highly efficient, potentially edible, and 
renewable oxygen sources. Dr. Chapelle also invented 
a method for the detection of ATP based on the 
light-emitting reaction of fireflies. In this reaction, the 
firefly enzyme luciferase acts on its substrate luciferin 
to produce light, provided ATP is present. All known 
organisms contain ATP, so he knew that he could use 
this reaction to test for the presence of life. His revo- 
lutionary method is still in use today for the detec- 
tion of microbes in food and water and the testing of 
extraterrestrial soils for microscopic life. 


hv 


Light-harvesting pigment 
(antenna molecules) 


Reaction 
center 


Figure 21.9 Schematic of a photosynthetic unit. 
The green disks represent individual chlorophyll 
molecules. 


photon of light, becoming activated to Chl* in the process. Electron transfer from 
Chl* to A leads to oxidized Chl (Chl-*, a cationic free radical) and reduced A (A7 in 
the diagram). Subsequent oxidation of A~ eventually culminates in reduction of 
NADP* to NADPH. The electron hole in oxidized Chl (ChI-*) is filled by transfer of 
an electron from B to Chl-*, restoring Chl and creating B*. B* is restored to B by an 
e donated by water. O, is the product of water oxidation. Note that the system is 
restored to its original state once NADPH is formed and H,O is oxidized. Proton 
translocations accompany these light-driven electron-transport reactions. Such H* 
translocations establish a chemiosmotic gradient across the photosynthetic mem- 
brane that can drive ATP synthesis. 


21.2d Photosynthetic Units Consist of Many Chlorophyll 
Molecules but Only a Single Reaction Center 


In the early 1930s, Robert Emerson and William Arnold investigated the relation- 
ship between the amount of incident light energy, the amount of chlorophyll pres- 
ent, and the amount of oxygen evolved by illuminated algal cells. Emerson and 
Arnold were seeking to determine the quantum yield of photosynthesis: the number 
of electrons transferred per photon of light. Their studies gave an unexpected result: 
When algae were illuminated with very brief light flashes that could excite every 
chlorophyll molecule at least once, only one molecule of O, was evolved per 2400 
chlorophyll molecules. This result implied that not all chlorophyll molecules are 
photochemically reactive, and it led to the concept that photosynthesis occurs in 
functionally discrete units. 

Chlorophyll serves two roles in photosynthesis. It is involved in light harvesting 
and the transfer of light energy to photoreactive sites by exciton transfer, and it par- 
ticipates directly in the photochemical events whereby light energy becomes chemical 
energy. A photosynthetic unit (Figure 21.9) can be envisioned as an antenna of several 
hundred light-harvesting chlorophyll molecules (green) plus a special pair of photo- 
chemically reactive chlorophyll a molecules called the reaction center (orange). The 
purpose of the vast majority of chlorophyll in a photosynthetic unit is to harvest 
light incident within the unit and funnel it, via resonance energy transfer, to the reac- 
tion center chlorophyll dimers that are photochemically active. Most chlorophyll 
thus acts as a large light-collecting antenna, and it is at the reaction centers that the 
photochemical event occurs. Oxidation of chlorophyll leaves a cationic free radical, 
Chl-*, whose properties as an electron acceptor have important consequences for 
photosynthesis. Note that the Mg?* ion does not change in valence during these 
redox reactions. 
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21.3 | What Kinds of Photosystems Are Used 


to Capture Light Energy? 


All photosynthetic cells contain some form of photosystem. Photosynthetic bacteria 
have only one photosystem; furthermore, they lack the ability to use light energy to 
split H2O and release O2. Cyanobacteria, green algae, and higher plants are oxygenic 
phototrophs because they can generate O, from water. Oxygenic phototrophs have two 
distinct photosystems: photosystem I (PSI) and photosystem II (PSII). Type I photo- 
systems use ferredoxins as terminal electron acceptors; type II photosystems use qui- 
nones as terminal electron acceptors. PSI is defined by reaction center chlorophylls 
with maximal red light absorption at 700 nm; PSII uses reaction centers that exhibit 
maximal red light absorption at 680 nm. The reaction center Chl of PSI is referred to 
as P700 because it absorbs light of 700 nm wavelength; the reaction center Chl of PSH 
is called P680 for analogous reasons. Both P700 and P680 are chlorophyll a dimers 
situated within specialized protein complexes. A distinct property of PSII is its role in 
light-driven O, evolution. Interestingly, the photosystems of photosynthetic bacteria 
are type II photosystems that resemble eukaryotic PSII more than PSI, even though 
these bacteria lack O.-evolving capacity. 


21.3a Chlorophyll Exists in Plant Membranes 
in Association with Proteins 


Isolated thylakoid membranes can be dissolved by detergent treatment, releasing 
complexes containing both chlorophylls and protein. These chlorophyll—protein com- 
plexes represent integral components of the thylakoid membrane, and their organiza- 
tion reflects their roles as either light-harvesting complexes (LHC), PSI complexes, or 
PSII complexes. All chlorophyll is apparently localized within these three macromo- 
lecular assemblies. 


21.3b PSI and PSII Participate in the Overall 
Process of Photosynthesis 


What are the roles of the two photosystems, and what is their relationship to each 
other? PSI provides reducing power in the form of NADPH. PSII splits water, produc- 
ing Op, and feeds the electrons released into an electron-transport chain that couples 
PSII to PSI. Electron transfer between PSII and PSI pumps protons for chemiosmotic 
ATP synthesis. As summarized by Equation 21.2, photosynthesis involves the reduc- 
tion of NADP* using electrons derived from water and activated by light, hv. ATP is 
generated in the process. The standard reduction potential for the NADPt/NADPH 
couple is —0.32 V. Thus, a strong reductant with an ©,’ more negative than —0.32 V is 
required to reduce NADP* under standard conditions. By similar reasoning, a very 
strong oxidant is required to oxidize water to oxygen because €5'(¢O,/H,0) is +0.82 V. 
Separation of the oxidizing and reducing aspects of Equation 21.2 is accomplished in 
nature by devoting PSI to NADP* reduction and PSII to water oxidation. PSI and PSII 
are linked via an electron-transport chain so that the weak reductant generated by PSII 
can provide an electron to reduce the weak oxidant side of P700 (Figure 21.10). Thus, 
electrons flow from HO to NADP*, driven by light energy absorbed at the reaction 


PSII PSI 
blue light < 680 nm red light 700 nm 
P680 P700 
Strong oxidant Weak reductant æ Weak oxidant Strong reductant 
AN >+0.8 V g. =0V é, =0.45V g, <-0.6V 
ie v J 
H,O Lo; ADP + P, —> ATP NADP* NADPH 


Figure 21.10 Roles of the two photosystems, 
PSI and PSII. 
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Ferredoxin (Fd) A generic term for small proteins 
possessing iron-sulfur clusters that participate in 
various electron-transfer reactions. 


centers. Oxygen is a by-product of the photolysis, literally “light-splitting,” of water. 
Accompanying electron flow is production of a proton gradient and ATP synthesis (see 
Section 21.6). This light-driven phosphorylation is termed photophosphorylation. 


21.3c The Pathway of Photosynthetic Electron 
Transfer Is Called the Z Scheme 


Photosystems I and II contain unique complements of electron carriers, and these 
carriers mediate the stepwise transfer of electrons from water to NADP*. When the 
individual redox components of PSI and PSII are arranged as an e~ transport chain 
according to their standard reduction potentials, the zigzag result resembles the letter 
Z laid sideways (Figure 21.11). The various electron carriers are indicated as follows: 
Mn complex is the manganese-containing oxygen-evolving complex; D is its e~ accep- 
tor and the immediate e~ donor to P680*; Q4 and Qs represent special plastoquinone 
molecules (see Figure 21.13), and PQ, the plastoquinone pool; Fe-S stands for the 
Rieske iron-sulfur center, and cyt f, cytochrome f. PC is the abbreviation for 
plastocyanin, the immediate e7 donor to P700*; and Fy, Fg, and Fy represent the 
membrane-associated ferredoxins downstream from Ap (a specialized Chl a) and A; 
(a specialized PSI quinone). Fd is the soluble ferredoxin pool that serves as the e~ 
donor to the flavoprotein (Fp), called ferredoxin-NADP* reductase (FNR), which 
catalyzes reduction of NADP* to NADPH. Cyt(b¢)y,(66)p symbolizes the cytochrome 
bs moieties of the cytochrome be f complex. 

The Z scheme of the linear flow of electrons from H,O to NADP* is accomplished 
by three membrane-spanning supramolecular complexes composed of intrinsic and 
extrinsic polypeptides (shown as shaded boxes bounded by solid black lines in Figure 
21.11). These complexes are the PSII complex, the cytochrome b,f complex, and the 
PSI complex. The PSII complex is aptly described as a light-driven water : plastoquinone 
oxidoreductase; it is the enzyme system responsible for photolysis of water, and as such, 
it contains the oxygen-evolving complex, or OEC. The OEC of PSII is a metal cluster 
containing four Mn atoms, one Ca atom, and five O atoms, with four associated H,O 
molecules. As P680 undergoes four cycles of light-induced oxidation, four protons and 
four electrons are removed from two of the water molecules and their O atoms are 
joined to form O,. A tyrosyl side chain of the PSII complex (see following discussion) 
mediates electron transfer between the MnyCaO; cluster and P680. The O>-evolving 
reaction requires Cl” ions in addition to the Mn,CaO; cluster. 

Protons (H*) accumulate in the chloroplast lumen when PSII catalyzes light-driven H,O 
oxidation. Further along the linear electron transfer pathway, additional protons are trans- 
located from the stroma into the lumen by the electron transfer reactions within the cyto- 
chrome baf complex. Light-driven accumulation of H* in the lumen is the driving force for 
ATP synthesis (Figure 21.11), and the process is thus termed photophosphorylation. 

An alternative to the Z-scheme linear electron flow (LEF) from water to NADP* occurs 
when electrons flow in a circle from photoexcited P700* back to itself via the cytochrome 
baf complex, filling the electron hole in P700*. In this cyclic electron flow (CEF; the dashed 
arrow in Figure 21.11), the electron from P700* passes primarily through Fd and FNR to 
reach the bef complex, which then catalyzes electron transfer into P700*. Of the three 
membrane-spanning supramolecular complexes, CEF requires just the participation of 
PSI and the cytochrome bgf complex. Since PSII is not involved, no O; is evolved, and since 
the electron from P700 returns to it, no NADP* is reduced. However, since electrons pass- 
ing through the baf complex cause the translocation of H* into the lumen, cyclic electron 
flow can drive ATP synthesis, a process called cyclic photophosphorylation. 


21.3d Oxygen Evolution Requires the Accumulation 
of Four Oxidizing Equivalents in PSII 


When isolated chloroplasts that have been held in the dark are illuminated with very 
brief flashes of light, O, evolution reaches a peak on the third flash and every fourth 
flash thereafter (Figure 21.12a). The oscillation in O, evolution dampens over repeated 
flashes and converges to an average value. These data are interpreted to mean that the 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


21.3 What Kinds of Photosystems Are Used to Capture Light Energy? 
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Figure 21.11 The Z scheme of photosynthesis. (a) The Z scheme is a diagrammatic representation of 
photosynthetic electron flow from H,O to NADP+. The energy relationships can be derived from the éo’ 
scale beside the Z diagram. Energy input as light is indicated by two broad arrows, one photon appearing 
in P680 and the other in P700. P680* and P700* represent photoexcited states. The three supramolecular 
complexes (PSI, PSII, and the cytochrome bef complex) are in shaded boxes. Proton translocations that 
establish the proton-motive force driving ATP synthesis are illustrated as well. (b) The functional 
relationships among PSII, the cytochrome bęf complex, PSI, and the photosynthetic CF;\CFy-ATP 
synthase within the thylakoid membrane. The functional relationships for photosynthetic electron 
transport through PSII, cytochrome bę f, and PSI are based on 4 photons (2 per photosystem) driving the 
passage of an electron pair from water to NADP+. ATP synthesis is based on an H*/ATP ratio of 14/3. 
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Figure 21.12 Oxygen evolution requires the 
accumulation of four oxidizing equivalents in PSII. 
(a) O: evolution after brief light flashes. (b) A simple 
model for the cycling of the PSII reaction center 
through five different oxidation states, Sọ to S4. One 
e` is removed photochemically at each light flash, 
moving the reaction center successively through S4, 
S2, S3, and S4. S4 decays spontaneously to Sp by 
oxidizing 2 H-O to Op. 
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Figure 21.13 The structures of plastoquinone A 
and its reduced form, plastohydroquinone (or 
plastoquinol). Plastoquinone A has nine isoprene 
units and is the most abundant plastoquinone in 
plants and algae. 


P680 reaction center complex cycles through five different oxidation states, numbered 
So to S4. One electron and one proton are removed photochemically in each step. 
When S, is attained, an O molecule is released (Figure 21.12b) as PSII returns to oxi- 
dation state Sọ and two new water molecules bind. (The reason the first pulse of Oy 
release occurred on the third flash [Figure 21.12a] is that the PSII reaction centers in 
the isolated chloroplasts were already poised at Sı reduction level.) 


21.3e Electrons Are Taken from H,O to Replace 
Electrons Lost from P680 


The events intervening between H2O and P680 involve D, the name assigned to a spe- 
cific protein tyrosine residue that mediates e~ transfer from H20 via the Mn complex 
to P680* (see Figure 21.11). The oxidized form of D is a tyrosyl free radical species, 
D-*. To begin the cycle, an exciton of energy excites P680 to P680*, whereupon P680* 
transfers an electron to a nearby Chl a molecule, which is the direct electron acceptor 
from P680*. This Chl a then reduces a molecule of pheophytin (Pheo in Figure 21.11). 
Pheophytin is like chlorophyll a, except two H+ replace the centrally coordinated 
Mg?" ion. Loss of an electron from P680* creates P680*, the electron acceptor for D. 
Electrons flow from Pheo via specialized molecules of plastoquinone (Q in Figure 
21.11), to a pool of plastoquinone (PQ) within the membrane. Because of its lipid 
nature, plastoquinone is mobile within the membrane and hence serves to shuttle elec- 
trons from the PSII supramolecular complex to the cytochrome bę f complex. Alter- 
nate oxidation-reduction of plastoquinone to its hydroquinone form involves the 
uptake of protons (Figure 21.13). The asymmetry of the thylakoid membrane is 
designed to exploit this proton uptake and release so that protons (H*) accumulate 
within the lumen of thylakoid vesicles, establishing an electrochemical gradient. Note 
that plastoquinone is an analog of coenzyme Q, the mitochondrial electron carrier 
(see Chapter 20). 


21.3f Electrons from PSII Are Transferred to PSI 
via the Cytochrome bsf Complex 


The cytochrome b¢f or plastoquinol: plastocyanin oxidoreductase is a large (210 kDa) 
multimeric protein possessing 26 transmembrane a-helices. This membrane protein 
complex is structurally and functionally homologous to the cytochrome bc; complex 
(Complex ITI) of mitochondria (see Chapter 20). It includes the two heme-containing 
electron transfer proteins for which it is named, as well as iron—sulfur clusters, which 
also participate in electron transport. The purpose of this complex is to mediate the 
transfer of electrons from PSII to PSI and to pump protons across the thylakoid 
membrane via a plastoquinone-mediated Q cycle, analogous to that found in mito- 
chondrial e~ transport (see Chapter 20). Cytochrome f (f from the Latin folium, 
meaning “foliage”) is a c-type cytochrome, with a reduction potential of +0.365 V. 
Cytochrome bę in two forms (low- and high-potential) participates in the oxidation 
of plastoquinol and the Q cycle of the bsf complex. Thus, the role of the cytochrome 
bf complex is directly analogous to the role of mitochondrial Complex III: Using 
the energy of an oxidation-reduction reaction to create a proton gradient across a 
membrane. The cytochrome bf complex also serves in the alternative cyclic electron 
flow pathway. 


21.3g Plastocyanin Transfers Electrons 
from the Cytochrome bsf Complex to PSI 


Plastocyanin (PC in Figure 21.11) is an electron carrier capable of diffusion along the 
inside of the thylakoid and migration in and out of the membrane, aptly suited to its 
role in shuttling electrons between the cytochrome bęf complex and PSI. Plastocyanin 
is a low-molecular-weight (10.4 kDa) protein containing a single copper atom. PC 
functions as a single-electron carrier (€,’ = +0.32 V) as its copper atom undergoes 
alternate oxidation—-reduction between the cuprous (Cu*) and cupric (Cu**) states. 
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Thus, the role of plastocyanin is directly analogous to the role of mitochondrial cyto- 
chrome c: It mediates electron transfer between two membrane protein complexes. 
PSI is a light-driven plastocyanin: ferredoxin oxidoreductase. When P700, the special- 
ized chlorophyll a dimer of PSI, is excited by light and oxidized by transferring its 
e` to an adjacent chlorophyll a molecule that serves as its immediate e~ acceptor, 
P700* is formed. (The standard reduction potential for the P700*/P700 couple is about 
+0.45 V.) P700* readily gains an electron from plastocyanin. 

The immediate electron acceptor for P700* is a special molecule of chlorophyll. This 
unique Chl a (Ao) rapidly passes the electron to a specialized quinone (4,), which in 
turn passes the e~ to the first in a series of membrane-bound ferredoxins. This Fd series 
ends with a soluble form of ferredoxin, Fd,, which serves as the immediate electron 
donor to the flavoprotein that catalyzes NADP* reduction, namely, ferredoxin; NADP* 
reductase. 


21.4 | What Is the Molecular Architecture 
of Photosynthetic Reaction Centers? 


What molecular architecture couples the absorption of light energy to rapid electron- 
transfer events, in turn coupling these e~ transfers to proton translocations so that 
ATP synthesis is possible? Part of the answer to this question lies in the membrane- 
associated nature of the photosystems. A major breakthrough occurred in 1984, when 
Johann Deisenhofer, Hartmut Michel, and Robert Huber reported the first X-ray 
crystallographic analysis of amembrane protein. To the great benefit of photosynthe- 
sis research, this protein was the reaction center from the photosynthetic purple bac- 
terium Rhodopseudomonas viridis. This research earned these three scientists the 1988 
Nobel Prize in Chemistry. 


21.4a The R. viridis Photosynthetic Reaction Center 
Is an Integral Membrane Protein 


R. viridis is a photosynthetic prokaryote with a single photosystem that resembles 
PSII (even though it lacks an OEC and the capacity to oxidize water). The reaction 
center (145 kDa) of the R. viridis photosystem is localized in the plasma membrane of 
these photosynthetic bacteria and is composed of four different polypeptides, desig- 
nated L (273 amino acid residues), M (323 residues), H (258 residues), and cytochrome 


Structure of the cyanobacterial cytochrome bef 
complex depicting the heme groups of cytochromes 
b6N, bep, and f (red) and the iron-sulfur clusters 
(blue; PDB id = 1BF5). The upper bundle of 
x-helices defines the transmembrane domain. 
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(a) (c) 


Cytochrome with 
4 heme groups 


Figure 21.14 The R. viridis reaction center (RC). (a) Diagram of the RC showing light activation and 
path of e~ transfer. (b) Molecular graphic of the R. viridis RC including M (yellow), L (blue), H 
(orange), and the cytochrome (green). (c) Deletion of the R. viridis RC protein chains reveals the 
spatial relationship between its heme, chlorophyll, and quinone prosthetic groups. The iron atom is 
represented by a sphere (PDB id = 1PRC). (d) The cytochrome subunit is membrane-associated via a 
diacylglycerol moiety on its N-terminal Cys residue. 


(333 amino acid residues) (Figure 21.14a). L and M each consist of five membrane- 
spanning a-helical segments; H has one such helix, the majority of the protein forming 
a globular domain in the cytoplasm (Figure 21.14b). The cytochrome subunit contains 
four heme groups; the N-terminal amino acid of this protein is cysteine. This cyto- 
chrome is anchored to the periplasmic face of the membrane via the hydrophobic 
chains of two fatty acid groups that are esterified to a glyceryl moiety joined via a 
thioether bond to the Cys (Figure 21.14d). L and M each bear two bacteriochlorophyll 
molecules (the bacterial version of Chl) and one bacteriopheophytin. L also has a 
bound quinone molecule, Q,. Together, L and M coordinate an Fe atom. The photo- 
chemically active species of the R. viridis reaction center, P870, is composed of two 
bacteriochlorophylls, one contributed by L and the other by M. 


21.4b Photosynthetic Electron Transfer by the R. viridis 
Reaction Center Leads to ATP Synthesis 


The prosthetic groups of the R. viridis reaction center (P870, BChl, BPheo, and the 
bound quinones) are fixed in a spatial relationship to one another that favors photo- 
synthetic e~ transfer (Figure 21.14a,c). Photoexcitation of P870 (creation of P870*) 
leads to e~ loss (P870*) via electron transfer to the nearby bacteriochlorophyll (BChl). 
Membrane The e is then transferred via the L bacteriopheophytin (BPheo) to Qa, which is also 

anchor an L prosthetic group. The corresponding site on M is occupied by a loosely bound 
quinone, Qg, and electron transfer from Qa, to Qg takes place. An interesting aspect of 
the system is that no electron transfer occurs through M, even though it has compo- 
nents apparently symmetric to and identical to the L e~ transfer pathway. 

The reduced quinone formed at the Qs site is free to diffuse to a neighboring cyto- 
chrome bc, membrane complex, where its oxidation is coupled to H* translocation and, 
hence, ultimately to ATP synthesis (Figure 21.15). 

Cytochrome c3, a periplasmic protein, serves to cycle electrons back to P870* via 
the four hemes of the reaction center cytochrome subunit. A specific tyrosine residue 
of L (Tyr'™) is situated between P870 and the closest cytochrome heme. This Tyr is the 
immediate e~ donor to P870* and completes the light-driven electron transfer cycle. 
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Figure 21.15 Photophosphorylation. Photoexcitation of the R. viridis RC leads to reduction of a 
quinone, Q, to form QH2. Oxidation of QH2 by the cytochrome bc; complex leads to H* 
translocation for ATP synthesis by the R. viridis F;)Fy-ATP synthase. 


21.4c The Molecular Architecture of PSII Resembles 
the R. viridis Reaction Center Architecture 


Some have argued that PSII is the most important enzyme system in the history of 
life. Its appearance in cyanobacteria about 2.5 billion years ago dramatically altered 
the trajectory of evolution. PSII accomplishes the light-driven formation of O, from 
two H,O, and it is directly responsible for all of the oxygen in the atmosphere. The 
availability of molecular oxygen for highly exergonic oxidative reactions allowed 
the evolution of energy-intensive organisms such as animals. So animals exist 
because of PSII. 

Type II photosystems of higher plants, green algae, and cyanobacteria contain 
more than 20 subunits and are considerably more complex than the R. viridis reaction 
center. Interestingly, both type II and type I photosystems show significant similarity 
to the R. viridis reaction center, thus establishing a strong evolutionary connection 
between reaction centers. 

PSII from the thermophilic cyanobacterium Thermosynechoccus elongatus is repre- 
sentative of Type II photosystems across all photosynthetic organisms, even higher 
plants. Such conservation fits well with the evolutionary origin of chloroplasts from 
cyanobacteria. PSII is a homodimeric structure. Each monomer has a mass of almost 
350 kDa and 23 different protein subunits, the 4 largest being the reaction center pair of 
subunits (D1 and D2) and 2 chlorophyll-containing inner antenna subunits (CP43 and 
CP47) that bracket D1 and D2 (Figure 21.16). Together, CP43 and CP47 have a total of 
26 Chl a molecules, and exciton energy is collected and transferred from them to P680. 
Collectively, the protein subunits in a PSII monomer have at least 34 transmembrane 
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Figure 21.16 Molecular architecture of the 

T. elongatus PSII dimer. (a) The arrow shows the 
path of electron transfer from P680* to Chlp; to 
Pheop; to Q4 on D2 and then, via the Fe atom, to 
Qr on D1. The Tyr!®! residue of D1, (Y,), is situated 
between P680 and the MnyCaO; cluster (OEC). 

(b) Structure of T. elongatus PSII (PDB id = 3BZ1), 
including chlorophylls of the reaction center and 
electron-transfer path (green), pheophytins (blue), 
and the OEC (brick red). (Adapted from Barber, J., 
2003. Photosystem II: The engine of life. Quarterly 
Review of Biophysics 36:71-89.) 


a-helical segments, 22 of which are found in the D1-D2-CP43-CP47 core structure. D1 
and D2 each have five membrane-spanning a-helices. Structurally and functionally, 
these two subunits are a direct counterpart of the L and M subunits of the R. viridis 
reaction center. P680 consists of a pair of Chl a molecules, with D1 and D2 each con- 
tributing one. D1 and D2 each have two other Chl a molecules, one near each P680 
(Chlp; and Chlp2, respectively) and another that interacts with CP43/CP47 (Chlz.p; and 
Chlz.p», respectively) (Figure 21.16). Two equivalents of pheophytin (Pheo) are located 
on D1 and D2. The tyrosine species D is Tyr'®! in the D1 amino acid sequence. Com- 
plexed to D2 is a tightly bound plastoquinone molecule, Q4. Electrons flow from P680* 
to Chlp; and on to Pheop;. Pheop, then transfers the electron to Q, on D2, where it then 
moves to a second plastoquinone situated in the Qg site on D1 (Figure 21.16). Electron 
transfer from Qa and Qs is assisted by the iron atom located between them. Each plas- 
toquinone that enters the Qg site accepts two electrons derived from water and two H* 
from the stroma before it is released into the membrane as the hydroquinone PQH). 
Thus, two photons are required to reduce each PQ that enters the Qg site. The stoichi- 
ometry of the electron transfer reaction catalyzed by PSII can be written as 


2H,O + 2 PQ + 4 hv —> O, + 2 PQH, 
Note, however, that the four protons added to the two PQ to form two PQH; arise 
from the stromal side of the photosynthetic membrane, as four protons released from 


the two water molecules appear on the lumenal side of the membrane. Net proton 
transfer across the membrane is | H*/Av for the PSII reaction. 


(a) 


Stroma 


Thylakoid 
lumen 


(b) 
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Electrons withdrawn from water are the source of the electrons needed to restore 
oxidized P680 (P680*) back to the P680 state for another cycle of light-driven electron 
transport. The oxidation of two HO molecules results in O, formation and is carried 
out by the oxygen evolving complex (OEC) located on the lumenal side of the PSII D1 
subunit. These electrons pass from the OEC to P680* via Tyr!®! (also known as Yz) of 
the D1 subunit. The water-binding site of the OEC is a cluster of four manganese, one 
calcium, and five oxygen atoms (Mn,4CaOs); an associated C17 ion is essential to water 
oxidation. Protons liberated upon oxidation of these H,O molecules are passed directly 
into the thylakoid lumen, where they contribute to the proton gradient generated by 
photosynthetic electron transport. 


21.4d How Does PSII Generate O, from H,O? 


PSII catalyzes what is probably the most thermodynamically difficult reaction in 
nature, the light-driven oxidation of water to molecular oxygen. The protons and 
electrons released in this reaction are used to reduce NADP* to NADPH and to 
establish a proton gradient across the photosynthetic membrane that can be tapped 
to drive chemiosmotic ATP synthesis (see Figure 21.20). Accumulation of molecular 
oxygen in the atmosphere as a byproduct of the photo-oxidation of water has trans- 
formed the planet since the evolutionary appearance of this reaction some 2 billion 
years ago in cyanobacteria. How does PSII evolve oxygen? 


The Structure of the Oxygen-Evolving Complex The architecture of the Thermo- 
synechococcus photosynthetic OEC reveals a large globular protein domain juxta- 
posed on the lumenal side of the D1 subunit of PSII (Figure 21.16). The active site of 
the OEC is identifiable by the Mn,CaO; cluster, as shown in Figure 21.17. This metal 
cluster is held by Glu!®, Asp*#?, His***, and His?” of the PSII D1 subunit and Glu**4 of 
CP43. The chloride ion (CI-) required for O, evolution is believed to be a Ca” ligand. 
Note also that Tyr'®! (Yz) is situated near the metal cluster, ideally poised to serve in 
electron transfer between HO and P680*. When four e~ have been removed from the 
cluster (one from each Mn atom) through e` transfer to PSII via Tyr'™, two H2O mol- 
ecules provide the four e~ needed to re-reduce the Mn atoms, and O; is evolved. One 
O atom in O, comes from the O5 atom of the cluster; the other one comes from either 
W2 or W3. The four H* released contribute to the proton gradient. 


Asp342 
His332 


Glu189 p 
His337 f 


His190 


wil 


Tyrl61 


Asp170 


729 


Figure 21.17 Structure of the PSII oxygen-evolving 
complex (OEC). Four Mn atoms (purple) and one 
Ca atom (green) form the water-splitting metal 
cluster of the OEC. Bridging O atoms are red. The 
four water molecules held by the MnyCaO; cluster 
(aqua) are labeled W1-W4. OS, the O atom shared 
by the Ca atom and Mn atoms 3 and 4, along with 
an O atom from either the W2 or W3 water 
molecule, give rise to the molecule of O, generated 
by the OEC. Those amino acid side chains of D1 
responsible for binding the MnyCaOs cluster 

are also shown (PDB id = 4UB6). (Adapted from 
Figure 2 in Suga, M., et al., 2015. Native structure 
of photosystem I at 1.95 A resolution viewed by 
femtosecond X-ray pulses. Nature 517:99-103.) 
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Figure 21.18 The molecular architecture of PSI. (a) Subunit organization. (Adapted from Golbeck, 
J. H., 1992. Structure and function of photosystem I. Annual Review of Plant Physiology and Plant 
Molecular Biology 43:293—324; and Fromme, P., Jordan, P., and Krausse, N., 2001. Structure of 
photosystem I. Biochimica et Biophysica Acta 1507:5—31.) (b) Molecular structure of PSI in a lipid 
(green) membrane, including PsaA (salmon), PsaB (white), chlorophylls (green), and iron-sulfur 
clusters (red; PDB id = 2WSC). 


21.4e The Molecular Architecture of PSI Resembles the R. viridis 
Reaction Center and PSII Architecture 


As with the T. elongatus PSII structure, the structure of T. elongatus PSI is representa- 
tive of Type I photosystems across all photosynthetic organisms that possess them, 
including representatives in the plant kingdom. These PSI structures complete our 
view of reaction center architecture and confirm the fundamental similarities in 
organization that exist in these energy-transducing integral membrane proteins. 
Because of direct correlations with eukaryotic PSI, this cyanobacterial PSI provides 
a general model for all P700-dependent photosystems. 

T. elongatus PSI exists as a cloverleaf-shaped trimeric structure. Each monomer 
(356 kDa) consists of 12 different protein subunits and 127 cofactors: 96 chlorophyll a 
molecules, 2 phylloquinones, 3 Fe,S, clusters, 22 carotenoids, and 4 lipids that are an 
intrinsic part of the protein complex (Figure 21.18). All of the electron-transferring 
prosthetic groups essential to PSI function are localized to just three polypeptides: 
PsaA, PsaB, and PsaC. PsaA and PsaB (83 kDa each) compose the reaction center 
heterodimer, a structural pattern now seen as universal in photosynthetic reaction cen- 
ters. PsaA and PsaB each have 11 transmembrane a-helices, with the 5 most C-terminal 
a-helices of each serving as the scaffold for the reaction center photosynthetic electron- 
transfer apparatus. PsaC interacts with the stromal face of the PsaA—PsaB heterodimer. 
PsaC carries two of the Fe,S, clusters, Fa and Fg, and interacts with PsaD. Together 
they provide a docking site for ferredoxin. The electron-transfer system of PSI consists 
of three pairs of chlorophyll molecules: P700 (a heterodimer of Chl a and an epimeric 
form, Chl a’) and two additional Chl a pairs (symbolized by Ap) that mediate e~ transfer 
to the quinone acceptor. The T. elongatus quinone acceptor (A) is phylloquinone (also 
known as vitamin K,). The Fx Fe,S, cluster bridges PsaA and PsaB; two of its four 
cysteine ligands come from PsaA, the other two from PsaB. Photochemistry begins with 
exciton absorption at P700, creating P700*, which has the most negative reduction 
potential known in nature. Almost instantaneous electron transfer and charge separa- 
tion (to form P700*:Ao_) is followed by transfer of the electron from Ap to Ay, then to 
F,, to Fg, and then on to a ferredoxin molecule located at the stromal side of the mem- 
brane. The positive charge at P700* and the e~ at F,/Fg represent a charge separation 
across the membrane, an energized condition created by light. Plastocyanin (or in cya- 
nobacteria, a lumenal cytochrome designated cytochrome cg) delivers an electron to fill 
the electron hole in P700*. 
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21.4f How Do Green Plants Carry Out Photosynthesis? 


Higher plant photosystems follow the structural and functional pattern first 
revealed in the bacterial RC and recapitulated in the cyanobacterial PSI and PSII. 
With new structural information for the higher plant PSI (from Pisum sativum, 
garden pea) and PSII (from spinach), the fundamental organization pattern for 
photosystems seen earlier is confirmed for higher plants. Further, the structure of 
a plant membrane protein supercomplex consisting of the PSI reaction center and 
its light-harvesting antenna LHC1 (light-harvesting complex 1) has been described. 
This supercomplex exists as a monomer composed of 16 distinct protein subunits 
and 232 prosthetic groups, including 156 chlorophylls, 2 phylloquinones, and 3 
Fe,S,4 clusters (Figure 21.19). The four LHC] subunits form an arc around one side 
of the PSI RC. A second light-harvesting complex (LHC2) binds to another side. 
This plant PSI system, like all photosystems, is remarkably efficient, showing a 
quantum efficiency of nearly one (one electron transferred per photon falling any- 
where within the supercomplex). The many Chl and other light-harvesting mole- 
cules of the supercomplex form an integrated network for highly efficient transfer 
of light energy into P700. Indeed, plant and algal photosystems I and II in nature 
always exist as supramolecular complexes, with the photosystem surrounded by its 
particular LHCs. The architecture of the LHCs around photosystems is variable 
from species to species. 


21.5 | What Is the Quantum Yield of Photosynthesis? 


The quantum yield of photosynthesis can be defined as the amount of product formed 
per equivalent of light input. In terms of exciton delivery to reaction center Chl dimers 
and subsequent e` transfer, the quantum yield of photosynthesis typically approaches 
the theoretical limit of one. The quantum yield of photosynthesis can also be expressed 


Figure 21.19 View of the plant PSI-LHC1 
supercomplex, from the stromal side of the thylakoid 
membrane, including ChI molecules (green) and 
carotenoids and lipids (red). The 16 protein subunits 
are shown as ribbon diagrams in the background, 
with the positions of the green plant-specific 
subunits PsaG, PsaH, and PsaK, as well as the 
LHC1-4 subunits indicated. (PDB id = 3LW5). 
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as the ratio of CO, fixed or O, evolved per photon absorbed. Interestingly, Figure 21.11b 
indicates an overall stoichiometry of seven H* contributed to the proton gradient 
across the thylakoid membrane for each pair of electrons moving from H,O to NADP*: 
two into the lumen upon oxidation of a water molecule, four into the lumen from oxi- 
dation of PQH, by the cytochrome bęf complex, and one H* disappearing from the 
stroma as NADP* is reduced to NADPH. More appropriately, 4 hv per photosystem 
(8 quanta) total would drive the evolution of 1 O2, the reduction of 2 NADP*, and a 
contribution of 14 H* to the proton gradient. 


21.5a The H*/ATP Ratio Determines ATP Yield per Quantum 


The H*/ATP ratio depends on the number of c-subunits in the F}Fy-ATP synthase pos- 
sessed by a cell or organism (chloroplast F;Fo-ATP synthase is termed CF\CFo-ATP 
synthase, C = chloroplast, see below). As we have seen, the number of c-subunits in 
ATP synthase can vary from organism to organism. Chloroplast CF\CFo-ATP syn- 
thase has 14 c-subunits; thus, 3 ATP are formed per 14 H*, so 3 ATP would be synthe- 
sized in chloroplasts from an input of 8 quanta. 

The energy of a photon depends on its wavelength, according to the equation E = hv = 
hclà, where E is energy, c is the speed of light, and A is its wavelength. Expressed in molar 
terms, an einstein is the amount of energy in Avogadro’s number of photons: E = Nhch. 
Light of 700 nm wavelength is the longest-wavelength and the lowest-energy light acting 
in the eukaryotic photosystems. An einstein of 700 nm light is equivalent in energy to 
approximately 170 kJ. Eight einsteins of this light, 1360 kJ, theoretically generate 2 moles 
of NADPH, 3 moles of ATP, and 1 mole of O2. 


21.5b Calculation of the Photosynthetic Energy Requirements 
for Hexose Synthesis Depends on H*/hv and ATP/H* Ratios 


The fixation of carbon dioxide to form hexose, the dark reactions of photosynthesis, 
requires considerable energy. The overall stoichiometry of this process (see Equation 21.3) 
involves 12 NADPH and 18 ATPs. Thus, the ATP/NADPH ratio for CO, fixation is 1.5. 
To generate 12 equivalents of NADPH and 18 equivalents of ATP necessitates the input 
of 48 einsteins of light, minimally 170 kJ each, or 8160 kJ, from which one mole of hexose 
would be synthesized. The standard free energy change, AG*’, for hexose formation from 
carbon dioxide and water (the exact reverse of cellular respiration) is +2870 kJ/mol. Note 
that many assumptions underlie these calculations, including assumptions about the 
ATP/H* ratio, the H*/e~ ratio, and ultimately, the relationship between quantum input 
and overall yields of NADPH and ATP. 


21.6 | How Does Light Drive the Synthesis of ATP? 


Light-driven ATP synthesis, or photophosphorylation, is a fundamental part of the 
photosynthetic process. The conversion of light energy to chemical energy results in 
electron-transfer reactions, which lead to the generation of reducing power (reduced 
quinones or NADPH). Coupled with these electron transfers, protons are driven 
across the thylakoid membranes from the stromal side to the lumenal side. These 
proton translocations occur in a manner analogous to the proton translocations 
accompanying mitochondrial electron transport that provide the driving force for 
oxidative phosphorylation (see Chapter 20). Figure 21.11 indicates that proton trans- 
locations can occur at a number of sites. For example, protons are produced in the 
thylakoid lumen upon photolysis of water by PSII. The oxidation—reduction events as 
electrons pass through the plastoquinone pool and the Q cycle are another source of 
proton translocations. The proton transfer accompanying NADP* reduction also can 
be envisioned as protons being taken from the stromal side of the thylakoid vesicle. 
The current view is that three protons are translocated for each electron that flows 
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from H2O to NADP‘. Because this electron transfer requires two photons, one falling 
at PSII and one at PSI, the overall yield is 1.5 protons per quantum of light. 


21.6a The Mechanism of Photophosphorylation Is Chemiosmotic 


The thylakoid membrane is asymmetrically organized, or sided, like the mitochon- 
drial membrane. It also shares the property of being a barrier to the passive 
diffusion of H* ions. Photosynthetic electron transport thus establishes an electro- 
chemical gradient, or proton-motive force, across the thylakoid membrane with the 
interior, or lumen, side accumulating H* ions relative to the stroma of the chloro- 
plast. Like oxidative phosphorylation, the mechanism of photophosphorylation is 
chemiosmotic. 

A proton-motive force of approximately —250 mV is needed to achieve ATP synthe- 
sis. This proton-motive force, Ap, is composed of a membrane potential, Aw, and a pH 
gradient, ApH (see Chapter 20). The proton-motive force is defined as the free energy 
difference, AG, divided by F, Faraday’s constant: 


Ap = AGIF = Ay — (2.3 RTIF)ApH (21.7) 


In chloroplasts, the value of Ay is typically —50 to —100 mV and the pH gradient is 
equivalent to about 3 pH units, so —(2.3 RT/#)ApH = —170 mV. This situation con- 
trasts with the mitochondrial proton-motive force, where the membrane potential 
contributes relatively more to Ap than does the pH gradient. 


21.6b CF,CF,—-ATP Synthase Is the Chloroplast Equivalent 
of the Mitochondrial FıFo-ATP Synthase 


The transduction of the electrochemical gradient into the chemical energy represented 
by ATP is carried out by the chloroplast ATP synthase, which is highly analogous to 
the mitochondrial F,Fy>-ATP synthase. The chloroplast enzyme complex is called 
CF,CF)—ATP synthase. Like the mitochondrial complex, CF,CFy—ATP synthase is a 
heteromultimer of a-, B-, y-, 6-, €-, a-, b-, and c-subunits (see Chapter 20), consisting 
of a knoblike structure some 9 nm in diameter (CF;) attached to a stalked base (CF) 
embedded in the thylakoid membrane. The mechanism of action of CF;CFo—ATP syn- 
thase in coupling ATP synthesis to the collapse of the pH gradient is similar to that of 
the mitochondrial ATP synthase described in Chapter 20. However, spinach chloro- 
plast CF,CFy) ATP synthase has 14 c-subunits in its Fp rotor, whereas the ATP synthase 
from the cyanobacterium Spirulina platensis has 15 c-subunits. Thus, one turn of the 
spinach Fo c-subunit ring would require 14 H+, and one turn of the S. platenis rotor, 
15. All proton-translocating ATP synthases have 3 ATP-synthesizing sites, so the H*/ 
ATP ratio for spinach chloroplasts is 14/3 or 4.67; the H*/ATP ratio for the S. platensis 
enzyme is 5. The mechanism of photophosphorylation is summarized schematically in 
Figure 21.20. The structure of the spinach chloroplast CF,;CF)—ATP synthase c-subunit 
assembly is shown in Figure 21.21. 


21.6c Photophosphorylation Can Occur in Either 
a Noncyclic or a Cyclic Mode 


Photosynthetic electron transport, which pumps H* into the thylakoid lumen, can 
occur in two modes, linear or cyclic, both of which lead to the establishment of a 
transmembrane proton-motive force. Thus, both modes are coupled to ATP synthesis 
and are considered alternative mechanisms of photophosphorylation, even though 
they are distinguished by differences in their electron-transfer pathways. Noncyclic 
photophosphorylation is illustrated in Figure 21.20, where electrons activated by 
quanta at PSII and PSI flow from H2O to NADP", with concomitant establishment of 
the proton-motive force driving ATP synthesis. Note that in noncyclic photophos- 
phorylation, O, is evolved and NADP* is reduced. 
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Figure 21.20 The mechanism of photophosphorylation. Photosynthetic electron transport 
establishes a proton gradient that is tapped by the CF,CF)-ATP synthase to drive ATP synthesis. 
The value, 4.7 H*/ATP, is the estimated ratio for the spinach chloroplast CF,CF)-ATP synthase. 


Figure 21.21 The c-subunit assembly of the spinach 
chloroplast CF;CF)—ATP synthase consists of a 
cylindrical array of 14 identical subunits. Each 
subunit reversibly binds an H* through 
protonation/deprotonation of its Glu“! side 

chain (PDB id = 2W5J). 


21.6d Cyclic Photophosphorylation Generates 
ATP but Not NADPH or O, 


In cyclic photophosphorylation, the electron hole in P700* created by electron loss 
from P700 is filled not by an electron derived from H,O via PSII but by a cyclic pathway 
in which the photoexcited electron returns ultimately to P700*, as described earlier in 
Section 21.3c and illustrated in Figure 21.22. In a representative higher plant (barley 
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Plastocyanin 


[Hordeum vulgare}), the 29-subunit NADH dehydrogenase-like (NDH) counterpart of 
FNR mediates cyclic electron transfer through association with PSI to form a super- 
complex composed of two PSI-LHC1 complexes per NDH. 

Proton translocations accompany these cyclic electron transfer events so that ATP 
synthesis can be achieved. In cyclic photophosphorylation, ATP is the sole product of 
energy conversion. No NADPH is generated, and because PSII is not involved, no oxy- 
gen is evolved. Cyclic photophosphorylation theoretically yields two H* per e~ (2 H*/ 
hv) from the operation of the cytochrome bę f complex. Thus, cyclic photophosphoryla- 
tion provides a mechanism for supplementing ATP production. Estimates indicate that 
cyclic photophosphorylation may contribute about 10% of total chloroplast ATP syn- 
thesis and thereby elevate the ATP/NADPH ratio. 


21.7 | How Is Carbon Dioxide Used to Make 
Organic Molecules? 


As we began this chapter, we saw that photosynthesis traditionally is equated with the 
process of CO, fixation, that is, the net synthesis of carbohydrate from CO). Indeed, 
the capacity to perform net accumulation of carbohydrate from CO, distinguishes the 
phototrophic (and autotrophic) organisms from heterotrophs. Although animals pos- 
sess enzymes capable of linking CO, to organic acceptors, they cannot achieve a net 
accumulation of organic material by these reactions. For example, fatty acid biosyn- 
thesis is primed by covalent attachment of CO, to acetyl-CoA to form malonyl-CoA. 
Nevertheless, this “fixed CO,” is liberated in the very next reaction, so no net CO, 
incorporation occurs. 

Elucidation of the pathway of CO) fixation represents one of the earliest applica- 
tions of radioisotope tracers to the study of biology. In 1945 Melvin Calvin and his 
colleagues at the University of California, Berkeley, were investigating photosyn- 
thetic CO, fixation in Chlorella. Using '*COs, they traced the incorporation of radio- 


Figure 21.22 The pathway of cyclic 
photophosphorylation by PSI. See 
text for a discussion of the H*/hu 
ratio. (Adapted from Arnon, D. I., 
1984. The discovery of photosynthetic 
phosphorylation. Trends in 
Biochemical Sciences 9:258-262.) 
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Figure 21.23 Molecular graphic of ribulose 


bisphosphate carboxylase. The enzyme consists of 
8 equivalents each of two subunits. Clusters of four 
small subunits (orange and red) are located at each 


end of the symmetric octamer formed by the 


L subunits (/ight and dark green). The active sites are 
revealed in the ribbon diagram by bound ribulose- 


1,5-bisphosphate (yellow; PDB id = 1RXO). 


active '4C into organic products and found that the earliest labeled product was 
3-phosphoglycerate (see Figure 17.14). Although this result suggested that the CO, 
acceptor was a two-carbon compound, further investigation revealed that, in reality, 
two equivalents of 3-phosphoglycerate were formed following addition of CO, to a 
five-carbon (pentose) sugar: 


CO, + 5-carbon acceptor —> [6-carbon intermediate] —> 
two 3-phosphoglycerates 


21.7a Ribulose-1,5-Bisphosphate Is the CO, Acceptor in CO, Fixation 


The five-carbon CO, acceptor was identified as ribulose-1,5-bisphosphate (RuBP), 
and the enzyme catalyzing this key reaction of CO) fixation is ribulose bisphosphate 
carboxylase/oxygenase, or, in the jargon used by workers in this field, rubisco. The 
name ribulose bisphosphate carboxylaseloxygenase reflects the fact that rubisco cata- 
lyzes the reaction of either CO) or, alternatively, O) with RuBP. Rubisco is found in 
the chloroplast stroma. It is a very abundant enzyme, constituting more than 15% of 
the total chloroplast protein. Given the preponderance of plant material in the bio- 
sphere, rubisco is probably the world’s most abundant protein. Rubisco in higher 
plants is a 550 kDa heteromultimeric (agg) complex consisting of eight identical large 
subunits (55 kDa) and eight small subunits (15 kDa) (Figure 21.23). The large subunit 
is the catalytic unit of the enzyme. It binds both substrates (CO, and RuBP) and Mg?* 
(a divalent cation essential for enzymatic activity). The small subunit modulates the 
catalytic efficiency of the enzyme, increasing kea more than 100-fold. The rubisco 
large subunit is encoded by a gene within the chloroplast DNA, whereas the small 
subunit is encoded by a multigene family in the nuclear DNA. Assembly of active 
rubisco heteromultimers occurs within chloroplasts following transit of the small 
subunit polypeptide across the chloroplast membrane. Unlike most proteins, the 
large subunits of rubisco do not readily fold into structures that spontaneously 
assemble with the small subunits to form the functional holoenzyme. Instead, forma- 
tion of LgSg rubisco molecules requires specific chaperones to achieve its hexadecam- 
eric final state. 


21.7b 2-Carboxy-3-Keto-Arabinitol Is an Intermediate 
in the Ribulose-1,5-Bisphosphate Carboxylase Reaction 


The addition of CO, to ribulose-1,5-bisphosphate results in the formation of an 
enzyme-bound intermediate, 2-carboxy-3-keto-arabinitol (Figure 21.24, II). This 
intermediate arises when CO, adds to the enediol intermediate generated from 
ribulose-1,5-bisphosphate. Hydrolysis of the C,—C; bond of the intermediate gener- 
ates two molecules of 3-phosphoglycerate. The CO, ends up as the carboxyl group of 
one of the two molecules. 
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Figure 21.24 The ribulose bisphosphate carboxylase reaction. Mg?* at the active site aids in 


stabilizing the 2,3-enediol transition state (I) for CO, addition and in facilitating the carbon-carbon 
bond cleavage that leads to product formation. 
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21.7c _Ribulose-1,5-Bisphosphate Carboxylase 
Exists in Inactive and Active Forms 


Rubisco exists in three forms: an inactive form, designated E; a carbamylated, but 
inactive, form, designated EC; and an active form, ECM, which is carbamylated and 
has Mg?* at its active sites as well. Carbamylation of rubisco takes place by addition 
of CO, to its Lys*°! e-NH, groups (to give e—NH— COO- derivatives). The CO, mol- 
ecules used to carbamylate Lys residues do not become substrates. The carbamylation 
reaction occurs spontaneously at slightly alkaline pH (pH 8). Carbamylation of 
rubisco completes the formation of a binding site for the Mg** that participates in the 
catalytic reaction. Once Mg?* binds to EC, rubisco achieves its active ECM form. 
Activated rubisco displays a Km for CO, of 10 to 20 uM. The relative abundance of 
CO, in the atmosphere is low, about 0.04%. The concentration of CO, dissolved in 
aqueous solutions equilibrated with air is about 10 uM. 

Substrate RuBP binds much more tightly to the inactive E form of rubisco 
(Kp = 20 nM) than to the active ECM form (Km for RuBP = 20 uM). Thus, RuBP is 
also a potent inhibitor of rubisco activity. Release of RuBP from the active site of 
rubisco is mediated by rubisco activase, a member of the AAA+ ATPase family of pro- 
teins involved in conformational remodeling of other proteins. Rubisco activase is a 
regulatory protein; it binds to E-form rubisco near its catalytic site and, in an ATP- 
dependent reaction, promotes the release of RuBP. Rubisco then becomes activated by 
carbamylation and Mg?* binding. Rubisco activase itself is activated in an indirect 
manner by light. Thus, light is the ultimate activator of rubisco. Even when activated, 
rubisco is a remarkably inefficient enzyme, catalyzing the carboxylation of only 2 to 5 
RuBP molecules per second. Thus, plants must produce large amounts of it to meet 
their CO, fixation needs. 


21.7d CO, Fixation into Carbohydrate Proceeds 
via the Calvin—Benson Cycle 


The immediate product of CO, fixation, 3-phosphoglycerate, must undergo a series of 
transformations before the net synthesis of carbohydrate is realized. Among carbohy- 
drates, hexoses (particularly glucose) occupy center stage. Glucose-1-P units are the 
building blocks for both cellulose and starch synthesis. These plant polymers constitute 
the most abundant organic material in the living world, and thus, the central focus on 
glucose-I-P as the ultimate end product of CO, fixation is amply justified. Also, sucrose 
(a-D-glucopyranosyl-(1—>2)-8-D-fructofuranoside), the major carbon form translocated 
out of leaves to other plant tissues, is synthesized from G-1-P. In nonphotosynthetic tis- 
sues, sucrose is metabolized via glycolysis and the TCA cycle to produce ATP. 

The set of reactions that transforms 3-phosphoglycerate into hexose is named the 
Calvin—Benson cycle (often referred to simply as the Calvin cycle) for its discoverers. 
The reaction series is indeed cyclic because not only must carbohydrate appear as an 
end product, but the five-carbon acceptor, RuBP, must be regenerated to provide 
for continual CO), fixation. Balanced equations that schematically represent this 
situation are 


6(1) + 6(5) — 12(3) 
12(3) —> 1(6) + 6(5) 
Net: 6(1) — 1(6) 


Each number in parentheses represents the number of carbon atoms in a compound, 
and the number preceding the parentheses indicates the stoichiometry of the reaction. 
Thus, 6(1), or 6 CO, condense with 6(5) or 6 RuBP to give 12 3-phosphoglycerates. 
These twelve 3-PG are the primary products of CO, fixation and they can be used for 
a wide variety of purposes. The Calvin cycle describes their fate as precursors to hex- 
ose synthesis, whereby the 12(3)s are rearranged to form one hexose 1(6) and regener- 
ate the six 5-carbon (RuBP) acceptors. 
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21.7e Rubisco and the Enzymes of the Calvin Cycle 
Serve Three Metabolic Purposes 


Rubisco and the Calvin cycle enzymes serve three important ends: 


1. Rubisco constitutes the predominant CO) fixation enzyme in nature. 
2. The Calvin cycle enzymes accomplish the reduction of 3-phosphoglycerate, the pri- 
mary product of CO, fixation, to glyceraldehyde-3-phosphate so that carbohydrate 


synthesis becomes feasible. 
3. The Calvin cycle enzymes catalyze reactions that transform three-carbon com- 
pounds into four-, five-, six-, and seven-carbon compounds. 


Most of the enzymes mediating the reactions of the Calvin cycle also participate 
in either glycolysis (see Chapter 18) or the pentose phosphate pathway (see Chapter 
22). The aim of the Calvin scheme is to account for hexose formation from 
3-phosphoglycerate. In the course of this metabolic sequence, the NADPH and ATP 
produced in the light reactions are consumed, as indicated earlier in Equation 21.3. 

The overall series of reactions starts with ribulose bisphosphate carboxylase cata- 
lyzing formation of 3-phosphoglycerate from CO, and RuBP and concludes with 
ribulose-5-phosphate kinase (also called phosphoribulose kinase), which forms RuBP 
(Figure 21.25 and Table 21.1). The carbon balance is given at the right side of 
Table 21.1. Several features of the reactions in this table merit discussion. Note that a 
total of 18 equivalents of ATP consumed in hexose formation are expended (reactions 
2 and 15): 12 to form 12 equivalents of 1,3-bisphosphoglycerate from 3-phosphoglycerate 
by a reversal of the normal glycolytic reaction catalyzed by 3-phosphoglycerate kinase 
and six to phosphorylate Ru-5-P to regenerate 6 RuBP. All 12 NADPH equivalents are 
used in reaction 3. Plants possess an NADPH-specific glyceraldehyde-3-phosphate 
dehydrogenase, which contrasts with its glycolytic counterpart in its specificity for 


NADP over NAD and in the direction in which the reaction normally proceeds. 


| Table 21.1 e| The Calvin Cycle Series of Reactions 


Reactions | through 15 constitute the cycle that leads to the formation of one equivalent of glucose. The enzyme catalyzing each step, a concise 
reaction, and the overall carbon balance are given. Numbers in parentheses show the numbers of carbon atoms in the substrate and product 
molecules. Prefix numbers indicate in a stoichiometric fashion how many times each step is carried out in order to provide a balanced net reaction. 


1. 
2. 
3. 


oflu Ni ns 


Net: 


Ribulose bisphosphate carboxylase: 6 CO, + 6 H,O + 6 RuBP —> 12 3-PG 
3-Phosphoglycerate kinase: 12 3-PG + 12 ATP —> 12 1,3-BPG + 12 ADP 


NADP*-glyceraldehyde-3-P dehydrogenase: 
12 1,3-BPG + 12 NADPH + 12 Ht —> 12 NADP* + 12 G-3-P + 12 P; 


. Triose-P isomerase: 5 G-3-P —> 5 DHAP 

. Aldolase: 3 G-3-P + 3 DHAP —-> 3 FBP 

. Fructose bisphosphatase: 3 FBP + 3 HO —> 3 F-6-P + 3 P; 
. Phosphoglucoisomerase: 1 F-6-P —> 1 G-6-P 

. Phosphoglucomutase: 1 G-6-P —> GLUCOSE-1-P 


The remainder of the pathway involves regenerating six RuBP acceptors (= 30 C) 
from the leftover two F-6-P (12 C), four G-3-P (12 C), and two DHAP (6 C). 


. Transketolase: 2 F-6-P + 2 G-3-P —> 2 Xu-5-P + 2 E-4-P 

. Aldolase: 2 E-4-P + 2 DHAP —> 2 sedoheptulose-1,7-bisphosphate (SBP) 
. Sedoheptulose bisphosphatase: 2 SBP + 2 HO —> 2 S-7-P + 2 P; 

. Transketolase: 2 S-7-P + 2 G-3-P —> 2 Xu-S-P + 2 R-5-P 

. Phosphopentose epimerase: 4 Xu-5-P —> 4 Ru-5-P 

. Phosphopentose isomerase: 2 R-5-P —> 2 Ru-5-P 

. Phosphoribulose kinase: 6 Ru-5-P + 6 ATP —> 6 RuBP + 6 ADP 

6 CO, + 18 ATP + 12 NADPH + 12 H+ + 11 HO —> glucose-1-P + 18 ADP + 17 P; + 12 NADP+ 


6(1) + 6(5) — 12(3) 
128) — 12(3) 
128) — 12(3) 


5(3) — 5(3) 
3(3) + 3(3) — 3(6) 
3(6) — 3(6) 
1(6) —> 1(6) 
1(6) —> 1(6) 


2(6) + 2(3) — 2(5) + 2(4) 
2(4) + 2(3) —> 2(7) 
AD — 2(7) 
2(7) + 2(3) — 4(5) 
4(5) — 4(5) 
2(5) —> 2(5) 
6(5) — 6(5) 
6(1) — 1(6) 
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Figure 21.25 The Calvin—Benson cycle of reactions. The line thickness and number (in a blue circle) 
associated with the arrow at each step indicates the number of molecules reacting in a turn of the 
cycle that produces one molecule of glucose. Reactions are numbered as in Table 21.1. 
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Figure 21.26 Light regulation of CO) fixation 
prevents a substrate cycle between cellular 
respiration and hexose synthesis by CO, fixation. 
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Figure 21.27 Light-induced pH changes in 
chloroplast compartments. These pH changes 
modulate the activity of key Calvin cycle enzymes. 


21.7f The Calvin Cycle Reactions Can Account for Net Hexose Synthesis 


When carbon rearrangements are balanced to account for net hexose synthesis, five of 
the glyceraldehyde-3-phosphate molecules are converted to dihydroxyacetone phos- 
phate (DHAP). Three of these DHAPs then condense with three glyceraldehyde-3-P 
via the aldolase reaction to yield three hexoses in the form of fructose-1,6-bisphosphate 
(Figure 21.25). (Recall that the AG?’ for the aldolase reaction in the glycolytic direc- 
tion is +23.9 kJ/mol. Thus, the aldolase reaction running in reverse 1n the Calvin cycle 
would be thermodynamically favored under standard-state conditions.) Taking one 
FBP to glucose-1-P, the desired product of this scheme, leaves 30 carbons, distributed 
as 2 fructose-6-phosphates, 4 glyceraldehyde-3-phosphates, and 2 DHAP. These 30 
Cs are reorganized into 6 RuBP by reactions 9 through 15. Step 9 and steps 12 through 
14 involve carbohydrate rearrangements like those in the pentose phosphate pathway 
(see Chapter 22). Reaction 11 is mediated by sedoheptulose-1,7-bisphosphatase. This 
phosphatase is unique to plants; it generates sedoheptulose-7-P, the seven-carbon 
sugar serving as the transketolase substrate. Likewise, phosphoribulose kinase carries 
out the unique plant function of providing RuBP from Ru-5-P (reaction 15). The net 
conversion accounts for the fixation of 6 equivalents of carbon dioxide into 1 hexose 
at the expense of 18 ATP and 12 NADPH. 


21.7g The Carbon Dioxide Fixation Pathway 
Is Indirectly Activated by Light 


Plant cells contain mitochondria and can carry out cellular respiration (glycolysis, 
the citric acid cycle, and oxidative phosphorylation) to provide energy in the dark. 
Futile cycling of carbohydrate to CO, by glycolysis and the citric acid cycle in one 
direction and CO; to carbohydrate by the CO; fixation pathway in the opposite direc- 
tion is thwarted through regulation of the Calvin cycle (Figure 21.26). In this regula- 
tion the activities of key Calvin cycle enzymes are coordinated with the output of 
photosynthesis. In effect, these enzymes respond indirectly to light activation. Thus, 
when light energy is available to generate ATP and NADPH for CO) fixation, the 
Calvin cycle proceeds. In the dark, when ATP and NADPH cannot be produced by 
photosynthesis, fixation of CO, ceases. The light-induced changes in the chloroplast, 
which regulate key Calvin cycle enzymes, include (1) changes in stromal pH, (2) 
generation of reducing power, and (3) Mg°* efflux from the thylakoid lumen. 


Light Induces pH Changes in Chloroplast Compartments As discussed in Section 21.6, illu- 
mination of chloroplasts leads to light-driven pumping of protons into the thylakoid lumen, 
which causes pH changes in both the stroma and the thylakoid lumen (Figure 21.27). The 
stromal pH rises, typically to pH 8. Because rubisco and rubisco activase are more active at 
pH 8, CO, fixation is activated as stromal pH rises. Fructose-1,6-bisphosphatase, ribulose- 
5-phosphate kinase, and glyceraldehyde-3-phosphate dehydrogenase all have alkaline pH 
optima. Thus, their activities increase as a result of the light-induced pH increase in 
the stroma. 


Light Energy Generates Reducing Power Illumination of chloroplasts initiates photo- 
synthetic electron transport, which generates reducing power in the form of reduced 
ferredoxin. In turn, Fd,.g leads to reduced thioredoxin via ferredoxin—thioredoxin 
reductase (FTR; Figure 21.28). Thioredoxin is a small (12 kDa) protein possessing in 
its reduced state a pair of sulfhydryls (—SH HS—), which upon oxidation form a 
disulfide bridge (—S—S—). Several Calvin cycle enzymes have pairs of cysteine resi- 
dues that are involved in a disulfide—-sulfhydryl transition between an inactive 
(—S—S—) and an active (—SH HS—) form. These enzymes include fructose- 
1,6-bisphosphatase (residues Cys! and Cys!”), NA DP* -malate dehydrogenase (residues 
Cys! and Cys!>), and ribulose-5-phosphate kinase (residues Cys! and Cys>>). Thus, light 
activates these key enzymes through this ferredoxin-dependent, thioredoxin-mediated 
pathway (Figure 21.28). 
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Light Induces Movement of Mg?* lons from the Thylakoid Vesicles into the Stroma When 
light-driven proton pumping across the thylakoid membrane occurs, a concomitant 
efflux of Mg?* ions from vesicles into the stroma is observed. This efflux of Mg?* 
somewhat counteracts the charge accumulation due to H* influx and is one reason 
why the membrane potential change in response to proton pumping is less in chloro- 
plasts than in mitochondria (see Equation 21.7). Both ribulose bisphosphate carboxyl- 
ase and fructose-1,6-bisphosphatase are Mg?*-activated enzymes, and Mg?* flux into 
the stroma as a result of light-driven proton pumping stimulates the CO, fixation 
pathway at these key steps. Activity measurements have indicated that fructose bis- 
phosphatase may be the rate-limiting step in the Calvin cycle. The recurring theme of 
fructose bisphosphatase as the target of the light-induced changes in the chloroplasts 
implicates this enzyme as a key point of control in the Calvin cycle. 


21.7h Protein-Protein Interactions Mediated by an Intrinsically 
Unstructured Protein Also Regulate Calvin-Benson 
Cycle Activity 


The 8.5 kDa chloroplast protein CP12 is an intrinsically unstructured protein (see 
Chapter 6) that interacts with ribulose-5-P kinase (RS5PK) and glyceraldehyde-3-P 
dehydrogenase (GAPDH) to form a complex. When complexed with CP12, RSPK and 
GAPDH are relatively inactive. Note that the first of these enzymes is ATP-depen- 
dent and the second is NADPH-dependent. Thus, formation of this complex con- 
strains the use of ATP and NADPH, the principal products of the light reactions of 
photosynthesis. Reduced thioredoxin, formed when PSI is illuminated, interacts with 
CP12, leading to the dissociation of the GAPDH-—CP12—R5PK complex. Uncom- 
plexed RS5PK and GAPDH are inherently more active, and operation of the Calvin- 
Benson cycle is enhanced. 


21.8 | How Does Photorespiration Limit CO, Fixation? 


As indicated in section 21.7a, ribulose bisphosphate carboxylase/oxygenase catalyzes 
an alternative reaction in which O, replaces CO, as the substrate added to RuBP 
(Figure 21.29a). The ribulose-1,5-bisphosphate oxygenase reaction diminishes plant 
productivity because it leads to loss of RuBP, the essential CO, acceptor. The Km for Oy 
in this oxygenase reaction is about 200 uM. Given the relative abundance of CO, and 
O, in the atmosphere and their relative Km values in these rubisco-mediated reactions, 
the ratio of carboxylase to oxygenase activity in vivo is about three or four to one. 

The products of ribulose bisphosphate oxygenase activity are 3-phosphoglycerate 
and phosphoglycolate. Dephosphorylation and oxidation convert phosphoglycolate to 
glyoxylate, the a-keto acid of glycine (Figure 21.29b). Transamination yields glycine. In 
mitochondria, two glycines from photorespiration are converted into one serine and 
CO). This step is the source of the CO, evolved during photorespiration. Transamina- 
tion of glyoxylate to glycine by the product serine yields hydroxypyruvate; reduction of 
hydroxypyruvate yields glycerate, which can be phosphorylated to 3-phosphoglycerate. 
3-Phosphoglycerate can fuel resynthesis of ribulose bisphosphate by the Calvin cycle 
(see Figure 21.25). 


Figure 21.28 The pathway for light-regulated 
reduction of Calvin cycle enzymes. Light-generated 
reducing power (Fda = reduced ferredoxin) 
provides e~ for reduction of thioredoxin (T) by 
FTR. 
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Figure 21.29 The oxygenase reaction of rubisco. 
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Other fates of phosphoglycolate are also possible, including oxidation to CO», with 
the released energy being dissipated as heat. Obviously, agricultural productivity is 
dramatically lowered by this phenomenon, which, because it is a light-related uptake 
of O, and release of COs, is termed photorespiration. As we shall see, certain plants, 
particularly tropical grasses, have evolved means to circumvent photorespiration. 
These plants are more efficient users of light for carbohydrate synthesis. 


21.8a Tropical Grasses Use the Hatch-Slack Pathway 
to Capture Carbon Dioxide for CO, Fixation 


Tropical grasses are less susceptible to the effects of photorespiration, as noted ear- 
lier. Studies using “CO, as a tracer indicated that the first organic intermediate 
labeled in these plants was not a three-carbon compound but a four-carbon com- 
pound. Hal Hatch and Roger Slack, two Australian biochemists, first discovered this 
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C-4 product of CO; fixation, and the C-4 pathway of CO, incorporation is named the 
Hatch-—Slack pathway after them. The C-4 pathway is not an alternative to the Calvin 
cycle series of reactions or even a net CO, fixation scheme. Instead, it functions as a 
CO) delivery system, carrying CO) from the relatively oxygen-rich surface of the leaf 
to interior cells, where oxygen is lower in concentration and, hence, less effective in 
competing with CO) in the rubisco reaction. Thus, the C-4 pathway is a means of 
avoiding photorespiration by sheltering the rubisco reaction in a cellular compart- 
ment away from high [O,]. The C-4 compounds serving as CO, transporters are 
malate or aspartate. 

Compartmentation of these reactions to prevent photorespiration involves the 
interaction of two cell types: mesophyll cells and bundle sheath cells. The mesophyll 
cells take up CO; at the leaf surface, where O, is abundant, and use it to carboxylate 
phosphoenolpyruvate to yield OAA in a reaction catalyzed by PEP carboxylase 
(Figure 21.30). This four-carbon dicarboxylic acid is then either reduced to malate by 
an NADPH-specific malate dehydrogenase or transaminated to give aspartate in the 
mesophyll cells. The 4-C CO, carrier (malate or aspartate) is then transported 
to the bundle sheath cells, where it is decarboxylated to yield CO, and a 3-C 
product. The CO; is then fixed into organic carbon by the Calvin cycle localized within 
the bundle sheath cells, and the 3-C product is returned to the mesophyll cells, where it 
is reconverted to PEP in preparation to accept another CO) (Figure 21.30). Plants that 
use the C-4 pathway are termed C4 plants, in contrast to those plants with the conven- 
tional pathway of CO, uptake (C3 plants). 


Intercellular Transport of Each CO, via a C-4 Intermediate Costs 2 ATPs The transport of 
each CO, requires the expenditure of two high-energy phosphate bonds. The energy 
of these bonds is expended in the phosphorylation of pyruvate to PEP (phosphoenol- 
pyruvate) by the plant enzyme pyruvate-P; dikinase; the products are PEP, AMP, and 
pyrophosphate (PP;). This represents a unique phosphotransferase reaction in that 
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Figure 21.30 Essential features of the 
compartmentation and biochemistry of the 
Hatch-Slack pathway of carbon dioxide uptake 
in C4 plants. 
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both the B- and y-phosphates of a single ATP are used to phosphorylate the two sub- 
strates, pyruvate and P;. The reaction mechanism involves an enzyme phosphohisti- 
dine intermediate. The y-phosphate of ATP is transferred to P;, whereas formation of 
E-His-P occurs by addition of the B-phosphate from ATP: 


E—His + AMP,—P,g—P, + P; —> E—His—P, + AMP, + P,P; 
E—His—P, + pyruvate —> PEP + E—His 
Net: ATP + pyruvate + P; —> AMP + PEP + PP; 


Pyruvate-P; dikinase is regulated by reversible phosphorylation of a threonine resi- 
due, the nonphosphorylated form being active. Interestingly, ADP is the phosphate 
donor in this interconvertible regulation. Despite the added metabolic expense of two 
phosphodiester bonds for each equivalent of carbon dioxide taken up, CO; fixation is 
more efficient in C4 plants, provided that light intensities and temperatures are both 
high. (As temperature rises, photorespiration in C3 plants rises and efficiency of CO, 
fixation falls.) Tropical grasses that are C4 plants include sugarcane, maize, and crab- 
grass. In terms of photosynthetic efficiency, cultivated fields of sugarcane represent 
the pinnacle of light-harvesting efficiency. Approximately 8% of the incident light 
energy on a sugarcane field appears as chemical energy in the form of CO) fixed into 
carbohydrate. This efficiency compares dramatically with the estimated photosyn- 
thetic efficiency of 0.2% for uncultivated plant areas. Research on photorespiration is 
actively pursued in hopes of enhancing the efficiency of agriculture by controlling this 
wasteful process. Synthetic biology research focused on dramatically increasing plant 
productivity would help provide enough food for the growing global population. Such 
initiatives include (1) improving rubisco’s catalytic activity and specificity for CO2»; (2) 
metabolic engineering plants to limit the deleterious effects of photorespiration; and 
(3) the transformation of C3 plants into C4 plants. Rice, a very important food source 
for much of humanity, is a C3 plant. Converting rice to a C4 plant would significantly 
enhance its yield. 


21.8b Cacti and Other Desert Plants Capture CO, at Night 


In contrast to C4 plants, which have separated CO, uptake and fixation into distinct cells 
to minimize photorespiration, succulent plants native to semiarid and tropical environ- 
ments separate CO, uptake and fixation in time. Carbon dioxide (as well as O3) enters 
the leaf through microscopic pores known as stomata, and water vapor escapes from 
plants via these same openings. In nonsucculent plants, the stomata are open during the 
day, when light can drive photosynthetic CO, fixation, and closed at night. Succulent 
plants, such as the Cactaceae (cacti) and Crassulaceae, cannot open their stomata during 
the heat of day because any loss of precious H,O in their arid habitats would doom 
them. Instead, these plants open their stomata to take up CO, only at night, when tem- 
peratures are lower and water loss is less likely. This carbon dioxide is immediately incor- 
porated into PEP to form OAA by PEP carboxylase; OAA is then reduced to malate by 
malate dehydrogenase and stored within vacuoles until morning. During the day, the 
malate is released from the vacuoles and decarboxylated to yield CO, and a 3-C product. 
The CO, is then fixed into organic carbon by rubisco and the reactions of the Calvin 
cycle. Because this process involves the accumulation of organic acids (OAA, malate) 
and is common to succulents of the Crassulaceae family, it is referred to as crassulacean 
acid metabolism, and plants capable of it are called CAM plants. 
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Foundational Biochemistry Things You Should Know after Reading Chapter 21. 


The vast majority of energy used by living organisms can be traced to 
solar energy captured by photosynthesis. 


Photosynthesis is often expressed in terms of CO, fixation into 
carbohydrates. 


The capture of light energy in photosynthesis occurs in membranes. 


Chloroplasts are plant organelles devoted principally to photosyn- 
thesis. 


The capture of light energy during photosynthesis is localized to the 
inner membrane of chloroplasts, the thylakoid membrane. 


Chloroplasts possess an inner compartment not found in mitochon- 
dria, the thylakoid lumen. 


Photosynthesis consists of light-dependent (light) reactions and light- 
independent (dark) reactions. 


The light reactions are responsible for O, evolution, NADPH 
reduction, and ATP synthesis and are associated with the chloroplast 
thylakoid membranes. 


The dark reactions carry out the ATP- and NADPH-dependent 
process of CO) fixation and occur in the stroma of the chloroplast. 


Water is the ultimate electron donor for photosynthetic NADP* 
reduction. 


All of the Ozin the earth’s atmosphere was produced by photosynthesis. 
Photosynthesis depends on the photoreactivity of chlorophyll (Chl). 


When light energy is absorbed by Chl, an electron in this pigment 
is excited to a higher orbital, making the loss of this electron by an 
oxidation-reduction reaction more likely. 


Chlorophylls and accessory light-harvesting pigments absorb light of 
differing wavelengths, allowing photosynthetic organisms to capture 
energy across the visible light spectrum. 


Light energy absorbed by photosynthetic pigments has several pos- 
sible fates—toss as heat, loss through fluorescence, resonance energy 
transfer to a neighboring pigment molecule, or transduction into the 
chemical energy of an oxidation-reduction reaction. 


Photosynthetic units consist of many Chl molecules but only a single 
reaction-center Chl dimer. 


Oxygenic phototrophs have two Chl-based photosystems, PSI and 
PSII. 


P700 is the PSI reaction-center Chl dimer. 
P60 is the PSII reaction-center Chl dimer. 
Chl exists in photosynthetic membranes in association with proteins. 


The Chl-containing protein complexes are PSI, PSII, and light- 
harvesting complexes (LHCs). 


PSI and PSII participate in photosynthesis, with PSII catalyzing 
the light-driven oxidation of water to O, and the generation of a 
weak reductant, and PSI catalyzing the light-driven reduction of 
NADP". 


Together, PSI and PSII carry out the light-driven transfer of electrons 
from H20 to NADP", a process known as linear electron flow (LEF). 


Cyclic electron flow (CEF) in photosynthesis occurs when the electron 


lost from light-activated P700* returns to it via the cytochrome bef 


complex. 


Transfer of reducing equivalents through the cytochrome bef com- 
plex leads to proton translocations that contribute to ATP synthesis. 


Synthesis of ATP by means of a proton gradient established by light- 
driven electron transport is called photophosphorylation. 


The pathway of photosynthetic electron transfer is called the 
Z scheme. 


Three membrane-spanning supramolecular complexes achieve the 
Z scheme of electron transfer: PSII, cytochrome bsf and PSI. 


Plastoquinone and plastocyanin mediate the transfer of electrons 
from PSII to cytochrome bęf and from cytochrome bęf to PSI, 
respectively. 


O, evolution requires the accumulation of four oxidizing equivalents 
in PSII. 


Electrons taken from H,O replace electrons lost upon photo- 
oxidation of P680. 


X-ray crystallographic solution of the Rhodopseudomonas viridis 
reaction center (RC) established the molecular architecture of reac- 
tion centers. 


The R. viridis RC is an integral membrane protein containing a 
reaction-center bacteriochlorophyll dimer (P870), bacteriochloro- 
phyll, bacteriopheophytin, quinone, cytochromes, and a Fe atom. 


The molecular architectures of PSII and PSI resemble the molecular 
architecture of the R. viridis RC. 


The oxygen-evolving complex (OEC) of PSII possesses four Mn?*, 
one Ca?*, and five O atoms. 


The plant PSI complex has a quantum efficiency that is near-perfect: 
Essentially every photon of light absorbed leads to the transfer of an 
electron. 


The quantum yield of photosynthesis is expressed in terms of 
O- evolution, H* translocation, NADP” reduction and ATP synthesis. 


The photosynthetic energy requirements for hexose synthesis depend 
on the H*/hv and the ATP/H* ratios. 


The mechanism of photophosphorylation, like oxidative phosphory- 
lation, is chemiosmotic. 


CF,CFp-ATP synthase is the chloroplast equivalent of mitochondrial 
F,Fo-ATP synthase. 


The chloroplast ATP synthase c-subunit rotor has 14 c-subunits, so 
the H*/ATP ratio for ATP synthesis in chloroplasts = 14/3 = 4.67. 


Photophosphorylation can occur in either a noncyclic or a cyclic 
mode. 


Noncyclic photophosphorylation yields ATP, Oo, and NADPH. 
Cyclic photophosphorylation yields only ATP. 


Ribulose-1,5-bisphosphate is the CO, acceptor in the Calvin—Benson 
carbon dioxide fixation pathway. 


Ribulose bisphosphate carboxylase/oxygenase (rubisco) catalyzes the 
formation of two 3-phosphoglycerate molecules from ribulose-1,5- 
bisphosphate + CO2. 


2-Carboxy-3-keto-arabinitol is the key intermediate in the rubisco 
reaction. 


Rubisco exists in an inactive state that is activated through 
carbamoylation of Lys™%!, coordination of Mg?*, and removal of 
tightly bound substrate by rubisco activase. 
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The Calvin—Benson cycle accounts for the vast preponderance of CO 
fixation in the biosphere, carbohydrate synthesis, and the production 
of four-, five-, six-, and seven-carbon sugars. 


The conversion of 6 CO; to | hexose requires 12 NADPH and 18 ATP. 


The Calvin—Benson cycle is indirectly activated by light through light- 
driven pH changes in the chloroplast compartments, reduction of 
disulfides in key enzymes to pairs of —SH groups, and Mg?* move- 
ment from the thylakoid lumen into the stroma. 


Protein-protein interactions mediated by an intrinsically unstruc- 
tured protein (IUP) also regulate the Calvin—Benson cycle activity. 


Photorespiration limits CO; fixation through the wasteful consump- 
tion of RuBP by the oxygenase activity of rubisco. 


Tropical grasses use the Hatch—Slack pathway to capture carbon 
dioxide for CO; fixation. 


In tropical grasses, carbon dioxide capture occurs in mesophyll cells 
and fixation occurs in bundle sheath cells. 


Pyruvate-P; kinase is a key enzyme in PEP synthesis for CO, capture. 


Cacti and other succulent plants of arid climates capture CO, at 
night. 


Answers to all problems are at the end of this book. Detailed solutions 
are available in the Student Solutions Manual. 


1. 


2+ 


Mg?* in Chlorophyll Give a possible explanation as to why Mg?*, and 


not Fe?*, is in the center of chlorophyll? 


. P700* Has the Most Negative Standard Reduction Potential Found 


in Nature In photosystem I, P700 in its ground state has an 

Éo = +0.4 V. Excitation of P700 by a photon of 700-nm light alters 
the €,’ of P700* to —0.6 V. What is the efficiency of energy capture in 
this light reaction of P700? 


. PSII Has the Most Positive Standard Reduction Potential Found in 


Nature What is the ©,’ for the light-generated primary oxidant of 
photosystem II if the light-induced oxidation of water (which leads to 
O- evolution) proceeds with a AG?’ of —25 kJ/mol? 


. Equating the Energy of an Oxidation-Reduction Potential Difference 


and the Energy for ATP Synthesis (Integrates with Chapters 3 and 20.) 
Assuming that the concentrations of ATP, ADP, and P; in chloro- 
plasts are 3, 0.1, and 10 mM, respectively, what is the AG for ATP 
synthesis under these conditions? Photosynthetic electron transport 
establishes the proton-motive force driving photophosphorylation. 
What redox potential difference is necessary to achieve ATP synthesis 
under the foregoing conditions, assuming 1.3 ATP equivalents are 
synthesized for each electron pair transferred? 


. The Q Cycle in Photosynthesis (Integrates with Chapter 20.) Write a 


balanced equation for the Q cycle as catalyzed by the cytochrome bef 
complex of chloroplasts. 


. The Relative Efficiency of ATP Synthesis in Noncyclic Versus Cyclic 


Photophosphorylation If noncyclic photosynthetic electron transport 
leads to the translocation of 7 H*/2e~ and cyclic photosynthetic 
electron transport leads to the translocation of 2 H*/e~, what is the 
relative photosynthetic efficiency of ATP synthesis (expressed as the 
number of photons absorbed per ATP synthesized) for noncyclic 
versus cyclic photophosphorylation? (Assume that the CF;CF)-ATP 
synthase yields 3 ATP/14 H+.) 


. ApH and Ax in the Chloroplast Proton-Motive Force (Integrates with 


Chapter 20.) In mitochondria, the membrane potential (Aw) 
contributes relatively more to Ap (proton-motive force) than does the 
pH gradient (ApH). The reverse is true in chloroplasts. Why do you 
suppose that the proton-motive force in chloroplasts can depend more 
on ApH than mitochondria can? Why is (Aw) less in chloroplasts than 
in mitochondria? 


. The Role of Tyr’ in PSII Predict the consequences of a Y161F 


mutation in the amino acid sequence of the D1 subunit of PSII. 


. H*/ATP Ratio in Photosynthetic Bacteria (Integrates with Chapter 20.) 


Calculate (in einsteins and in kJ/mol) how many photons would be 
required by the Rhodopseudomonas viridis photophosphorylation 
system to synthesize 3 ATPs. (Assume that the R. viridis FiF -ATP 


10. 


Tl. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


synthase c-subunit rotor contains 12 c-subunits and that the R. viridis 
cytochrome bc, complex translocates 2 H*/e7.) 


The Standard-State Free Energy Change for NADP-Glyceraldehyde-3-P 
Dehydrogenase (Integrates with Chapters 18 and 20.) Calculate A G°’ 
for the NADP*-specific glyceraldehyde-3-P dehydrogenase reaction 
of the Calvin—Benson cycle. 


Photosynthesis of Glucose Write a balanced equation for the 
synthesis of a glucose molecule from ribulose-1,5-bisphosphate and 
CO) that involves the first three reactions of the Calvin cycle and 
subsequent conversion of the two glyceraldehyde-3-P molecules into 
glucose. 


Tracing the Fate of CO, During Photosynthesis '*C-labeled carbon 
dioxide is administered to a green plant, and shortly thereafter the 
following compounds are isolated from the plant: 3-phosphoglycerate, 
glucose, erythrose-4-phosphate, sedoheptulose-1,7-bisphosphate, and 
ribose-5-phosphate. In which carbon atoms will radioactivity be 
found? 


Fate of C'8O, in the Calvin-Benson Cycle If the Calvin-Benson cycle 
proceeded in the presence of C!ŝO, (assuming both oxygens are 
labeled), in what intermediates or products would the oxygen isotopes 
appear? 


ROS and Photosynthesis Is it possible for reactive oxygen species 
(ROS) to be generated through the photosynthesis process? 


Regulation of CO, Fixation During Photosynthesis The photosynthetic 
CO, fixation pathway is regulated in response to specific effects 
induced in chloroplasts by light. What is the nature of these effects, 
and how do they regulate this metabolic pathway? 


Uncoupling in Chloroplasts? What would happen if the photosyn- 
thetically driven proton translocation was uncoupled from ATP 
synthesis in chloroplasts? 


The Formation of Phosphoglycerate During Photorespiration Write a 
balanced equation for the conversion of phosphoglycolate to 
glycerate-3-P by the reactions of photorespiration. Does this balanced 
equation demonstrate that photorespiration is a wasteful process? 


The Source of the Oxygen Atoms in Photosynthetic O, Evolution 
The overall equation for photosynthetic CO, fixation is 


6 CO; +6 H:O === C6H1206 +6 (07 


All the O atoms evolved as O2 come from water; none come from 
carbon dioxide. But 12 O atoms are evolved as 6 O», and only 

6 O atoms appear as 6 H,O in the equation. Also, 6 CO, have 12 O 
atoms, yet there are only 6 O atoms in C6H1206. How can you account 
for these discrepancies? (Hint: Consider the partial reactions of 
photosynthesis: ATP synthesis, NADP* reduction, photolysis of 
water, and the overall reaction for hexose synthesis in the Calvin— 
Benson cycle.) 
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19. Alternative Oxygen-Evolving Oxidation Reactions Consider the 
following alternative oxygen-evolving redox reactions: 
NO; ——> NO, + O, and S0, — SO;- + 0O: 
a. Balance these redox reactions. 
b. Explain why NO,- and SO,?~ would be poor substitutes for H»O 
in the photosynthesis process. 


20. The Influence of c-Subunit Stoichiometry on the Efficiency of ATP 
Synthesis The number of c-subunits in F|Fo-type ATP synthases 
shows some variation from organism to organism. For example, the 
yeast ATP synthase contains 10 c-subunits, the spinach CF;CFy-ATP 
synthase has 14, and the cyanobacterium Spirulina platensis enzyme 
apparently has 15. 

a. What is the consequence of c-subunit stoichiometry for the 
H‘*/ATP ratio? 

b. What is the relationship between c-subunit stoichiometry and the 
magnitude of Ap (the proton-motive force)? 


21. The Energetics of Hydroquinone Reduction by Photosynthetic Reaction 
Centers The reduction of membrane-associated quinones, such as 
coenzyme Q and plastoquinones, is a common feature of photosys- 
tems (see Figures 21.15, 21.20, and 21.22). Assume ©,’ for 
PQ/PQH, = 0.07 V and the potential of the ground-state chlorophyll 
molecule = 0.5 V, calculate AG for the reduction of plastoquinone by 
a. 870 nm light. 

b. 700 nm light. 


c. 680 nm light. 

22. The Energetics of Proton Translocation by the Cytochrome b; f Complex 
Plastoquinone oxidation by cytochrome bc, and cytochrome bef 
complexes apparently leads to the translocation of 4*/2e~. If €,’ for 
cytochrome f = 0.365 V (Table 3.3) and E,’ for PQ/PQH; = 0.07 V, 
calculate AG for the coupled reaction: 


2 hv + 4 Hn — 4 Hoot 
(Assume a value of 23 kJ/mol for the free energy change (AG) 
associated with moving protons from inside to outside.) 


23. The Overall Free Energy Change for Photosynthetic NADP* Reduction 
Calculate the overall free energy change (AG) for noncyclic photosyn- 
thetic electron transport. 


4 (700 nm photons) + 4 (680 nm photons) + 2 H,O + 2 NADP* —> 
O- + 2 NADPH + 2 H* 
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24. Tyrosine Radical as Primary Electron Acceptor from the OEC Why is a 
tyrosine radical, not a serine or threonine radical, better suited to be 
the primary electron acceptor from the oxygen evolving complex? 


25. Why can Water be Considered Food for Plants? In animals and 
humans, complex organic molecules like carbohydrates, lipids or 
proteins are considered food. For plants, water can be considered a 
food. What chemical functions do these two categories of food have 
in common? 


26. Light-Harvesting by Accessory Pigments From Figure 21.5, predict 
the spectral properties of accessory light-harvesting pigments found 
in plants. 


27. Mg?* Movements Upon Chloroplast Illumination Draw a figure 
analogous to Figure 21.27, plotting [Mg?*] in the stroma and 
thylakoid lumen on the y-axis and dark-light-dark on the x-axis. 


PSAFE Problems 


(PSAFE models, listed by chapter or protein name, can be found at http:// 
www.psafeonline.net.) 


28. Using the ActiveModel for photosystem II, explain why dimerization 
occurs, and identify the residues that drive dimerization. 


29. Look at the Ramachandran plot in the ActiveModel for photosystem I, 
and determine the ¢ and y angles of the a-helices in photosystem II. 
What are typical ¢ and y angles for a-helices? 


Think-Pair-Share Question 


The transketolasee reaction in the Calvin-Benson cycle (Table 21.1 and 
Figure 21.25) involves the reaction between a ketose and an aldose to 
yield a different aldose and ketose. 

a. Draw the molecules in the transketolase reaction that uses 
D-ribose or D-arabinose as the aldose (see Figure 7.2) and 
D-fructose as the ketose (see Figure 7.3). Use colored pens to 
illustrate the fragments of the molecules that are exchanged. 

b. Fora transketolase reaction that generates D-ribose or 
D-fructose as a product, suggest a combination of an aldose and 
a ketose. Draw the molecules in this reaction and use colored 
pens to illustrate the fragments of the molecules that are 
exchanged. 

(Hint: Transketolases require the ketose to have the C3 and C4 
alcohol groups in threose configuration. See Figure 7.2). 
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Gluconeogenesis, Glycogen 22 
Metabolism, and the Pentose 
Phosphate Pathway P 


(With bread and wine you can walk your road.) 


Spanish proverb 


< A basket of fresh bread. Carbohydrates 
such as these provide a significant 
portion of human caloric intake. 


Essential Question Key Questions 

As shown in Chapters 18 and 19, the metabolism of sugars is an important source of energy 22.1 What Is Gluconeogenesis, 

for cells. Animals, including humans, typically obtain significant amounts of glucose and and How Does It Operate? 

other sugars from the breakdown of starch in their diets. Glucose can also be supplied via 22.2 How Is Gluconeogenesis Regulated? 
breakdown of cellular reserves of glycogen (in animals) or starch (in plants). What is the 22.3 How Are Glycogen and Starch 


nature of gluconeogenesis, the pathway that synthesizes glucose from noncarbohydrate Catabolized in Animals? 
precursors; how is glycogen synthesized from glucose; and how are electrons from 


ffs . 22.4 How Is Glycogen Synthesized? 
glucose used in biosynthesis? Wiis SU a 


22.5 How Is Glycogen Metabolism 
Controlled? 


- i : f 22.6 Can Glucose Provide Electrons 
he ability to synthesize glucose from common metabolites is very important for Biosynthesis? 


to most organisms. Human metabolism, for example, consumes about 160 
+ 20 grams of glucose per day, about 75% of this in the brain. Body fluids carry only 
about 20 grams of free glucose, and glycogen stores normally can provide only about 
180 to 200 grams of free glucose. Thus, the body carries only a little more than a 
one-day supply of glucose. If glucose is not obtained in the diet, the body must 
produce new glucose from noncarbohydrate precursors. The term for this activity is 
gluconeogenesis, which means the generation (genesis) of new (neo) glucose. 

Furthermore, muscles consume large amounts of glucose via glycolysis, producing 
large amounts of pyruvate. In vigorous exercise, muscle cells become anaerobic and 
pyruvate is converted to lactate. Gluconeogenesis salvages this pyruvate and lactate and 
reconverts it to glucose. 

Another pathway of glucose catabolism, the pentose phosphate pathway, is the primary 
source of NADPH, the reduced coenzyme essential to most reductive biosynthetic pro- 
cesses. For example, NADPH is crucial to the biosynthesis of fatty acids and amino acids. 
The pentose phosphate pathway also results in the production of ribose-5-phosphate, an 
essential component of ATP, NAD+, FAD, coenzyme A, and particularly DNA and 
RNA. This important pathway will also be considered in this chapter. 
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Chapter 22 Gluconeogenesis, Glycogen Metabolism, and the Pentose Phosphate Pathway 


22.1 | What Is Gluconeogenesis, 


| and How Does It Operate? 


This section describes the essential features of gluconeogenesis: the substrates that 
the pathway utilizes, which organs in animals carry out the pathway, and the four 
reactions that make the pathway energetically favorable and spontaneous. 


22.la The Substrates for Gluconeogenesis Include 
Pyruvate, Lactate, and Amino Acids 


As noted above, pyruvate and lactate are the principal substrates of gluconeogenesis. 
In addition to pyruvate and lactate, other noncarbohydrate precursors can be used 
as substrates for gluconeogenesis in animals. These include most of the amino acids, as 
well as glycerol and all the TCA cycle intermediates. On the other hand, fatty acids 
are not substrates for gluconeogenesis in animals, because most fatty acids yield only 
acetyl-CoA upon degradation, and animals cannot carry out net synthesis of sugars 
from acetyl-CoA. Lysine and leucine are the only amino acids that are not substrates 
for gluconeogenesis. These amino acids produce only acetyl-CoA upon degradation. 
Note also that acetyl-CoA can be a substrate for gluconeogenesis in plants when the 
glyoxylate cycle is operating (see Section 19.9). 


22.1b Nearly All Gluconeogenesis Occurs in the Liver 
and Kidneys in Animals 


Interestingly, the mammalian organs that consume the most glucose, namely, brain 
and muscle, carry out very little glucose synthesis. The major sites of gluconeogenesis 
are the liver and kidneys, which account for about 90% and 10% of the body’s gluco- 
neogenic activity, respectively. Glucose produced by gluconeogenesis in the liver and 


Human Biochemistry 


The Chemistry of Glucose Monitoring Devices 


Individuals with diabetes must measure their serum glucose con- 
centration frequently, often several times a day. The advent of 
computerized, automated devices for glucose monitoring has made 
this necessary chore easier, far more accurate, and more convenient 
than it once was. These devices all use a simple chemical scheme for 
glucose measurement that involves oxidation of glucose to gluconic 
acid by glucose oxidase. This reaction produces two molecules of 
hydrogen peroxide per molecule of glucose oxidized. The HO, is 
then used to oxidize a dye, such as o-dianisidine, to a colored product 
that can be measured: 


Glucose + 2 HO + O, —> gluconic acid + 2 H,O, 
o-dianisidine (colorless) + H:O, —> oxidized o-anisidine 
(colored) + HO 


The amount of colored dye produced is directly proportional to the 
amount of glucose in the sample. 

The patient typically applies a drop of blood (from a finger- 
prick*) to a plastic test strip that is then inserted into the glucose 
monitor. Within half a minute, a digital readout indicates the blood 
glucose value. Modern glucose monitors store several days of 
glucose measurements, and the data can be easily transferred to a 
computer for analysis and graphing. 


Charles M. Grisham 


*How does the monitor deal with getting just the right amount of blood? 
The blood flows up an absorbent wick by capillary action. It is impossible 
to overfill this device, but the monitor will give an error signal if not enough 
blood flows up the strip. 
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Dr. Kayunta Johnson-Winters 


Dr. Kayunta Johnson-Winters is an Associate 
Professor at the University of Texas at Arlington 
and a mechanistic enzymologist with a research 
focus on F4.9-dependent enzymes. She double 
majored in biology and chemistry at Alverno 
College in Milwaukee and went on to earn a Ph.D. 
in Biochemistry from the University of Wisconsin- 
Milwaukee. Dr. Johnson-Winters conducted 
postdoctoral research with John Enemark at the 
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Careers in Chemistry 


University of Arizona on the mechanisms of sulfite 
oxidizing enzymes. 

Her current research uses steady-state and rapid- 
mixing pre-steady state kinetic methods to investi- 
gate reaction intermediates of glucose-6-phosphate 
dehydrogenase. This enzyme and other F429 cofactor 
dependent enzymes in Mycobacterium tuberculosis 
hold promise as potential drug targets for treatment 
of tuberculosis. 


kidneys is released into the blood and is subsequently absorbed by brain, heart, 
muscle, and red blood cells to meet their metabolic needs. In turn, pyruvate and 
lactate produced in these tissues are returned to the liver and kidneys to be used as 
gluconeogenic substrates. 


22.1c Gluconeogenesis Is Not Merely the Reverse of Glycolysis 


In some ways, gluconeogenesis is the reverse, or antithesis, of glycolysis. Glucose is 
synthesized, not catabolized; ATP is consumed, not produced; and NADH is oxidized 
to NAD*, rather than the other way around. However, gluconeogenesis cannot be 
merely the reversal of glycolysis, for two reasons. First, glycolysis is exergonic, with a 
AG” of approximately —74 kJ/mol. If gluconeogenesis were merely the reverse, it 
would be a strongly endergonic process and could not occur spontaneously. The 
energetics of the process must be augmented so that gluconeogenesis can proceed 
spontaneously. Second, the processes of glycolysis and gluconeogenesis must be 
regulated in a reciprocal fashion so that when glycolysis is active, gluconeogenesis is 
inhibited, and when gluconeogenesis is proceeding, glycolysis is turned off. Both of 
these limitations are overcome by having unique reactions within the routes of 
glycolysis and gluconeogenesis, rather than a completely shared pathway. 


22.1d Gluconeogenesis—Something Borrowed, Something New 


The complete route of gluconeogenesis is shown in Figure 22.1, side by side with the 
glycolytic pathway. Gluconeogenesis employs four different reactions, catalyzed by 
four different enzymes, for the three steps of glycolysis that are highly exergonic (and 
highly regulated). In essence, seven of the ten steps of glycolysis are merely reversed 
in gluconeogenesis. The six reactions between fructose-1,6-bisphosphate and PEP are 
shared by the two pathways, as is the isomerization of glucose-6-P to fructose-6-P. 
The three exergonic, regulated reactions—the hexokinase (glucokinase), phospho- 
fructokinase, and pyruvate kinase reactions—are replaced by alternative reactions 
in the gluconeogenic pathway. 

The conversion of pyruvate to PEP that initiates gluconeogenesis is accomplished 
by two unique reactions. Pyruvate carboxylase catalyzes the first, converting pyruvate 
to oxaloacetate. Then, PEP carboxykinase catalyzes the conversion of oxaloacetate to 
PEP. Conversion of fructose-1,6-bisphosphate to fructose-6-phosphate is catalyzed 
by a specific phosphatase, fructose-1,6-bisphosphatase. The final step to produce 
glucose, hydrolysis of glucose-6-phosphate, is mediated by glucose-6-phosphatase. 
Each of these steps is considered in detail in the following paragraphs. The overall 
conversion of pyruvate to PEP by pyruvate carboxylase and PEP carboxykinase has 
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Figure 22.1 The pathways of gluconeogenesis and This reaction occurs in the ER 
glycolysis. Yellow boxes denote reactions unique to 
gluconeogenesis. Species in blue, green, and peach- 
colored shaded boxes indicate other entry points for 
gluconeogenesis (in addition to pyruvate). 


Glyceraldehyde-3-P ———~ Dihydroxyacetone-P 
NAD+ > a NAD+ 
NADH < N NADH 
1,3-Bisphosphoglycerate 


N 
ADP ADP 


m 


3-Phosphoglycerate 


j 


2-Phosphoglycerate 


acids 


a AG” close to zero but is pulled along by subsequent reactions. The conversion of 
fructose-1,6-bisphosphate to glucose in the last three steps of gluconeogenesis is 
strongly exergonic, with a AG?’ of about —30.5 kJ/mol. This sequence of two phos- 
phatase reactions separated by an isomerization accounts for most of the free energy 
release that makes the gluconeogenesis pathway spontaneous. 
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22.le Four Enzyme Reactions Are Unique to Gluconeogenesis 


1. Pyruvate Carboxylase—A Biotin-Dependent Enzyme—Converts Pyruvate to Oxalocetate 
Initiation of gluconeogenesis occurs in the pyruvate carboxylase reaction—the conversion 
of pyruvate to oxaloacetate. 


© O 
Il + HCO; + ATP —> Il + ADP + P; 
CH;C — COOR OKO S—(Cla (C= (COOK 
Pyruvate Bicarbonate Oxaloacetate 


The reaction takes place in two discrete steps, involves ATP and bicarbonate as sub- 
strates, and utilizes biotin as a coenzyme and acetyl-CoA as an allosteric activator. 
Pyruvate carboxylase is a tetrameric enzyme (with a molecular mass of about 500 kDa). 
Each monomer possesses a biotin covalently linked to the e-amino group of a lysine 


rapidly to form N-carboxybiotin, liberating inorganic phosphate. The third step involves 
abstraction of a proton from the C-3 of pyruvate, forming a carbanion that can attack 
the carbon of N-carboxybiotin to form oxaloacetate. 


Pyruvate Carboxylase Is Allosterically Activated by Acetyl-Coenzyme A Two particularly 
interesting aspects of the pyruvate carboxylase reaction are (i) allosteric activation of 
the enzyme by acyl-CoA derivatives and (ii) compartmentation of the reaction in the 
mitochondrial matrix. The carboxylation of biotin requires the presence (at an allo- 
steric site) of acetyl-CoA or other acylated CoA derivatives. The second half of the 
carboxylase reaction—the attack by pyruvate to form oxaloacetate—is not affected 
by CoA derivatives. 

Activation of pyruvate carboxylase by acetyl-CoA provides an important physiological 
regulation. Acetyl-CoA is the primary substrate for the TCA cycle, and oxaloacetate 
(formed by pyruvate carboxylase) is an important intermediate in both the TCA cycle and 
the gluconeogenesis pathway. If levels of ATP and/or acetyl-CoA (or other acyl-CoAs) 


In most organisms, pyruvate carboxylase is a 
homotetramer of 130-kDa subunits, with each 

, saan . . oo a subunit composed of three functional domains 
residue at the active site (Figure 22.2). The first step of the reaction involves nucleophilic named biotin carboxylase, carboxyl transferase, and 
attack of a bicarbonate oxygen at the y-P of ATP to form carbonylphosphate (an activated biotin carboxyl carrier protein. Shown here is the 


form of CO») and ADP (Figure 22.3). Reaction of carbonylphosphate with biotin occurs biotin carboxylase domain of pyruvate carboxylase 


from Bacillus thermodenitrificans (PDB id = 2DZD). 


CHCHCH;CH, = Lysine 
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Figure 22.2 Covalent linkage of biotin to an active- 


site lysine in pyruvate carboxylase. 
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Figure 22.3 A mechanism for the pyruvate carboxylase reaction. Bicarbonate must be activated for 
attack by the pyruvate carbanion. This activation is driven by ATP and involves formation of a 
carbonylphosphate intermediate—a mixed anhydride of carbonic and phosphoric acids. 
(Carbonylphosphate and carboxyphosphate are synonyms.) 
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Figure 22.4 Pyruvate carboxylase is a 
compartmentalized reaction. Pyruvate is converted 
to oxaloacetate in the mitochondria. Because 
oxaloacetate cannot be transported across the 
mitochondrial membrane, it must be reduced to 
malate, which is then transported to the cytosol and 
oxidized back to oxaloacetate before 


gluconeogenesis can continue. 


PEP carboxykinase from Escherichia coli with 
ADP (blue), pyruvate (purple), and Mg?* 


(orange; PDB id = 10S1). 


are low, pyruvate is directed primarily into the TCA cycle, which eventually promotes the 
synthesis of ATP. If ATP and acetyl-CoA levels are high, pyruvate is converted to oxalo- 
acetate and consumed in gluconeogenesis. Clearly, high levels of ATP and CoA deriva- 
tives are signs that energy is abundant and that metabolites will be converted to glucose 
(and perhaps even glycogen). If the energy status of the cell is low (in terms of ATP and 
CoA derivatives), pyruvate is consumed in the TCA cycle. Also, as noted in Chapter 19, 
pyruvate carboxylase is an important anaplerotic enzyme. Its activation by acetyl-CoA 
leads to oxaloacetate formation, replenishing the level of TCA cycle intermediates. 


Compartmentalized Pyruvate Carboxylase Depends on Metabolite Conversion and Trans- 
port The second interesting feature of pyruvate carboxylase is that it is found only in 
the matrix of the mitochondria. By contrast, the next enzyme in the gluconeogenic 
pathway, PEP carboxykinase, may be localized in the cytosol, in the mitochondria, 
or both. For example, rabbit-liver PEP carboxykinase is predominantly mitochon- 
drial, whereas the rat-liver enzyme is strictly cytosolic. In human liver, PEP car- 
boxykinase is found both in the cytosol and in the mitochondria. Pyruvate is 
transported into the mitochondrial matrix (Figure 22.4), where it can be converted to 
acetyl-CoA (for use in the TCA cycle) and then to citrate (for fatty acid synthesis). 
Alternatively, it may be converted directly to OAA by pyruvate carboxylase and used 
in gluconeogenesis. In tissues where PEP carboxykinase is found only in the mito- 
chondria, oxaloacetate is converted to PEP, which is then transported to the cytosol 
for gluconeogenesis. However, in tissues that must convert some oxaloacetate to PEP 
in the cytosol, a problem arises. Oxaloacetate cannot be transported directly across 
the mitochondrial membrane. Instead, it must first be transformed into malate or 
aspartate for transport across the mitochondrial inner membrane (Figure 22.4). 
Cytosolic malate and aspartate are reconverted to oxaloacetate before continuing 
along the gluconeogenic route. 


2. PEP Carboxykinase Converts Oxaloacetate to PEP The second reaction in the gluco- 
neogenic pyruvate—PEP bypass is the conversion of oxaloacetate to PEP. 


O 2— 
X4 oe Oz PO, : E 
C—COOT + GTP > C— COO” + GpP + xy 
[ PEP carboxykinase I| 
HC — E00 CH, 
Oxaloacetate PEP 


Production of a high-energy metabolite such as PEP requires energy. The energetic 
requirements are handled in two ways here. First, the CO, added to pyruvate in the 
pyruvate carboxylase step is removed in the PEP carboxykinase reaction. Decarboxyl- 
ation is a favorable process and helps drive the formation of the very high-energy enol 
phosphate in PEP. This decarboxylation drives a reaction that would otherwise be 
highly endergonic. Note the inherent metabolic logic in this pair of reactions: Pyruvate 
carboxylase consumed an ATP to drive a carboxylation so that the PEP carboxykinase 
could use the decarboxylation to facilitate formation of PEP. Second, another high- 
energy phosphate is consumed by the carboxykinase. Mammals and various other 
organisms use GTP in this reaction, rather than ATP. The use of GTP here is equiva- 
lent to the consumption of an ATP, due to the activity of the nucleoside diphosphate 
kinase (see Figure 19.2). The substantial free energy of hydrolysis of GTP is crucial to 
the synthesis of PEP in this step. The overall AG for the pyruvate carboxylase and PEP 
carboxykinase reactions under physiological conditions in the liver is —22.6 kJ/mol. 
Once PEP is formed in this way, the phosphoglycerate mutase, phosphoglycerate 
kinase, glyceraldehyde-3-P dehydrogenase, aldolase, and triose phosphate isomerase 
reactions act to eventually form fructose-1,6-bisphosphate, as shown in Figure 22.1. 


3. Fructose-1,6-Bisphosphatase Converts Fructose-1,6-Bisphosphate to Fructose-6-Phos- 
phate The hydrolysis of fructose-1,6-bisphosphate to fructose-6-phosphate is cata- 
lyzed by fructose-1,6-bisphosphatase. 
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0,POH,C ~~?\_ cH,OPOe *OsPOHSC On CHOH 
H HO + H0O > H HO + ® 
H OH Fructose-1,6- H OH 
bisphosphatase 
HO H HO H 
AG” = -16.7 kJ/mol 
Fructose-1,6-bisphosphate Fructose-6-phosphate 


Like all phosphate ester hydrolyses, this a thermodynamically favorable (exergonic) 
reaction under standard-state conditions (AG°’ = —16.7 kJ/mol). Under physiological 
conditions in the liver, the reaction is also exergonic (AG = —8.6 kJ/mol). Fructose- 
1,6-bisphosphatase is an allosterically regulated enzyme. Citrate stimulates bisphos- 
phatase activity, but fructose-2,6-bisphosphate is a potent allosteric inhibitor. AMP 
also inhibits the bisphosphatase; the inhibition by AMP is enhanced by fructose- 
2,6-bisphosphate. 


4. Glucose-6-Phosphatase Converts Glucose-6-Phosphate to Glucose The final step in 
the gluconeogenesis pathway is the conversion of glucose-6-phosphate to glucose by 
the action of glucose-6-phosphatase (Figure 22.5). This enzyme is present in the 
membranes of the endoplasmic reticulum of liver and kidney cells but is absent in 
muscle and brain. For this reason, gluconeogenesis is not carried out in muscle and 
brain. The glucose-6-phosphatase system includes the phosphatase itself and three 
transport proteins, T1, T2, and T3. The glucose-6-phosphate transporter (T1) takes 
glucose-6-phosphate into the endoplasmic reticulum, where it is hydrolyzed to 
glucose and P;. The T2 and T3 transporters export glucose and Pj, respectively, to the 
cytosol, and glucose is then exported (to the circulation) by the GLUT2 transporter. _ Fructose-1,6-bisphosphatase from pig with fructose- 
The glucose-6-phosphatase reaction involves a phosphorylated enzyme intermediate, © Phosphate (orange), AMP (blue), and Mg** (dark 
sgi A : ; blue; PDB id = 1FBP). 
phosphohistidine (Figure 22.6). The AG for the glucose-6-phosphatase reaction in 
liver is —5.1 kJ/mol. 


Coupling with Hydrolysis of ATP and GTP Drives Gluconeogenesis The net reaction for the 
conversion of pyruvate to glucose in gluconeogenesis is 


2 Pyruvate + 4 ATP + 2 GTP + 2 NADH + 2 H* + 6 H:O —> 
glucose + 4 ADP + 2 GDP + 6 P; + 2 NAD+ 


Glucose-6-phosphate Figure 22.5 Glucose-6-phosphatase is localized in the 
endoplasmic reticulum. Conversion of glucose-6- 
Glucose-6-phosphatase phosphate to glucose occurs following transport into 
aa the endoplasmic reticulum. Glucose-6-phosphatase 
N ITY M and the three transporters, T1, T2, and T3, are known 
Wij, collectively as the glucose-6-phosphatase system. 
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Figure 22.6 The glucose-6-phosphatase reaction involves formation of a phosphohistidine intermediate. 


The net free energy change, AG°’, for this conversion is —37.7 kJ/mol. The consump- 
tion of a total of six nucleoside triphosphates drives this process forward. If glycolysis 
were merely reversed to achieve the net synthesis of glucose from pyruvate, the net 
reaction would be 
2 Pyruvate + 2 ATP + 2 NADH + 2 H+ + 2 H,O —> 

glucose + 2 ADP + 2 P; + 2 NAD+ 


and the overall AG°’ would be about +74 kJ/mol. Such a process would be highly 
endergonic and, therefore, thermodynamically unfeasible. Hydrolysis of four addi- 
tional high-energy phosphate bonds makes gluconeogenesis thermodynamically 


Human Biochemistry 


Metformin—a Diabetes Drug with Multiple Actions 


Diabetes, the inability to assimilate and metabolize blood glucose, 
afflicts millions of people. People with type 1 diabetes are unable 
to synthesize and secrete insulin. On the other hand, people with 
type 2 diabetes make sufficient insulin, but the molecular pathways 
that respond to insulin are defective. Many type 2 diabetic people 
exhibit a condition termed insulin resistance even before the onset 
of diabetes. Metformin (see accompanying figure) is a drug that 
regulates glucose homeostasis and improves sensitivity to insulin. 
Metformin does not act on just one cellular target, but instead 
works through multiple mechanisms in the body. It acts preferen- 
tially in the liver, in part because it is taken up readily by transport 
channels in hepatocytes (liver cells). Metformin also stimulates glu- 
cose uptake by glucose transporters in peripheral tissues, increases 
insulin receptor expression, increases binding of insulin to insulin 
receptors, stimulates tyrosine kinase activity of the insulin receptor, 
and inhibits gluconeogenesis and glycogenolysis in the liver. The 
inhibition of gluconeogenesis involves reduction in liver uptake of 
gluconeogenic substrates and antagonism of glucagon signaling 
pathways. Metformin inhibits Complex I of the electron transport 
system (Section 20.2b), which suppresses ATP production, increases 
cellular AMP and ADP, and raises the cellular AMP/ATP ratio. 


CH, H 
| | HS Nig COO- 
I I FZ 
Metformin. 3-Mercaptopicolinate 


Metformin also inhibits AMP deaminase, which also raises 
AMP levels. Elevated AMP activates AMP kinase (AMPK), the 
cellular energy sensor. That, in turn, promotes catabolic reactions 
that generate ATP and inhibit anabolic reactions, including the 
fructose-1,6-bisphosphatase reaction of gluconeogenesis. 

Metformin also increases secretion from intestinal cells of 
glucagon-like peptide 1 (GLP-1), a glucose-lowering hormone that 
is released in response to food ingestion and that stimulates glucose- 
dependent secretion of insulin and inhibits glucagon release from the 
pancreas. Metformin also exerts antioxidant effects, reducing reac- 
tive oxygen species (ROS) in cells by several mechanisms. Finally, 
metformin exerts anti-inflammatory effects, preventing vascular 
damage and protecting against cardiovascular disease. 

Gluconeogenesis inhibitors may be the next wave in diabetes ther- 
apy. Drugs that block gluconeogenesis without affecting glycolysis 
would need to target one of the enzymes unique to gluconeogenesis. 
3-Mercaptopicolinate and hydrazine inhibit PEP carboxykinase, 
and chlorogenic acid, a natural product found in the skin of peaches, 
inhibits the transport activity of the glucose-6-phosphatase system 
(but not the glucose-6-phosphatase enzyme activity). Drugs of this 
type may be useful in the treatment of type 2 diabetes. 
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favorable. Under physiological conditions, however, gluconeogenesis is somewhat 
less favorable than at standard state, with an overall AG of —15.6 kJ/mol for the 
conversion of pyruvate to glucose. 


Lactate Formed in Muscles Is Recycled to Glucose in the Liver A final point on the 
redistribution of lactate and glucose in the body serves to emphasize the metabolic 
interactions between organs. Vigorous exercise can lead to oxygen shortage (anaero- 
bic conditions), and energy requirements must be met by increased levels of glycolysis. 
Under such conditions, glycolysis converts NAD* to NADH, yet O; is unavailable for 
regeneration of NAD* via cellular respiration. Instead, large amounts of NADH are 
reoxidized by the reduction of pyruvate to lactate. The lactate thus produced can be 
transported from muscle to the liver, where it is reoxidized by liver lactate dehydroge- 
nase to yield pyruvate, which is converted eventually to glucose. In this way, the liver 
shares in the metabolic stress created by vigorous exercise. It exports glucose to 
muscle, which produces lactate, and lactate from muscle can be processed by the liver 
into new glucose. This is referred to as the Cori cycle (Figure 22.7). Liver, with a typi- 
cally high NAD*/NADH ratio (about 700), readily produces more glucose than it can 
use. Muscle that is vigorously exercising will enter anaerobiosis and show a decreasing 
NAD*/NADH ratio, which favors reduction of pyruvate to lactate. 


22.2 | How Is Gluconeogenesis Regulated? 


Nearly all of the reactions of glycolysis and gluconeogenesis take place in the cytosol. 
If metabolic control were not exerted over these reactions, glycolytic degradation of 
glucose and gluconeogenic synthesis of glucose could operate simultaneously, with 
no net benefit to the cell and with considerable consumption of ATP. This is pre- 
vented by a sophisticated system of reciprocal control, which inhibits glycolysis when 
gluconeogenesis is active, and vice versa. Reciprocal regulation of these two pathways 
depends largely on the energy status of the cell. When the energy status of the cell is 
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Figure 22.7 The Cori cycle recycles lactate formed 
in muscles to glucose in the liver. 
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Critical Developments in Biochemistry 


The Pioneering Studies of Carl and Gerty Cori 


The Cori cycle is named for Carl and Gerty Cori, who received 
the Nobel Prize in Physiology or Medicine in 1947 for their stud- 
ies of glycogen metabolism and blood glucose regulation. Carl 
Ferdinand Cori and Gerty Theresa Radnitz were both born in 
Prague (then in Austria). They earned medical degrees from the 
German University of Prague in 1920 and were married later 
that year. They joined the faculty of the Washington University 
School of Medicine in St. Louis in 1931. Their remarkable col- 
laboration resulted in many fundamental advances in carbohydrate 


They also showed that glucose-6-phosphate was produced from 
glucose-1-P by the action of phosphoglucomutase. They isolated 
and crystallized glycogen phosphorylase and elucidated the path- 
way of glycogen breakdown. In 1952, they showed that absence of 
glucose-6-phosphatase in the liver was the enzymatic defect in von 
Gierke’s disease, an inherited glycogen-storage disease. Six eventual 
Nobel laureates received training in their laboratory. Gerty Cori was 
the first American woman to receive a Nobel Prize. Carl Cori said 
of their remarkable collaboration: “Our efforts have been largely 


and glycogen metabolism. They were credited with the discovery complementary and one without the other would not have gone so 
of glucose-1-phosphate, also known at the time as the Cori ester. far...” 


low, glucose is rapidly degraded to produce needed energy. When the energy status 
is high, pyruvate and other metabolites are utilized for synthesis (and storage) of 
glucose. Moreover, when blood glucose levels are low, gluconeogenesis is active. 

In glycolysis, the three regulated enzymes are those catalyzing the strongly exergonic 
reactions: hexokinase (glucokinase), phosphofructokinase, and pyruvate kinase. As 
noted earlier, the gluconeogenic pathway replaces these three reactions with corre- 
sponding reactions that are exergonic in the direction of glucose synthesis: glucose- 
6-phosphatase, fructose-1,6-bisphosphatase, and the pyruvate carboxylase-PEP 
carboxykinase pair, respectively. These are the three most appropriate sites of regulation 
in gluconeogenesis. 


22.2a Gluconeogenesis Is Regulated by Allosteric 
and Substrate-Level Control Mechanisms 


The mechanisms of regulation of gluconeogenesis are shown in Figure 22.8. Control is 
exerted at the same steps as for glycolysis, but in different ways. Glucose-6-phosphatase 
is not under allosteric control. However, the K,, for the substrate, glucose-6-phosphate, 
is considerably higher than the normal range of substrate concentrations. As a result, 
glucose-6-phosphatase displays a near-linear dependence of activity on substrate con- 
centrations and is thus said to be under substrate-level control by glucose-6-phosphate. 
Acetyl-CoA is a potent allosteric effector of glycolysis and gluconeogenesis. It alloste- 
rically inhibits pyruvate kinase (as noted in Chapter 18) and activates pyruvate carboxyl- 
ase. Because it also allosterically inhibits pyruvate dehydrogenase (the enzymatic link 
between glycolysis and the TCA cycle), the cellular fate of pyruvate is strongly dependent 
on acetyl-CoA levels. A rise in [acetyl-CoA] indicates that cellular energy levels are high 
and that carbon metabolites can be directed to glucose synthesis and storage. When ace- 
tyl-CoA levels drop, the activities of pyruvate kinase and pyruvate dehydrogenase 
increase and flux through the TCA cycle increases, providing needed energy for the cell. 
Fructose-1,6-bisphosphatase is another important site of gluconeogenic regula- 
tion. This enzyme is inhibited by AMP and activated by citrate. These effects by AMP 
and citrate are the opposites of those exerted on phosphofructokinase in glycolysis, 
providing another example of reciprocal regulatory effects. When AMP levels increase, 
gluconeogenic activity is diminished and glycolysis is stimulated. An increase in citrate 
concentration signals that TCA cycle activity can be curtailed and that pyruvate 


2i 2— : ae 
oo OPO3 should be directed to sugar synthesis instead. 
H HO 
H CH,OH ; > ; : 
eae Fructose-2,6-Bisphosphate—Allosteric Regulator of Gluconeogenesis Emile Van 


Schaftingen and Henri-Géry Hers demonstrated in 1980 that fructose-2,6-bisphos- 


Fructose-2,6-bisphosphate . g K k 
PRSE phate is a potent stimulator of phosphofructokinase (see Chapter 18). Cognizant of the 
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reciprocal nature of regulation in glycolysis and gluconeogenesis, Van Schaftingen and 
Hers also considered the possibility of an opposite effect—inhibition— for fructose-1,6- 
bisphosphatase. In 1981, they reported that fructose-2,6-bisphosphate was indeed a 
powerful inhibitor of fructose-1,6-bisphosphatase (Figure 22.9). Inhibition occurs in 
either the presence or absence of AMP, and the effects of AMP and fructose-2,6- 
bisphosphate are synergistic. 

Cellular levels of fructose-2,6-bisphosphate are controlled by phosphofructokinase-2 
(PFK-2), an enzyme distinct from the phosphofructokinase (often designated PFK-1) 
of the glycolytic pathway, and by fructose-2,6-bisphosphatase (F-2,6-BPase). (PFK-1 
and PFK-2 are numbered not only in terms of their order of discovery, but also because 
PFK-1 phosphorylates at the 1-position and PFK-2 phosphorylates at the 2-position.) 
Remarkably, these two enzymatic activities are both found in the same protein molecule, 
which is an example of a bifunctional, or tandem, enzyme (Figure 22.10). The opposing 
activities of this bifunctional enzyme are themselves regulated in two ways. First, 
fructose-6-phosphate, the substrate of phosphofructokinase and the product of 
fructose-1,6-bisphosphatase, allosterically activates PFK-2 and inhibits F-2,6-BPase. 
Second, the phosphorylation by cAMP-dependent protein kinase of a single Ser (Ser*?) 


22.2 
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Figure 22.8 The principal regulatory mechanisms 
in glycolysis and gluconeogenesis. Activators are 
indicated by plus signs and inhibitors by minus 
signs. Carbohydrate metabolism is also modulated 
by a variety of cellular signals, including the protein 
kinases PI3K, AKT, and AMPK, the tumor 
suppressor p53, and TIGAR (see A Deeper Look, 
“TIGAR-P53-Induced Enzyme That Mimics 
Fructose-2,6-Bisphosphatase’’). 
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Figure 22.9 Inhibition of fructose-1,6-bisphosphatase by fructose-2,6-bisphosphate in the 

(a) absence and (b) presence of 25 mM AMP. In (a) and (b), enzyme activity is plotted against 
substrate (fructose-1,6-bisphosphate) concentration. Concentrations of fructose-2,6-bisphosphate 
(in mM) are indicated above each curve. (c) The effect of AMP (0, 10, and 25 mM) on the inhibition 
of fructose-1,6-bisphosphatase by fructose-2,6-bisphosphate. Activity was measured in the presence 
of 10 mM fructose-1,6-bisphosphate. (Adapted from Van Schaftingen, E., and Hers, H-G., 1981. 
Inhibition of fructose-1,6-bisphosphatase by fructose-2,6-bisphosphate. Proceedings of the National 
Academy of Science, U.S.A. 78:2861-2863.) 


A Deeper Look 


TIGAR—A p53-Induced Enzyme That Mimics Fructose-2,6-Bisphosphatase 


Cellular stresses, such as oncogenesis and DNA damage, provoke 
the activation of target genes by the tumor suppressor p53. A prin- 
cipal target of p53 is TIGAR (TP53-induced glycolysis and apopto- 
sis regulator). TIGAR inhibits glycolysis, reduces cellular levels of 
reactive oxygen species (ROS), and protects cells from apoptosis. 
Hui Li and Gerwald Jogl have shown that TIGAR’s structure is 
nearly identical to that of the fructose-2,6-bisphosphatase domain 
of the bifunctional PFK-2/F-2,6-BPase (see figure). TIGAR is a 
fructose bisphosphatase that exerts its cellular effects by removing 


(b) 


the allosteric effector fructose-2,6-bisphosphate and also the glyco- 
lytic intermediate fructose-1,6-bisphosphate. 
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(a) Overlay of TIGAR (orange; PDB id = 53E9D) and F-2,6-BPase (blue, PDB id = 52BIF) (b) An active-site comparison that 


includes the phosphates (yellow/red). 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


(a) Glucagon 


v 
Vv 


cAMP-dep. PK (PKA) 


F-6-P ATP wm | anp H,O F2,6-BP 


F2,6-BP ADP P; O>F6P 


© 
> PFK-1 


> F1,6-BPase 


Figure 22.10 Synthesis and degradation of fructose-2,6-bisphosphate are catalyzed by the same 
bifunctional enzyme. (a) Details of the regulation of PFK-2 and F-1,6-BPase. (b) The structure of 
PFK-2/F-2,6-BPase from rat liver. PFK-2 activity resides in the N-terminal portion of the protein (left), 
and the C-terminal domain (right) contains F-2,6-BPase activity. The PFK-2 domain has a bound ATP 
analog; the F-2,6-BPase has two phosphates bound (PDB id = | BIF). 


residue on the liver enzyme exerts reciprocal control of the PFK-2 and F-2,6-BPase 
activities. Phosphorylation inhibits PFK-2 activity (by increasing the K,, for fructose- 
6-phosphate) and stimulates F-2,6-BPase activity. 


22.2b Substrate Cycles Provide Metabolic Control Mechanisms 


If fructose-1,6-bisphosphatase and phosphofructokinase acted simultaneously, they 
would constitute a substrate cycle in which fructose-1,6-bisphosphate and fructose- 
6-phosphate became interconverted with net consumption of ATP: 


Fructose-1,6-BP + HO —> fructose-6-P + P; 
Fructose-6-P + ATP —> fructose-1,6-BP + ADP 


Net: ATP + HO —> ADP + P; 


Because substrate cycles such as this appear to operate with no net benefit to the cell, 
they were once regarded as metabolic quirks and were referred to as futile cycles. More 
recently, substrate cycles have been recognized as important devices for controlling 
metabolite concentrations. 

The three steps in glycolysis and gluconeogenesis that differ constitute three such 
substrate cycles, each with its own particular metabolic raison d’étre. Consider, for 
example, the regulation of the fructose-1,6-BP-—fructose-6-P cycle by fructose-2,6- 
bisphosphate. As already noted, fructose-1,6-bisphosphatase is subject to allosteric inhi- 
bition by fructose-2,6-bisphosphate, whereas phosphofructokinase is allosterically 
activated by fructose-2,6-bisphosphate. The combination of these effects should permit 
either phosphofructokinase or fructose-1,6-bisphosphatase (but not both) to operate at 
any one time and should thus prevent futile cycling. For instance, in the fasting state, when 
food (that is, glucose) intake is zero, phosphofructokinase (and, therefore, glycolysis) is 
inactive due to the low concentration of fructose-2,6-bisphosphate. In the liver, 
gluconeogenesis operates to provide glucose for the brain. However, in the fed state, up to 
30% of fructose-1,6-bisphosphate formed from phosphofructokinase is recycled back to 
fructose-6-P (and then to glucose). Because the dependence of fructose-1,6-bisphosphatase 
activity on fructose-1,6-bisphosphate is sigmoidal in the presence of fructose-2,6- 
bisphosphate (see Figure 22.9), substrate cycling occurs only at relatively high levels of 
fructose-1,6-bisphosphate. Substrate cycling in this case prevents the accumulation of 
excessively high levels of fructose-1,6-bisphosphate. 


22.2 How Is Gluconeogenesis Regulated? 


© PFK-2 F2,6-BPase 
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22.3 | How Are Glycogen and Starch Catabolized in Animals? 


As noted earlier, well-fed adult human beings normally metabolize about 160 grams 
of carbohydrates each day. A balanced diet easily provides this amount, mostly in 
the form of starch. If too little carbohydrate is supplied by the diet, glycogen reserves 
in liver and muscle tissue can also be mobilized. 


22.3a Dietary Starch Breakdown Provides Metabolic Energy 


The reactions by which ingested starch and glycogen are digested are shown in 
Figure 22.11. The enzyme known as a-amylase is an important component of saliva 
and pancreatic juice. (8-Amylase is found in plants. The a- and -designations for 
these enzymes serve only to distinguish the two and do not refer to glycosidic 
linkage nomenclature.) a-Amylase is an endoglycosidase that hydrolyzes a(1—+4) 
linkages of amylopectin and glycogen at random positions, eventually producing a 
mixture of maltose, maltotriose [with three a(1—+4)-linked glucose residues], and 
other small oligosaccharides. a-Amylase can cleave on either side of an amylopectin 
branch point, but activity is reduced in highly branched regions of the polysaccha- 
ride and stops four residues from any branch point. 

The highly branched polysaccharides that are left after extensive exposure to a- 
amylase are called limit dextrins. These structures can be further degraded by the action 
of a debranching enzyme, which carries out two distinct reactions. The first of these, 
known as oligo(al,4—>a1,4) glucanotransferase activity, removes a trisaccharide unit 
and transfers this group to the end of another, nearby branch (Figure 22.12). This 
leaves a single glucose residue in a(1—+6) linkage to the main chain. The a(/—~>6) 
glucosidase activity of the debranching enzyme then cleaves this residue from the chain, 
leaving a polysaccharide chain with one branch fewer. Repetition of this sequence of 
events leads to complete degradation of the polysaccharide. 


Figure 22.11 Starch and glycogen are broken down by amylases and glucosidases. (a) The sites of 
hydrolysis of starch by a- and B-amylase are indicated. (b) Glycogenin is a glycosyltransferase that 
initiates eukaryotic glycogen synthesis from glucose. Bound UDP-glucose (blue) 

and Mn?* ion (purple) are shown (PDB id = 1LL2). 
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Figure 22.12 The reactions of debranching enzyme. 
Transfer of a group of three a(1—4)-linked glucose 
residues from a limit branch to another branch is 
followed by cleavage of the a(1—+6) bond of the 
residue that remains at the branch point. 


Limit branch 


CG Geer 


Limit dextrin 


Debranching enzyme 


a(1—6)-glucosidase activity 
of debranching enzyme cleaves 
this residue 


Further cleavage by a-amylase 


B-Amylase is an exoglycosidase that cleaves maltose units from the free, nonreducing 
ends of amylopectin branches, as in Figure 22.11. Like a-amylase, however, B-amylase 
does not cleave either the a(1—>6) bonds at the branch points or the a(1—>4) linkages 
near the branch points. 


22.3b Metabolism of Tissue Glycogen Is Regulated 


Digestion itself is a highly efficient process in which almost 100% of ingested food is 
absorbed and metabolized. Digestive breakdown of starch is an unregulated pro- 
cess. On the other hand, tissue glycogen represents an important reservoir of poten- 
tial energy, and it should be no surprise that the reactions involved in its degradation 
and synthesis are carefully controlled and regulated. Glycogen reserves in liver and 
muscle tissue are stored in the cytosol as granules exhibiting a molecular weight 
range from 6 X 10° to 1600 x 10°. These granular aggregates contain the enzymes 
required to synthesize and catabolize the glycogen as well as all the enzymes of 
glycolysis. 

The principal enzyme of glycogen catabolism is glycogen phosphorylase, a highly 
regulated enzyme that was discussed extensively in Chapter 15. The glycogen phosphor- 
ylase reaction (Figure 22.13) involves phosphorolysis at a nonreducing end of a glyco- 
gen polymer. The standard-state free energy change for this reaction is +3.1 kJ/mol, but 
the intracellular ratio of [P;] to [glucose-1-P] approaches 100, and thus the actual AG in 
vivo is approximately —6 kJ/mol. There is an energetic advantage to the cell in this phos- 
phorolysis reaction. If glycogen breakdown were hydrolytic and yielded glucose as a 
product, it would be necessary to phosphorylate the product glucose (with the expendi- 
ture of a molecule of ATP) to initiate its glycolytic degradation. 

The glycogen phosphorylase reaction degrades glycogen to produce limit dextrins, 
which are further degraded by debranching enzyme, as already described. 
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Figure 22.13 The glycogen phosphorylase reaction. 


22.4 | How Is Glycogen Synthesized? 


Animals synthesize and store glycogen when glucose levels are high, but the synthetic 
pathway is not merely a reversal of the glycogen phosphorylase reaction. High levels 
of phosphate in the cell favor glycogen breakdown and prevent the phosphorylase 
reaction from synthesizing glycogen in vivo, despite the fact that AG?’ for the phos- 
phorylase reaction actually favors glycogen synthesis. Hence, another reaction path- 
way must be employed in the cell for the net synthesis of glycogen. In essence, this 
pathway must activate glucose units for transfer to glycogen chains. 


22.4a Glucose Units Are Activated for Transfer 
by Formation of Sugar Nucleotides 


We are familiar with several examples of chemical activation as a strategy for 
group transfer reactions. Acetyl-CoA is an activated form of acetate; biotin and 
tetrahydrofolate activate one-carbon groups for transfer; and ATP is an activated 
form of phosphate. Luis Leloir, a biochemist in Argentina, showed in the 1950s 
that glycogen synthesis depended upon sugar nucleotides, which may be thought of 
as activated forms of sugar units. For example, formation of an ester linkage 
between the C-1 hydroxyl group and the B-phosphate of UDP activates the glucose 
moiety of UDP-glucose. 


O 
HOCH, HN | 
oO 
H H H 
OH H 
HO 
H OH 


Uridine diphosphate glucose 
(UDPG) 


22.4b UDP-Glucose Synthesis Is Driven by Pyrophosphate Hydrolysis 


Sugar nucleotides are formed from sugar-l-phosphates and nucleoside triphosphates 
by specific pyrophosphorylase enzymes (Figure 22.14). For example, UDP-glucose 
pyrophosphorylase catalyzes the formation of UDP-glucose from glucose-l-phosphate 
and uridine 5'-triphosphate: 
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Glucose-1-P + UTP —> UDP-glucose + pyrophosphate 


The reaction proceeds via attack by a phosphate oxygen of glucose-l-phosphate on 
the a-phosphorus of UTP, with departure of the pyrophosphate anion. The reaction is 
a reversible one, but—as is the case for many biosynthetic reactions—it is driven for- 
ward by subsequent hydrolysis of pyrophosphate: 


Pyrophosphate + H,0 —> 2 P; 


The net reaction for sugar nucleotide formation (combining the preceding two equa- 
tions) is thus 


Glucose-1-P + UTP + H:O —> UDP-glucose + 2 P; 


Sugar nucleotides of this type act as donors of sugar units in the biosynthesis of oligo- 
saccharides and polysaccharides. In animals, UDP-glucose is the donor of glucose 
units for glycogen synthesis, but ADP-glucose is the glucose source for starch synthesis 
in plants. 


22.4c Glycogen Synthase Catalyzes Formation 
of a(1—>4) Glycosidic Bonds in Glycogen 


The very large glycogen polymer is built around a tiny protein core. The first glucose 
residue is covalently joined to the protein glycogenin (see Figure 22.11b) via an acetal 
linkage to a tyrosine—OH group on the protein. Sugar units can then be added by the 
action of glycogen synthase. The reaction involves transfer of a glucosyl unit from 
UDP-glucose to the C-4 hydroxyl group at a nonreducing end of a glycogen strand. 
The mechanism proceeds by cleavage of the C—O bond between the glucose moiety 
and the B-phosphate of UDP-glucose, leaving an oxonium ion intermediate, which is 
rapidly attacked by the C-4 hydroxyl oxygen of a terminal glucose unit on glycogen 
(Figure 22.15). The reaction is exergonic and has a AG?’ of —13.3 kJ/mol. 
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Figure 22.14 The UDP-glucose pyrophosphorylase 
reaction is a phosphoanhydride exchange, with a 
phosphoryl oxygen of glucose-1-P attacking the 
a-phosphorus of UTP to form UDP-glucose and 
pyrophosphate. 


Glycogen synthase from Agrobacterium tumefaciens 
consists of two Rossman folds (see Chapter 16) 
separated by a deep cleft that includes the active site 
(shown here with bound ADP, purple; PDB id = 
IRZU). 
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Figure 22.15 The glycogen synthase reaction. 
Cleavage of the C—O bond of UDP-glucose yields 
an oxonium intermediate. Attack by the hydroxyl 

oxygen of the terminal residue of a glycogen CHOH 
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Glycogen is a branched polymer of glucose units. The branches arise from a(1—>6) 
linkages, which occur every 8 to 12 residues. As noted in Chapter 7, the branches pro- 
vide multiple sites for rapid degradation or elongation of the polymer and also increase 
its solubility. Glycogen branches are formed by amylo-(1,4—>1,6)-transglycosylase, 
also known as branching enzyme. The reaction involves the transfer of a 6 or 7 residue 
segment from the nonreducing end of a linear chain at least 11 residues in length to the 
C-6 hydroxyl of a glucose residue of the same chain or another chain (Figure 22.16). 
For each branching reaction, the resulting polymer has gained a new terminus at 
which growth can occur. 


o 


o 


22.5 | How Is Glycogen Metabolism Controlled? 


Synthesis and degradation of glycogen must be carefully controlled so that this 
important energy reservoir can properly serve the metabolic needs of the organism. 
This section describes the regulation of the synthesis and degradation of glycogen. 


lex 
v 
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22.5a Glycogen Metabolism Is Highly Regulated 


g Glucose is the principal metabolic fuel for the brain, and the concentration of glucose 
in circulating blood must be maintained at about 5 mM for this purpose. Glucose 
derived from glycogen breakdown is also a primary energy source for muscle 


o' 


o 
Ho 


Figure 22.16 Formation of glycogen branches by the branching enzyme. Six or seven residue 
segments of a growing glycogen chain are cleaved (Step 1) and transferred (Step 2) to the C-6 
hydroxyl group of a glucose residue on the same or a nearby chain. 
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Human Biochemistry 


Advanced Glycation End Products—A Serious Complication of Diabetes 


Covalent linkage of sugars to proteins to form glycoproteins 
normally occurs through the action of enzymes that use sugar 
nucleotides as substrates. However, sugars may also react non- 
enzymatically with proteins. The C-1 carbonyl group of glucose 
forms Schiff base linkages with lysine side chains of proteins. 
These Schiff base adducts undergo Amadori rearrangements and 
subsequent oxidations to form irreversible glycation products, 
including carboxymethyllysine and pentosidine derivatives (see 
Figure 1). These advanced glycation end products (AGEs) can alter 


the function of the protein. Such AGE-dependent changes are 
thought to contribute to circulation, joint, and vision problems in 
people with diabetes. 

Nonenzymatic glycation of hemoglobin is a better diagnostic 
yardstick for type 2 diabetes than serum glucose levels. Red blood 
cells have an average life expectancy of about four months. By 
measuring the concentration of glycated hemoglobin in a patient, 
it is possible to determine the average glucose concentration in the 
blood over the past several months. 
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Figure 1 Sugars react nonenzymatically with amine groups on proteins to form Schiff base adducts, which they rearrange to form a 


variety of advanced glycation end products. 


The cell surface receptor that binds advanced glycation end 
products (RAGE) is a multifunctional protein of the innate immune 
system that plays pivotal roles in diabetes, chronic inflammation, 
neurodegenerative diseases, T-lymphocyte proliferation, and can- 
cer. RAGE is a single-transmembrane segment receptor expressed 
mainly in immune cells, neurons, endothelial and vascular smooth 
muscle cells, bone-forming cells, and a variety of cancer cells. 
RAGE consists of three extracellular immunoglobulin domains 


(V, Cl, and C2; Figure 2a, a transmembrane segment and a 
small cytoplasmic domain). The first two extracellular domains of 
RAGE form a single structural unit, termed VC1, which contains 
a basic patch (Figure 2b) and a hydrophobic patch. Binding of its 
ligands (AGE-modified proteins, amyloid-B, and S100 proteins) 
depends on interactions with these surfaces, particularly the large 
basic patch (blue in Figure 2b) comprised of Arg and Lys side 
chains. Binding of ligands induces dimerization of RAGE and 

(Continued) 
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Figure 2 Structure of the receptor that binds advanced glycation end products (RAGE). (a) The domains of RAGE. 
(b) Space-filling structure of RAGE showing acidic (red) and basic (blue) surfaces. (c) A ribbon diagram of RAGE 
(PDB id = 303U). 


triggers a variety of intracellular responses related to inflammation Reference 
and tumorigenesis. Understanding these cellular responses will be Park, H., and Boyington, J. C., 2010. The 1.5A crystal structure of 


essential in light of the developing worldwide epidemic of type 2 human receptor for advanced glycation end products (RAGE) ectodomains 
diabetes reveals unique features determining ligand binding. The Journal of Biological 


Chemistry 285:40762-40770. 


contraction. Control of glycogen metabolism is effected via reciprocal regulation of 
glycogen phosphorylase and glycogen synthase. Thus, activation of glycogen 
phosphorylase is tightly linked to inhibition of glycogen synthase, and vice versa. 
Regulation involves both allosteric control and covalent modification, with the latter 
being under hormonal control. The regulation of glycogen phosphorylase has been 
discussed in detail in Section 15.5. 


22.5b Glycogen Synthase Is Regulated 
by Covalent Modification 


Glycogen synthase exists in two distinct forms that can be interconverted by the 
action of specific enzymes: active, dephosphorylated glycogen synthase I (glucose- 
6-P-independent) and less active, phosphorylated glycogen synthase D (glucose- 
6-P—dependent). The phosphorylated form can be allosterically activated by 
glucose-6-phosphate, but the unphosphorylated enzyme is insensitive to this allo- 
steric effector (Figure 22.17). The nature of phosphorylation is complex (Figure 
22.17a). At least nine serine residues on the enzyme appear to be subject to phos- 
phorylation, each site’s phosphorylation having some effect on enzyme activity. 
Four protein kinases are involved in phosphorylation of glycogen synthase: casein 
kinase, AMP-dependent protein kinase, protein kinase A, and glycogen synthase 
kinase 3 (GSK3). 

Dephosphorylation of both glycogen phosphorylase and glycogen synthase is car- 
ried out by phosphoprotein phosphatase-1 (PP1). The action of PP1 inactivates glycogen 
phosphorylase and activates glycogen synthase. 
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Figure 22.17 Binding of insulin to plasma 
membrane receptors in the liver and muscles 
triggers protein kinase cascades that stimulate 
glycogen synthesis. (a) Insulin’s effects include 
inactivation of GSK3 and stimulation of PP1, 
both actions activating glycogen synthase, as well 
as recruitment of GLUT4 to the plasma 
membrane. Glucose uptake through the GLUT1 
and GLUT4 transporters provides substrate for 
glycogen synthesis and glucose-6-phosphate, which 
allosterically activates the otherwise inactive form 
of glycogen synthase. (b) The structure of human 
GLUT! is shown in two orientations (PDB id = 
4PYP) (c) The metabolic effects of insulin are 
mediated through protein phosphorylation and 
second messenger modulation. 
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Portal vein 
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Figure 22.18 The portal vein system carries 
pancreatic secretions such as insulin and glucagon 
to the liver and then into the rest of the circulatory 
system. 
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22.5c Hormones Regulate Glycogen Synthesis and Degradation 


Storage and utilization of tissue glycogen, maintenance of blood glucose concentra- 
tion, and other aspects of carbohydrate metabolism are meticulously regulated by 
hormones, including insulin, glucagon, epinephrine, and the glucocorticoids. 


Insulin Is a Response to Increased Blood Glucose The primary hormone responsible for 
conversion of glucose to glycogen is insulin (see Figure 5.8). Insulin is secreted by the 
B-cells in the pancreas within the islets of Langerhans. Secretion of insulin is a 
response to increased glucose in the blood. When blood glucose levels rise (after a 
meal, for example), insulin is secreted from the pancreas into the pancreatic vein, 
which empties into the portal vein system (Figure 22.18), so insulin traverses the liver 
before it enters the systemic blood supply. Insulin acts to rapidly lower blood glucose 
concentration in several ways. Insulin stimulates glycogen synthesis and inhibits 
glycogen breakdown in liver and muscle. 


Insulin Triggers Glycogen Synthesis When Blood Glucose Rises The action of insulin 
when blood glucose rises is immediate and powerful. During periods between meals, 
typical human blood glucose levels are 70 to 90 mg/dL. Glucose levels normally rise to 
about 150 mg/dL within the first hour following a carbohydrate-rich meal (Figure 22.19) 
and then return to normal within 2 to 3 hours. (For diabetic subjects, whose insulin 
response is impaired, glucose levels rise after a meal to 250 mg/dL or even higher and 
remain high for much longer times.) 


Human Biochemistry 


Carbohydrate Utilization in Exercise 


Animals have a remarkable ability to shift gears metabolically during 
periods of strenuous exercise or activity. Metabolic adaptations allow 
the body to draw on different sources of energy (all of which produce 
ATP) for different types of activity. During periods of short-term, 
high-intensity exercise (such as a 100 m dash), most of the required 
energy is supplied directly by existing stores of ATP and creatine phos- 
phate (see figure, part a). Long-term, low-intensity exercise (such as a 
10 km run or a 42.2 km marathon) is fueled almost entirely by aero- 
bic metabolism. Between these extremes is a variety of activities (an 
800 m run, for example) that rely on anaerobic glycolysis—conversion 
of glucose to lactate in the muscles and utilization of the Cori cycle. 
For all these activities, breakdown of muscle glycogen provides 
much of the needed glucose. The rate of glycogen consumption 


(a) (b) 
100 100 


from phosphocreatine 


Light exercise 


depends on the intensity of the exercise (see figure, part b). By 
contrast, glucose derived from gluconeogenesis makes only small 
contributions to total glucose consumed during exercise. During 
prolonged mild exercise, gluconeogenesis accounts for only about 
8% of the total glucose consumed. During heavy exercise, this 
percentage becomes even lower. 

Choice of diet has a dramatic effect on glycogen recovery fol- 
lowing exhaustive exercise. A diet consisting mainly of protein 
and fat results in very little recovery of muscle glycogen, even after 
5 days (see figure, part c). On the other hand, a high-carbohydrate 
diet provides faster restoration of muscle glycogen. Even in this 
case, however, complete recovery of glycogen stores takes about 
2 days. 
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Carbohydrate utilization in exercise. (a) Contributions of the various energy sources to muscle activity during 
mild exercise. (b) Consumption of glycogen stores in fast-twitch muscles during light, moderate, and heavy 
exercise. (c) Rate of glycogen replenishment following exhaustive exercise. (a and c adapted from Rhodes, R., 
and Pflanzer, R. G., 1992. Human Physiology. Philadelphia: Saunders College Publishing; b adapted from 
Horton, E. S., and Terjung, R. L., 1988. Exercise, Nutrition and Energy Metabolism. New York: Macmillan.) 
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Insulin lowers blood glucose by triggering several cascades of reactions that result in 
glucose uptake and glycogen synthesis (see Figure 22.17a). An insulin-triggered protein 
kinase cascade increases glucose transport into muscle, liver, and adipose tissues by stimu- 
lating exocytotic processes that translocate GLUT4, a glucose transporter, from intracel- 
lular vesicles to the plasma membrane (see Figure 22.17a). Large amounts of glucose thus 
transported into the cell are converted to glucose-6-P, which can be directed to glycogen 
synthesis (by conversion to glucose-1-P). Also, glucose-6-P is the allosteric effector that 
activates the otherwise inactive, phosphorylated form of glycogen synthase. 

Binding of insulin to the plasma membrane, in either liver or muscle cells, triggers 
another protein kinase cascade (see Figure 22.17a) that results in phosphorylation and 
inactivation of glycogen synthase kinase 3 (GSK3). This kinase normally phosphorylates 
and inactivates glycogen synthase. Inhibition of GSK3 means that more of the cell’s 
glycogen synthase will remain in the unphosphorylated, active state (see Figure 22.17a). 
Insulin also stimulates PP1, which dephosphorylates (and activates) glycogen synthase. 

Several other physiological effects of insulin also serve to lower blood and tissue 
glucose levels (see Figure 22.17c). Insulin increases cellular utilization of glucose by 
inducing the synthesis of several important glycolytic enzymes, namely, glucokinase, 
phosphofructokinase, and pyruvate kinase. In addition, insulin acts to inhibit several 
enzymes of gluconeogenesis. These various actions enable the organism to respond 
quickly to increases in blood glucose levels. 


Glucagon and Epinephrine Stimulate Glycogen Breakdown Catabolism of tissue glycogen 
is triggered by the actions of the hormones epinephrine and glucagon (Figure 22.20). In 
response to decreased blood glucose, glucagon (a 29-residue peptide) is released from 
the a-cells in pancreatic islets of Langerhans. This peptide hormone travels through the 
blood to specific receptors on liver cell membranes. (Glucagon acts on liver and 
adipose tissue but not other tissues.) Similarly, signals from the central nervous system 
cause release of epinephrine—also known as adrenaline—from the adrenal glands into 
the bloodstream. Epinephrine acts on liver and muscles. When either hormone binds 
to its receptor on the outside surface of the cell membrane, a cascade is initiated that 
activates glycogen phosphorylase and inhibits glycogen synthase (Figure 22.20). The 
result of these actions is tightly coordinated stimulation of glycogen breakdown and 
inhibition of glycogen synthesis. 


The Phosphorylase Cascade Amplifies the Hormonal Signal Stimulation of glycogen 
breakdown involves consumption of molecules of ATP at three different steps in the 
hormone-sensitive adenylyl cyclase cascade (see Figure 15.18). Note that the cascade 
mechanism is a means of chemical amplification, because the binding of just a few 
molecules of epinephrine or glucagon results in the synthesis of many molecules of 
cyclic AMP, which, through the action of cAMP-dependent protein kinase, can acti- 
vate many more molecules of phosphorylase kinase and even more molecules of 
phosphorylase. For example, an extracellular level of 107! to 1078 M epinephrine 
prompts the formation of 1076 M cyclic AMP, and for each protein kinase activated 
by cyclic AMP, approximately 30 phosphorylase kinase molecules are activated; each 
of these in turn can activate some 800 molecules of phosphorylase. Each of these 
catalyzes the formation of many molecules of glucose-1-P. 


The Difference between Epinephrine and Glucagon Although both epinephrine and 
glucagon exert glycogenolytic effects, they do so for quite different reasons. Epineph- 
rine is secreted as a response to anger or fear, and may be viewed as an alarm or 
danger signal for the organism. Called the fight-or-flight hormone, it prepares the 
organism for mobilization of large amounts of energy. Among the many physiological 
changes elicited by epinephrine, one is the initiation of the enzyme cascade, as in 
Figure 15.18, which leads to rapid breakdown of glycogen, inhibition of glycogen 
synthesis, stimulation of glycolysis, and production of energy. The burst of energy 
produced is the result of a 2000-fold amplification of the rate of glycolysis. A fear or 
anger response must include generation of energy (in the form of glucose)—both 
immediately in localized sites (the muscles) and eventually throughout the organism 
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Figure 22.19 A glucose tolerance test involves 
ingestion of a glucose solution followed by 
measurements of blood glucose for about 3 hours. 
Normal subjects exhibit a rise in blood glucose to 
about 150 mg/dL, followed by a decline to normal 
values over a 3-hour period. In diabetic subjects, 
blood glucose rises to higher values and remains 
high for longer periods. 
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Figure 22.20 Glucagon and epinephrine each activate a cascade of reactions that stimulate glycogen 
breakdown and inhibit glycogen synthesis in liver and muscles, respectively. The effects of these 
hormones on other metabolic pathways depend on the tissue. In liver, glucagon inhibits glycolysis 
and stimulates gluconeogenesis, facilitating export of glucose into the bloodstream. In muscles, 
epinephrine stimulates glycolysis to provide energy for contraction. These effects all depend on 
protein phosphorylations by cAMP-dependent protein kinase. Note that the liver and heart isoforms 
of PFK-2/F-2,6-BPase respond oppositely to phosphorylation by PKA. The + and © symbols 
indicate activation and inhibition. 


(as supplied by the liver). Epinephrine must be able to activate glycogenolysis in both 
liver and muscles. 

Glucagon is involved in the long-term maintenance of steady-state levels of glucose 
in the blood. It performs this function by stimulating the liver to release glucose from 
glycogen stores into the bloodstream. To further elevate glucose levels, glucagon also 
stimulates liver gluconeogenesis by activating F-2,6-BPase activity (see Figure 22.10). It 
is important to note, however, that stabilization of blood glucose levels is managed 
almost entirely by the liver. Glucagon does not activate the phosphorylase cascade in 
muscle (since muscle membranes do not contain glucagon receptors). Muscle glycogen 
breakdown occurs only in response to epinephrine release, and muscle tissue does not 
participate in maintenance of steady-state glucose levels in the blood. 

Glucagon and epinephrine both trigger glycogen breakdown and inhibit glycogen 
synthesis (in liver and muscles, respectively), but their other effects on metabolic 
pathways are adapted exquisitely to the needs of the tissues involved. The liver must 
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O-GIcNAc Signaling and the Hexosamine Biosynthetic Pathway 


O-linked B-N-acetylglucosamine (O-GlcNAc) is a post-translational 
modification consisting of a single N-acetylglucosamine moiety 
attached in O-8-glycosidic linkage to Ser and Thr residues. 
Modification with a single O-GlcNAc is a signaling device similar 
to phosphorylation. Emerging research reveals that O-GlcNAc 
signaling and its crosstalk with phosphorylation modulate a wide 
array of cellular events. Such signaling and crosstalk are altered in 
diseases such as diabetes and cancer. 

Uridine diphosphate N-acetylglucosamine (UDP-GIcNAc), the 
high-energy donor substrate for O-GleNAcylation, is synthesized in 
the hexosamine biosynthetic pathway (HBP—-see figure), a nexus of 
glucose, nitrogen, fatty acid, and nucleic acid metabolic pathways 
which consumes approximately 2% to 3% of total cellular glucose. 
The amide N of glutamine is transferred to fructose-6-P to form 
glucosamine-6-P by glutamine:fructose-6-P aminotransferase (GFAT), 
the rate-limiting step in the HBP. O-GlcNAc-dependent glycosylation 
(O-GlcNAcylation) of proteins is catalyzed by O-GlcNAc transferase 
(OGT) and O-GlcNAc groups are removed from protein targets by 
O-GlcNAcase (OGA). 

O-GlcNAcylation is intertwined with cellular metabolism; sugars 
can be attached or removed dynamically in response to changes in 
the cellular environment triggered by stress, hormones, or nutrients. 
In some cases, the same Ser or Thr residues may be either phos- 
phorylated or modified by O-GlcNAc, provoking different cellular 
responses. Moreover, O-GlcNAcylated and phosphorylated residues 
can be in proximity to each other, and either can sterically impair 
the attachment of the other. 

O-GlcNAcylation plays a direct role in insulin signaling and cancer. 
Cellular stimulation by insulin provokes OGT association with the 


insulin receptor, followed by OGT phosphorylation, and an increase 
in OGT activity, resulting in O-GleNAcylation of several components 
of the insulin signaling pathway. In cancer cells, expression of 
the transcription factor c-Myc is typically enhanced. c-Myc is O- 
GlcNAcylated at a Thr residue that is also a phosphorylation site, and 
these modifications have opposing effects on the activity of c-Myc. 
Similar interaction between O-GlcNAcylation and phosphorylation, 
with consequent effects on activity, are observed for several other 
transcription factors, oncogenes, and tumor suppressors that play 
essential roles in tumorigenesis and cell proliferation. For example, 
in the fasting state, glucagon stimulates gluconeogenesis by activating 
the transcription factor called CREB  (cyclic-ATP-responsive 
element-binding protein). The pathway between glucagon and 
CREB begins with cAMP generation inside the cell in response 
to glucagon binding to its membrane receptor. cAMP activates 
protein kinase A (PKA), which phosphorylates CREB, enhancing 
expression of pyruvate carboxylase, PEP carboxykinase, and 
glucose-6-phosphatase, thus stimulating gluconeogenesis. cAMP 
also triggers O-GlcNAcylation of CRTC, which binds to CREB, 
further stimulating gluconeogenesis. Phosphorylated CREB also 
promotes expression of peroxisome proliferator-activated receptor-y 
co-activator la (PGCla@), which provides prolonged stimulation of 
gluconeogenic gene transcription. 
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Figure 22.21 The effects of cortisol on carbohydrate 
and protein metabolism in the liver. 
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export glucose to the bloodstream to support other tissues. Thus, in the liver, PFK-2 is 
phosphorylated and inhibited by protein kinase A (PKA), lowering [fructose-2,6-bispho- 
sphate], inhibiting glycolysis, and activating gluconeogenesis (see Figure 22.20). In mus- 
cles, the glucose provided by glycogen breakdown is used immediately and locally to 
provide ATP energy for contraction. Therefore, glycolysis should be activated in concert 
with glycogen breakdown in muscles. Activation of glycolysis is accomplished in 
different ways, depending on the muscle. Heart muscle PFK-2 is activated upon 
phosphorylation at Ser*® and Ser by PKA in response to epinephrine, thus activating 
glycolysis (see Figure 22.20). Unlike heart PFK-2, skeletal muscle PFK-2 is not a 
substrate for PKA. Instead, skeletal muscle PFK-1 is phosphorylated and activated by 
PKA (see Figure 22.20). 


Cortisol and Glucocorticoid Effects on Glycogen Metabolism Glucocorticoids are a 
class of steroid hormones that exert distinct effects on liver, skeletal muscle, and 
adipose tissue. The effects of cortisol, a typical glucocorticoid, are best described as 
catabolic because cortisol promotes protein breakdown and decreases protein 
synthesis in skeletal muscle. In the liver, however, it stimulates gluconeogenesis and 
increases glycogen synthesis. Cortisol-induced gluconeogenesis results primarily 
from increased conversion of amino acids into glucose (Figure 22.21). Specific effects 
of cortisol in the liver include increased expression of several genes encoding en- 
zymes of the gluconeogenic pathway, activation of enzymes involved in amino acid 
metabolism, and stimulation of the urea cycle, which disposes of nitrogen liberated 
during amino acid catabolism. 


22.6 | How Does Glucose Provide Electrons for Biosynthesis? 


Cells require a constant supply of NADPH for reductive reactions vital to biosynthetic 
purposes. Much of this requirement is met by a glucose-based metabolic sequence 
variously called the pentose phosphate pathway, the hexose monophosphate shunt, or the 
phosphogluconate pathway. In addition to providing NADPH for biosynthetic pro- 
cesses, this pathway produces ribose-5-phosphate, which is essential for nucleic acid 
synthesis. Several metabolites of the pentose phosphate pathway can also be shuttled 
into glycolysis. 


22.6a The Pentose Phosphate Pathway Operates 
Mainly in Liver and Adipose Cells 
The pentose phosphate pathway begins with glucose-6-phosphate, a six-carbon sugar, 


and produces three-, four-, five-, six-, and seven-carbon sugars (Figure 22.22). As will 
be seen, the pathway begins with, two successive oxidations that lead to the reduction 
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Figure 22.22 The pentose phosphate pathway. The numerals in the blue circles indicate the steps discussed 


in the text. The glucose-6-phosphate dehydrogenase reaction is inhibited by the tumor suppressor p53. 
Steps 1-3 involve two oxidations. Steps 4-9 interconvert 3-, 4-, 5-, 6-, and 7-carbon sugars. 
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Figure 22.23 The glucose-6-phosphate 
dehydrogenase reaction is the first step in the 
pentose phosphate pathway and is the committed 
step in the pentose phosphate pathway. 


Figure 22.24 The gluconolactonase reaction is the 
second pentose phosphate pathway reaction (step 2 
in Figure 22.22). 


of NADP* to NADPH and the release of CO). Five subsequent nonoxidative steps 
produce a variety of carbohydrates, some of which may enter the glycolytic pathway. 
The enzymes of the pentose phosphate pathway are particularly abundant in the 
cytoplasm of liver and adipose cells. These enzymes are largely absent in muscle, 
where glucose-6-phosphate is utilized primarily for energy production via glycolysis 
and the TCA cycle. These pentose phosphate pathway enzymes are located in the 
cytosol, which is the site of fatty acid synthesis, a pathway heavily dependent on 
NADPH for reductive reactions. 


22.6b The Pentose Phosphate Pathway Begins 
with Two Oxidative Steps 


The aldehyde group of glucose-6-P is oxidized to a carboxyl group and then lost as 
CO, in the first two steps of the pentose phosphate pathway. 


Step 1. Glucose-6-Phosphate Dehydrogenase The pentose phosphate pathway begins 
with the oxidation of glucose-6-phosphate. The products of the reaction are a cyclic 
ester (the lactone of phosphogluconic acid) and NADPH (Figure 22.23). Glucose-6- 
phosphate dehydrogenase (G6PDH), which catalyzes this reaction, is highly specific for 
NADPH. As the first step of a major pathway, the reaction is irreversible and highly 
regulated. Glucose-6-phosphate dehydrogenase is strongly inhibited by the product 
coenzyme, NADPH, and also by fatty acid esters of coenzyme A (which are intermedi- 
ates of fatty acid biosynthesis). Inhibition due to NADPH depends upon the cytosolic 
NADP*/NADPH ratio, which in the liver is about 0.015 (compared to about 725 for the 
NAD*/NADH ratio in the cytosol). G6PDH is the rate-limiting step of the pentose 
phosphate pathway. G6PDH is directly inhibited by the tumor suppressor p53. 


Step 2. Gluconolactonase The gluconolactone produced in step 1 is hydrolytically 
unstable and readily undergoes a spontaneous ring-opening hydrolysis, although an 
enzyme, gluconolactonase, accelerates this reaction (Figure 22.24). The linear 
product, the sugar acid 6-phospho-pD-gluconate, is further oxidized in step 3. 
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Step 3. 6-Phosphogluconate Dehydrogenase The oxidative decarboxylation of 6-phos- 
phogluconate by 6-phosphogluconate dehydrogenase yields D-ribulose-5-phosphate 
and another equivalent of NADPH. There are two distinct steps in this reaction 
(Figure 22.25): The initial NADP*t-dependent dehydrogenation yields a B-keto acid, 
3-keto-6-phosphogluconate, which is very susceptible to decarboxylation (the sec- 
ond step). The resulting product, D-ribulose-5-P, is the substrate for the nonoxidative 
reactions composing the rest of this pathway. 


22.6c There Are Four Nonoxidative Reactions 
in the Pentose Phosphate Pathway 


This portion of the pathway begins with an isomerization and an epimerization, and 
it leads to the formation of either D-ribose-5-phosphate or D-xylulose-5-phosphate. 
These intermediates can then be converted into glycolytic intermediates or directed 
to biosynthetic processes. 


Step 4. Phosphopentose Isomerase This enzyme interconverts ribulose-5-P and 
ribose-5-P via an enediol intermediate (Figure 22.26). The reaction (and mechanism) 
is quite similar to the phosphoglucoisomerase reaction of glycolysis, which intercon- 
verts glucose-6-P and fructose-6-P. The ribose-5-P produced in this reaction is uti- 
lized in the biosynthesis of coenzymes (including NADH, NADPH, FAD, and B,,), 
nucleotides, and nucleic acids (DNA and RNA). The net reaction for the first four 
steps of the pentose phosphate pathway is 


Glucose-6-P + 2 NADP* + H,0 —> ribose-5-P + 2 NADPH + 2 H* + CO; 


Step 5. Phosphopentose Epimerase This reaction converts ribulose-5-P to another ke- 
tose, namely, xylulose-5-P. This reaction also proceeds by an enediol intermediate but 
involves an inversion at C-3 (Figure 22.27). In the reaction, an acidic proton located 
a- to a carbonyl carbon is removed to generate the enediolate, but the proton is added 
back to the same carbon from the opposite side. Note the distinction in nomenclature 
here. Interchange of groups on a single carbon is an epimerization, and interchange of 
groups between carbons is an isomerization. 
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Figure 22.25 The 6-phosphogluconate 


dehydrogenase reaction is the third step in the 


779 


pentose phosphate pathway (Step 3 in Figure 22.22). 


Figure 22.26 The phosphopentose isomerase 
reaction is the fourth reaction of the pentose 
phosphate pathway. It involves an enediol 


intermediate. 
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HCOH 
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Enediolate Xylulose-5-P 


Figure 22.27 The phosphopentose epimerase reaction is the fifth step of the pentose phosphate 
pathway. It interconverts ribulose-5-P and xylulose-5-phosphate. The mechanism involves an enediol 
intermediate and occurs with inversion at C-3. 


Human Biochemistry 


Aldose Reductase and Diabetic Cataract Formation 


The complications of diabetes include a high propensity for cataract 
formation in later life, both in type 1 and type 2 diabetics. Hyper- 
glycemia is the suspected cause, but by what mechanism? Several 
lines of evidence point to the polyol pathway, in which glucose and 
other simple sugars are reduced in NADPH-dependent reactions. 
Glucose, for example, is reduced by aldose reductase to sorbitol (see 
accompanying figure), which accumulates in lens fiber cells, increas- 
ing the intracellular osmotic pressure and eventually rupturing the 


cells. The involvement of aldose reductase in this process is sup- 
ported by the fact that animals that have high levels of this enzyme 
in their lenses (such as rats and dogs) are prone to develop diabetic 
cataracts, whereas mice that have low levels of lens aldose reduc- 
tase activity are not. Moreover, aldose reductase inhibitors such as 
tolrestat and epalrestat suppress cataract formation. These drugs or 
derivatives from them may represent an effective preventive therapy 
against cataract formation in people with diabetes. 
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Cataract formation is a common complication of type 1 and type 2 diabetes. (a) The reduction of glucose to sorbitol. (b) Tolrestat and epalrestat are aldose 
reductase inhibitors that suppress cataract formation. 


To this point, the pathway has generated a pool of pentose phosphates. The A G°’ 
for each of the last two reactions is small, and the three pentose-5-phosphates coexist 
at equilibrium. The pathway has also produced two molecules of NADPH for each 
glucose-6-P converted to pentose-5-phosphate. The next three steps rearrange the five- 
carbon skeletons of the pentoses to produce three-, four-, six-, and seven-carbon units, 
which can be used for various metabolic purposes. Why should the cell do this? Very 
often, the cellular need for NADPH is considerably greater than the need for ribose- 
5-phosphate. The next three steps thus return some of the five-carbon units to glycer- 
aldehyde-3-phosphate and fructose-6-phosphate, which can enter the glycolytic 
pathway. The advantage of this is that the cell has met its needs for NADPH and 
ribose-5-phosphate in a single pathway, yet at the same time it can return the excess 
carbon metabolites to glycolysis. 
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Figure 22.28 The transketolase reaction of step 6 in the pentose phosphate pathway. 
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Figure 22.29 The transketolase reaction of step 8 in the pentose phosphate pathway. 


Steps 6 and 8. Transketolase The transketolase enzyme acts at both steps 6 and 8 of the 
pentose phosphate pathway. In both cases, the enzyme catalyzes the transfer of 
two-carbon units. In these reactions (and also in step 7, the transaldolase reaction, 
which transfers three-carbon units), the donor molecule is a ketose and the recipient 
is an aldose. In step 6, xylulose-5-phosphate transfers a two-carbon unit to ribose- 
5-phosphate to form glyceraldehyde-3-phosphate and sedoheptulose-7-phosphate 
(Figure 22.28). Step 8 involves a two-carbon transfer from xylulose-5-phosphate to 
erythrose-4-phosphate to produce another glyceraldehyde-3-phosphate and a fruc- 
tose-6-phosphate (Figure 22.29). Three of these products enter directly into the 
glycolytic pathway. (The sedoheptulose-7-phosphate is taken care of in step 7, as we 
shall see.) Transketolase is a thiamine pyrophosphate—dependent enzyme, and the 
mechanism (Figure 22.30) involves abstraction of the acidic thiazole proton of TPP, 
attack by the resulting carbanion at the carbonyl carbon of the ketose phosphate 
substrate, expulsion of the glyceraldehyde-3-phosphate product, and transfer of the 
two-carbon unit. Transketolase can process a variety of 2-keto sugar phosphates in a 
similar manner. It is specific for ketose substrates with the configuration shown but 
can accept a variety of aldose phosphate substrates. 


Step 7. Transaldolase The transaldolase functions primarily to make a useful glyco- 
lytic substrate from the sedoheptulose-7-phosphate produced by the first transketo- 
lase reaction. This reaction (Figure 22.31) is quite similar to the aldolase reaction of 
glycolysis, involving formation of a Schiff base intermediate between the sedoheptu- 
lose-7-phosphate and an active-site lysine residue (Figure 22.32). Elimination of the 
erythrose-4-phosphate product leaves an enamine of dihydroxyacetone, which re- 
mains stable at the active site (without imine hydrolysis) until the other substrate 
comes into position. Attack of the enamine carbanion at the carbonyl carbon of 
glyceraldehyde-3-phosphate is followed by hydrolysis of the Schiff base (imine) to 
yield the product fructose-6-phosphate. 
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Figure 22.30 The mechanism of the TPP-dependent 
transketolase reaction. Ironically, the group 
transferred in the transketolase reaction 

might best be described as an aldol, whereas the 
transferred group in the transaldolase reaction is 
actually a ketol. Despite the irony, these names 


persist for historical reasons. Sedoheptulose-7-P 
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Figure 22.31 The transaldolase reaction. 
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22.6d Utilization of Glucose-6-P Depends on the Cell’s 
Need for ATP, NADPH, and Ribose-5-P 


It is clear that glucose-6-phosphate can be used as a substrate either for glycolysis or 
for the pentose phosphate pathway. The cell makes this choice on the basis of its rela- 
tive needs for biosynthesis and for energy from metabolism. ATP can be produced in 
abundance if glucose-6-phosphate is channeled into glycolysis. On the other hand, if 
NADPH or ribose-5-phosphate is needed, glucose-6-phosphate can be directed to 
the pentose phosphate pathway. The molecular basis for this regulatory decision 
depends on the enzymes that metabolize glucose-6-phosphate in glycolysis and the 
pentose phosphate pathway. In glycolysis, phosphoglucoisomerase converts glu- 
cose-6-phosphate to fructose-6-phosphate, which is utilized by phosphofructo- 
kinase (a highly regulated enzyme) to produce fructose-1,6-bisphosphate. In the 
pentose phosphate pathway, glucose-6-phosphate dehydrogenase (also highly regu- 
lated) produces 6-phosphogluconolactone from glucose-6-phosphate. Thus, the fate 
of glucose-6-phosphate is determined to a large extent by the relative activities of 
phosphofructokinase and glucose-6-P dehydrogenase. Recall from Section 18.3c 
that PFK is inhibited when the ATP/AMP ratio increases and that it is inhibited by 
citrate but activated by fructose-2,6-bisphosphate. Thus, when the energy charge is 
high, glycolytic flux decreases. Glucose-6-P dehydrogenase, on the other hand, is 
inhibited by high levels of NADPH and also by the acyl-CoA intermediates of fatty 
acid biosynthesis. Both of these are indicators that biosynthetic demands have been 
satisfied. If that is the case, glucose-6-phosphate dehydrogenase and the pentose 


Figure 22.32 The transaldolase mechanism involves 
attack on the substrate by an active-site lysine. 
Departure of erythrose-4-P leaves the reactive 
enamine, which attacks the aldehyde carbon of 
glyceraldehyde-3-P. Schiff base hydrolysis yields the 
second product, fructose-6-P. 
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phosphate pathway are inhibited. If NADPH levels drop, the pentose phosphate 
pathway turns on and NADPH and ribose-S-phosphate are made for biosynthetic 
purposes. 

Even when the latter choice has been made, however, the cell must still be responsive 
to the relative needs for ribose-5-phosphate and NADPH (as well as ATP). Depending 
on these relative needs, the reactions of glycolysis and the pentose phosphate pathway 
can be combined in novel ways to emphasize the synthesis of needed metabolites. There 
are four principal possibilities. 


1. Both Ribose-5-P and NADPH Are Needed by the Cell In this case, the first four reactions 
of the pentose phosphate pathway predominate (Figure 22.33). NADPH is produced 
by the oxidative reactions of the pathway, and ribose-5-P is the principal product of 
carbon metabolism. As stated earlier, the net reaction for these processes is 


Glucose-6-P + 2 NADP* + HO —> ribose-5-P + CO, + 2 NADPH + 2 H* 


2. More Ribose-5-P Than NADPH Is Needed by the Cell Synthesis of ribose-5-P can be 
accomplished without production of NADPH if the oxidative steps of the pentose phos- 
phate pathway are bypassed. The key to this route is the withdrawal of fructose-6-P and 
glyceraldehyde-3-P, but not glucose-6-P, from glycolysis. The action of transketolase and 
transaldolase on fructose-6-P and glyceraldehyde-3-P produces three molecules of ribose- 
5-P from two molecules of fructose-6-P and one of glyceraldehyde-3-P. In this route, as in 
case 1, no carbon metabolites are returned to glycolysis. The net reaction for this route is 


5 Glucose-6-P + ATP —> 6 ribose-5-P + ADP + H* 


3. More NADPH Than Ribose-5-P Is Needed by the Cell Large amounts of NADPH can be 
supplied for biosynthesis without concomitant production of ribose-5-P if ribose-5-P 
produced in the pentose phosphate pathway is recycled to produce glycolytic intermedi- 
ates. This alternative involves a complex interplay between the transketolase and trans- 
aldolase reactions to convert ribulose-5-P to fructose-6-P and glyceraldehyde-3-P, which 
can be recycled to glucose-6-P via gluconeogenesis. The net reaction for this process is 


6 Glucose-6-P + 12 NADP* + 6 HO —> 
6 Ribulose-5-P + 6 CO, + 12 NADPH + 12 Ht 
6 Ribulose-5-P —> 5 glucose-6-P + P; 


Net: Glucose-6-P + 12 NADP* + 6 H:O —> 6 CO, + 12 NADPH + 12 H+ + P; 


Note that in this scheme, the six hexose sugars have been converted to six pentose 
sugars with release of six molecules of CO,, and the six pentoses are reconverted to 
five glucose molecules. 


4. Both NADPH and ATP Are Needed by the Cell, but Ribose-5-P Is Not Under some condi- 
tions, both NADPH and ATP must be provided in the cell. This can be accomplished 
in a series of reactions similar to case 3 if the fructose-6-P and glyceraldehyde-3-P 
produced in this way proceed through glycolysis to produce ATP and pyruvate, which 
itself can yield even more ATP by continuing on to the TCA cycle. The net reaction for 
this alternative is 


3 Glucose-6-P + 5 NAD+ + 6 NADP* + 8 ADP + 5 P; —> 
5 Pyruvate + 3 CO, + 5 NADH + 6 NADPH + 8 ATP + 2 H:O + 8 H+ 


Figure 22.33 When biosynthetic demands dictate, the first four reactions of the pentose phosphate 
pathway predominate and the principal products are ribose-5-P and NADPH. 
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Integrating the Warburg Effect—ATP Consumption 


Promotes Cancer Metabolism 


How and why do cells fine-tune the production of NADPH, ribose- 
5-phosphate, and ATP, as described in Section 22.6? The Warburg 
effect, described in Chapter 18, is a good example of these choices. 
It is exhibited by cancer cells, which metabolize glucose differ- 
ently than normal, differentiated cells. Normal cells generally 
have low rates of glycolysis, followed by oxidation of pyruvate in 
the mitochondria, leading to efficient generation of ATP. Rapidly 
proliferating cancer cells, on the other hand, use glycolytic inter- 
mediates for the synthesis of macromolecules and must balance 
their ATP requirements and biosynthetic needs. Moreover, cancer 
cells must fine-tune their metabolism to choose between synthesiz- 
ing NADPH, ribose-5-phosphate, or both. Tumor cells also face 
another problem: If too much ATP is produced by aerobic glycoly- 
sis, feedback inhibition of phosphofructokinase may limit glucose 
utilization altogether. 

How do tumor cells manage these needs? One piece of the puzzle 
has been provided by Xiaodong Wang and coworkers, who have 
shown that ENTPDS5, a UDPase in the endoplasmic reticulum, is 
associated with ATP consumption in tumor cells. UMP made in this 
reaction is transported into the cytosol, where it is converted back to 
UDP, then UTP, and then to UDP-glucose. These reactions are cou- 
pled to adenylate kinase (see figure) with net production of AMP. 
Consumption of ATP and production of AMP stimulates phospho- 
fructokinase, so that glycolysis proceeds to make glyceraldehyde- 
3-phosphate, which can, together with fructose-6-phosphate, yield 
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ribose-5-phosphate in the pentose phosphate pathway. Cancer cells 
use this strategy, together with other regulatory controls (green and 
red in the figure) to fine-tune the production of NADPH, ribose- 
5-phosphate, and ATP. 

Can cancer cells afford to sacrifice so much ATP in this way? 
Compare the possible yields from glucose with the needs of a prolif- 
erating cell. One glucose can generate up to 34 ATPs, or 2 NADPH 
plus about 28 ATP (if shunted through the pentose phosphate path- 
way), or six carbons for biosynthesis. By comparison, in fatty acid 
synthesis, one palmitate requires 16 carbons, 7 molecules of ATP, 
and 28 electrons from 14 NADPH. Thus, one glucose provides 
nearly five times the amount of ATP required to make a palmitate, 
but only one-seventh of the NADPH required. Clearly, proliferat- 
ing cells must devote much of their available glucose to NADPH 
production and biosynthesis, yet still provide sufficient ATP for 
cellular energy needs. 
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A variety of cellular signals enable proliferating cells to modulate the production of NADPH, 
ribose-5-phosphate, and ATP. ENTPDS expression is enhanced in tumors and promotes ATP 
consumption, stimulating aerobic glycolysis. The protein kinases P13K and AKT stimulate 
glucose uptake, as do hexokinase and phosphofructokinase. Phosphorylation by AMP kinase 
(AMPK) inhibits PFK, whereas p53 blocks synthesis of NADPH by inhibition of glucose-6- 
phosphate dehydrogenase. 
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Note that, except for the three molecules of CO,, all the other carbon from glucose- 
6-P is recovered in pyruvate. 

The pentose phosphate pathway plays a pivotal role in helping cancer cells meet their 
needs for ATP, NADPH, ribose-5-phosphate, and fatty acids. Activation of oncogenic 
proteins and inactivation of tumor suppressors provide a plethora of mechanisms that 
modulate and regulate the reactions of this versatile pathway. 


22.6e Xylulose-5-Phosphate Is a Metabolic Regulator 


In addition to its role in the pentose phosphate pathway, xylulose-5-phosphate 
serves as a signaling molecule. When blood glucose concentration rises (for example, 
following a carbohydrate-rich meal), glycolysis and the pentose phosphate pathway 
are activated in the liver, and xylulose-5-phosphate produced in the latter pathway 
stimulates protein phosphatase 2A (PP2A), which dephosphorylates the bifunc- 
tional enzyme PFK-2/F-2,6-BPase (Figure 22.34; see also Figure 22.10). Increases 
in [fructose-2,6-bisphosphate] stimulate glycolysis and inhibit gluconeogenesis. At 
the same time, PP2A dephosphorylates carbohydrate-responsive element-binding 
protein (ChREBP), a transcription factor which then activates expression of liver 
genes for lipid synthesis. These effects are a powerful combination. Increased gly- 
colysis produces substantial amounts of acetyl-CoA, the principal substrate for 
lipid synthesis. The pentose phosphate pathway produces NADPH, the source of 
electrons for lipid biosynthesis. Elevated expression of the appropriate genes sets 
the stage for lipid biosynthesis in the liver, an important consequence of ingestion 
of carbohydrates. 
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Figure 22.34 Xylulose-5-phosphate activation of PP2A affects multiple metabolic pathways. 
Activation of PP2A triggers dephosphorylation of the bifunctional PFK-2/F2,6-BPase, which raises 
fructose-2,6-BP levels, in turn activating glycolysis and inhibiting gluconeogenesis. Simultaneously, 
ChREBP is dephosphorylated, leading to elevated expression of genes encoding enzymes for 
lipogenesis. These effects combine to trigger lipid biosynthesis in response to a carbohydrate-rich 
meal. Plus signs indicate activation. 
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Problems 


Foundational Biochemistry Things You Should Know after Reading Chapter 22. 


Gluconeogenesis is the generation (genesis) of new (neo) glucose. 
Substrates for gluconeogenesis include pyruvate, lactate, and amino 
acids. 


Gluconeogenesis is not the reverse of glycolysis. It involves four new 
reactions—pyruvate carboxylase, PEP carboxykinase, fructose-1,6- 
bisphosphatase, and glucose-6-phosphatase. The resulting cellular 
free energy change for gluconeogenesis is strongly negative. 


Regulation of gluconeogenesis involves allosteric activation of pyru- 
vate carboxylase by acetyl-CoA, allosteric inhibition of fructose-1,6- 
bisphosphatase by F-2,6-bisP and AMP, and substrate level control 
of glucose-6-phosphatase by glucose-6-phosphate. 


Dietary starch breakdown is unregulated and provides metabolic 
energy. 


Tissue glycogen represents an important reservoir of potential energy 
and metabolism of tissue glycogen is carefully controlled and regulated. 


Glucose units are activated for transfer into complex polysaccharides 
by formation of sugar nucleotides. 


UDP-Glucose synthesis is driven by pyrophosphate hydrolysis. 


Glycogen metabolism is highly regulated. Glycogen synthase is regu- 
lated by covalent modifications and hormones regulate glycogen 
synthesis and degradation. 


Glucagon and epinephrine stimulate glycogen breakdown. Epineph- 
rine acts quickly, initiating a cascade of reactions that lead to rapid 
breakdown of glycogen, inhibition of glycogen synthesis, stimulation 
of glycolysis, and production of energy. 


Glucagon is involved in long-term maintenance of steady-state levels 
of glucose in the blood. 


The pentose phosphate pathway, operating mainly in liver and adi- 
pose tissue, provides electrons for biosynthesis. 


The pentose phosphate pathway begins with two oxidative reactions 
that convert glucose-6-phosphate to ribulose-5 phosphate, followed 
by four non-oxidative reactions. 


Utilization of glucose-6-phosphate depends on the cell’s need for 
ATP, NADPH, and ribose-5-P. 


Xylulose-5-phosphate is a metabolic signal that stimulates protein 
phosphatase 2A. 


Problems 


Answers to all problems are at the end of this book. Detailed solutions are 
available in the Student Solutions Manual. 


1. 


Assessing the Overall Equation for Gluconeogenesis Consider the 
balanced equation for gluconeogenesis in Section 22.1. Account 
for each of the components of this equation (that is, explain where 
each component comes from) and the indicated stoichiometry. 


. NADH in Gluconeogenesis While other anabolic pathways use 


NADPH as the reducing agent, gluconeogenesis uses NADH. Provide 
three possible reasons for why this is advantageous for the cell. 


. Assessing the Energetics of Gluconeogenesis (Integrates with Chapters 3 


and 18.) Calculate AG°’ and AG for gluconeogenesis in the erythrocyte, 
using information from Tables 3.2, 18.1, 18.2, and Section 22.1 (assume 
NAD*/NADH = 20, [GTP] = [ATP], and [GDP] = [ADP]). Make sure 
that your values match those in Section 22.1. 


. Decarboxylation Reactions Explain why decarboxylation reactions 


like the conversion of oxaloacetate into PEP are highly favorable. 


. Understanding the Regulation of the Fructose-1,6-Bisphosphatase 


Reaction Use the data of Figure 22.9 to calculate the percent inhibition 
of fructose-1,6-bisphosphatase by 25 mM fructose-2,6-bisphosphate 
when fructose-1,6-bisphosphate is (a) 25 mM and (b) 100 mM. 


. Regulation of Fructose-2,6-Bisphosphate Glucagon is a hormone 


released when blood sugar levels are low. If this hormone had an effect 
on the activity of phosphofructokinase-2 (PFK-2) in liver cells, would 
you predict it to activate or inhibit PFK-2? Explain your answer. 


. Understanding the Energetics of the Glycogen Synthase Reaction 


(Integrates with Chapter 3.) Suggest an explanation for the exergonic 
nature of the glycogen synthase reaction (AG°’ = —13.3 kJ/mol). 
Consult Chapter 3 to review the energetics of high-energy phosphate 
compounds if necessary. 


. Glycogen in Muscles Skeletal muscles contain high levels of glycogen. 


However, this muscle glycogen cannot be used to raise blood sugar 
during times of fasting. Explain why this is the case. 


. Determining the Rate of Energy Consumption by Muscles During 


Exercise Using the values in Table 23.1 for body glycogen content 
and the data in part b of the illustration for A Deeper Look 
(“Carbohydrate Utilization in Exercise”), calculate the rate of energy 


10. 


14. 


15. 


16. 


consumption by muscles in heavy exercise (in J/sec). Use the data for 
fast-twitch muscle. 


Assessing the Action of Sodium Borohydride on the Pentose Phosphate 
Pathway Which reactions of the pentose phosphate pathway would 
be inhibited by NaBH,? Why? 


. Reversible Reactions in Gluconeogenesis For the reversible reactions 


of glycolysis/gluconeogenesis, what determines the direction of the 
reaction? 


. Determining the Extent of Branching in Glycogen (Integrates with 


Chapter 7.) Imagine a glycogen molecule with 8000 glucose residues. 
If branches occur every eight residues, how many reducing ends does 
the molecule have? If branches occur every 12 residues, how many 
reducing ends does it have? How many nonreducing ends does it have 
in each of these cases? 


. Describing the Regulation of Gluconeogenesis and Glycogen 


Metabolism Explain the effects of each of the following on the 

rates of gluconeogenesis and glycogen metabolism: 

a. Increasing the concentration of tissue fructose-1,6-bisphosphate. 
b. Increasing the concentration of blood glucose. 

c. Increasing the concentration of blood insulin. 

d. Increasing the amount of blood glucagon. 

e. Decreasing levels of tissue ATP. 

f. Increasing the concentration of tissue AMP. 

g. Decreasing the concentration of fructose-6-phosphate. 

Other Carbohydrates in Glycogen Metabolism (Integrates with 
Chapter 18.) Can D-mannose, D-fructose, or D-galactose be stored as 


glycogen or serve as sources of NADPH through the pentose phos- 
phate pathway? 


Fatty Acids and Gluconeogenesis Although fatty acids cannot serve 
as precursors for gluconeogenesis, could triacylglycerides play a role 
in gluconeogenesis? Explain. 


Assessing the Energetics of the Glycogen Phosphorylase Reaction (Inte- 

grates with Chapters 3 and 15.) The free energy change of the glycogen 
phosphorylase reaction is AG” = +3.1 kJ/mol. If [P] = 1 mM, what is 
the concentration of glucose-1-P when this reaction is at equilibrium? 
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17. 


(a) 


Understanding Enzyme Mechanisms Related to Pyruvate Carboxylase 
Based on the mechanism for pyruvate carboxylase (Figure 22.3), write 
reasonable mechanisms for the reactions that follow: 


(0) 


Chapter 22 Gluconeogenesis, Glycogen Metabolism, and the Pentose Phosphate Pathway 


CH3 o 


ATP + HCO,~ + Nc=CH—C—SCoA —> -O0OC—CHə—C=CH—C—SCoA + ADP + P; 


(b) 


7A 
H;C 


B-Methylcrotonyl-CoA 


B-Methylglutaconyl-CoA 


CH, CH; O CH; CH, O 
A + HCO," + owe ee a N ie 
HC SCoA HC SCoA + ADP + P; 


COO- 
Geranyl-CoA y-Carboxygeranyl-CoA 
(c) 
i i 
ATP + HCO,~ + HN —C—NH, —> ~OOC—N—C—NHp + ADP + P; 
H 
Urea N-Carboxyurea 
(d) 
Transcarboxylase 
TOOC O (0) O (0) 
| H | H 
H;C— C—C—SCoA + H3C—C—COO7 == H;C—C—C—SCoA + ~OOC—C—C—COO- 
H H H 
Methylmalonyl-CoA Pyruvate Propionyl-CoA Oxaloacetate 
18. Understanding the Mechanisms of Reactions Related to Transketolase The 
mechanistic chemistry of the acetolactate synthase and phosphoketolase 
reactions (shown here) is similar to that of the transketolase reaction 
(Figure 22.30). Write suitable mechanisms for these reactions. 
(0) (0) OH 
| Acetolactate 
CH3;— C— COO- + CH3—C—COO- > CH3;—C—C—COO™ + CO», 
synthase 
CH; 
HC — OH 
C=O HC—O 
HOCH O HCOH 
,___Phosphoketolase | 
HCOH + HOPO,;?” = = > HsC—C—OPO,?- + HCOH 
2- 
HCOH HO CH OPO; 
CHOPO;°- 
Fructose-6-P Acetyl-P Erythrose-4-P 


19. 


20. 


Malate/Aspartate Shuttle and Gluconeogenesis (Integrates with 
Chapter 20.) What is the contribution of the malate/aspartate shuttle 
to the gluconeogenesis pathway? (See Figure 20.31 and Figure 22.4 in 
Section 22.1e.) 


Understanding the Mechanisms of Action of a Popular Diabetes 
Medication Metaglip is a prescribed preparation (from Bristol- 
Myers Squibb) for treatment of type 2 diabetes. It consists of 
metformin (see Human Biochemistry, box “Metformin—A Diabetes 
Drug with Multiple Actions” in Section 22.1.) together with glipizide. 
The actions of metformin and glipizide are said to be complemen- 
tary. Suggest a mechanism for the action of glipizide. 


21. 


22. 


Assessing the Mechanism of Action of Aldose Reductase Inhibitors 
Study the structures of tolrestat and epalrestat in the Human 
Biochemistry box “Aldose Reductase and Diabetic Cataract Forma- 
tion” in Section 22.6, and suggest a mechanism of action for these 
inhibitors of aldose reductase. 


Understanding a Version of the Pentose Phosphate Pathway That 
Produces Mainly Ribose-5-P Based on the discussion in Section 22.6, 
draw a diagram to show how several steps in the pentose phosphate 
pathway can be bypassed to produce large amounts of ribose- 
5-phosphate. Begin your diagram with fructose-6-phosphate. 
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23. Assessing the Labeling Patterns in Versions of the Pentose Phosphate 
Pathway Consider the diagram you constructed in Problem 22. 
Which carbon atoms in ribose-5-phosphate are derived from carbon 
atoms in positions 1, 3, and 6 of fructose-6-phosphate? 


24. Understanding a Version of the Pentose Phosphate Pathway That 
Produces Mainly NADPH As described in Section 22.6, the pentose 
phosphate pathway may be used to produce large amounts of 
NADPH without significant net production of ribose-5-phosphate. 
Draw a diagram, beginning with glucose-6-phosphate, to show how 
this may be accomplished. 


25. Understanding a Version of the Pentose Phosphate Pathway That 
Produces Mainly NADPH and ATP Section 22.6 explains that the 
pentose phosphate pathway and the glycolytic pathway can be 
combined to provide both NADPH and ATP (as well as some 
NADH) without net ribose-5-phosphate synthesis. Draw a diagram 
to show how this may be accomplished. 


26. Assessing the Labeling Patterns in Versions of the Pentose Phosphate 
Pathway Consider the pathway diagram you constructed in Problem 25. 
What is the fate of carbon from positions 2 and 4 of glucose- 
6-phosphate after one pass through the pathway? 


27. Understanding the Initial Reaction in Formation of Glycogen 
Glycogenin catalyzes the first reaction in the synthesis of a glycogen 
particle, with Tyr'™ of glycogenin (Figure 22.11) combining with a 
glucose unit (provided by UDP-glucose) to produce a tyrosyl glucose. 
Write a mechanism to show how this reaction could occur. 


28. Assessing the Biochemistry of Exercise Study the graphs in the Deeper 
Look box “Carbohydrate Utilization in Exercise,” and explain the 
timing of the provision of energy from different metabolic sources 
during periods of heavy exercise. 
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29. Understanding the Chemistry and Energetics of Creatine Phosphate 
(Integrates with Chapters 3 and 14.) What is the structure of creatine 
phosphate? Write reactions to indicate how it stores and provides 
energy for exercise. 


PSAFE Problems 


(PSAFE models, listed by chapter or protein name, can be found at 
http://www. psafeonline.net.) 


30. Examine the PSAFE model for fructose-1,6-bisphosphatase, then 
describe the role of Arg” in the conformational change between the 
R and T states of this enzyme. 


31. Using the PSAFE model for aldose reductase, describe the structure 
of the TIM barrel motif as well as the structure and location of the 
active site. 


Think-Pair-Share Question 


The product of the p53 tumor suppressor gene, the p53 protein, has 
multiple functions, including suppressing uncontrolled cell growth. 
Dysfunctional p53 has been identified as a strong contributing factor in 
the transformation of normal cells into cancerous cells for many different 
types of tumors. P53 has been shown to inhibit glycolysis and the pentose 
phosphate pathway (see Figures 22.8 and 22.22). What benefits may there 
be to a cancer cell from a dysfunctional p53 and a lesser inhibition of the 
glycolysis and pentose phosphate pathway? Discuss at least four products 
or intermediates from these pathways that benefit the growth of a normal 
cell and that would stimulate uncontrolled growth if these pathways are 
not properly regulated. 
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Fatty Acid Catabolism 


23 


The fat is in the fire. 


John Heywood 
Proverbs (1497-1580) 


|a The hummingbird’s tremendous capacity to store and 
use fatty acids enables it to make migratory journeys of 
remarkable distances. 


Essential Questions 


Fatty acids are the principal form of stored energy for many organisms. There are two 
important advantages to this (i) The carbon in fatty acids (mostly —CH,— groups) is 
almost completely reduced compared to the carbon in sugars and amino acids. Therefore, 
oxidation of fatty acids will yield more energy (in the form of ATP) than any other form of 
carbon. (ii) Fatty acids are not as hydrated as monosaccharides and polysaccharides, and 
thus pack more closely in storage tissues. How are fatty acids catabolized, and how is their 
inherent energy captured by organisms? 


he catabolism of fatty acids provides a rich source of energy for many organisms, 

primarily in the form of ATP. This chapter describes the mobilization of fats from 
dietary sources and from adipose tissue, and the pathways of fatty acid breakdown, 
including the oxidation pathway and the processing of unsaturated, odd-carbon and 
branched chain fatty acids. 


23.1 | How Are Fats Mobilized from Dietary Intake 


and Adipose Tissue? 


Fatty acids are acquired readily in the diet and can also be made from carbohydrates 
and the carbon skeletons of amino acids. Animals also store fatty acids (mainly as 
triglycerides) in adipose tissue, a central metabolic organ involved in the regulation of 
energy homeostasis in the body. 


23.la Modern Diets Are Often High in Fat 


Fatty acids provide 30% to 60% of the calories in the diets of most Americans. For our 
caveman and cavewoman ancestors, the figure was probably closer to 20%. Dairy 
products were apparently not part of their diet, and the meat they consumed (from 
fast-moving animals) was low in fat. In contrast, modern domesticated cows and pigs 


Key Questions 


23.1 


23.2 
23.3 


23.4 


23.5 


23.6 


How Are Fats Mobilized from Dietary 
Intake and Adipose Tissue? 


How Are Fatty Acids Broken Down? 


How Are Odd-Carbon Fatty Acids 
Oxidized? 

How Are Unsaturated Fatty Acids 
Oxidized? 

Are There Other Ways to Oxidize 
Fatty Acids? 


What Are Ketone Bodies, and What 
Role Do They Play in Metabolism? 
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SPL/Science Source 


Figure 23.1 Scanning electron micrograph of an 
adipose cell (fat cell). Globules of triacylglycerols 
occupy most of the volume of such cells. 


| Table 23.1 e| Stored Metabolic Fuel in a 70-kg Person 


Energy Dry Weight Available Energy 

Constituent (k}/g dry weight) (g) (kJ) 

Fat (adipose tissue) 37 15,000 555,000 
Protein (muscle) 17 6000 102,000 
Glycogen (muscle) 16 120 1920 
Glycogen (liver) 16 70 1120 
Glucose (extracellular fluid) 16 20 320 
Total 660,360 


Sources: Owen, O. E., and Reichard, G. A., Jr., 1971. Fuels consumed by man: The interplay between carbohydrates 
and fatty acids. Progress in Biochemistry and Pharmacology 6:177; and Newsholme, E. A., and Leech, A. R., Biochemistry 
for the Medical Sciences. New York: Wiley, 1983. 


are actually bred for high fat content (and better taste). However, woolly mammoth 
burgers and saber-toothed tiger steaks are hard to find these days—even in the gour- 
met sections of grocery stores—and so, by default, we consume (and metabolize) large 
quantities of fatty acids from both animal and non-animal sources. 


23.1b Triacylglycerols Are a Major Form of Stored Energy in Animals 


Although some of the fat in our diets is in the form of phospholipids, triacylglycerols 
are a major source of fatty acids. Triacylglycerols are also our principal stored energy 
reserve. As shown in Table 23.1, the energy available in stores of fat in the average per- 
son far exceeds the energy available from protein, glycogen, and glucose. Overall, fat 
accounts for approximately 83% of available energy, partly because more fat is stored 
than protein and carbohydrate and partly because of the substantially higher energy 
yield per gram for fat compared with protein and carbohydrate. Complete combustion 
of fat yields about 37 kJ/g, compared with about 16 to 17 kJ/g for sugars, glycogen, and 
amino acids. In animals, fat is stored mainly as triacylglycerols in specialized cells 
called adipocytes or adipose cells. As shown in Figure 23.1, triacylglycerols, aggregated 
to form large globules, occupy most of the volume of adipose cells. Much smaller 
amounts of triacylglycerols are stored as small, aggregated globules in muscle tissue. 


23.1c Hormones Trigger the Release of Fatty Acids 
from Adipose Tissue 


The pathways for liberation of fatty acids from triacylglycerols, either from adi- 
pose cells or from the diet, are shown in Figures 23.2 and 23.3. Fatty acids are 
mobilized from adipocytes in response to hormone messengers such as adrena- 
line, glucagon, and adrenocorticotropic hormone (ACTH). These signal mole- 
cules bind to receptors on the plasma membrane of adipose cells and lead to the 
activation of adenylyl cyclase, which forms cyclic AMP from ATP. In adipose 
cells, cAMP activates protein kinase A, which phosphorylates and activates a 
triacylglycerol lipase (also termed hormone-sensitive lipase) that hydrolyzes a fatty 
acid from C-1 or C-3 of triacylglycerols. Subsequent actions of diacylglycerol 
lipase and monoacylglycerol lipase yield fatty acids and glycerol. The cell then 
releases the fatty acids into the blood, where they are bound to serum albumin (the 
most abundant protein in blood serum). Serum albumin transports free fatty 
acids to sites of utilization. 


23.1d Degradation of Dietary Triacylglycerols Occurs 
Primarily in the Duodenum 
Dietary triacylglycerols are degraded to a small extent (via fatty acid release) by 


triacylglycerol lipases in the low-pH environment of the stomach, but mostly they 
pass untouched into the duodenum. Alkaline pancreatic juice secreted into the 
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Figure 23.2 Liberation of fatty acids from triacylglycerols in adipose tissue is hormone 
dependent. 


duodenum (Figure 23.3a) raises the pH of the digestive mixture, allowing hydrolysis 
of the triacylglycerols by pancreatic lipase and by nonspecific esterases, which hydro- 
lyze the fatty acid ester linkages. Pancreatic lipase cleaves fatty acids from the C-1 and 
C-3 positions of triacylglycerols, and other lipases and esterases attack the C-2 posi- 
tion (Figure 23.3b). These processes depend upon the presence of bile salts, a family 
of carboxylic acid salts with steroid backbones (see Chapter 8). These agents act as 
detergents to emulsify the triacylglycerols and facilitate the hydrolytic activity of the 
lipases and esterases. Short-chain fatty acids (10 or fewer carbons) released in this way 
are absorbed directly into the villi of the intestinal mucosa, whereas long-chain fatty 
acids, which are less soluble, form mixed micelles with bile salts and are carried in this 
fashion to the surfaces of the epithelial cells that cover the villi (Figure 23.4). The fatty 
acids pass into the epithelial cells, where they are condensed with glycerol to form 
new triacylglycerols. These triacylglycerols aggregate with lipoproteins to form par- 
ticles called chylomicrons, which are then transported into the lymphatic system and 
on to the bloodstream, where they circulate to the liver, lungs, heart, muscles, and 
other organs. At these sites, the triacylglycerols are hydrolyzed to release fatty acids, 
which can then be oxidized in a highly exergonic metabolic pathway known as 
B-oxidation. 
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Triacylglycerol 


(a) (b) 


Food containing 
triacylglycerols 


/ | N 


Pancreatic Pancreatic Pancreatic 
lipase lipase lipase 


B-Monoacylglycerol Diacylglycerol Diacylglycerol 


Duodenum 


| Stomach 


Pancreas 


Small 
intestine 


Pancreatic 


Epithelial cells 


2 fat id 2 CoA 
of intestinal wall ee z 


Entry of pancreatic juice 
into duodenum 


2 fatty acyl CoA 


Monoacylglycerol 


i Triacylglycerol iii 
Protein ees Chylomicrons 


Figure 23.3 Dietary triacylglycerols are degraded primarily in the duodenum. (a) The pancreatic duct 
secretes digestive fluids into the duodenum, the first portion of the small intestine. (b) Hydrolysis of 
triacylglycerols by pancreatic and intestinal lipases. Pancreatic lipases cleave fatty acids at the C-1 and 
C-3 positions. Resulting monoacylglycerols with fatty acids at C-2 are hydrolyzed by intestinal lipases. 
Fatty acids and monoacylglycerols are absorbed through the intestinal wall and assembled into 
lipoprotein aggregates termed chylomicrons. 
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Human Biochemistry 


Serum Albumin—Tramp Steamer of the Bloodstream 


Human serum albumin, at a concentration of 40 mg/mL, is the 
most abundant protein in the blood serum. It is a single polypeptide 
of 585 amino acids (66.4 kDa); the absence of any glycosylations 
makes it novel among extracellular proteins. Serum albumin has 
three principal functions: (i) It provides 80% of the osmotic pres- 
sure of blood plasma; (ii) As the major macromolecular anion in 
the blood, it has important buffering capacity; (iii) It is the major 
transport vehicle of hydrophobic substances. In this latter role, 
serum albumin merits the sobriquet “tramp steamer of the blood- 
stream” because of its ability to nonspecifically bind and transport 
numerous hydrophobic substances, including drugs taken for phar- 
macological benefit. (Tramp steamers are ships that operate on an 
as-needed basis, hauling any sort of cargo from place to place.) 
As much as 90% of some drugs are bound by serum albumin. 

The solubility of free fatty acids in blood plasma is limited, 
amounting to only 0.1 nM. However, the concentration of fatty acids, 
as bound to serum albumin, approaches 1 mM. Serum albumin pos- 
sesses 11 distinct fatty acid-binding sites. 


The heart-shaped tertiary structure of human serum albumin, with 
11 molecules of decanoic acid bound (PDB id = 1E7F). 
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Each of the villi of the small intestine consists of a 
layer of epithelial cells and a core of capillaries and 
connective tissuc. The outside (apical) face of each 
epithelial cell is covered by smaller projections called 
microvilli or the brush border. 
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Figure 23.4 In the small intestine, fatty acids combine with bile salts in mixed micelles, which deliver 
fatty acids to epithelial cells that cover the intestinal villi. Fatty acids diffuse across the brush border 
membrane and into the epithelial cells. There, the fatty acids combine with glycerol to form new 
triacylglycerols, which aggregate with lipoproteins to form chylomicrons. Taken up by the lymphatic 
system, chylomicrons circulate to tissues throughout the body. 
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A Deeper Look 


The Biochemistry of Obesity 


Obesity is a worldwide health problem. It is associated with insu- 
lin resistance, type 2 diabetes, atherosclerosis, and ischemic heart 
disease—conditions that reduce life expectancy and present signifi- 
cant societal and economic consequences. Evidence is accumulating 
that obesity causes chronic low-grade inflammation, which contrib- 
utes to the systemic metabolic dysfunction that is associated with 
obesity-linked disorders. 

Fat consumed in the diet traverses the circulatory system either 
as serum albumin-bound free fatty acids (FFA) or in triglyceride- 
rich lipoproteins, which release FFA to cells through the action of 
membrane-associated lipoprotein lipases (LPL). Circulating fatty acids 
are taken up by the endothelial cells lining blood vessel walls and bind 
to cytosolic fatty acid-binding proteins (Figure 1). 


Helix-turn-helix motif 


Oleic acid ligand 


B-barrel domain 


Figure 1 Human fatty acid-binding protein with bound oleic acid 
(PDB id = 2LKK). 


Binding to these proteins determines their destination and fate, such 
as serving as ligands for transcription factors in the nucleus. For 
example, peroxisome proliferator-activated receptors (PPARs) regu- 
late the expression of genes that control fatty acid oxidation and 
triglyceride metabolism (Figure 2). Another transcription factor, 
NHR-SO, regulates expression of desaturases, enzymes that convert 
saturated fatty acids into unsaturated fatty acids. Overexpression of 
NHR-80 in the roundworm C. elegans extends the organism’s life 
span significantly. 

High levels of circulating FFA lead to altered deposition and 
accumulation of triglycerides in liver, muscle, and adipose tissue. In 
each of these tissues, altered fat accumulation provokes a cascade 
of signals that leads to stress and inflammatory responses and meta- 
bolic dysfunction. In this context, adipose tissue is not only involved 
in energy storage but also functions as an endocrine organ that plays 
a key role in integration of system metabolism by secreting a variety 
of signaling proteins. 

Proteins secreted by adipose tissue are collectively called adipokines. 
These secreted proteins have either pro-inflammatory or anti- 
inflammatory activities. Altered production and/or secretion of these 
adipokines contributes to obesity-linked complications. The expres- 
sion of adipokines varies depending on the site of the adipose tissue 
depot (see Figure 3). The two most abundant depots are visceral and 
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Figure 2 Trafficking of fatty acids. Fatty acids delivered to the nucleus are 
ligands for transcription factors including PPAR and RXR. The PPAR- 
RXR transcriptional complex is a key regulator of proteins central to 
handling of energy resources in the cell, including transport (lipoprotein 
metabolism), release (lipases), cellular uptake (fatty acid transporters), 
energy release (beta oxidation), and storage (adipogenesis). 
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Figure 3 Locations of adipose tissues in the human body. 
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subcutaneous adipose tissues, but each of the adipose depots in the 
body produces a unique collection of adipokines. The various adi- 
pokines are produced either by adipose cells themselves or by other 
associated cells including macrophages and immune system cells. 

Pro-inflammatory adipokines include tumor necrosis factor 
(TNF), which induces systemic inflammation and apoptosis; resis- 
tin and IL-6, which induce insulin resistance; lipocalin, which binds 
and transports small lipophilic molecules like retinoids, arachidonic 
acid, and steroids; and ANGPTL2, which induces both inflam- 
mation and insulin resistance. Obese individuals also secrete high 
levels of /eptin, which normally inhibits eating and promotes energy 
expenditure; however, obese individuals have often become leptin- 
resistant. Leptin also increases the production of TNF and II-6 and 
is thus considered a pro-inflammatory adipokine. 


23.2 How Are Fatty Acids Broken Down? 797 


Anti-inflammatory adipokines include adiponectin, which pro- 
tects against obesity-linked metabolic dysfunction in several ways. 
Adiponectin enhances insulin sensitivity and reduces plasma levels 
of glucose, free fatty acids, and triglycerides. The beneficial effects of 
adiponectin derive in part from its activation of AMP-activated pro- 
tein kinase (AMPK), which stimulates glucose uptake and fatty acid 
oxidation in muscle and inhibits gluconeogenesis in the liver. 


References: 

Ouchi, N., Parker, J. L., Lugus, J. J., and Walsh, K., 2011. Adipokines in 
inflammation and metabolic disease. Nature Reviews Immunology 11:85-97. 
Plutzky, J., 2009. The mighty, mighty fatty acid. Nature Medicine 15:618-619. 
Robinson, R., 2011. One secret of longevity: The fatty acid pathway. PLoS 
Biology 9:e1001030. 

Storch, J., and Thumser, A. E., 2010.Tissue-specific functions in the fatty acid- 
binding protein family. The Journal of Biological Chemistry 285:32679-32683. 


23.2 | How Are Fatty Acids Broken Down? 


The earliest clue to the secret of fatty acid oxidation and breakdown came in the early 
1900s, when Franz Knoop carried out experiments in which he fed modified fatty 
acids to dogs. Knoop’s experiments showed that fatty acids must be degraded by 
oxidation at the B-carbon (Figure 23.5), followed by cleavage of the C,—Cg bond. 
Repetition of this process yielded two-carbon units, which Knoop assumed must be 
acetate. Much later, Albert Lehninger showed that this degradative process took 
place in the mitochondria, and Knoop and Lehninger showed that the two-carbon 
unit released is acetyl-CoA, not free acetate. Because the entire process begins with 
oxidation of the carbon that is “B” to the carboxyl carbon, the process has come to be 


known as B-oxidation. 


In mammalian cells, B-oxidation takes place primarily in mitochondria, but a similar 
pathway occurs in peroxisomes. In yeast and other lower eukaryotes, B-oxidation is 
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Figure 23.5 Fatty acids are degraded by repeated cycles of oxidation at the B-carbon and cleavage 
of the C,—Cg bond to yield acetate units, in the form of acetyl-CoA. Each cycle of B-oxidation 
yields four electrons, captured as FADH; and NADH, which drive electron transport and oxidative 


phosphorylation pathways to produce ATP. 
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confined exclusively to peroxisomes. Mitochondrial B-oxidation provides energy to the 
organism (Figure 23.6), whereas peroxisomal B-oxidation is responsible for shortening 
long-chain fatty acids that are poor substrates for mitochondrial B-oxidation. Such 
shortened fatty acids then become substrates for mitochondrial B-oxidation. 


23.2a Coenzyme A Activates Fatty Acids for Degradation 


The process of B-oxidation begins with the formation of a thiol ester bond between the 
fatty acid and the thiol group of coenzyme A. This reaction, shown in Figure 23.6, is 
catalyzed by acyl-CoA synthetase, which is also called acyl-CoA ligase or fatty acid 
thiokinase. This condensation with CoA activates the fatty acid for reaction in the 
B-oxidation pathway. For long-chain fatty acids in higher eukaryotes, this reaction nor- 
mally occurs at the outer mitochondrial membrane before entry of the fatty acid into 
the mitochondrion. However, it may also occur at the surface of the endoplasmic reticu- 
lum. Short- and medium-length fatty acids undergo this activating reaction in the mito- 
chondria. In all cases, the reaction is accompanied by the hydrolysis of ATP to form 
AMP and pyrophosphate. As shown in Figure 23.6, the overall reaction has a net AG’ 
of about —0.8 kJ/mol, so the reaction is favorable but easily reversible. However, there 
is more to the story. The pyrophosphate produced in this reaction is rapidly hydrolyzed 
by inorganic pyrophosphatase to two molecules of phosphate, with a net AG?’ of about 
—33.6 kJ/mol. Thus, pyrophosphate is maintained at a low concentration in the cell 
(usually less than 10 uM), and the synthetase reaction is strongly promoted. The mech- 
anism of the acyl-CoA synthetase reaction is shown in Figure 23.7 and involves attack 
of the fatty acid carboxylate on ATP to form an acyladenylate intermediate, which is 
subsequently attacked by CoA, forming a fatty acyl-CoA thioester. 
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Figure 23.6 The acyl-CoA synthetase reaction activates fatty acids for B-oxidation. The reaction is 
driven by hydrolysis of ATP to AMP and pyrophosphate as well as by the subsequent hydrolysis of 
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Figure 23.7 The mechanism of the acyl-CoA synthetase reaction involves fatty acid carboxylate 
attack on ATP to form an acyl-adenylate intermediate. The fatty acyl-CoA thioester product is 
formed by CoA attack on this intermediate. 
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23.2b Carnitine Carries Fatty Acyl Groups Across 
the Inner Mitochondrial Membrane 


All of the other enzymes of the B-oxidation pathway are located in the mitochondrial 
matrix. Short-chain fatty acids, as already mentioned, are transported into the 
matrix as free acids and form the acyl-CoA derivatives there. However, long-chain 
fatty acyl-CoA derivatives cannot be transported into the matrix directly. These 
long-chain derivatives must first be converted to acylcarnitine derivatives, as shown 
in Figure 23.8. Carnitine acyltransferase I, associated with the outer mitochondrial 
membrane, catalyzes the formation of the O-acylcarnitine, which is then transported 
across the inner membrane by a translocase. At this point, the acylcarnitine is passed 
to carnitine acyltransferase IT on the matrix side of the inner membrane, which trans- 
fers the fatty acyl group back to CoA to re-form the fatty acyl-CoA, leaving free 
carnitine, which can return across the membrane via the translocase. 

Several additional points should be made. First, although oxygen esters usually have 
lower group-transfer potentials than thiol esters, the O—acyl bonds in acylcarnitines 
have high group-transfer potentials, and the transesterification reactions mediated by 
the acyltransferases have equilibrium constants close to 1. Second, note that eukaryotic 
cells maintain separate pools of CoA in the mitochondria and in the cytosol. The cyto- 
solic pool is utilized principally in fatty acid biosynthesis, and the mitochondrial pool 
is important in the oxidation of fatty acids and pyruvate as well as some amino acids. 
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membrane. The process involves the coordinated 
actions of carnitine acyltransferases on both the 
inner and outer mitochondrial membranes and of a 
translocase that shuttles O-acylcarnitines across the L-Carnitine 
inner membrane. 
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23.2c B-Oxidation Involves a Repeated Sequence of Four Reactions 


For saturated fatty acids, the process of B-oxidation involves a recurring cycle of four 
steps, as shown in Figure 23.9. The overall strategy in the first three steps is to create 
a carbonyl group on the B-carbon by oxidizing the C,—Cg bond to form an olefin, 
with subsequent hydration and oxidation. In essence, this cycle is directly analogous 
to the sequence of reactions converting succinate to oxaloacetate in the TCA cycle. 
The fourth reaction of the cycle cleaves the B-keto ester in a reverse Claisen condensa- 
tion, producing an acetate unit and leaving a fatty acid chain that is two carbons 
shorter than it began. (Recall from Chapter 19 that Claisen condensations involve 
attack by a nucleophilic agent on a carbonyl carbon to yield a B-keto acid.) 
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Figure 23.9 The B-oxidation of saturated fatty acids 
involves a cycle of four enzyme-catalyzed reactions. 


NADH + H* Oxidation Hydration a HO Step 1: Oxidation of a single bond to a double 


bond. Step 2: Hydration across the double bond. 
L-Hydroxyacyl-CoA Enoyl-CoA Step 3: Oxidation of the hydroxyl to a carbonyl. 

dehydrogenase hydratase Step 4: a-B cleavage to produce an acetyl-CoA 
H H o product. Each cycle produces single molecules of 

NAD+ | l | FADH:, NADH, and acetyl-CoA; consumes a 

R— CH — C — C — C — S — CoA water; and yields a fatty acid shortened by two 
Le |* carbons. (The delta [A] symbol connotes a double 

a H bond, and its superscript indicates the lower- 
L-B-Hydroxyacyl-CoA numbered carbon involved.) 
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A Family of Acyl-CoA Dehydrogenases Carries Out the First Reaction of B-Oxidation 
The enzymes of mitochondrial B-oxidation are organized in two functional systems: 
a membrane-bound complex that is specific for long-chain fatty acids (14 carbons and 
longer) and a family of soluble enzymes in the matrix that is specific for short- and 
medium-chain fatty acids (Figure 23.10). As a fatty acyl chain is shortened in succes- 
sive cycles of B-oxidation, it moves from the membrane-bound complex to the family 
of soluble matrix enzymes. The first reaction of the B-oxidation cycle is catalyzed by 
one of four acyl-CoA dehydrogenases. These include the very long-chain acyl-CoA 
dehydrogenase (VLCAD), as well as acyl-CoA dehydrogenases specific for long-chain 
(LCAD), medium-chain (MCAD), and short-chain (SCAD) substrates. VLCAD is a 
membrane-bound homodimer of 67-kDa subunits (Figure 23.11), whereas the soluble 
LCAD, MCAD, and SCAD are homotetramers of 40 to 45 kDa subunits. 
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Figure 23.10 Very long-chain fatty acids proceed through several cycles of B-oxidation (left) via 
membrane-bound enzymes in mitochondria. A membrane-bound multifunctional complex includes 
the enoyl-CoA hydratase (ECH), hydroxyacyl-CoA dehydrogenase (HAD), and ketoacyl thiolase 
(KT) activities. As chains shorten progressively, they become substrates for the separate, soluble 
enzymes of B-oxidation (right). 


(b) 


Figure 23.11 The first reaction of beta-oxidation is catalyzed by one of four acyl-CoA 
dehydrogenases, including the very long-chain acyl-CoA dehydrogenases (VLCADs) and medium- 
chain acyl-CoA dehydrogenases (MCADs). (a) The VLCAD of mammalian mitochondria is a 

67 kDa membrane-bound homodimer with bound FAD (red) and myristoyl-CoA (blue; PDB id = 
3B96). The tertiary structure of the N-terminal 400 residues of VLCAD is similar to that of (b) the 
soluble MCAD (PDB id = 3MDE), shown with bound FAD (red) and octanoyl-CoA (blue). These 
similar structures each include an N-terminal a-helical domain (yellow), followed by a B-sheet domain 
(blue) and another a-helical domain (green). The acyl-CoA substrate lies in a long cleft between these 
three domains. The VLCAD also has a C-terminal a-helical domain (purple). 
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| || 1.6 ADP 
R—CH, = C —SCoA tien, 16 ATP 
| 
= Figure 23.12 The acyl-CoA dehydrogenase reaction. The two electrons removed in this oxidation 
trans-A*-Enoyl-CoA reaction are delivered to the electron-transport chain in the form of reduced coenzyme Q (UQHS). 


Figure 23.14 Structures of mitochondrial 

(a) enoyl-CoA hydratase trimer (PDB id = 2DUB), 
(b) hydroxyacyl-CoA dehydrogenase dimer 

(PDB id = 1FOY), and (c) ketoacyl thiolase dimer 
(PDB id = 1AFW). 


All acyl-CoA dehydrogenases carry noncovalently (but tightly) bound FAD, which is 
reduced during the oxidation of the fatty acid. As shown in Figure 23.12, FADH, trans- 
fers its electrons to an electron transfer flavoprotein (ETF). Reduced ETF is reoxidized 
by a specific oxidoreductase (an iron-sulfur protein), which in turn sends the electrons 
on to the electron-transport chain at the level of coenzyme Q. Recall from Chapter 20 
that mitochondrial oxidation of FAD in this way eventually results in the net formation 
of about 1.6 ATPs. The mechanism of the acyl-CoA dehydrogenase (Figure 23.13) 
involves deprotonation of the fatty acid chain at the a-carbon, followed by hydride 
transfer from the B-carbon to FAD. 


Enoyl-CoA Hydratase Adds Water Across the Double Bond The next step in B-oxidation 
is the addition of the elements of H2O across the new double bond in a stereospecific 
manner, yielding the corresponding hydroxyacyl-CoA. 
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The reaction is catalyzed by enoyl-CoA hydratase (Figure 23.14). A number of 
different enoyl-CoA hydratase activities have been detected in various tissues. Also 
called crotonases, these enzymes specifically convert trans-enoyl-CoA derivatives to 
L-B-hydroxyacyl-CoA. Enoyl-CoA hydratases will also metabolize cis-enoyl-CoA (at 
slower rates) to give specifically D-B-hydroxyacyl-CoA. In addition, there is a novel 
enoyl-CoA hydratase that converts trans-enoyl-CoA to D-B-hydroxyacyl-CoA. 
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Figure 23.13 The mechanism of acyl-CoA dehydrogenase. Removal of a proton from the a-C is 
followed by hydride transfer from the B-carbon to FAD. 
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L-Hydroxyacyl-CoA Dehydrogenase Oxidizes the B-Hydroxyl Group The third reac- 
tion of this cycle is the oxidation of the hydroxyl group at the B-position to pro- 
duce a B-ketoacyl-CoA derivative. This second oxidation reaction is catalyzed by 
L-hydroxyacyl-CoA dehydrogenase, an enzyme that requires NAD* as a coenzyme 
(see Figure 23.9). 


+4 
NAD i ii 
| 
R— CH, — C— CH, — C—SCoA > R—CH,— C— CH — C— SCoA 
NADH + H* 
L-B-Hydroxyacyl-CoA B-Ketoacyl-CoA 


NADH produced in this reaction represents metabolic energy. Each NADH produced 
in mitochondria by this reaction drives the synthesis of 2.7 molecules of ATP in the 
electron-transport pathway. L-Hydroxyacyl-CoA dehydrogenase shows absolute spec- 
ificity for the L-hydroxyacyl isomer of the substrate. (D-Hydroxyacyl isomers, which 
arise mainly from the metabolism of unsaturated fatty acids, are handled differently.) 


B-Ketoacyl-CoA Intermediates Are Cleaved in the Thiolase Reaction The final step in 
the B-oxidation cycle is the cleavage of the B-ketoacyl-CoA. This reaction, catalyzed 
by ketoacyl thiolase (also known as B-ketothiolase, Figure 23.9), involves the attack 
of a cysteine thiolate from the enzyme on the B-carbonyl carbon, followed by 
cleavage to give the enolate of acetyl-CoA and an enzyme-thioester intermediate 
(Figure 23.15). Subsequent attack by the thiol group of a second CoA and departure 
of the cysteine thiolate yields a new (shorter) acyl-CoA. If the reaction in Figure 23.15 
is read in reverse, it is easy to see that it is a Claisen condensation—an attack of the 
enolate anion of acetyl-CoA on a thioester. Despite the formation of a second 
thioester, this reaction has a very favorable K,,, and it drives the three previous reac- 
tions of B-oxidation. 


q? 


23.2d Repetition of the B-Oxidation Cycle Yields 
a Succession of Acetate Units 


In essence, this series of four reactions has yielded a fatty acid (as a CoA ester) that has 
been shortened by two carbons and one molecule of acetyl-CoA. The shortened fatty 
acyl-CoA can now go through another B-oxidation cycle, as shown in Figure 23.9. 
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Figure 23.15 The mechanism of the thiolase reaction. Attack by an enzyme cysteine thiolate group 
at the B-carbonyl carbon produces a tetrahedral intermediate, which decomposes with departure of 
acetyl-CoA, leaving an enzyme thioester intermediate. Attack by the thiol group of a second CoA 
yields a new (shortened) acyl-CoA. 


23.2 How Are Fatty Acids Broken Down? 
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Human Biochemistry 


Exercise Can Reverse the Consequences of Metabolic Syndrome 


Metabolic syndrome is a combination of disorders that increase 
the risk of diabetes and cardiovascular disease. The hallmarks of 
metabolic syndrome include high blood pressure, elevated serum 
triglycerides, reduced serum high-density lipoprotein (HDL), insu- 
lin resistance, and obesity. The prevalence of these conditions is 
increasing in the United States. By most estimates, more than 40% 
of Americans are obese, and rising obesity has contributed to an 
epidemic of type 2 diabetes. Insights into how the body deals with 
high fat and high sugar diets are emerging from a variety of studies, 
and evidence points clearly to the benefits of exercise and dietary 
restriction. Endurance training (such as distance running) and resis- 
tance training (with weights) are both beneficial. Endurance training 
increases the mass of slow-twitch muscle fibers, resistance training 
builds fast-twitch muscle fibers, and both types of exercise reduce 
body fat, but in quite different ways. 


Endurance training 


Slow-twitch 
muscle growth 


Local effects in muscle: 

Increased fatty acid 
oxidation 

Increased expression of 
insulin receptors and 
glucose transporters 


Heart: 


Increased fat 
uptake and 


oxidation 


Slow-twitch muscles depend on fatty acid oxidation and TCA 
cycle activity to support long periods of exercise and are termed ox- 
idative. Effects of endurance training include increased expression 
of (i) peroxisome proliferator-activated receptor 6, a transcription 
factor that builds slow-twitch muscle fiber, and (ii) insulin receptors 
and glucose transporters. Fast-twitch muscles are adapted for short 
bursts of energy, which can be supplied by glycolysis and thus are 
termed glycolytic. Effects of resistance training include activation 
of metabolic processes by the serine/threonine kinase Akt1. Induc- 
tion of the Aktl pathway results in growth of fast-twitch skeletal 
muscle fibers and subsequent effects on several other organs. These 
include increased fat uptake and oxidation by the liver and heart, 
reduction of adipose (fat cell) mass, and reduced blood glucose and 
insulin levels. 


Resistance training 


Fast-twitch 
muscle growth 


Muscle: 


Increased glucose uptake 
and glycolysis 


Circulating 
factors 


Adipose tissue: Circulation: Liver: 
Reduced Reduced glucose, Increased fat 
cell size insulin, and uptake and 
leptin levels oxidation 


Repetition of this cycle with a fatty acid with an even number of carbons eventually 
yields two molecules of acetyl-CoA in the final step. Complete -oxidation of palmitic 
acid yields eight molecules of acetyl-CoA, seven molecules of [FADH,], and seven 
molecules of NADH (Figure 23.16 and Table 23.2). The acetyl-CoA can be further 
metabolized in the TCA cycle (as we have already seen). Alternatively, acetyl-CoA can 
also be used as a substrate in amino acid biosynthesis. As noted in Chapter 22, 
however, acetyl-CoA cannot be used as a substrate for gluconeogenesis. 
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23.2 How Are Fatty Acids Broken Down? 
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Figure 23.16 Reduced coenzymes produced by B-oxidation and TCA cycle activity provide electrons 
that drive the synthesis of ATP in oxidative phosphorylation. Complete oxidation of palmitoyl-CoA 
yields a total of 115.7 ATP. Subtracting the 2 ATP equivalents consumed in forming the original CoA 
thioester, oxidation of palmitate produces 113.7 ATP. (Calculations are based on P/O ratios of 2.7 and 
1.6 for the mitochondrial oxidation of NADH and FADH, (succinate), respectively. See Chapter 20.) 


Equations for the Complete Oxidation of Palmitoyl-CoA to CO, and H,O 


8 Acetyl-CoA \ eR Oxidative a 107.7 ADP + 107.7P, 


805 


ATP Free Energy 
Yield Yield (kJ/mol) 


Equation 
CH,(CH)),4CO-CoA + 7 [FAD] + 7 H2O + 7 NAD* + 7 CoA —> 8 CH;CO-CoA + 7 [FADH,] + 7 NADH + 7 H* 
7 [FADH] + 11.2 P; + 11.2 ADP + 3.5 O —> 7 [FAD] + 18.2 H,O + 11.2 ATP 11.2 
7 NADH + 7 H* + 18.9 P; + 18.9 ADP + 3.5 O, —> 7 NAD+ + 25.9 H,O + 18.9 ATP 18.9 
8 Acetyl-CoA + 16 O2 + 85.6 ADP + 85.6 P; —> 8 CoA + 93.6 H2O + 16 CO: + 85.6 ATP 85.6 
CH3(CH2),4CO-CoA + 115.7 P; + 115.7 ADP + 23 O, —> 115.7 ATP + 16 CO, + 130.7 HO + CoA 115.7 
Energetic cost of forming palmitoyl-CoA from palmitate and CoA ~2 
Total 113.7 


23.2e Complete B-Oxidation of One Palmitic Acid Yields 
113.7 Molecules of ATP 


If the acetyl-CoA is directed entirely to the TCA cycle in mitochondria, it can even- 
tually generate approximately 10 high-energy phosphate bonds—that is, 10 mole- 
cules of ATP synthesized from ADP (Table 23.2). Including the ATP formed from 
[FADH,] and NADH, complete B-oxidation of a molecule of palmitoyl-CoA in 
mitochondria yields 115.7 molecules of ATP. Subtracting the two high-energy bonds 
needed to form palmitoyl-CoA, the substrate for B-oxidation, one concludes that 
B-oxidation of a molecule of palmitic acid yields 113.7 molecules of ATP. The AG?’ 
for complete combustion of palmitate to CO, is —9790 kJ/mol. The hydrolytic 
energy embodied in 113.7 ATPs is 113.7 X 30.5 kJ/mol = 3468 kJ/mol, so the overall 
efficiency of B-oxidation under standard-state conditions is approximately 35%. 
The large energy yield from fatty acid oxidation is a reflection of the highly reduced 
state of the carbon in fatty acids. Sugars, in which the carbon is already partially 
oxidized, produce less energy, carbon for carbon, than do fatty acids. The break- 
down of fatty acids is regulated by a variety of metabolites and hormones. 


23.2f Migratory Birds Travel Long Distances on Energy 
from Fatty Acid Oxidation 
Because they represent the most highly concentrated form of stored biological 


energy, fatty acids are the metabolic fuel of choice for sustaining the incredibly long 
flights of many migratory birds. Although some birds migrate over landmasses and 
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(a) 


Millard H. Sharp/Science Source 


Francois Gohier/Science Source 


Orca 


eat frequently, other species fly long distances without stopping to eat. The American 
golden plover (Figure 23.17) flies directly from Alaska to Hawaii, a 3300 km flight 
requiring 35 hours (at an average speed of nearly 60 miles/hr) and more than 250,000 
wing beats! The ruby-throated hummingbird, which winters in Central America and 
nests in eastern North America, often flies nonstop across the Gulf of Mexico. These 
and similar birds accomplish these prodigious feats by storing large amounts of fatty 
acids (as triacylglycerols) in the days before their migratory flights. The percentage of 
dry-weight body fat in these birds may be as high as 70% when migration begins (com- 
pared with values of 30% and less for nonmigratory birds). 


23.2g Fatty Acid Oxidation Is an Important Source 
of Metabolic Water for Some Animals 


Large amounts of metabolic water are generated by GB-oxidation (130.7 H,O per 
palmitoyl-CoA, see Table 23.2). For certain animals—including desert animals (such 
as gerbils) and killer whales (which do not drink seawater)—the oxidation of fatty 
acids can be a significant source of dietary water. A striking example is the camel 
(Figure 23.17), whose hump is essentially a large deposit of fat. Metabolism of fatty 
acids from this store provides needed water (as well as metabolic energy) during peri- 
ods when drinking water is not available. It might well be said that “the ship of the 
desert” sails on its own metabolic water! 


Eric and David Hosking/Corbis 


George Holton/Getty Images 


Camels Gerbil 


Figure 23.17 Animals whose existence is strongly dependent on fatty acid oxidation: The ruby- 
throated hummingbird and the golden plover (a) consume large amounts of fatty acids in long 
migrations. The orca (killer whale), camels, and gerbils (b) produce metabolic water from fatty acid 
oxidation to survive in conditions where fresh water is not available to them. 
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23.3 


23.3 | How Are Odd-Carbon Fatty Acids Oxidized? 


Fatty acids with odd numbers of carbon atoms are rare in mammals but fairly com- 
mon in plants and marine organisms. Humans and animals whose diets include these 
food sources metabolize odd-carbon fatty acids via the B-oxidation pathway. 


23.3a B-Oxidation of Odd-Carbon Fatty Acids 
Yields Propionyl-CoA 


The final product of B-oxidation for odd-carbon fatty acids is the three-carbon 
propionyl-CoA instead of acetyl-CoA. Three specialized enzymes then carry out the 
reactions that convert propionyl-CoA to succinyl-CoA, a TCA cycle intermediate. 
(Because propionyl-CoA is a degradation product of methionine, valine, and isoleu- 
cine, this sequence of reactions is also important in amino acid catabolism.) The path- 
way involves an initial carboxylation at the a-carbon of propionyl-CoA to produce 
pD-methylmalonyl-CoA (Figure 23.18). The reaction is catalyzed by a biotin-dependent 
enzyme, propionyl-CoA carboxylase. The mechanism involves ATP-driven carboxyl- 
ation of biotin at Nj, followed by nucleophilic attack by the a-carbanion of propionyl- 
CoA in a stereospecific manner. 

D-Methylmalonyl-CoA, the product of this reaction, is converted to the L-isomer by 
methylmalonyl-CoA epimerase (Figure 23.18). (This enzyme has often and incorrectly 
been called methylmalonyl-CoA racemase. It is not a racemase because the CoA moi- 
ety contains five other asymmetric centers.) The epimerase reaction involves a carban- 
ion at the a-position formed via a reversible dissociation of the acidic a-proton 
(Figure 23.19). The L-isomer is the substrate for methylmalonyl-CoA mutase. 
Methylmalonyl-CoA epimerase is an impressive catalyst. The pK, for the proton that 
must dissociate to initiate this reaction is approximately 21! If binding of a proton to 
the a-anion is diffusion limited, with kon = 10° M~! sec™!, then the initial proton dis- 
sociation must be rate limiting and the rate constant must be 


kott = Ka + Kon = (107?! M) - (10? M™! sec!) = 107 sec™! 


The turnover number of methylmalonyl-CoA epimerase is 100 sec™!, and thus the 
enzyme enhances the reaction rate by a factor of 10". 
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Figure 23.19 The methylmalonyl-CoA epimerase mechanism involves a resonance-stabilized 
carbanion at the a-position. 
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Figure 23.18 The conversion of propionyl-CoA 
(formed from B-oxidation of odd-carbon fatty acids) 
to succinyl-CoA is carried out by a trio of enzymes, 
as shown. Succinyl-CoA can enter the TCA cycle. 
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23.3b A B,2-Catalyzed Rearrangement Yields Succinyl-CoA 
from L-Methylmalonyl-CoA 


The third reaction, catalyzed by methylmalonyl-CoA mutase, is quite unusual because 
it involves a migration of the carbonyl-CoA group from one carbon to its neighbor 
(Figure 23.18). The mutase reaction is vitamin B,.-dependent and begins with homo- 
lytic cleavage of the Co**—C bond in 5’-deoxyadenosylcobalamin, reducing the 
cobalt to Co? (see A Deeper Look, “The Activation of Vitamin B,.”). Transfer of a 
hydrogen atom from the substrate to the deoxyadenosyl group produces a methylmal- 
onyl-CoA radical, which then can undergo a classic B,»-catalyzed rearrangement to 
yield a succinyl-CoA radical. Hydrogen transfer from the deoxyadenosyl group yields 
succinyl-CoA and regenerates the B,. coenzyme (see problem 20 at the end of the 
chapter). 


23.3c Net Oxidation of Succinyl-CoA Requires 
Conversion to Acetyl-CoA 


Succinyl-CoA derived from propionyl-CoA can enter the TCA cycle. Oxidation of 
succinate to oxaloacetate provides a substrate for glucose synthesis. Thus, although 
the acetate units produced in B-oxidation cannot be utilized in gluconeogenesis by 
animals, the occasional propionate produced from oxidation of odd-carbon fatty 
acids can be used for sugar synthesis. Alternatively, succinate introduced to the TCA 
cycle from odd-carbon fatty acid oxidation may be oxidized to CO,. However, all of 
the four-carbon intermediates in the TCA cycle are regenerated in the cycle and thus 


A Deeper Look 
The Activation of Vitamin B,, 


Conversion of inactive vitamin B; to active 5’-deoxyadenosylcobal-_ Because two electrons from Co* are donated to the Co—carbon 
amin involves three steps (see accompanying figure). Two flavopro- bond, the oxidation state of cobalt reverts to Co** in the active 
tein reductases sequentially convert Co** in cyanocobalamin to the coenzyme. This is one of only two known adenosy] transfers (that 
Co** state and then to the Co* state. Cot is an extremely powerful is, nucleophilic attack on the ribose 5'-carbon of ATP) in biological 
nucleophile. It attacks the C-5’ carbon of ATP as shown, expel- systems. 

ling the triphosphate anion to form 5’-deoxyadenosylcobalamin. 
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Formation of the active coenzyme 5’-deoxyadenosylcobalamin from inactive vitamin Bj» is initiated by the action of flavoprotein reductases. The 
resulting Co* species, dubbed a supernucleophile, attacks the 5’-carbon of ATP in an unusual adenosyl transfer. Homolytic cleavage of the Co3+—C 
bond produces a reactive free radical that facilitates rearrangements such as that in the methylmalonyl-CoA mutase reaction. 
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23.4 How Are Unsaturated Fatty Acids Oxidized? 
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Figure 23.20 The malic enzyme reaction proceeds by oxidation of malate to oxaloacetate, followed by 
decarboxylation to yield pyruvate. 


should be viewed as catalytic species. Net consumption of succinyl-CoA thus does 
not occur directly in the TCA cycle. Rather, the succinyl-CoA generated from 
B-oxidation of odd-carbon fatty acids must be converted to pyruvate and then to 
acetyl-CoA (which is completely oxidized in the TCA cycle). To follow this latter 
route, succinyl-CoA entering the TCA cycle must be first converted to malate in the 
usual way and then transported from the mitochondrial matrix to the cytosol, where 
it is oxidatively decarboxylated to pyruvate and CO, by malic enzyme, as shown in 
Figure 23.20. Pyruvate can then be transported back to the mitochondrial matrix, 
where it enters the TCA cycle via pyruvate dehydrogenase. Note that malic enzyme 
plays a role in fatty acid synthesis. 


23.4 | How Are Unsaturated Fatty Acids Oxidized? 


Unsaturated fatty acids are also catabolized by B-oxidation, but two additional mito- 
chondrial enzymes—an isomerase and a novel reductase—are required to handle the 
cis double bonds of naturally occurring fatty acids. 


23.4a An Isomerase and a Reductase Facilitate the B-Oxidation 
of Unsaturated Fatty Acids 


As an example of oxidation of unsaturated fatty acids, consider the breakdown of 
oleic acid, an 18-carbon chain with a double bond at the 9,10-position. The reactions 
of B-oxidation proceed normally through three cycles, producing three molecules of 
acetyl-CoA and leaving the degradation product cis-A3-dodecenoyl-CoA, shown in 
Figure 23.21. This intermediate is not a substrate for acyl-CoA dehydrogenase. With 
a double bond at the 3,4-position, it is not possible to form another double bond at the 
2,3- (or B-) position. As shown in Figure 23.21, this problem is solved by enoyl-CoA 
isomerase, an enzyme that rearranges this cis-A* double bond to a trans-A* double 
bond. This latter species can proceed through the normal route of B-oxidation. 


23.4b Degradation of Polyunsaturated Fatty Acids 
Requires 2,4-Dienoyl-CoA Reductase 


Polyunsaturated fatty acids pose a slightly more complicated situation for the cell. 
Consider, for example, the case of linoleic acid shown in Figure 23.22. As with oleic 
acid, B-oxidation proceeds through three cycles, and enoyl-CoA isomerase converts 
the cis-A? double bond to a trans-A? double bond to permit one more round of 
B-oxidation. What results this time, however, is a cis-A*+ enoyl-CoA, which is converted 
normally by acyl-CoA dehydrogenase to a trans-A’, cis-A* species. This, however, is a 
poor substrate for the enoyl-CoA hydratase. This problem is solved by 2,4-dienoyl-CoA 
reductase, the product of which depends on the organism. The mammalian form of 
this enzyme produces a trans-A? enoyl product, as shown in Figure 23.22; this enoyl 
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produce three molecules of acetyl-CoA and leave 
cis-A>-dodecenoyl-CoA. Rearrangement by enoyl- 
CoA isomerase gives the trans-A? species, which then 
proceeds normally through the B-oxidation pathway. 
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Figure 23.22 The oxidation pathway for polyunsaturated fatty acids in mammals, illustrated for 
linoleic acid. Three cycles of B-oxidation on linoleoyl-CoA yield the cis-A?, cis-A® intermediate, which 
is converted to a trans-A’, cis-A® intermediate. An additional round of B-oxidation gives cis-A* enoyl- 
CoA, which is oxidized to the trans-A~, cis-A* species by acyl-CoA dehydrogenase. The subsequent 
action of 2,4-dienoyl-CoA reductase yields the trans-A? product, which is converted by enoyl-CoA 
isomerase to the trans-A? form. Normal B-oxidation then produces five molecules of acetyl-CoA. 
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A Deeper Look 


Can Natural Antioxidants in Certain Foods Improve Fat Metabolism? 


Numerous beneficial effects have been attributed in recent years to 
the polyphenolic compounds in foods such as chocolate, red wine, 
and (black or green) tea. Principal among these compounds are the 
catechins, a class of antioxidants that have demonstrated protective 
effects against certain cancers, obesity, and heart disease as well as 
other beneficial health effects. Is it possible to demonstrate specific 
effects of catechins on cellular metabolism? Green tea extract, con- 
sisting almost entirely of six catechins (shown in the figure), added 
to the diets of rats increased muscle glycogen content, decreased 
muscle fatty acid synthesis, and increased fatty acid oxidation by rat 
skeletal muscle by 36%. In these same rats, exercise endurance capac- 
ity for swimming and running increased significantly, suggesting that 
fatty acids can be an effective source of energy in exercising muscle. 
Measurements of the catechin content of a range of teas, wines, 
and other beverages indicate that the broadest range and the highest 
levels of these compounds are found in tea, but that red wine has 


ee : Tea plants 
significant levels of catechins. (Green tea and black tea are made i 
from the same shrub, Camellia sinensis.) 
(a) R = H (+)-catechin, (b) R = H (-)-epicatechin, 
R = OH (+)-gallocatechin (GC) R = OH (-)-epigallocatechin (EGC) 
OH OH 
OH OH 
HO. O A 
R ` R 
“OH 
OH 


Catechins found in various varieties of tea. 


iStockphoto.com/micheldenijs 


(c) R = H (-)-epicatechin gallate (ECg), 
R = OH (-)-epigallocatechin gallate (EGCg) 
OH 
OH 
HO O ww 
8 R 
"te 
H OH 
a (0) 
OH 
OH 


product can be converted by an enoyl-CoA isomerase to the trans-A? enoyl-CoA, 
which can then proceed normally through the B-oxidation pathway. Escherichia coli 
possesses a 2,4-dienoyl-CoA reductase that reduces the double bond at the 4,5-position 
to yield the trans-A? enoyl-CoA product in a single step. 


23.5 | Are There Other Ways to Oxidize Fatty Acids? 


Although -oxidation in mitochondria! is the principal pathway of fatty acid catabo- 
lism, several other minor pathways play important roles in fat catabolism. For exam- 
ple, organelles other than mitochondria, including peroxisomes and glyoxysomes, carry 
out B-oxidation processes. 


23.5a Peroxisomal B-Oxidation Requires FAD-Dependent Acyl-CoA Oxidase 


Peroxisomes are so named because they carry out a variety of flavin-dependent 
oxidation reactions, regenerating oxidized flavins by reaction with oxygen to produce 
hydrogen peroxide, H,O,. Peroxisomal B-oxidation is similar to mitochondrial 


'B-Oxidation does not occur significantly in plant mitochondria. Most B-oxidation in plants occurs in 
peroxisomes. 


ad 


Structure of acyl-CoA oxidase monomer from rat 
liver (PDB id = 1IS2). The enzyme is a homodimer, 
each monomer consisting of an N-terminal a-helical 
domain (blue), a B-sheet domain (orange), and a 
C-terminal a-helical domain (violet). 
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H H 1 
| | 
R — CHa— Cp Ca — C— SCoA 
| 
H H 
Fatty acyl-CoA 
[FAD] HO% 
Acyl-CoA 
Oxidase 
[FADH,] Op 
Ii 
| 
R—CH,—G=C—C—SCoA HO + +40, 
| 
Tal 
trans-A?-Enoyl-CoA 


Figure 23.23 The acyl-CoA oxidase reaction in 
peroxisomes. Electrons captured as FADH; are used 
to produce the hydrogen peroxide required for deg- 
radative processes in peroxisomes and thus are not 
available for eventual generation of ATP. (Compare 
this reaction with the acyl-CoA dehydrogenase of 
mitochondria shown in Figure 23.12.) 


Figure 23.24 Branched-chain fatty acids are 
oxidized by a-oxidation, as shown for phytanic 
acid. The product of the phytanic acid a-oxidase, 
pristanic acid, is a suitable substrate for normal 
B-oxidation. Isobutyryl-CoA and propionyl-CoA 
can both be converted to succinyl-CoA, which can 
enter the TCA cycle. 


B-oxidation, except that the initial double-bond formation is catalyzed by an FAD- 
dependent acyl-CoA oxidase (Figure 23.23). The action of this enzyme in the peroxi- 
somes transfers the liberated electrons directly to oxygen instead of the 
electron-transport chain. As a result, each two-carbon unit oxidized in peroxisomes 
produces fewer ATPs. (Compare Figure 23.23 with Figure 23.12.) The enzymes 
responsible for fatty acid oxidation in peroxisomes are inactive with acyl chains of 
eight carbons or fewer. Such short-chain products must be transferred to the mito- 
chondria for further breakdown. Similar B-oxidation enzymes are also found in 
glyoxysomes—peroxisomes in plants that also carry out the reactions of the glyoxyl- 
ate pathway. 


23.5b Branched-Chain Fatty Acids Are Degraded Via a-Oxidation 


Although B-oxidation is universally important, there are some instances in which it 
cannot operate effectively. For example, branched-chain fatty acids with alkyl branches 
at odd-numbered carbons are not effective substrates for B-oxidation. For such species, 
a-oxidation is a useful alternative. Consider phytol, a breakdown product of chlorophyll 
that occurs in the fat of ruminant animals such as sheep and cows and also in dairy 
products. Ruminants oxidize phytol to phytanic acid, and digestion of phytanic acid 
from dairy products is thus an important dietary consideration for humans. The 
methyl group at C-3 will block B-oxidation, but, as shown in Figure 23.24, phytanic acid 


CH; CH; 
| | 
CH3— (CH — CH — CH, — CHo) — C == CH — CHOH 
Phytol 
CH; CH; 
| | 
CH; — (CH — CH; — CH; — CH») 3 — CH — CHy—COO- 
Phytanic acid 
HO Phytanic acid a-hydroxylase 
CH; 


CH, —(CH— CH, — CH; — 


a IN 
eA d Phytanic acid a-oxidase 


Hs 


CH; fos CH, — CH, — CHo) — CH 
Pristanic acid 
ATP + CoA 
Acyl-CoA synthase 
AMP + B® 
cH, | CH; O 
| 9¢ a 
CH; —(CH— CHj CH, — CHo) 3 CH—C—SCoA 
Six cycles of B-oxidation 
(0) 


oO (0) 
o | | 
i—C-—-SCoA + CH,;—C—SCoA + CHs—CH,—C—SCoA 


~— Tsobutyryl- -CoA 3 Acetyl-CoA 


3 Propionyl-CoA 
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a-hydroxylase places an —OH group at the a-carbon, and phytanic acid a-oxidase 
decarboxylates it to yield pristanic acid. The CoA ester of this metabolite can undergo 
B-oxidation in the normal manner. The terminal product, isobutyryl-CoA, can be sent 
into the TCA cycle by conversion to succinyl-CoA. 


23.5¢ w-Oxidation of Fatty Acids Yields Small 
Amounts of Dicarboxylic Acids 


In the endoplasmic reticulum of eukaryotic cells, the oxidation of the terminal carbon 
of a normal fatty acid—a process termed omega oxidation (w-oxidation)—can lead to 
the synthesis of small amounts of dicarboxylic acids (Figure 23.25). Cytochrome P-450, 
a monooxygenase enzyme that requires NADPH as a coenzyme and uses O, as a sub- 
strate, places a hydroxyl group at the terminal carbon. Subsequent oxidation to a car- 
boxyl group produces a dicarboxylic acid. Either end can form an ester linkage to CoA 
and be subjected to B-oxidation, producing a variety of smaller dicarboxylic acids. 
(Cytochrome P-450—dependent monooxygenases also play an important role as agents 
of detoxification, the degradation and metabolism of toxic hydrocarbon agents.) 


23.6 | What Are Ketone Bodies, and What Role 


Do They Play in Metabolism? 


Most of the acetyl-CoA produced by the oxidation of fatty acids in liver mitochondria 
undergoes further oxidation in the TCA cycle, as stated earlier. However, some of this 
acetyl-CoA is converted to three important metabolites: acetone, acetoacetate, and 
B-hydroxybutyrate. The process is known as ketogenesis, and these three metabolites 
are traditionally known as ketone bodies, despite the fact that B-hydroxybutyrate does 
not contain a ketone function. 


23.6a Ketone Bodies Are a Significant Source of Fuel 
and Energy for Certain Tissues 


Ketone bodies are synthesized primarily in the liver but are important sources of fuel 
and energy for many tissues, including brain, heart, and skeletal muscle. The brain, 
for example, normally uses glucose as its source of metabolic energy. However, during 
periods of starvation, ketone bodies may be the major energy source for the brain. 
Acetoacetate and B-hydroxybutyrate are normal substrates for kidney cortex and for 
heart muscle. 

Ketone body synthesis occurs only in the mitochondrial matrix. The reactions 
responsible for the formation of ketone bodies are shown in Figure 23.26. The first reac- 
tion—the condensation of two molecules of acetyl-CoA to form acetoacetyl-CoA—is 


w-oxidation 


v 
COO- 


(CHg) 19 


COO- 


Figure 23.25 Dicarboxylic acids can be formed by 
oxidation of the omega carbon of fatty acids in a 
cytochrome P-450—dependent reaction. 


Human Biochemistry 


Large Amounts of Ketone Bodies Are Produced in Diabetes Mellitus 


Diabetes mellitus is the most common endocrine disease and one of 
the leading causes of death in the United States, with approximately 
6 million diagnosed cases and an estimated 4 million more borderline 
but undiagnosed cases. Diabetes is characterized by an abnormally 
high level of glucose in the blood. In type 1 diabetes (representing 
10% or less of all cases), elevated blood glucose results from inade- 
quate secretion of insulin by the islets of Langerhans in the pancreas. 
Type 2 diabetes (at least 90% of all cases) results from an insensitivity 
to insulin. Type 2 diabetics produce normal or even elevated levels 
of insulin, but their cells are not responsive to insulin, often due to 
a shortage of insulin receptors. In both cases, transport of glucose 


into muscle, liver, and adipose tissue is significantly reduced, and 
despite abundant glucose in the blood, the cells are metabolically 
starved. They respond by turning to increased gluconeogenesis and 
catabolism of fat and protein. In type 1 diabetes, increased gluconeo- 
genesis consumes most of the available oxaloacetate, but breakdown 
of fat (and, to a lesser extent, protein) produces large amounts of 
acetyl-CoA. This increased acetyl-CoA would normally be directed 
into the TCA cycle, but with oxaloacetate in short supply, it is used 
instead for production of unusually large amounts of ketone bodies. 
Acetone can often be detected on the breath of type 1 diabetics, an 
indication of high plasma levels of ketone bodies. 
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Figure 23.27 Reconversion of ketone bodies to 
acetyl-CoA in the mitochondria of many tissues 
(other than liver) provides significant metabolic 
energy. 
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Figure 23.26 The formation of ketone bodies, synthesized primarily in liver mitochondria. 


catalyzed by thiolase, which is also known as acetoacetyl-CoA thiolase or acetyl-CoA 
acetyltransferase. This is the same enzyme that carries out the thiolase reaction in 
B-oxidation, but here it runs in reverse. The second reaction adds another molecule of 
acetyl-CoA to give 3-hydroxy-3-methylglutaryl-CoA, commonly abbreviated HMG- 
CoA. These two mitochondrial matrix reactions are analogous to the first two steps in 
cholesterol biosynthesis, a cytosolic process. HMG-CoA is converted to acetoacetate 
and acetyl-CoA by the action of HMG-CoA lyase in a mixed aldol-Claisen ester cleav- 
age reaction. This reaction is mechanistically similar to the reverse of the citrate synthase 
reaction in the TCA cycle. A membrane-bound enzyme, B-hydroxybutyrate dehydroge- 
nase, then can reduce acetoacetate to B-hydroxybutyrate. 

Acetoacetate and B-hydroxybutyrate are transported through the blood from liver to tar- 
get organs and tissues, where they are converted to acetyl-CoA (Figure 23.27). Ketone bodies 
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are easily transportable forms of fatty acids that move through the circulatory system without 
the need for complexation with serum albumin and other fatty acid-binding proteins. 


23.6b B-Hydroxybutyrate Is a Signaling Metabolite 


In addition to its role as a circulating energy source for certain tissues in the fasting 
state, B-hydroxybutyrate (BOHB) functions as a signaling molecule (Figure 23.28). 
BOHB is a ligand for two G-protein—coupled membrane receptor proteins (GPCRs) 
that regulate a variety of cellular metabolic processes via second messengers such 
as cAMP and Ca’. BOHB also binds and activates the hydroxycarboxylic acid 
receptor 2 (HCAR2); in contrast, activity of the free fatty acid receptor 3 (FFA R3) is 
antagonized when BOHB binds to it. 

Ina separate signaling role, 8-hydroxybutyrate provides indirect activation of histone 
acetyltransferases (HATs) that acetylate lysine residues on histones (acetylation acti- 
vates gene expression) and a variety of other nuclear and cytosolic signaling proteins. 
As shown in Figure 23.28, uptake of B-hydroxybutyrate increases the cellular pool of 
acetyl-CoA, the substrate for lysine acetylation by HATs. B-hydroxybutyrate is also an 
inhibitor of several histone deacetylase enzymes (known as HDACs; Section 29.3) that 
remove acetyl groups from lysine residues on these same signaling proteins. By these 
various mechanisms, B-hydroxybutyrate regulates gene expression, lipid metabo- 
lism, metabolic rate, and resistance to oxidative stress. 


Nucleus 


E 2- 
Histone 
acetyltransferases 


$ Histone acetylation | Lipolysis 

i Non-histone protein acetylation | Metabolic rate 
i Gene expression changes 

$ Resistance to oxidative stress 


Figure 23.28 Cellular signaling mediated by B-hydroxybutyrate (BOHB), including activation of 

the HCAR2 receptor, antagonism of the FFAR3 receptor, activation of histone acetyltransferases 
(HATs), and inhibition of deacetylases (HDACs). B-Hydroxybutyrate taken up by mitochondria is 
converted to acetyl-CoA and then to citrate (by citrate synthase—CS). Citrate exported to the cytosol 
can be converted back to acetyl-CoA by ATP citrate lyase (ACLY). 
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Foundational Biochemistry Things You Should Know after Reading Chapter 23. 


Triacylglycerols are a major form of stored energy in animals. 


Hormones trigger the release of fatty acids from adipose tissue. Deg- 
radation of dietary triacylglycerols occurs primarily in the duodenum. 


Degradation of fatty acids occurs by oxidation of the carbon that is 
B to the carboxyl carbon. The process is referred to as B-oxidation. 


Coenzyme A activates fatty acids for degradation via the formation of 
thioesters in the acyl-CoA synthetase reaction. 


The mechanism of the acyl-CoA synthetase involves nucleophilic 
attack by the carboxylate oxygen of a fatty acid at the a-P of ATP, 
forming a reactive mixed anhydride that breaks down to yield a fatty- 
acyl-CoA. 


Carnitine carries fatty acyl groups across the inner mitochondrial 
membrane. 


B-Oxidation involves a repeated sequence of four reactions: acyl-CoA 
dehydrogenase, enoyl-CoA hydratase, L-hydroxyacyl-CoA dehydro- 
genase, and thiolase. 


Repetition of the B-oxidation cycle yields a succession of acetate units. 


Three of the enzymes in the B-oxidation cycle, the enoyl-Co-A 
hydratase (ECH), the hydroxyacyl-CoA dehydratase (HACD), and 
the ketoacyl thiolase (KACT), are associated in a trifunctional 
enzyme complex. Fatty acyl substrates migrate through substrate 
channels in this complex without diffusing into the bulk solvent. 


Complete B-oxidation of one palmitic acid yields 113.7 molecules of 
ATP. 


B-Oxidation of odd-carbon fatty acids yields propionyl-CoA. Net 
oxidation of succinyl-CoA obtained from propionyl-CoA requires 
conversion to acetyl-CoA. 


Propionyl-CoA is converted to succinyl-CoA by the combined action 
of propionyl-CoA carboxylase, methylmalonyl-CoA epimerase, and 
methylmalonyl-CoA mutase. Methylmalonyl-CoA mutase is a vita- 
min B-dependent enzyme. 


An isomerase and a reductase facilitate the B-oxidation of unsatu- 
rated fatty acids. 


Degradation of polyunsaturated fatty acids requires 2,4-dienoyl-CoA 
reductase. 


Peroxisomal B-oxidation requires FAD-dependent acyl-CoA oxidase. 
Branched-chain fatty acids are degraded via a-oxidation. 
w-Oxidation of fatty acids yields small amounts of dicarboxylic acids. 


The ketone bodies—acetone, acetoacetate, and B-hydroxybutyrate— 
are a significant source of fuel and energy for certain tissues, including 
brain, heart, and skeletal muscle. 


Ketone body formation involves several enzymes, including thiolase, 
HMG-CoA synthase, HMG-CoA lyase, and 6-hydroxybutyrate de- 
hydrogenase. 


B-Hydroxyburyate is a signaling metabolite. It is a ligand for two 
G-protein-coupled membrane receptor proteins that regulate a va- 
riety of cellular metabolic processes via second messengers such as 
cAMP and Ca?*. 


Answers to all problems are at the end of this book. Detailed solutions are 
available in the Student Solutions Manual. 


1. 


Determining the Amount of Water Produced from Fatty Acid Oxidation 
Calculate the volume of metabolic water available to a camel through 
fatty acid oxidation if it carries 30 pounds of triacylglycerol in its hump. 


. Determining the Amount of ATP Produced from Fatty Acid Oxidation 


Calculate the approximate number of ATP molecules that can be 
obtained from the oxidation of cis-11-heptadecenoic acid to CO, and 
water. 


. Assessing the Mechanism of Oxidation of Phytanic Acid 


Phytanic acid, the product of chlorophyll that causes problems 
for individuals with Refsum’s disease, is 3,7,11,15-tetramethyl 
hexadecanoic acid. Suggest a route for its oxidation that is 
consistent with what you have learned in this chapter. (Hint: 

The methyl group at C-3 effectively blocks hydroxylation and 
normal B-oxidation. You may wish to initiate breakdown in some 
other way.) 


. Examining the Labeling of Glucose from “C-labeled Acetate 


Even though acetate units, such as those obtained from fatty acid 
oxidation, cannot be used for net synthesis of carbohydrate in 
animals, labeled carbon from '4C-labeled acetate can be found in 
newly synthesized glucose (for example, in liver glycogen) in animal 
tracer studies. Explain how this can be. Which carbons of glucose 
would you expect to be the first to be labeled by '*C-labeled acetate? 


. Assessing the Fatty Acid-Binding Capability of Serum Albumin Human 


serum albumin (66.4 kDa) is a soluble protein present in blood at 


0.75 mM or so. Among other functions, albumin acts as the major 
transport vehicle for fatty acids in the circulation, carrying fatty acids 
from storage sites in adipose tissue to their sites of oxidation in liver 
and muscle. The albumin molecule has up to 11 distinct binding sites. 
Consult the biochemical literature to learn about the fatty acid— 
binding sites of albumin. Where are they located on the protein? What 
are their relative affinities? (Two suitable references with which to begin 
your study are Bhattacharya, A. A., Griine, T., and Curry, S., 2000. 
Crystallographic analysis reveals common modes of binding of 
medium- and long-chain fatty acids in human serum albumin. Journal 
of Molecular Biology 303:721—732; and Simard, J. R., et al., 2005. 
Locating high-affinity fatty acid-binding sites on albumin by X-ray 
crystallography and NMR spectroscopy. Proceedings of the National 
Academy of Sciences, U.S.A. 102:17958-17963.) 


Fatty Acid Catabolism and Gluconeogenesis The catabolism of fatty 
acids cannot provide the carbons needed for gluconeogenesis (see 
Chapter 22), but can provide other molecules to benefit this pathway? 
Name at least two such molecules. 


. Fatty Acid Catabolism in Mitochondria or Peroxisomes and Weight 


Regulation What difference in weight regulation could be predicted if 
more fatty acids underwent catabolism in peroxisomes (see Section 
23.5a) rather than in the mitochondria? 


8. Binding Properties of Albumin Refer to Human Biochemistry box, 


“Serum albumin — Tramp Steamer of the Bloodstream” and explain 
how elevated levels of free fatty acids could affect the pharmacology 
of drugs. 
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10. 


Tl. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


19. 


20. 


21. 


22. 


Understanding the Implications of Storing Energy in the Form of Fat 
Versus Protein or Carbohydrate Overweight individuals who diet to 
lose weight often view fat in negative ways, but the “weighty” 
consequences might be even worse if excess calories were stored in 
other forms. Consider a person who is 10 pounds “overweight,” and 
estimate how much more he or she would weigh if excess energy were 
stored as carbohydrate instead of fat. 


Understanding the Consequences of Vitamin B}, Deficiency What 
would be the consequences of a deficiency in vitamin Bj» for fatty 
acid oxidation? What metabolic intermediates might accumulate? 


Balancing the Equations for Fatty Acid Oxidation Write properly balanced 
chemical equations for the oxidation to CO, and water of (a) myristic 
acid, (b) stearic acid, (c) a-linolenic acid, and (d) arachidonic acid. 


Tracing Tritium Incorporation into Acetate from Fatty Acid Oxidation 
How many tritium atoms are incorporated into acetate if a molecule 
of palmitic acid is oxidized in 100% tritiated water? 


Fatty Acid Oxidation in Isotopically-labeled Water If the B-oxidation of 
a fatty acid happened in the presence of H,!*8O, which products would 
contain !80? 


Understanding the Consequences of Carnitine Deficiency What would 
be the consequences of a carnitine deficiency for fatty acid oxidation? 


Glucagon and Lipases Glucagon is a hormone released when blood 
sugar levels are low. Give a reason for why glucagon activates lipases. 


Villi on the Surface of Epithelial Cells of the Digestive System The 
membranes of the epithelial cells of the digestive system possess 
indentations known as microvilli (see Figure 23.3). What is the 
advantage of these structures? 


Assessing Energy Consumption by a Migrating Bird The ruby-throated 
hummingbird flies 500 miles nonstop across the Gulf of Mexico. The 
flight takes 10 hours at 50 mph. The hummingbird weighs about 4 grams 
at the start of the flight and about 2.7 grams at the end. Assuming that 
all the lost weight is fat burned for the flight, calculate the total energy 
required by the hummingbird in this prodigious flight. Does anything 
about the results of this calculation strike you as unusual? 
Understanding Human Energy Consumption During Exercise Energy 
production in animals is related to oxygen consumption. The ruby- 
throated hummingbird consumes about 250 mL of oxygen per hour 
during its migration across the Gulf of Mexico. Use this number and the 
data in problem 11 to determine a conversion factor for energy expended 
per liter of oxygen consumed. If a human being consumes 12.7 kcal/min 
while running 8-minute miles, how long could a human run on the energy 
that the hummingbird consumes in its trans-Gulf flight? How many 
8-minute miles would a person have to run to lose 1 pound of body fat? 


Understanding the Mechanism of the HMG-CoA Synthase Reaction 
Write a reasonable mechanism for the HMG-CoA synthase reaction 
shown in Figure 23.26. 


Understanding the Mechanism of the Methylmalonyl-CoA Mutase 
Reaction The methylmalonyl-CoA mutase reaction (see Figure 23.18) 
involves vitamin B, as a coenzyme. Write a reasonable mechanism for 
this reaction. (Hint: The reaction begins with abstraction of a 
hydrogen atom—that is, a proton plus an electron—from the substrate 
by vitamin B,. Consider the chemistry shown in A Deeper Look: 
“The Activation of Vitamin B,2” as you write your mechanism.) 


Assessing the Oxidation States of Cobalt in the Methylmalonyl-CoA 
Mutase Reaction Discuss the changes of the oxidation state of cobalt 
in the course of the methylmalonyl-CoA mutase reaction. Why do 
they occur as you suggested in your answer to problem 20? 


Extending the Mechanism of Methylmalonyl-CoA Mutase to Similar 
Reactions Based on the mechanism for the methylmalonyl-CoA 
mutase (see problem 20), write reasonable mechanisms for the 
following reactions. 


Problems 817 


H H 
ee == -O00C—C—CH, 
i CH—COO- CH—COO- 
NH,* ea 
H H 
CH,—CH—C—OH —> CH; i C—H + HO 
DH M E 
H H 
CH,—CH—C—H —> CH,—C—C—H + HO 
hee bi OH ek H O 
H 
CH, l H —> CH,—C—H + NH,* 
NH,* OH | 


. Understanding the Toxicity of Unripened Akee Fruit A popular dish in 


the Caribbean islands consists of akee and salt fish. Unripened akee 
fruit (from the akee tree, native to West Africa but brought to the 
Caribbean by enslaved Africans) is quite poisonous. The unripened 
fruit contains hypoglycin (structure shown below), a metabolite that 
serves as a substrate for acyl-CoA dehydrogenase. However, the 
product of this reaction irreversibly inhibits the acyl-CoA dehydro- 
genase by reacting covalently with FAD on the enzyme. Consump- 
tion of unripened akee fruit can lead to vomiting and, in severe 
cases, convulsions, coma, and death. Write a reaction scheme to 
show the product of the acyl-CoA dehydrogenase reaction that 
reacts with FAD. 


CH, H 
i 
H,C=C — CH— CH,—C— COOT 
| 
NH;* 


Hypoglycin A 


. Exploring Substrate Channeling in Multi-Functional Enzyme Aggregates 


In mammalian mitochondria, three of the enzymes in the membrane- 
bound B-oxidation system are components of a multifunctional enzyme 
(MFE). Similarly, glyoxysomes (in plants), peroxisomes (in most 
species), and gram-negative bacteria carry out several reactions of 
B-oxidation via MFEs. In all these cases, the components of an MFE 
must cooperate to carry or channel the acyl-CoA substrate from one 
active site to the next in a cyclic fashion. The structure of the complete 
MFE from Pseudomonas fragi provides insights into how this channel- 
ing might occur. Consult the journal article that describes this structure 
(Ishikawa, M., et al., 2004. Structural basis for channelling mechanism 
of a fatty acid B-oxidation multienzyme complex. EMBO Journal 
23:2745-2754), and explain in your own words how this substrate 
channeling occurs. 


. Comparing the Electron Transfer Characteristics of FAD/FADH, and 


NAD*/NADH Study Figure 23.12 and comment on why nature uses 
FAD/FADH) as a cofactor in the acyl-CoA dehydrogenase reaction 
rather than NAD*/NADH. 


. Understanding a Ubiquitous Series of Metabolic Reactions Study 


Figure 23.9. Where else in metabolism have you seen the chemical 
strategy and logic of the B-oxidation pathway? Why is it that these 
two pathways are carrying out the same chemistry? 
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PSAFE Problems 


(PSAFE models, listed by chapter or protein name, can be found at http:// 
www.psafeonline.net.) 


27. Using the PSAFE model for enoyl-CoA dehydratase, give an example 
of a case in which conserved residues in slightly different positions 
can change the catalytic rate of reaction. 


Think-Pair-Share Question 


The B-oxidation of different fatty acids generates different amounts of 
products. Review the list of Cj; fatty acids shown in Table 8.1. 


a. For each fatty acid, determine the number of NADH and 
FADH, generated. In addition, determine the grand total of 
ATP generated, assuming that each NADH and FADH, 
generate 2.7 or 1.6 ATP molecules, respectively. Do include all 
the NADH, FADH), and GTP generated from acetyl-CoA 
through the TCA cycle. If NADPH is required to resolve 
“unsaturated bond issues,” subtract 4 ATP molecules. Use 
Figures 23.21 and 23.22 as your guide. 

b. As outlined in Section 23.2 and Table 23.2, the B-oxidation of 
fatty acids can be an important source of water. Does the 
B-oxidation of the different Cı fatty acids listed in Table 8.1 
yield different amounts of water? 
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Lipid Biosynthesis 


< Walruses basking on the beach. 
Blubber — a layer of fat reinforced by 
fibrous connective tissue just below the 
skin — insulates the walrus and serves 
as an energy reserve. 


Essential Question 

Lipids—including fatty acids, triglycerides, glycerophospholipids, sphingolipids, and 
steroids—are key components of membranes and essential energy sources for organisms. 
They may also serve as signal molecules in biological systems. All lipids are synthesized 
from smaller constituent molecules. What are the pathways of lipid synthesis in biological 
systems and how are lipids transported throughout the body? 


W: have already seen several cases in which the synthesis of a class of biomole- 
cules is conducted differently from degradation (glycolysis versus gluconeogen- 
esis and glycogen or starch breakdown versus polysaccharide synthesis, for example). 
Likewise, the synthesis of fatty acids and other lipid components is different from 
their degradation. 


24.1 | How Are Fatty Acids Synthesized? 


Fatty acid synthesis involves a set of reactions that follow a strategy different in sev- 
eral ways from the corresponding degradative process: 


1. Fatty acid synthesis occurs in the cytosol, whereas fatty acid degradation takes 
place in mitochondria. 

2. Intermediates in fatty acid synthesis are linked covalently to the sulfhydryl groups 
of special proteins, the acyl carrier proteins (ACPs). In contrast, fatty acid break- 
down intermediates are bound to the—SH group of coenzyme A. 

3. In animals, the enzymes of fatty acid synthesis are components of one long 
polypeptide chain, the fatty acid synthase, whereas no similar association exists for 
the degradative enzymes. (Plants and bacteria employ separate enzymes to carry 
out the biosynthetic reactions.) 

4. The coenzyme for the oxidation—reduction reactions of fatty acid synthesis is 
NADP*/NADPH, whereas degradation involves the NAD*/NADH couple. 


24 


To everything there is a season, and a time 


for every purpose under heaven...A time to 


break down and a time to build up. 


Ecclesiastes 3:1-3 


| Key Questions 


24.1 How Are Fatty Acids Synthesized? 
24.2 How Are Complex Lipids Synthesized? 


24.3 How Are Eicosanoids Synthesized, 
and What Are Their Functions? 


24.4 How Is Cholesterol Synthesized? 


24.5 How Are Lipids Transported 
Throughout the Body? 


24.6 How Are Bile Acids Biosynthesized? 


24.7 How Are Steroid Hormones 
Synthesized and Utilized? 
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24.1a Formation of Malonyl-CoA Activates Acetate 
Units for Fatty Acid Synthesis 


The design strategy for fatty acid synthesis is this: 


1. Fatty acid chains are constructed by the addition of two-carbon units derived from 
acetyl-CoA. 

2. The acetate units are activated by formation of malonyl-CoA (at the expense of ATP). 

3. The addition of two-carbon units to the growing chain is driven by decarboxylation 
of malonyl-CoA. 

4. The elongation reactions are repeated until the growing chain reaches 16 carbons 
in length (palmitic acid). 

5. Other enzymes then add double bonds and additional carbon units to the chain. 


24.1b Fatty Acid Biosynthesis Depends 
on the Reductive Power of NADPH 


Fatty acids are synthesized from acetyl-CoA building blocks in a reductive pathway 
driven by NADPH. The net reaction for the formation of palmitate from acetyl-CoA is 


Acetyl-CoA + 7 malonyl-CoA~ + 14 NADPH + 13 H* + H,O —> 
palmitate” + 7 HCO; + 8 CoASH + 14 NADP+ 


Levels of free fatty acids are very low in the typical cell. The palmitate made in this 
process is rapidly converted to CoA esters in preparation for the formation of triacyl- 
glycerols and phospholipids. 


24.1c Cells Must Provide Cytosolic Acetyl-CoA 
and Reducing Power for Fatty Acid Synthesis 


Eukaryotic cells face a dilemma in providing suitable amounts of substrate for fatty 
acid synthesis. Sufficient quantities of acetyl-CoA, malonyl-CoA, and NADPH must 
be generated in the cytosol for fatty acid synthesis. Malonyl-CoA is made by carbox- 
ylation of acetyl-CoA, so the problem reduces to generating sufficient acetyl-CoA 
and NADPH. 

There are three principal sources of acetyl-CoA (Figure 24.1): 


1. Amino acid degradation produces cytosolic acetyl-CoA. 

2. Fatty acid oxidation produces mitochondrial acetyl-CoA. 

3. Glycolysis yields cytosolic pyruvate, which (after transport into the mitochondria) 
is converted to acetyl-CoA by pyruvate dehydrogenase. 


Chemistry in History 


Dr. Konrad Emil Bloch 


Dr. Konrad Emil Bloch (1912-2000) began his 
studies at the Technical University of Munich, 
before being forced to flee Germany due to Jewish 
persecution before the start of World War II. 
Dr. Bloch earned his Ph.D. in Biochemistry from 
Columbia University where he taught for seven 
years before moving on to faculty positions at the 
University of Chicago and Harvard University. 

Though his research ranged over many areas of 
lipid metabolism, Bloch devoted more than twenty 


years to the study of cholesterol biosynthesis. 
He and his collaborators showed that squalene is 
synthesized from acetate units and that cholesterol 
is derived from squalene. Konrad Bloch and Feodor 
Lynen received the Nobel Prize in Medicine or 
Physiology in 1964 for their studies of cholesterol 
and fatty acid metabolism. 

Dr. Bloch stayed active as a scientist and a teacher 
until his death in Burlington, Massachusetts, in 
October 2000. 
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24.1 How Are Fatty Acids Synthesized? 
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Figure 24.1 The citrate-malate—pyruvate shuttle provides cytosolic acetate units and some reducing 
equivalents (electrons) for fatty acid synthesis. The shuttle collects carbon substrates, primarily from 
glycolysis but also from fatty acid oxidation and amino acid catabolism. Pathways that provide 
carbon for fatty acid synthesis are shown in blue; pathways that supply electrons for fatty acid 
synthesis are shown in red. 


The acetyl-CoA derived from amino acid degradation is normally insufficient for 
fatty acid biosynthesis, and the acetyl-CoA produced by pyruvate dehydrogenase and 
by fatty acid oxidation cannot cross the mitochondrial membrane to participate 
directly in fatty acid synthesis. Instead, acetyl-CoA is linked with oxaloacetate to 
form citrate, which is transported from the mitochondrial matrix to the cytosol 
(Figure 24.1). Here it can be converted back into acetyl-CoA and oxaloacetate by 
ATP-citrate lyase. In this manner, mitochondrial acetyl-CoA becomes the substrate 
for cytosolic fatty acid synthesis. (Oxaloacetate returns to the mitochondria in the 
form of either pyruvate or malate, which is then reconverted to acetyl-CoA and 
oxaloacetate, respectively.) 

NADPH can be produced in the pentose phosphate pathway as well as by malic 
enzyme (Figure 24.1). Reducing equivalents (electrons) derived from glycolysis in the 
form of NADH can be transformed into NADPH by the combined action of malate 
dehydrogenase and malic enzyme: 


Oxaloacetate + NADH + H* —> malate + NAD* 
Malate + NADP* —> pyruvate + CO, + NADPH + H+ 
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Quaternary structure of the E. coli biotin carboxyl 
carrier protein-biotin carboxylase (BCCP-BC) 
complex. The BC subunits are orange and blue; the 
BCCP subunits are pink and yellow (PDB id = 4HR7). 


How many of the 14 NADPH needed to form one palmitate can be made by this 
coupled MDH/malic enzyme route? The answer depends on the status of malate. Every 
citrate entering the cytosol produces one acetyl-CoA and one malate (Figure 24.1). Every 
malate oxidized by malic enzyme produces one NADPH, at the expense of a decarboxyl- 
ation to pyruvate. Thus, when malate is oxidized, one NADPH is produced for every 
acetyl-CoA. Conversion of eight acetyl-CoA units to one palmitate would then be accom- 
panied by production of eight NADPH. (As noted, a total of 14 NADPH are required. 
The remaining six are provided by the pentose phosphate pathway.) On the other hand, 
for every malate returned to the mitochondria, one NADPH fewer is produced. 


24.1d Acetate Units Are Committed to Fatty Acid 
Synthesis by Formation of Malonyl-CoA 


Rittenberg and Bloch showed in the late 1940s that acetate units are the building blocks 
of fatty acids. Their work, together with the discovery by Salih Wakil that bicarbonate 
is required for fatty acid biosynthesis, eventually made clear that this pathway involves 
synthesis of malonyl-CoA. The carboxylation of acetyl-CoA to form malonyl-CoA 
is essentially irreversible and is the committed step in the synthesis of fatty acids 
(Figure 24.2). The reaction is catalyzed by acetyl-CoA carboxylase, which contains a 
biotin prosthetic group. This carboxylase is the only enzyme of fatty acid synthesis in 
animals that is not part of the multienzyme complex fatty acid synthase. 


24.le Acetyl-CoA Carboxylase Is Biotin Dependent 
and Displays Ping-Pong Kinetics 


The biotin prosthetic group of acetyl-CoA carboxylase is covalently linked to the 
é-amino group of an active-site lysine in a manner similar to pyruvate carboxylase 
(see Figure 22.2). The reaction mechanism is also analogous to that of pyruvate 
carboxylase (see Figure 22.3): ATP-driven carboxylation of biotin is followed by transfer 
of the activated CO; to acetyl-CoA to form malonyl-CoA. The enzyme from Escherichia 
coli has three components: (1) a biotin carboxyl carrier protein (BCCP, a dimer of 22.5 
kDa subunits); (2) biotin carboxylase (BC, a dimer of 51 kDa subunits), which adds CO, 
to the prosthetic group; and (3) carboxyltransferase (an aß tetramer with 30 and 35 kDa 
subunits), which transfers the activated CO, unit to acetyl-CoA. The long, flexible 
biotin—lysine chain (biocytin) enables the activated carboxyl group to be carried be- 
tween the biotin carboxylase and the carboxyltransferase (Figure 24.3). 


24.1f Acetyl-CoA Carboxylase in Animals Is a Multifunctional Protein 


In animals, acetyl-CoA carboxylase (ACC) is a filamentous polymer (4 to 8 X 10° Da) 
composed of 265 kDa protomers. Each of these subunits contains the biotin carboxyl 
carrier moiety, biotin carboxylase, and carboxyltransferase activities as well as allo- 
steric regulatory sites. Animal ACC is thus a multifunctional protein. The polymeric 
form is active, but the 265 kDa protomers are inactive. The activity of ACC is thus 
dependent upon the position of the equilibrium between these two forms: 


Inactive protomers —— active polymer 


Because this enzyme catalyzes the committed step in fatty acid biosynthesis, it is 
carefully regulated. Palmitoyl-CoA, the final product of fatty acid biosynthesis, shifts 
the equilibrium toward the inactive protomers, whereas citrate, an important allosteric 
activator of this enzyme, shifts the equilibrium toward the active polymeric form of the 
enzyme. Acetyl-CoA carboxylase shows the kinetic behavior of a Monod—Wyman— 
Changeux V-system! allosteric enzyme in which allosteric effectors shift the T/R 
equilibrium between active R conformers and inactive T conformers. 


'The MWC J-system is an allosteric model characterized by (1) S binds equally well to R and T, but 
only R is active (T is inactive), and (2) effectors (A binds only to R; I binds only to T) regulate activity 
by shifting the relative R versus T distribution. 
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Figure 24.3 In the acetyl-CoA carboxylase reaction, the biotin ring, on its flexible tether, acquires 
carboxyl groups from carbonylphosphate on the biotin carboxylase subunit and transfers them to 
acyl-CoA molecules on the carboxyltransferase subunits. Colors of the domains correspond to those 
in Figure 24.4. 
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Figure 24.2 The acetyl-CoA carboxylase reaction 
produces malonyl-CoA for fatty acid synthesis. 

(a) The reaction uses bicarbonate as a substrate and 
is driven by ATP. (b) A mechanism for the acetyl- 
CoA carboxylase reaction. (Step 1) Bicarbonate is 
activated for carboxylation reactions by formation of 
N-carboxybiotin. ATP drives the reaction forward, 
with transient formation of a carbonylphosphate 
intermediate. (Step 2) In a typical biotin-dependent 
reaction, nucleophilic attack by the acetyl-CoA 
carbanion on the carboxyl carbon of 
N-carboxybiotin—a transcarboxylation—yields the 
carboxylated product. 
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Carboxyltransferase domain of 
yeast acetyl-CoA carboxylase 
(PDB id = 1OD2) 


The carboxyltransferase component of acetyl-CoA 
carboxylase-1 from Saccharomyces cerevisiae. Unlike 
the E. coli carboxyltransferase (CT) a28> 
heterotetramer, the yeast CT domain is a component 
of the ACC dimer. The N- and C-subdomains of 
one monomer are cyan and yellow, whereas those of 
the other monomer are purple and green. CoA is 
shown as a ball-and-stick model in one subunit. 


Dephospho-acetyl-CoA carboxylase 
(Low [citrate] activates, high [fatty acyl-CoA] 


inhibits) 


ATP P; 


Phosphatases 
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Phospho-acetyl-CoA carboxylase 
(High [citrate] activates, low [fatty acyl-CoA] 
inhibits) 


boboda 


Figure 24.5 The activity of acetyl-CoA carboxylase 
is modulated by phosphorylation and 
dephosphorylation. The dephospho form of the 
enzyme is activated by low [citrate] and inhibited 
only by high levels of fatty acyl-CoA. In contrast, 
the phosphorylated form of the enzyme is activated 
only by high levels of citrate but is very sensitive to 
inhibition by fatty acyl-CoA. 


24.1g Phosphorylation of ACC Modulates Activation 
by Citrate and Inhibition by Palmitoyl-CoA 


The regulatory effects of citrate and palmitoyl-CoA are dependent on the phosphory- 
lation state of acetyl-CoA carboxylase. The animal enzyme is phosphorylated at eight 
to ten sites on each enzyme subunit (Figure 24.4). Some of these sites are regulatory, 
whereas others are silent and have no effect on enzyme activity. Unphosphorylated 
acetyl-CoA carboxylase binds citrate with high affinity and thus is active at very low 
citrate concentrations (Figure 24.5). Phosphorylation of the regulatory sites decreases 
the affinity of the enzyme for citrate, and in this case, high levels of citrate are required 
to activate the carboxylase. The inhibition by fatty acyl-CoAs operates in a similar 
but opposite manner. Thus, low levels of fatty acyl-CoA inhibit the phosphorylated 
carboxylase, but the dephosphoenzyme is inhibited only by high levels of fatty acyl- 
CoA. Specific phosphatases act to dephosphorylate ACC, thereby increasing the 
sensitivity to citrate. 


24.1h Acyl Carrier Proteins Carry the Intermediates 
in Fatty Acid Synthesis 


The basic building blocks of fatty acid synthesis are acetyl and malonyl groups, but they 
are not transferred directly from CoA to the growing fatty acid chain. Rather, they are 
first passed to ACP. This protein consists (in Æ. coli) of a single polypeptide chain of 
77 residues to which is attached (on a serine residue) a phosphopantetheine group, the 
same group that forms the business end of coenzyme A. Thus, ACP is a somewhat larger 
version of coenzyme A, specialized for use in fatty acid biosynthesis (Figure 24.6). 
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Figure 24.4 Schematic of the acetyl-CoA carboxylase polypeptide, with domains and phosphoryla- 


tion sites indicated, along with the protein kinases responsible. Phosphorylation at Ser!?” is primarily 
responsible for decreasing the affinity for citrate. 
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Figure 24.6 Fatty acids are conjugated both to coenzyme A and to acyl carrier protein through the 
sulfhydryl of phosphopantetheine prosthetic groups. 


24.li In Some Organisms, Fatty Acid Synthesis 
Takes Place in Multienzyme Complexes 


The enzymes that catalyze formation of acetyl-ACP and malonyl-ACP and the subse- 
quent reactions of fatty acid synthesis are organized quite differently in different 
organisms. Fatty acid synthesis in mammals occurs on homodimeric fatty acyl 
synthase I (FAS I), each 270 kDa polypeptide of which contains all reaction centers 
required to produce a fatty acid. In lower eukaryotes, such as yeast and fungi, the 
enzymatic activities of FAS are distributed over two multifunctional peptide chains, 
which form 2.6-megadalton a¢85 complexes. In plants, most bacteria, and parasites, 
the enzymes of fatty acid synthesis are separated and independent, and this collection 
of enzymes is referred to as fatty acyl synthase IT (FAS IT). 


24.1} Decarboxylation Drives the Condensation 
of Acetyl-CoA and Malonyl-CoA 


The individual steps in the elongation of the fatty acid chain are quite similar across 
all organisms. The mammalian pathway (Figure 24.7) is a cycle of elongation that 
involves six enzyme activities. The elongation cycle is initiated by transfer of the acyl 
moiety of acetyl-CoA to the acyl carrier protein by the malonyl-CoA—acetyl-CoA- 
ACP transacylase (MAT), which also transfers the malonyl group of malonyl-CoA 
to ACP. 

The B-ketoacyl-ACP synthase (KS) catalyzes the decarboxylative condensation of 
the acyl group with malonyl-ACP to produce a B-ketoacyl-ACP intermediate (aceto- 
acetyl-ACP in the first cycle). The mechanism (Figure 24.8) begins with acetyl group 
transfer to MAT, followed with attack by the ACP thiol sulfur to form an acetyl-ACP. 
The acetyl group is transferred to a cysteine sulfur on KS, freeing the ACP thiol to 
acquire the malonyl group. In the condensation reaction that follows, decarboxylation 
of the malonyl group creates a transient, highly nucleophilic carbanion that can attack 
the acetate group. 

The net reaction for each turn of this cycle (see Figure 24.7) is addition of a two- 
carbon unit to the acyl group. Why is the three-carbon malonyl group used here as a 
two-carbon donor? The answer is that this is yet another example of a decarboxylation 
driving a desired but otherwise thermodynamically unfavorable reaction. The decarbox- 
ylation that accompanies the reaction with malonyl-ACP drives the synthesis of 
acetoacetyl-ACP. Note that hydrolysis of ATP drove the carboxylation of acetyl-CoA 
to form malonyl-ACP, so, indirectly, ATP is responsible for the condensation reaction to 
form acetoacetyl-ACP. Malonyl-CoA can be viewed as a form of stored energy for driv- 
ing fatty acid synthesis. 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


C— CH, — CH, —N—C— C—C—CH,—0 —P — O — CH, — Ser 


How Are Fatty Acids Synthesized? 


Adenine 


827 


828 Chapter 24 Lipid Biosynthesis 


Figure 24.7 The pathway of palmitate synthesis from acetyl-CoA and 
malonyl-CoA. Acetyl and malonyl building blocks are introduced as ACP 
conjugates. Decarboxylation drives the B-ketoacyl-ACP synthase and results in 
the addition of two-carbon units to the growing chain. The first turn of the 
cycle begins at ® and goes to butyryl-ACP; subsequent turns of the cycle are 


indicated as @ through ©. 
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It is also worth noting that the carbon of the carboxyl group that was added to drive 
this reaction is the one removed by the condensing enzyme. Thus, all the carbons of 
acetoacetyl-ACP (and of the fatty acids to be made) are derived from acetate units of 
acetyl-CoA. 
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Figure 24.8 A mechanism for mammalian ketoacyl synthase. An acetyl | ` | Os X2 
group is transferred from CoA to MAT, then to the acyl carrier protein, SH C=O Ss C 
and then to ketoacyl synthase. Next, a malonyl group is transferred to | | 
MAT and then to the acyl carrier protein. Decarboxylation of the CHo O a | =" oh 
malonyl group creates a transient carbanion on the acyl group of ACP, cC=ọo CO CH. Cc=o 
which attacks the KS acetyl group to form a ketoacyl-ACP. A cycle (see 7 | 
Figure 24.7) of keto group reduction (by KR), water removal (by DH), S S 
and double-bond reduction (by ER; see next section) will finally produce l 
ACP ACP 


an acyl group increased in length by two carbons. 


24.1k Reduction of the B-Carbonyl Group Follows a Now-Familiar Route 


The next three steps—reduction of the B-carbonyl group by B-ketoacyl-ACP reductase 
(KR) to form a B-alcohol, then dehydration by B-hydroxyacyl-ACP dehydratase (DH) 
and reduction by 2,3-trans-enoyl-ACP reductase (ER) to saturate the chain (see 
Figure 24.7)—look very similar to the fatty acid degradation pathway in reverse. 
However, there are two crucial differences between fatty acid biosynthesis and fatty 
acid oxidation (besides the fact that different enzymes are involved): First, the alcohol 
formed in biosynthesis has the D-configuration rather than the L-form seen in 
catabolism; and second, the reducing coenzyme is NADPH, whereas NAD* and 
FAD are the oxidants in the catabolic pathway. 

The net result of the first turn of the biosynthetic cycle is the synthesis of a four- 
carbon unit, a butyryl group, from two smaller building blocks. In the next cycle of the 
process, this butyryl-ACP condenses with another malonyl-ACP to make a six-carbon B- 
ketoacyl-ACP and CO). Subsequent reduction to a B-alcohol, dehydration, and another 
reduction yield a six-carbon saturated acyl-ACP. This cycle continues with the net addi- 
tion of a two-carbon unit in each turn until the chain is 16 carbons long (see Figure 24.7). 
The KS cannot accommodate larger substrates, so the reaction cycle ends with a 16- 
carbon chain. Hydrolysis of the C)¢-acyl-ACP yields a palmitic acid and the free ACP. 

In the end, seven malonyl-CoA molecules and one acetyl-CoA yield a palmitate 
(shown here as palmitoyl-CoA): 


Acetyl-CoA + 7 malonyl-CoA” + 14 NADPH + 14 H* —> 
palmitoyl-CoA + 7 HCO,- + 14 NADP+ + 7 CoASH 


The formation of seven malonyl-CoA molecules requires 


7 Acetyl-CoA + 7 HCO; + 7 ATP —> 
7 malonyl-CoA~ + 7 ADP?" + 7 P2- + 7H* 


Thus, the overall reaction of acetyl-CoA to yield palmitic acid is 


8 Acetyl-CoA + 7 ATP*~ + 14 NADPH + 7 H+ —> 
palmitoyl-CoA + 14 NADP* + 7 CoASH + 7 ADP? + 7 PP- 


Note: These equations are stoichiometric and charge balanced. See problem 1 at the 
end of the chapter for practice in balancing these equations. 
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AT: Acetyl transferase 

MPT: Malonyl/palmitoyl transferase 

MAT: Malonyl-CoA-acetyl-CoA-ACP 
transacylase 

TE: Thioesterase 

ACP: Acyl carrier protein 

PPT: Phosphopantetheinyl 
transferase 

KR: B-Ketoacyl reductase 

KS: B-Ketoacyl synthase 

ER: B-Enoyl reductase 

DH: Dehydratase 
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24.11 Eukaryotes Build Fatty Acids on Megasynthase Complexes 


The multiple enzyme domains of eukaryotic fatty acyl synthases are arrayed on large 
protein structures termed megasynthases. The individual enzyme domains of these 
structures in all eukaryotes are homologous to the corresponding small, discrete en- 
zymes of bacterial FAS pathways. Remarkably, however, lower eukaryotes such as 
fungi and higher eukaryotes such as mammals have evolved entirely different mega- 
synthase architectures for fatty acid synthesis. Mammalian homodimeric FAS has a 
flattened X-shape, whereas the fungal dodecameric FAS is a large, closed barrel, with 
two reaction chambers separated by equatorial stabilizing struts (Figure 24.9). In the 
fungal structure, the six a-subunits form a central ring that is a trimer of dimers 
(Figure 24.10a,b). Each a-subunit contributes an extended a-helical segment to the 
center of the structure. Pairs of these helices form three coiled-coil struts anchored by 
a six-helix bundle in the center of the barrel. Each a-subunit contains KR and KS 
domains. Three KR and three KS active sites are oriented toward the upper reaction 
chamber, and three of each face the lower chamber. The -subunit trimers form 
rounded caps over the upper and lower reaction chambers. Each chamber contains 
three pores that allow substrates (acetyl-CoA and malonyl-CoA) to diffuse in and 
palmitoyl-CoA to exit. On each end of the structure, the active sites of the four B- 
subunit enzyme domains (see Figure 24.9) are oriented toward the interior of the reac- 
tion chamber. Three ACP domains in each chamber shuttle growing acyl chains from 
site to site during the catalytic cycle. Each ACP is tethered by two flexible linker pep- 
tides, which facilitate its site-to-site movement (Figure 24.10c). The phosphopantethe- 
ine arm on each ACP can extend outward to reach into active sites or may retract to 
insert its acyl chain in a protective hydrophobic cavity during intersite transport. 

The homodimeric mammalian FAS contains all six functional enzyme domains on 
each subunit (Figures 24.9 and 24.11). In the X-shaped dimer, three of the domains— 
including KS, ER, and DH—form dimeric structures, whereas the KR and MAT 


Fungal fatty acid synthase Mammalian fatty acid synthase 


Reaction chamber 


* = Active sites 


Re ion ‘ 
Caco Reaction 


chamber 


chamber 


Reaction chamber 


«es Harn} er f ke | 


Figure 24.9 Organization of enzyme functions on two eukaryotic fatty acid synthases. (left) Fungal 
FAS is a closed barrel 260 Å high and 230 Å wide. (right) Mammalian (pig) FAS is an asymmetric 
X-shape 210 Å high, 180 Å wide, and 90 Å deep. The arrangement of functional domains along the 
FAS polypeptides is shown at the bottom of the figure. KS domains form dimers in both structures. 
KR domains form dimers in the fungal enzyme, whereas ER and DH domains form dimers in the 
mammalian complex. 
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Figure 24.10 Structure of the yeast fatty acyl synthase. (a) FAS from 

S. cerevisiae possesses two trimeric reaction chambers separated by 
equatorial stabilizing struts. ACP domains are located at the equatorial base 
of each reaction chamber, close to the catalytic site of KS. (b) The equatorial 
base of this structure is an a, trimer of dimers, with alternating KS and KR 
domains. The central stabilizing struts are a-helical extensions of the 
functional domains arranged around the outside of the ring. A central six- 
helix bundle stabilizes the structure. (c) The ACP domains (red) are tethered 
on flexible linkers (yellow) so that they can move from one active site to the 
next in the catalytic cycle. Comparison of the ACP domains of S. cerevisiae 
and E. coli reveals that the ACP domains probably extend the acyl- 
phosphopantetheine group to active sites but then retract the acyl group into 
a hydrophobic cleft while moving from one site to the next (PDB = 2UV8; 
images courtesy of Marc Leibundgut and Nenad Ban, ETH [Zurich]). 


Central 
anchor 


Peripheral 
anchor 


domains are separated and lie near the ends of the extended arms. The arms form reac- 
tion chambers on either side of the structure. The flexible ACP domains do not 
appear in this structure (probably because they are not fixed in any one position in the 
crystals used for the structural studies). However, since it follows the KR domain in the 
polypeptide sequence, the ACP domain probably lies at the end of each KR arm, where 
it can rotate to interact with the adjacent active sites. 

In both the fungal and the mammalian FAS structures, the close association of enzy- 
mic domains within one large complex permits efficient transfer of intermediates from 
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Figure 24.11 Structural studies reveal that 


mammalian FAS homodimer is X-shaped. 


The ACP domains are probably located 
adjacent to the KR domains at the ends 
of the arms (PDB id = 2VZ8; image 
courtesy of Marc Leibundgut and Nenad 
Ban, ETH [Zurich]). 


one active site to the next. In addition, the presence of all these enzyme domains on one 
or two polypeptides allows the cell to coordinate synthesis of all enzymes needed for 
fatty acid synthesis. 


24.1m Ci, Fatty Acids May Undergo Elongation and Unsaturation 


As seen already, palmitate is the primary product of the fatty acid synthase. Cells 
synthesize many other fatty acids. Shorter chains are easily made if the chain is re- 
leased before reaching 16 carbons in length. Longer chains are made through special 
elongation reactions, which occur both in the mitochondria and at the surface of the 
endoplasmic reticulum (ER). 


Additional Elongation The fatty acid elongation reactions in the ER are actually 
quite similar to those we have just discussed: addition of two-carbon units at the 
carboxyl end of the chain by means of oxidative decarboxylations involving malonyl- 
CoA. As was the case for the fatty acid synthase, this decarboxylation provides the 
thermodynamic driving force for the condensation reaction. The mitochondrial 
reactions involve addition (and subsequent reduction) of acetyl units. These reac- 
tions (Figure 24.12) are essentially a reversal of fatty acid oxidation, except that 
NADPH is utilized in the saturation of the double bond, instead of FADH). 


Introduction of a Single cis Double Bond Both prokaryotes and eukaryotes are capable 
of introducing a single cis double bond in a newly synthesized fatty acid. Bacteria 
such as E. coli carry out this process in an O,-independent pathway, whereas eukary- 
otes have adopted an O,-dependent pathway. There is a fundamental chemical differ- 
ence between the two. The O,-dependent reaction can occur anywhere in the fatty 
acid chain, with no (additional) need to activate the desired bond toward dehydroge- 
nation. However, in the absence of O2, some other means must be found to activate 
the bond in question. Thus, in the bacterial reaction, dehydrogenation occurs while 
the bond of interest is still near the B-carbonyl or B-hydroxy group and the thioester 
group at the end of the chain. 

In E. coli, the biosynthesis of a monounsaturated fatty acid begins with four normal 
cycles of elongation to form a 10-carbon intermediate, B-hydroxydecanoyl-ACP 
(Figure 24.13). At this point, B-hydroxydecanoyl] thioester dehydrase forms a double 
bond ß,y to the thioester and in the cis configuration. This is followed by three rounds 
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of the normal elongation reactions to form palmitoleoyl-ACP. Elongation may 
terminate at this point or may be followed by additional biosynthetic events. The prin- 
cipal unsaturated fatty acid in E. coli, cis-vaccenic acid, is formed by an additional elon- 
gation step, using palmitoleoyl-ACP as a substrate. 


24.1In Unsaturation Reactions Occur in Eukaryotes 
in the Middle of an Aliphatic Chain 


The addition of double bonds to fatty acids in eukaryotes does not occur until the 
fatty acyl chain has reached its full length (usually 16 to 18 carbons). Dehydrogena- 
tion of stearoyl-CoA occurs in the middle of the chain, despite the absence of any 
useful functional group on the chain to facilitate activation: 


CH3—(CH)2);s6CO—SCoA —> CH;—(CH2);CH=CH(CH)2)7;CO—SCoA 


This impressive reaction is catalyzed by stearoyl-CoA desaturase, a 53 kDa enzyme 
containing a nonheme iron center. NADH and O, are required, as are two other pro- 
teins: cytochrome b; reductase (a 43 kDa flavoprotein) and cytochrome bs (16.7 kDa). 
All three proteins are associated with the ER membrane. Cytochrome b; reductase 
transfers a pair of electrons from NADH through FAD to cytochrome b; (Figure 24.14). 
Oxidation of reduced cytochrome b; is coupled to reduction of nonheme Fe** to Fe?* 
in the desaturase. The Fe** accepts a pair of electrons (one at a time in a cycle) from 
cytochrome b; and creates a cis double bond at the 9,10-position of the stearoyl-CoA 
substrate. O, is the terminal electron acceptor in this fatty acyl desaturation cycle. 
Note that two water molecules are made, which means that four electrons are trans- 
ferred overall. Two of these come through the reaction sequence from NADH, and 
two come from the fatty acyl substrate that is being dehydrogenated. 
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Figure 24.12 Fatty acids longer than 
palmitate are made in a sequence of four 
steps. (1) Elongation of fatty acids in 
mitochondria is initiated by the thiolase 
reaction. The B-ketoacyl intermediate thus 
formed undergoes the same three 
reactions (in reverse order) that are the 
basis of B-oxidation of fatty acids. 

(2) Reduction of the B-keto group is 
followed by (3) dehydration to form a 
double bond. (4) Reduction of the double 
bond yields a fatty acyl-CoA that is 
elongated by two carbons. Note that the 
reducing coenzyme for the second step is 
NADH, whereas the reductant for the 
fourth step is NADPH. 
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Figure 24.13 Double bonds are introduced into the 
growing fatty acid chain in Æ. coli by specific 
dehydrases. Palmitoleoyl-ACP is synthesized by a 
sequence of reactions involving four rounds of chain 
elongation, followed by double-bond insertion by 
B-hydroxydecanoyl thioester dehydrase and three 
additional elongation steps. Another elongation 
cycle produces cis-vaccenic acid. 
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Figure 24.14 The conversion of stearoyl-CoA to oleoyl-CoA in eukaryotes is catalyzed by stearoyl- 
CoA desaturase in a reaction sequence that also involves cytochrome b; and cytochrome b; reductase. 


Two electrons are passed from NADH through the chain of reactions as shown, and two electrons are 
derived from the fatty acyl substrate. 


24.10 The Unsaturation Reaction May Be Followed by Chain Elongation 


Additional chain elongation can occur following this single desaturation reaction. 
The oleoyl-CoA produced can be elongated by two carbons to form a 20:1 cis-A! 
fatty acyl-CoA. If the starting fatty acid is palmitate, reactions similar to the 
preceding scheme yield palmitoleoyl-CoA (16:1 cis-A®), which subsequently can be 
elongated to yield cis-vaccenic acid (18:1 cis-A!!). Similarly, Cie and Cj; fatty acids can 
be elongated to yield C), and C24 unsaturated fatty acids, such as are often found in 
sphingolipids. 


24.1p Mammals Cannot Synthesize Most 
Polyunsaturated Fatty Acids 


Organisms differ with respect to formation, processing, and utilization of polyunsat- 
urated fatty acids. E. coli, for example, does not have any polyunsaturated fatty acids. 
Eukaryotes do synthesize a variety of polyunsaturated fatty acids, certain organisms 
more than others. For example, plants manufacture double bonds between the A® and 
the methyl end of the chain, but mammals cannot. Plants readily desaturate oleic acid 
at the 12-position (to give linoleic acid) or at both the 12- and 15-positions (producing 
linolenic acid). Mammals require polyunsaturated fatty acids but must acquire them 
in their diet. As such, these fatty acids are referred to as essential fatty acids. On 
the other hand, mammals can introduce double bonds between the double bond at the 
8- or 9-position and the carboxyl group. Enzyme complexes in the ER desaturate 
the 5-position, provided a double bond exists at the 8-position, and form a double 
bond at the 6-position if one already exists at the 9-position. Thus, oleate can be 
unsaturated at the 6,7-position to give an 18:2 cis-A°,A° fatty acid. 


24.1q Arachidonic Acid Is Synthesized from Linoleic 
Acid by Mammals 


Mammals can add additional double bonds to unsaturated fatty acids in their diets. 
Their ability to make arachidonic acid from linoleic acid is one example (Figure 24.15). 
This fatty acid is the precursor for prostaglandins (the mediators of pain and inflam- 
mation) and other biologically active derivatives such as leukotrienes. Synthesis 
involves formation of a linoleoyl ester of CoA from dietary linoleic acid, followed by 
introduction of a double bond at the 6-position. The triply unsaturated product is then 
elongated (by malonyl-CoA with a decarboxylation step) to yield a 20-carbon fatty 
acid with double bonds at the 8-, 11-, and 14-positions. A second desaturation reaction 
at the 5-position, followed by a reverse acyl-CoA synthetase reaction (see Chapter 23), 
liberates the product, a 20-carbon fatty acid with double bonds at the 5-, 8-, 11-, and 
14-positions. 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


NNN YN Ree coo- 
Linoleic acid (18:229!) 
CoA + ATP 
Acyl-CoA o 
synthetase AMP + B® | 
IN NSN YONI ROO C—S—CoA 
Linoleoyl-CoA (18:24%!?-CoA) 


Desaturation 
2H 
12 9 6 
i C—S—CoA 
Linolenoyl-CoA (18:34%9!?_CoA) | 
O 
Malonyl-CoA Elongation 
CO, + CoA 
14 11 8 
C— S — CoA 
(20:328:11.14_CoA) | 
O 
Desaturation 
2H 1 
14 1 8 5 C—S—CoA 
Arachidonoyl-CoA (20:4458:11.14_Coa) 
coe 
Y CoA o 
14 1 8 5 c—O” 
Arachidonic acid 


24.1r Regulatory Control of Fatty Acid Metabolism Is an Interplay of 
Allosteric Modifiers and Phosphorylation—Dephosphorylation Cycles 


The activities of the enzymes of fatty acid synthesis fluctuate in response to changes 
in physiological state. Fatty acid synthesis is also regulated by the direct action of 
metabolite effectors on key enzymes in the pathway. The control and regulation of 
fatty acid synthesis is intimately related to regulation of fatty acid breakdown, gly- 
colysis, and the TCA cycle. Acetyl-CoA is an important intermediate metabolite in all 
these processes. In these terms, it is easy to appreciate the interlocking relationships 
in Figure 24.16. Malonyl-CoA can act to prevent fatty acyl-CoA derivatives from en- 
tering the mitochondria by inhibiting the carnitine acyltransferase of the outer 
mitochondrial membrane that initiates this transport. In this way, when fatty acid 
synthesis is turned on (as signaled by higher levels of malonyl-CoA), B-oxidation is 
inhibited. As described in section 24.1f, citrate is an important allosteric activator of 
acetyl-CoA carboxylase, and fatty acyl-CoAs are inhibitors. The degree of inhibition 
is proportional to the chain length of the fatty acyl-CoA; longer chains show a higher 
affinity for the allosteric inhibition site on acetyl-CoA carboxylase. Palmitoyl-CoA, 
stearoyl-CoA, and arachidyl-CoA are the most potent inhibitors of the carboxylase. 


24.1s Hormonal Signals Regulate ACC and Fatty Acid Biosynthesis 


Several hormones act to control fatty acid synthesis, especially via regulation of 
acetyl-CoA carboxylase. As described in Section 24.1f, citrate activation and palmitoyl- 
CoA inhibition of acetyl-CoA carboxylase are strongly dependent on the phosphoryla- 
tion state of the enzyme. This provides a crucial connection to hormonal regulation. 
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Figure 24.15 Arachidonic acid is synthesized from 
linoleic acid in eukaryotes. This is the means by 
which animals synthesize fatty acids with double 
bonds at positions other than C-9. 
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Human Biochemistry 


w3 and w6—Essential Fatty Acids with Many Functions 


Linoleic acid and a-linolenic acid are termed essential fatty acids 
because animals cannot synthesize them and must acquire them 
in their diet. Linoleic acid is the precursor of arachidonic acid, 
and both of these are referred to as w6 fatty acids because, count- 
ing from the end (omega, w) carbon of the chain, the first double 
bond is at the sixth position (see figure). Similarly, a-linolenic acid 
is the precursor of eicosapentaenoic acid and docosahexaenoic acid 
(DHA), and these three are termed w3 fatty acids. Vegetable oils are 
rich in linoleic acid and thus satisfy the body’s w6 dietary require- 
ments, whereas fish oils (for example, cod, herring, menhaden, and 
salmon) are rich in w3 fatty acids. 

The w6 fatty acids are precursors of inflammation mediators, 
including the prostaglandins, thromboxanes, and leukotrienes (see 


(a) 


Section 24.3). The w3 fatty acids have beneficial effects in a variety of 
organs and biological processes, including growth regulation, platelet 
activation, and lipoprotein metabolism. The w3 fats are generally car- 
dioprotective, anti-inflammatory, and anticarcinogenic. 

Interestingly, especially high levels of DHA have been found in 
rod cell membranes in animal retina and in neural tissue. DHA is 
approximately 22% of total fatty acids in animal retina and 35% to 
40% of the fatty acids in retinal phosphatidylethanolamine. DHA 
supports neural and visual development, in part because it is a pre- 
cursor for eicosanoids that regulate numerous cell and organ func- 
tions. Infants can synthesize DHA and other polyunsaturated fatty 
acids, but the rates of synthesis are low. Strong evidence exists for 
the importance of these fatty acids in infant nutrition. 


w6 Essential fatty acid 


TIN 


(b) 


The “w” numbering system, where the position of the first double 
bond relative to the methyl (w) group is indicated (red box), is 
useful because the w double-bond position is retained during chain 
elongation and desaturation. (a) w6 fatty acids. (b) w3 fatty acids. 


Linoleic acid (18:2 w6) 


COOH 


Arachidonic acid (20:4 w6) 
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COOH 
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<<<— 


COOH 
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Figure 24.16 Regulation of fatty acid synthesis and 
fatty acid oxidation are coupled as shown. Malonyl- 
CoA, produced during fatty acid synthesis, inhibits 
the uptake of fatty acylcarnitine (and thus fatty acid 
oxidation) by mitochondria. When fatty acyl-CoA 
levels rise, fatty acid synthesis is inhibited and fatty 
acid oxidation activity increases. Rising citrate levels 
(which reflect an abundance of acetyl-CoA) 
similarly signal the initiation of fatty acid synthesis. 
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Many ofthe enzymes that act to phosphorylate acetyl-CoA carboxylase (see Figure 24.4) 
are controlled by hormonal signals. Glucagon is a good example (Figure 24.17). As 
noted in Chapter 22, glucagon binding to membrane receptors activates an intracellular 
cascade involving activation of adenylyl cyclase. Cyclic AMP produced by the cyclase 
activates protein kinase A, which then phosphorylates acetyl-CoA carboxylase. Unless 
citrate levels are high, phosphorylation causes inhibition of fatty acid biosynthesis. The 
carboxylase (and fatty acid synthesis) can be reactivated by a specific phosphatase, 
which dephosphorylates the carboxylase. Also indicated in Figure 24.17 is the simulta- 
neous activation by glucagon of triacylglycerol lipases, which hydrolyze triacylglycerols, 
releasing fatty acids for B-oxidation. Both the inactivation of acetyl-CoA carboxylase 
and the activation of triacylglycerol lipase are counteracted by insulin, whose receptor 
acts to stimulate a phosphodiesterase that converts cAMP to AMP. 


24.2 | How Are Complex Lipids Synthesized? 


Complex lipids consist of backbone structures to which fatty acids are covalently bound. 
Principal classes include the glycerolipids, for which glycerol is the backbone, and sphin- 
golipids, which are built on a sphingosine backbone. The two major classes of glycerolip- 
ids are glycerophospholipids and triacylglycerols. The phospholipids, which include both 
glycerophospholipids and sphingomyelins, are crucial components of membrane struc- 
ture. They are also precursors of hormones such as the eicosanoids (for example, prosta- 
glandins) and signal molecules (such as the breakdown products of phosphatidylinositol). 


24.2 How Are Complex Lipids Synthesized? 837 


Figure 24.17 Hormonal signals regulate fatty acid 
synthesis, primarily through actions on acetyl-CoA 
carboxylase. Availability of fatty acids also depends 
upon hormonal activation of triacylglycerol lipase. 
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Chapter 24 Lipid Biosynthesis 


Different organisms possess greatly different complements of lipids and therefore 
utilize somewhat different lipid biosynthetic pathways. For example, sphingolipids and 
triacylglycerols are produced only in eukaryotes. In contrast, bacteria usually have 
rather simple lipid compositions. Phosphatidylethanolamine accounts for at least 75% 
of the phospholipids in E. coli, with phosphatidylglycerol and cardiolipin accounting 
for most of the rest. E. coli membranes possess no phosphatidylcholine, phosphati- 
dylinositol, sphingolipids, or cholesterol. On the other hand, some bacteria (such as 
Pseudomonas) can synthesize phosphatidylcholine, for example. In this section and the 
one following, we consider some of the pathways for the synthesis of glycerolipids, 
sphingolipids, and the eicosanoids, which are derived from phospholipids. 


24.2a Glycerolipids Are Synthesized by Phosphorylation 
and Acylation of Glycerol 


A common pathway operates in nearly all organisms for the synthesis of 
phosphatidic acid, the precursor to other glycerolipids. Glycerokinase catalyzes 
the phosphorylation of glycerol to form glycerol-3-phosphate, which is then acyl- 
ated at both the l- and 2-positions to yield phosphatidic acid (Figure 24.18). The 
first acylation, at position 1, is catalyzed by glycerol-3-phosphate acyltransferase, 
an enzyme that in most organisms is specific for saturated fatty acyl groups. 
Eukaryotic systems can also use dihydroxyacetone phosphate as a starting point 
for synthesis of phosphatidic acid (Figure 24.18). Again a specific acyltransferase 
adds the first acyl chain, followed by reduction of the backbone keto group by 
acyldihydroxyacetone phosphate reductase, using NADPH as the reductant. Alter- 
natively, dihydroxyacetone phosphate can be reduced to glycerol-3-phosphate by 
glycerol-3-phosphate dehydrogenase. 


24.2b Eukaryotes Synthesize Glycerolipids 

from CDP-Diacylglycerol or Diacylglycerol 
In eukaryotes, phosphatidic acid is converted directly either to diacylglycerol or to 
cytidine diphosphodiacylglycerol (or simply CDP-diacylglycerol; Figure 24.19). From 
these two precursors, all other glycerophospholipids in eukaryotes are derived. 


Human Biochemistry 


Lipins—Phosphatases Essential for Triglyceride Synthesis and Other Functions 


Lipins are phosphatidic acid phosphatases (also known as PAPs) 
widely conserved in eukaryotes. Unlike other enzymes of the 
triacylglycerol biosynthetic pathway, lipins lack transmembrane 
domains and are extensively phosphorylated and thus confined 
to the cytosol until activated. Activation occurs upon PAP bind- 
ing to a complex of two ER/nuclear transmembrane proteins, 
Nem1p and Spo7p, followed by removal of the phosphate groups 
from PAP. The activated lipin-Nem1p-Spo7p complex dephos- 
phorylates phosphatidic acid to produce diacylglycerols, which 
can be acylated to form triacylglycerols. Lipins are critical for 
triacylglycerol biosynthesis in most eukaryotes, and they are also 
essential for maintenance of nuclear structure and organization 
of the ER membrane. 

Many cell types maintain a pool of lipins within the nucleus, 
where they regulate expression of genes encoding fatty acid— 
metabolizing enzymes. Nuclear import of certain lipins in response 
to nutrient depletion induces nuclear envelope remodeling and 
downregulation of certain transcription factors involved in lipid 
synthesis. 


Phosphatidic 
acid 


Lipins 


DAG CDP-DAG 
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Reference 
Siniossoglou, S., 2015. The many faces of lipins. ASBMB Today 14:14-15. 
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Figure 24.18 Synthesis of glycerolipids in eukaryotes begins with the formation of phosphatidic acid, 


which may be formed from dihydroxyacetone phosphate or glycerol as shown. 


Diacylglycerol is a precursor for synthesis of triacylglycerol, phosphatidylethanol- 
amine, and phosphatidylcholine. Triacylglycerol is synthesized mainly in adipose tis- 
sue, liver, and intestines and serves as the principal energy storage molecule in eukary- 
otes. Triacylglycerol biosynthesis in liver and adipose tissue occurs via diacylglycerol 
acyltransferase, an enzyme bound to the cytoplasmic face of the ER. A different route 
is used, however, in intestines. Recall (see Figure 23.3) that triacylglycerols from the 
diet are broken down to 2-monoacylglycerols by specific lipases. Acyltransferases 
then acylate 2-monoacylglycerol to produce new triacylglycerols (Figure 24.20). 
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Figure 24.19 Diacylglycerol and CDP-diacylglycerol are the principal precursors of glycerolipids in 
eukaryotes. Phosphatidylethanolamine and phosphatidylcholine are formed by reaction of diacylglycerol 
with CDP-ethanolamine or CDP-choline, respectively. 
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24.2c Phosphatidylethanolamine Is Synthesized 
from Diacylglycerol and CDP-Ethanolamine 


Phosphatidylethanolamine synthesis begins with phosphorylation of ethanolamine 
to form phosphoethanolamine (see Figure 24.19). The next reaction involves trans- 
fer of a cytidylyl group from CTP to form CDP-ethanolamine and pyrophosphate. 
As always, PP; hydrolysis drives this reaction forward. A specific phosphoethanol- 
amine transferase then links phosphoethanolamine to the diacylglycerol backbone. 
Biosynthesis of phosphatidylcholine is entirely analogous because animals can syn- 
thesize it directly. All of the choline used in this pathway must be acquired from the 
diet. On the other hand, yeast, certain bacteria, and animal livers can convert phos- 
phatidylethanolamine to phosphatidylcholine by methylation reactions involving 
S-adenosylmethionine. 


24.2d Exchange of Ethanolamine for Serine Converts 
Phosphatidylethanolamine to Phosphatidylserine 


Mammals synthesize phosphatidylserine (PS) in a calcium ion—dependent reaction 
involving aminoalcohol exchange (Figure 24.21). The enzyme catalyzing this reaction 
is associated with the ER and will accept phosphatidylethanolamine (PE) and other 
phospholipid substrates. A mitochondrial PS decarboxylase can subsequently con- 
vert PS to PE. No other pathway converting serine to ethanolamine has been found. 


24.2e Eukaryotes Synthesize Other Phospholipids 
Via CDP-Diacylglycerol 


Eukaryotes also use CDP-diacylglycerol, derived from phosphatidic acid, as a pre- 
cursor for several other important phospholipids, including phosphatidylinositol 
(PI), phosphatidylglycerol (PG), and cardiolipin (Figure 24.22). PI accounts for only 
about 2% to 8% of the lipids in most animal membranes, but breakdown products of 
PI, including inositol-1,4,5-trisphosphate and diacylglycerol, are second messengers 
in a vast array of cellular signaling processes. 
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Figure 24.21 The interconversion of phosphatidylethanolamine and phosphatidylserine in mammals. 


24.2 How Are Complex Lipids Synthesized? 841 


CH, OH 
2-Monoacylglycerol 
Oo 
| Il 
R= C—Scoa 


Monoacylglycerol acyltransferase 


CoASH 
i 
R= €— 0— CH, 
O 
Il 
Ry — C— O— ii 
CH, OH 
Diacylglycerol 
O 


Diacylglycerol acyltransferase 


CoASH 
= 
© 
m] 
i= C— O— CHo 
1 | 
Il 
Ry — C— O— CH 


Triacylglycerol 


Figure 24.20 Triacylglycerols are formed primarily 
by the action of acyltransferases on monoacylglycerol 
and diacylglycerol. 
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Figure 24.22 CDP-diacylglycerol is a precursor of phosphatidylinositol, phosphatidylglycerol, and 
cardiolipin in eukaryotes. 
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24.2f Dihydroxyacetone Phosphate Is a Precursor to the Plasmalogens 


Certain glycerophospholipids possess alkyl or alkenyl ether groups at the l-position 
in place of an acyl ester group. These glyceroether phospholipids are synthesized 
from dihydroxyacetone phosphate (Figure 24.23). Acylation of dihydroxyacetone 
phosphate (DHAP) is followed by an exchange reaction, in which the acyl group is 
removed as a carboxylic acid and a long-chain alcohol adds to the 1-position. This 


Figure 24.23 Biosynthesis of plasmalogens in animals. (1) Acylation at C-1 is followed by 


| (2) exchange of the acyl group for a long-chain alcohol. (3) Reduction of the keto group at 


C-2 is followed by (4 and 5) transferase reactions, which add an acyl group at C-2 and a 
polar head-group moiety (as shown here for phosphoethanolamine), and a (6) desaturase 
reaction that forms a double bond in the alkyl chain. The first two enzymes are of 


| cytoplasmic origin, and the last transferase is located at the endoplasmic reticulum. 
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long-chain alcohol is derived from the corresponding acyl-CoA by means of an acyl- 
CoA reductase reaction involving oxidation of two molecules of NADH. The 2-keto 
group of the DHAP backbone is then reduced to an alcohol, followed by acylation. 
The resulting l-alkyl-2-acylglycero-3-phosphate can react in a manner similar to 
phosphatidic acid to produce ether analogs of phosphatidylcholine, phosphatidyl- 
ethanolamine, and so forth (Figure 24.23). In addition, specific desaturase enzymes 
associated with the ER can desaturate the alkyl ether chains of these lipids as shown. 
The products, which contain a,8-unsaturated ether-linked chains at the C-1 position, 
are plasmalogens; they are abundant in cardiac tissue and in the central nervous sys- 
tem. The desaturases catalyzing these reactions are distinct from but similar to those 
that introduce unsaturations in fatty acyl-CoAs. 


24.2g Platelet-Activating Factor Is Formed by Acetylation 
of 1-Alkyl-2-Lysophosphatidylcholine 


A particularly interesting ether phospholipid with unusual physiological properties, 
1-alkyl-2-acetylglycerophosphocholine, also known as platelet-activating factor, pos- 
sesses an alkyl ether at C-1 and an acetyl group at C-2 (Figure 24.24). The very short 
chain at C-2 makes this molecule much more water soluble than typical glycerolipids. 
Platelet-activating factor displays a dramatic ability to dilate blood vessels (and thus 
reduce blood pressure in hypertensive animals) and to aggregate platelets. 


24.2h Sphingolipid Biosynthesis Begins with Condensation 
of Serine and Palmitoyl-CoA 


Sphingolipids, ubiquitous components of eukaryotic cell membranes, are present at 
high levels in neural tissues. The myelin sheath that insulates nerve axons is particu- 
larly rich in sphingomyelin and other related lipids. Prokaryotic organisms normally 
do not contain sphingolipids. Sphingolipids are built upon sphingosine backbones 
rather than glycerol. The initial reaction, which involves condensation of serine and 
palmitoyl-CoA with release of bicarbonate, is catalyzed by 3-ketosphinganine synthase, 
a PLP-dependent enzyme (Figure 24.25). Reduction of the ketone product to form 
sphinganine is catalyzed by 3-ketosphinganine reductase, with NADPH as a reactant. 
In the next step, sphinganine is acylated to form N-acyl sphinganine, which is then 
desaturated to form ceramide. Sphingosine itself does not appear to be an intermediate 
in this pathway in mammals. 


24.2i Ceramide Is the Precursor for Other 
Sphingolipids and Cerebrosides 


Ceramide is the building block for all other sphingolipids. Sphingomyelin, for example, 
is produced by transfer of phosphocholine from phosphatidylcholine (Figure 24.26). 
Glycosylation of ceramide by sugar nucleotides yields cerebrosides, such as galacto- 
sylceramide, which makes up about 15% of the lipids of myelin sheath structures. 
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Figure 24.24 Platelet-activating factor, formed from 1-alkyl-2-lysophosphatidylcholine by acetylation 
at C-2, is degraded by the action of acetylhydrolase. 
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24.2 How Are Complex Lipids Synthesized? 
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Figure 24.25 Biosynthesis of sphingolipids in animals begins with the 3-ketosphinganine synthase 
reaction, a PLP-dependent condensation of palmitoyl-CoA and serine. Subsequent reduction of the 
keto group, acylation, and desaturation (via reduction of an electron acceptor, X) form ceramide, the 
precursor of other sphingolipids. 
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Figure 24.26 Glycosylceramides (such as galactosylceramide), gangliosides, and sphingomyelins are 
synthesized from ceramide in animals. 
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24.3 How Are Eicosanoids Synthesized, and What Are Their Functions? 


Cerebrosides that contain one or more sialic acid (N-acetylneuraminic acid) moieties 
are called gangliosides. Several dozen gangliosides have been characterized, and the 
general form of the biosynthetic pathway is illustrated for the case of ganglioside 
GM, (Figure 24.26). Sugar units are added to the developing ganglioside from 
nucleotide derivatives, including UDP-—N-acetylglucosamine, UDP-galactose, and 
UDP-glucose. 


24.3 | How Are Eicosanoids Synthesized, 
and What Are Their Functions? 


Eicosanoids are signaling molecules derived from arachidonic acid. They are so 
named because they are all derived from 20-carbon fatty acids and they are ubiqui- 
tous breakdown products of phospholipids. In response to appropriate stimuli, cells 
activate the breakdown of selected phospholipids (Figure 24.27). Phospholipase A, 
(see Chapter 8) selectively cleaves fatty acids from the C-2 position of phospholipids. 
Often these are unsaturated fatty acids, among which is arachidonic acid. Arachidonic 
acid may also be released from phospholipids by the combined actions of phospholi- 
pase C (which yields diacylglycerols) and diacylglycerol lipase (which releases fatty 
acids). 


24.3a Eicosanoids Are Local Hormones 


Animal cells can modify arachidonic acid and other polyunsaturated fatty acids, in 
processes often involving cyclization and oxygenation, to produce so-called local 
hormones that (1) exert their effects at very low concentrations and (2) usually act 
near their sites of synthesis. These substances include the prostaglandins (PG) (Figure 
24.27), described in this section, as well as thromboxanes (Tx), leukotrienes, and other 
hydroxyeicosanoic acids. Thromboxanes, discovered in blood platelets (thrombocytes), 
are cyclic ethers (TXB, is actually a hemiacetal; see Figure 24.27) with a hydroxyl 
group at C-15. 


24.3b Prostaglandins Are Formed from Arachidonate 
by Oxidation and Cyclization 


All prostaglandins are cyclopentanoic acids derived from arachidonic acid. The 
biosynthesis of prostaglandins is initiated by an enzyme associated with the ER, 
called prostaglandin endoperoxide H synthase (PGHS), also known as cyclooxygenase 
(COX). The enzyme catalyzes simultaneous oxidation and cyclization of arachidonic 
acid. The enzyme is viewed as having two distinct activities, COX and peroxidase 
(POX), as shown in Figure 24.28. 


24.3c A Variety of Stimuli Trigger Arachidonate 
Release and Eicosanoid Synthesis 


The release of arachidonate and the synthesis or interconversion of eicosanoids can 
be initiated by a variety of stimuli, including histamine, hormones such as epinephrine 
and bradykinin, proteases such as thrombin, and even serum albumin. An important 
mechanism of arachidonate release and eicosanoid synthesis involves tissue injury 
and inflammation. When tissue damage or injury occurs, special inflammatory cells, 
monocytes and neutrophils, invade the injured tissue and interact with the resident 
cells (such as smooth muscle cells and fibroblasts). This interaction typically leads to 
arachidonate release and eicosanoid synthesis. Examples of tissue injury in which 
eicosanoid synthesis has been characterized include heart attack (myocardial infarc- 
tion), rheumatoid arthritis, and ulcerative colitis. 
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Figure 24.27 Arachidonic acid, derived from breakdown of phospholipids (PL), is the precursor 

of prostaglandins, thromboxanes, and leukotrienes. The letters used to name the prostaglandins are 
assigned on the basis of similarities in structure and physical properties. The class denoted PGE, 

for example, consists of B-hydroxyketones that are soluble in ether, whereas PGF denotes 1,3-diols 
that are soluble in phosphate buffer. PGA denotes prostaglandins possessing a,8-unsaturated ketones. 
The number following the letters refers to the number of carbon-carbon double bonds. Thus, PGE 
contains two double bonds. 
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A Deeper Look 


The Molecular Basis for the Action of Nonsteroidal 


Anti-inflammatory Drugs 


Prostaglandins are potent mediators of inflammation. The first and 
committed step in the production of prostaglandins from arachi- 
donic acid is the bis-oxygenation of arachidonate to prostaglandin 
PGG). This is followed by reduction to PGH; in a peroxidase reac- 
tion. Both these reactions are catalyzed by PGHS or COX. This 
enzyme is inhibited by the family of drugs known as nonsteroidal 
anti-inflammatory drugs, or NSAIDs. Aspirin, ibuprofen, flurbi- 
profen, and acetaminophen (trade name Tylenol) are all NSAIDs. 

There are two isoforms of COX in animals: COX-1, which car- 
ries out normal, physiological production of prostaglandins, and 
COX-2, which is induced by cytokines, mitogens, and endotoxins in 
inflammatory cells and is responsible for the production of prosta- 
glandins in inflammation. 

The enzyme structure shown in panel (a) is that of residues 33 to 
583 of COX-1 from sheep, inactivated by ibuprofen (cyan). 


(b) 


PDB id = 1EQG 


the other hand, COX-1 has bulkier isoleucine (red) at position 523, 
which prevents binding of the inhibitor. 

COX-2 inhibitors were introduced as pain medications in 1997, 
and by 2004 nearly half of the 100 million prescriptions written 
annually for NSAIDs in the United States were COX-2 inhibi- 
tors. However, several COX-2 inhibitors were taken off the U.S. 
market in late 2004 and early 2005, when their use was linked to 


The COX active site lies at the end of a long, narrow, hydropho- 
bic tunnel or channel. 

The COX-1 structure shown in panel (a) has a molecule of ibu- 
profen bound in the tunnel. Deep in the tunnel, at the far end, lies 
Tyr’, a catalytically important residue. Heme-dependent peroxi- 
dase activity is implicated in the formation of a proposed Tyr*® rad- 
ical, which is required for COX activity. Aspirin and other NSAIDs 
block the synthesis of prostaglandins by filling and blocking the 
tunnel, preventing the migration of arachidonic acid to Tyr?’ in the 
active site at the back of the tunnel. 

Why do the new COX-2 inhibitors bind to (and inhibit) COX-2 
but not COX-1? A single amino acid substitution makes all the dif- 
ference. Panel (b) shows an overlay of COX-1 (LEQG) and COX-2 
(1CX2) structures. COX-2 has a valine (blue) at position 523, which 
leaves room for binding of a celebrex-like inhibitor (orange). On 
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heart attacks and strokes in a small percentage of users. Since 
then, prescriptions for COX-2 inhibitors have dropped by 65%. 
Interestingly, although COX-2 inhibitors were originally intended 
to alleviate pain without the risk of adverse gastrointestinal ef- 
fects, less than 5% of patients that used COX-2 prescriptions at 
the peak of their popularity were at high risk for these adverse 
effects. 


In 1971, biochemist John Vane was the first to show that aspirin (acetylsalicylate; 
Figure 24.29) exerts most of its effects by inhibiting the biosynthesis of prostaglandins. 
Its site of action is PGHS. COX activity is destroyed when aspirin O-acetylates Ser™° on 
the enzyme. From this you may begin to infer something about how prostaglandins (and 
aspirin) function. Prostaglandins are known to enhance inflammation in animal tissues. 
Aspirin exerts its powerful anti-inflammatory effect by inhibiting this first step in their 
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Figure 24.28 Prostaglandin endoperoxide H 
synthase (PGHS), the enzyme that converts 
arachidonic acid to prostaglandin PGH)>, possesses 
two distinct activities: cyclooxygenase (COX) and a 
glutathione-dependent hydroperoxidase (POX). The 
mechanism of the reaction begins with hydrogen 5, 8,11,14-Eicosatetraenoic acid 
atom abstraction by a tyrosine radical on the (arachidonic acid) 
enzyme, followed by rearrangement to cyclize and 
incorporate two oxygen molecules. Reduction of the 
peroxide at C15 completes the reaction. COX is the 
site of action of aspirin and other analgesic agents. 


Peroxide radical 


a 4 


oun) Na ee i ian 
COOH 
Olm: a i N 
S 1 i N 4 oo $ 
j | 
OH 
PGG, 
POX < ` | 
n“ kee SS 
COOH 
A ee 
HOY “H 
PGH, 
`x 
b 
(a) (b) coo- | 
F oea 
HO e: NH Acetylsalicylate (aspirin) 
Acetaminophen [ 1 
Ser —OH 7 > Ser— O C CH; 
i CH; Active COOT Inactive 
HC A \ | cyclooxygenase OH cyclooxygenase 
CH— CHg CH— COOH 
HC = 
Ibuprofen Salicylate 


Figure 24.29 The structures of several common analgesic agents. (a) Acetaminophen is marketed under 
the trade name Tylenol. Ibuprofen is sold as Motrin, Nuprin, and Advil. (b) Acetylsalicylate (aspirin) 
inhibits the COX activity of endoperoxide synthase via acetylation (covalent modification) of Ser>°°. 


synthesis. Aspirin does not have any measurable effect on the peroxidase activity of the 
synthase. Other nonsteroidal anti-inflammatory agents, such as ibuprofen (Figure 24.29) 
and phenylbutazone, inhibit COX by competing at the active site with arachidonate or 
with the peroxyacid intermediate (PGG», as in Figure 24.28). 
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24.4 | How Is Cholesterol Synthesized? 


The most prevalent steroid in animal cells is cholesterol (Figure 24.30). Plants contain 
no cholesterol, but they do contain other steroids very similar to cholesterol in structure 
(see the Human Biochemistry box “Plant Sterols and Stanols—Natural Cholesterol 
Fighters” in Section 8.7). Cholesterol serves as a crucial component of cell membranes 
as well as a precursor to bile acids (such as cholate, glycocholate, taurocholate) and 
steroid hormones (such as testosterone, estradiol, progesterone). Also, vitamin D; is 
derived from 7-dehydrocholesterol, the immediate precursor of cholesterol. Liver is the 
primary site of cholesterol biosynthesis. 


24.4a Mevalonate Is Synthesized from Acetyl-CoA 
Via HMG-CoA Synthase 


The cholesterol biosynthetic pathway begins in the cytosol with the synthesis of 
mevalonate from acetyl-CoA (Figure 24.31). The first step is the B-ketothiolase— 
catalyzed Claisen condensation of two molecules of acetyl-CoA to form acetoacetyl- 
CoA. In the next reaction, acetyl-CoA and acetoacetyl-CoA join to form 3-hydroxy- 
3-methylglutaryl-CoA, which is abbreviated HMG-CoA. The reaction, a second 
Claisen condensation, is catalyzed by HMG-CoA synthase. The third step in the 
pathway is the rate-limiting step in cholesterol biosynthesis. Here, HMG-CoA under- 
goes two NADPH-dependent reductions to produce 3R-mevalonate (Figure 24.32). 
The reaction is catalyzed by HMG-CoA reductase, a 97 kDa glycoprotein that spans 
the ER membrane with its active site facing the cytosol. As the rate-limiting step, 
HMG-CoA reductase is the principal site of regulation in cholesterol synthesis. 
Three different regulatory mechanisms are involved: 


1. Phosphorylation by cAMP-dependent protein kinase inactivates the reductase. 
This inactivation can be reversed by two specific phosphatases (Figure 24.33). 

2. Degradation of HMG-CoA reductase. This enzyme has a half-life of only 3 hours, 
and the half-life itself depends on cholesterol levels: High [cholesterol] means a 
short half-life for HMG-CoA reductase. 

3. Gene expression. Cholesterol levels control the amount of mRNA. If [cholesterol] 
is high, levels of mRNA coding for the reductase are reduced. If [cholesterol] is low, 
more mRNA is made. 


24.4b Why Can’t Thiolase Be Used in Fatty Acid Synthesis? 


If acetate units can be condensed by the thiolase reaction to yield acetoacetate in the 
first step of cholesterol synthesis, why couldn’t this same reaction also be used in fatty 
acid synthesis, avoiding all the complexity of the fatty acyl synthase? The answer is 
that the thiolase reaction is more or less reversible but slightly favors the cleavage 
reaction. In the cholesterol synthesis pathway, subsequent reactions, including HMG- 
CoA reductase and the following kinase reactions, pull the thiolase-catalyzed con- 
densation forward. However, in the case of fatty acid synthesis, a succession of eight 
thiolase condensations would be distinctly unfavorable from an energetic perspective. 
Given the necessity of repeated reactions in fatty acid synthesis, it makes better ener- 
getic sense to use a reaction that is favorable in the desired direction. 


24.4 How Is Cholesterol Synthesized? 


Figure 24.30 The structure of cholesterol, drawn (a) in the traditional planar motif and (b) in a form 
that more accurately describes the conformation of the ring system. 
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Figure 24.31 The biosynthesis of 3R-mevalonate 
from acetyl-CoA. 
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Figure 24.32 Two successive NADPH-dependent reductions convert the thioester, HMG-CoA, to a 
primary alcohol. (a) A reaction mechanism for HMG-CoA reductase. (b) Homotetramer of the 
catalytic portion of the human HMG-CoA reductase. 


24.4c Squalene Is Synthesized from Mevalonate 


Squalene, a triterpene formed from mevalonate, is a precursor in the synthesis of 
cholesterol and all other sterols. The biosynthesis of squalene involves conversion of 
mevalonate to two key 5-carbon intermediates, isopentenyl pyrophosphate and dimeth- 
ylallyl pyrophosphate, which join to yield farnesyl pyrophosphate and then squalene. A 
series of four reactions converts mevalonate to isopentenyl pyrophosphate and then 
to dimethylallyl pyrophosphate (Figure 24.34). The first three steps each consume an 
ATP, two for the purpose of forming a pyrophosphate at the 5-position and the third 
to drive the decarboxylation and double-bond formation in the third step. Pyrophos- 
phomevalonate decarboxylase phosphorylates the 3-hydroxyl group, and this is fol- 
lowed by trans elimination of the phosphate and carboxyl groups to form the double 
bond in isopentenyl pyrophosphate. Isomerization of the double bond yields the 
dimethylallyl pyrophosphate. Condensation of these two 5-carbon intermediates pro- 
duces geranyl pyrophosphate; addition of another 5-carbon isopentenyl group gives 
farnesyl pyrophosphate. Both steps in the production of farnesyl pyrophosphate oc- 
cur with release of pyrophosphate, hydrolysis of which drives these reactions forward. 
Note too that the linkage of isoprene units to form farnesyl pyrophosphate occurs in 
a head-to-tail fashion. This is the general rule in biosynthesis of molecules involving 
isoprene linkages. The next step—the joining of two farnesyl pyrophosphates to 
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Figure 24.33 HMG-CoA reductase activity is 
modulated by a cycle of phosphorylation and 
dephosphorylation. 


24.4 How Is Cholesterol Synthesized? 
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Figure 24.34 The conversion of mevalonate to squalene. In the last step, two farnesyl-PP condense 


to form squalene. 
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Human Biochemistry 


Statins Lower Serum Cholesterol Levels 


Chemists and biochemists have long sought a means of lowering se- 
rum cholesterol levels to reduce the risk of heart attack and cardio- 
vascular disease. Because HMG-CoA reductase is the rate-limiting 
step in cholesterol biosynthesis, this enzyme is a likely drug target. 
Mevinolin, also known as lovastatin (see accompanying figure), was 
isolated from a strain of Aspergillus terreus and developed at Merck 
Sharpe & Dohme for this purpose. It is now a widely prescribed 
cholesterol-lowering drug. Dramatic reductions of serum choles- 
terol are observed at dosages of 20 to 80 mg per day. 

Lovastatin is administered as an inactive lactone. After oral in- 
gestion, it is hydrolyzed to the active mevinolinic acid, a competitive 


(a) 


wr 


CH; 


1 R=H Mevinolin (Lovastatin, MEVACOR®) 
2 R=CH; Synvinolin (Simvastatin, ZOCOR®) 


(b) HO 


Lipitor® 
(Atorvastatin) 


HMG-CoA reductase with 
NADP? (magenta), HMG (blue), and 
CoA (green) (PDB id = 1DQA). 


inhibitor of the reductase with a Kı of 0.6 nM. Mevinolinic acid is 
thought to behave as a transition-state analog (see Chapter 14) of 
the tetrahedral intermediate formed in the HMG-CoA reductase 
reaction (see figure). 

Derivatives of lovastatin have been found to be even more potent 
in cholesterol-lowering trials. Synvinolin lowers serum cholesterol 
levels at much lower dosages than lovastatin. Lipitor, shown bound 
at the active site of HMG-CoA reductase, was the best-selling drug 
of all time, with total sales exceeding $125 billion over a 15-year 
period. 


SCoA 


Tetrahedral intermediate 
in HMG-CoA reductase 
mechanism 


Mevalonate 


Lipitor® (ved) bound at the 
active site of HMG-CoA 
reductase (PDB id = 1HWK). 


The structures of (a) (inactive) lovastatin, (active) mevinolinic acid, mevalonate, and (b) Lipitor (atorvastatin). 
(c) HMG-CoA reductase with NADP*, HMG, and CoA. (d) Lipitor bound at the HMG-CoA reductase active site. 


produce squalene—is a tail-to-tail condensation and represents an important excep- 
tion to the general rule. 

Squalene monooxygenase, an enzyme bound to the ER, converts squalene to 
squalene-2,3-epoxide (Figure 24.35). This reaction employs FAD and NADPH as 
coenzymes and requires O, as well as a cytosolic protein called soluble protein activator. 
A second ER membrane enzyme, 2,3-oxidosqualene lanosterol cyclase, catalyzes the 
second reaction, which involves a succession of 1,2 shifts of hydride ions and methyl 
groups and produces lanosterol. 
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Figure 24.35 Cholesterol is synthesized from 
squalene via lanosterol. The primary route from 
lanosterol involves 20 steps, the last of which 
converts 7-dehydrocholesterol to cholesterol. 

An alternative route produces desmosterol as the 
penultimate intermediate. 
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George Holton/Photo Researchers, Inc 


Figure 24.36 Photograph of an arterial plaque. The 
view is into the artery (orange), with the plaque 
shown in yellow at the back. 


24.4d Conversion of Lanosterol to Cholesterol 
Requires 20 Additional Steps 


Although lanosterol may appear similar to cholesterol in structure, another 20 steps 
are required to convert lanosterol to cholesterol (Figure 24.35). The enzymes re- 
sponsible for this are all associated with the ER. The primary pathway involves 
7-dehydrocholesterol as the penultimate intermediate. An alternative pathway, also 
composed of many steps, produces the intermediate desmosterol. Reduction of the 
double bond at C-24 yields cholesterol. Cholesterol esters—a principal form of 
circulating cholesterol—are synthesized by acyl-CoA:cholesterol acyltransferases 
(ACATs) on the cytoplasmic face of the ER. 


24.5 | How Are Lipids Transported Throughout the Body? 


Lipids are universal energy sources for the organism and must circulate to the sites 
where they are utilized. When most lipids circulate in the body, they do so in the form 
of lipoprotein complexes. Simple, unesterified fatty acids are bound to serum albumin 
and other proteins in blood plasma, but phospholipids, triacylglycerols, cholesterol, 
and cholesterol esters are all transported in the form of lipoproteins. At various sites 
in the body, lipoproteins interact with specific receptors and enzymes that transfer or 
modify their lipid cargoes. It is now customary to classify lipoproteins according to 
their densities (Table 24.1). The densities are related to the relative amounts of lipid 
and protein in the complexes. Because most proteins have densities of about 1.3 to 1.4 
g/mL, and lipid aggregates usually possess densities of about 0.8 g/mL, the more pro- 
tein and the less lipid in a complex, the denser the lipoprotein. Thus, there are high- 
density lipoproteins (HDLs), low-density lipoproteins (LDLs), intermediate-density 
lipoproteins (IDLs), very-low-density lipoproteins (VLDLs), and also chylomicrons. 
Chylomicrons have the lowest protein-to-lipid ratio and thus are the lowest-density 
lipoproteins. They are also the largest. 


24.5a Lipoprotein Complexes Transport 
Triacylglycerols and Cholesterol Esters 


HDL and VLDL are assembled primarily in the ER of liver cells (with smaller amounts 
produced in the intestine), whereas chylomicrons form in the intestine. LDL is not 
synthesized directly but rather is made from VLDL. LDL appears to be the major 
circulatory complex for cholesterol and cholesterol esters. The primary task of 
chylomicrons is to transport triacylglycerols. Despite all this, it is extremely important 
to note that each of these lipoprotein classes contains some of each type of lipid. The 
relative amounts of HDL and LDL are important in the disposition of cholesterol in 
the body and in the development of arterial plaques (Figure 24.36). The structures of the 
various lipoproteins are approximately similar, and they consist of a core of mobile 
triacylglycerols or cholesterol esters surrounded by a single layer of phospholipid, into 


| Table 241 | “ZB Composition and Properties of Human Lipoproteins 


Composition (% dry weight) 


Lipoprotein Density Diameter ———— a 
Class (g/mL) (nm) Protein Cholesterol Phospholipid Triacylglycerol 
HDL 1063-1210 5-15 338 ss 9 o g 
LDL 1.019-1.063 18-28 25 50 21 4 
IDL 1.006-1.019 25-50 18 29 22 31 
VLDL 0.95-1.006 30-80 10 22 18 50 
Chylomicrons <0.95 100-500 1-2 8 7 84 


Adapted from Brown, M., and Goldstein, J. In Braunwald, E., et al., eds., Harrison’s Principles of Internal 
Medicine, 11th ed, 1987. New York: McGraw-Hill; and Vance, D., and Vance, J., eds. Biochemistry of Lipids and 
Membranes, 1985. Menlo Park, CA: Benjamin/Cummings. 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


Human Biochemistry 


24.5 How Are Lipids Transported Throughout the Body? 857 


APOC3—An Apolipoprotein That Regulates Plasma Triglyceride Levels 


Elevated plasma triglycerides (>150 mg/dL) are a demonstrated 
risk factor for cardiovascular disease and heart attacks. Life- 
style factors such as smoking, lack of exercise, and weight gain 
all contribute to high triglyceride levels. However, heritability 
also plays a role in individual variation in triglyceride levels. 
For example, one in 20 people in Amish communities are ge- 
netically predisposed to triglyceride levels of 50 or less and can 
enjoy diets high in fat with no ill effects. At the other extreme, 
familial chylomicronemia syndrome (FCS) is a serious, often life- 
threatening, hereditary condition affecting one in a million in- 
dividuals in which triglyceride levels often exceed 2000 mg/dL. 
Analysis of data from the Exome Sequencing Project of the Na- 
tional Heart, Lung, and Blood Institute has revealed that four loss- 
of-function mutations in APOC3, a gene encoding apolipoprotein 
C3 (apoC3), are associated with remarkably low plasma triglyceride 


levels. Individuals carrying at least one of these mutations were 
found to have 39% lower triglycerides than noncarriers and a 40% 
lower risk of coronary heart disease. 

Volanesorsen, sold under the brand name Waylivra by Ionis 
Pharmaceuticals, is an antisense oligonucleotide drug that targets 
messenger RNA for APOC3. It is approved for use in the European 
Union. In clinical trials, participants with FCS who were given vo- 
lanesorsen had an average 77% reduction in the level of triglycerides 
in the blood. 
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which is inserted a mixture of cholesterol and proteins (Figure 24.37). Note that the 
phospholipids are oriented with their polar head groups facing outward to interact with 
solvent water and that the phospholipids thus shield the hydrophobic lipids inside from 
the solvent water outside. The proteins also function as recognition sites for the various 
lipoprotein receptors throughout the body. A number of different apoproteins have 
been identified in lipoproteins (Table 24.2), and others may exist as well. The apoproteins 
have an abundance of hydrophobic amino acid residues, as is appropriate for interactions 
with lipids. A cholesterol ester transfer protein also associates with lipoproteins. 


24.5b Lipoproteins in Circulation Are Progressively 
Degraded by Lipoprotein Lipase 


The liver and intestines of animals are the primary sources of circulating lipids. 
Chylomicrons carry triacylglycerol and cholesterol esters from the intestines to other 
tissues, and VLDLs carry lipid from the liver, as shown in Figure 24.38. At various 
target sites, particularly in the capillaries of muscle and adipose cells, these particles 


(a) (b) 


Figure 24.37 A model for the structure of a typical 
lipoprotein. (a) A core of cholesterol and cholesteryl 
esters is surrounded by a phospholipid (monolayer) 
membrane. Apolipoprotein A-I is modeled here as a 
long amphipathic a-helix, with the nonpolar face of 
the helix embedded in the hydrophobic core of the 
lipid particle and the polar face of the helix exposed 
to solvent. (b) A ribbon diagram of apolipoprotein 
A-I. (Adapted from Borhani, D. W., Rogers, D. P., 
Engler, J. A., and Brouillette, C. G., 1997. Crystal 
structure of truncated human apolipoprotein A-I 
suggests a lipid-bound conformation. Proceedings of 
the National Academy of Sciences 94:12291—12296.) 
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| Table 24.2 | YZ) Apoproteins of Human Lipoproteins 


Concentration 


in Plasma 
Apoprotein M, (mg/100 mL) Distribution 
A-1 28,300 90-120 Principal protein in HDL 
A-2 8700 30-50 Occurs as dimer mainly in HDL 
B-48 240,000 <5 Found only in chylomicrons 
B100 500,000 80-100 Principal protein in LDL 
C-1 7000 4-7 Found in chylomicrons, VLDL, HDL 
C-2 8800 3-8 Found in chylomicrons, VLDL, HDL 
C-3 8800 8-15 Found in chylomicrons, VLDL, IDL, HDL 
D 32,500 8-10 Found in HDL 
E 34,100 3-6 Found in chylomicrons, VLDL, IDL, HDL 


Adapted from Brown, M., and Goldstein, J. In Braunwald, E., et al., eds., Harrison’s Principles of Internal Medicine, 11th 
ed, 1987. New York: McGraw-Hill; and Vance, D., and Vance, J., eds. Biochemistry of Lipids and Membranes, 1985. 
Menlo Park, CA: Benjamin/Cummings. 


@ Synthesis of apoproteins, phosphatidylcholine, 
triacylglycerol, cholesterol, cholesterol esters 
occurs in the endoplasmic reticulum 


C) a ® Assembly of components into prelipoprotein 
particles in the ER, then transfer to Golgi 


© Golgi processes the particles with additional 
phospholipids and perhaps also cholesterol 
and cholesterol esters added 


into the circulation 
Liver cell 


Extracellular space 


Figure 24.38 Lipoprotein components are synthesized predominantly in the ER of liver cells. 
Following assembly of lipoprotein particles (red dots) in the ER and processing in the Golgi, 
lipoproteins are packaged in secretory vesicles for export from the cell (via exocytosis) and released 
into the circulatory system. 


are degraded by lipoprotein lipase, which hydrolyzes triacylglycerols. Lipase action 
causes progressive loss of triacylglycerol (and apoprotein) and makes the lipoproteins 
smaller. This process gradually converts VLDL particles to IDL and then LDL par- 
ticles, which are either returned to the liver for reprocessing or redirected to adipose 
tissues and adrenal glands. Every 24 hours, nearly half of all serum LDL is removed 
from circulation in this way. The LDL binds to specific LDL receptors, which cluster 
in domains of the plasma membrane known as coated pits (invaginations of the mem- 
brane discussed in subsequent paragraphs). These domains eventually invaginate to 
form coated vesicles (Figure 24.39), which pinch off from the plasma membrane and 
form endosomes (literally “bodies inside” the cell). In the low pH environment of the 
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Figure 24.39 Endocytosis and degradation of lipoprotein particles. (ACAT is acyl-CoA cholesterol 
acyltransferase.) 


endosome, the LDL particles dissociate from their receptors. The endosomes then 
fuse with lysosomes, and the LDLs are degraded by lysosomal acid lipases. 

HDLs have much longer life spans in the body (5 to 6 days) than other lipoproteins. 
Newly formed HDL contains virtually no cholesterol ester. However, over time, choles- 
terol esters are accumulated through the action of lecithin: cholesterol acyltransferase 
(LCAT), a 59 kDa glycoprotein associated with HDLs. Another associated protein, 
cholesterol ester transfer protein, transfers some of these esters to VLDL and LDL. 
Alternatively, HDLs function to return cholesterol and cholesterol esters to the liver. 
This latter process apparently explains the correlation between high HDL levels and 
reduced risk of cardiovascular disease. (High LDL levels, on the other hand, are corre- 
lated with an increased risk of coronary artery and cardiovascular disease.) 


24.5c The Structure of the LDL Receptor Involves Five Domains 


The LDL receptor in plasma membranes (Figure 24.40) consists of 839 amino acid 
residues and is composed of five domains, two of which contain multiple subdomains. 
The N-terminal LDL-binding domain (292 residues) contains seven cysteine-rich 
repeats, denoted R1 to R7. The next segment (417 residues) contains three epidermal 
growth factor repeats, as well as a B-propellor module. This is followed in the sequence 
by a 58-residue segment of O-linked oligosaccharides, a 22-residue membrane- 
spanning segment, and a 50-residue segment extending into the cytosol. The cluster- 
ing of receptors prior to the formation of coated vesicles requires the presence of this 
cytosolic segment. Note that the LDL particle binds specifically to the receptor at the 
fourth and fifth cysteine-rich repeats (R4 and RS). 
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Figure 24.40 The structure of the LDL receptor. 
The amino-terminal binding domain is responsible 
for recognition and binding of LDL apoprotein. 
The B-100 apolipoprotein of the LDL particle is 
presumed to bind to the fourth and fifth cysteine- 
rich repeats (R4 and R5). The O-linked 
oligosaccharide-rich domain may act as a molecular 
spacer, raising the binding domain above the 
glycocalyx. The cytosolic domain is required for 
aggregation of LDL receptors during endocytosis. 


Figure 24.39 shows the release of LDL particles in endosomes that pinch off from 
the plasma membrane when cells take up LDLs. What molecular events trigger the 
release of LDL particles? A collaboration by three Nobel laureates has provided an 
answer. Johann Deisenhofer, Michael Brown, and Joseph Goldstein determined the 
structure of the extracellular domain of the LDL receptor at pH 5.3, the typical pH 
inside endosomes. At this low pH, the receptor polypeptide is folded back on itself, with 
the B-propellor domain associated with R4 and RS, the two repeats that normally bind 
the LDL particle (Figure 24.41). The implication is that the B-propellor displaces the 
LDL particle in the lower pH environment of the endosome. What residues at the inter- 
face between the propellor and the R4 and R5 repeats act as the pH sensors? Three 
histidines at the propellor-R4/R5 interface—His!*’, His, and His°*°—are the likely 
pH-sensing residues. His!” lies at the tip of a loop on R5, whereas His*® and His>® are 
on the surface of the propellor domain (Figure 24.41). These three His residues form a 
cluster at the three-way junction between R4, R5, and the B-propellor. 


24.5d Defects in Lipoprotein Metabolism Can 
Lead to Elevated Serum Cholesterol 


The mechanism of LDL metabolism and the various defects that can occur therein 
have been studied extensively by Brown and Goldstein. Familial hypercholesterolemia 
is the term given to a variety of inherited metabolic defects that lead to greatly ele- 
vated levels of serum cholesterol, much of it in the form of LDL particles. The general 
genetic defect responsible for familial hypercholesterolemia is the absence or dysfunc- 
tion of LDL receptors in the body. Only about half the normal level of LDL receptors 
is found in heterozygous individuals (persons carrying one normal gene and one 
defective gene). Homozygotes (with two copies of the defective gene) have few, if any, 
functional LDL receptors. In such cases, LDLs (and cholesterol) cannot be absorbed, 
and plasma levels of LDL (and cholesterol) are very high. Typical heterozygotes 
display serum cholesterol levels of 300 to 400 mg/dL, but homozygotes carry serum 
cholesterol levels of 600 to 800 mg/dL or even higher. There are two possible causes of 
an absence of LDL receptors—either receptor synthesis does not occur at all, or the 
newly synthesized protein does not successfully reach the plasma membrane due to 
faulty processing in the Golgi or faulty transport to the plasma membrane. Even when 
LDL receptors are made and reach the plasma membrane, they may fail to function 
for two reasons. They may be unable to form clusters competent in coated pit forma- 
tion because of folding or sequence anomalies in the carboxy-terminal domain, or 
they may be unable to bind LDL because of sequence or folding anomalies in the 
LDL-binding domain. 


(b) (c) 
R5 R4 
R3 


R2 


B-propellor 


Figure 24.41 The B-propellor domain of the LDL receptor displaces LDL particles in endosomes. 
(a) The cysteine-rich repeats R4 and R5 are the site of LDL particle binding. Each repeat 
contains two loops connected by three disulfide bonds. The second loop in each repeat carries 
acidic residues (Asp and Glu) and forms a Ca?*-binding site. (b) The B-propellor domain of the 
LDL receptor. (c) The structure of the LDL receptor extracellular domain at pH 5.3 (similar to 
that found in endosomes). The B-propellor domain is associated with cysteine-rich domains R4 
and RS. A cluster of histidines (His!*°, His*?, and His°*°) and a variety of hydrophobic and 
charged interactions mediate the association (PDB id = 1N7D). 
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New Cholesterol-Lowering Drugs Target PCSK9, an LDL Receptor Chaperone 


Michael Brown and Joseph Goldstein (recipients of the Nobel Prize 
in Physiology or Medicine, 1985) established the genetic connection 
between LDL and heart attacks, showing that loss-of-function 
mutations in LDL receptors raise serum LDL and increase the risk 


Ser153 


Catalytic domain 


Pro-domain 


PCSK9 is a 692-residue protein consisting of an N-terminal prodomain, a 
catalytic domain (residues 153-447) and a C-terminal domain composed of 
three B—sandwich folds. Autocatalytic cleavage between Gln-152 and Ser-153 
leaves the prodomain tightly bound (PDB id = 2PMW). Thr-61 marks the 


of heart attacks. The opposite correlation—that mutations that 
increase LDL receptor function will lower serum LDL and protect 
against coronary heart disease and heart attacks—also turns out 
to be true. These latter mutations occur in a gene that encodes a 
protein called PCSK9 (for proprotein convertase subtilisin/kexin 
type 9) that is secreted into the circulation by liver and other or- 
gans. PCSK9 binds to liver LDL receptors (LDLRs) and disrupts 
the recycling mechanism that normally returns the receptors to the 
cell surface after internalization (Figure 24.39). Thus, the normal 
function of PCSK9 results in a decrease in the number of LDL 
receptors and an increase in serum LDL. 

Interestingly, PCSK9 is a zymogen precursor that undergoes au- 
tocatalytic cleavage, leaving an N-terminal prodomain (see figure) 
tightly bound in its active site. Although the prodomain is required 
for proper folding of the rest of the protein, it renders the protease 
domain inactive. Once secreted from the liver cell, PCSK9 acts as 
a molecular chaperone that binds to LDL receptors on the cell 
surface. That is, PCSK9 does not function as an enzyme; it acts as 
a chaperone! The PCSK9-LDLR complex is taken up by liver cells 
and directed to lysosomes and degraded. 

The function of PCKS9 suggested a novel cholesterol-lowering 
strategy: If the levels of LDL receptors could be raised, more 
LDL would be taken from the blood, lowering serum levels of 
cholesterol. Two new PCKS9 inhibitors have received Food and 
Drug Administration (FDA) approval as drugs. Both are mono- 
clonal antibodies against PCKS9 (Praluent from Sanofi and Re- 
patha from Amgen). They attach to PCKS9, preventing it from 
binding to LDL receptors, thus aborting its role as a chaperone 
that targets LDL receptors for destruction in the lysosome. 
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24.6 | How Are Bile Acids Biosynthesized? 


Bile acids, which exist mainly as bile salts, are polar carboxylic acid derivatives of 
cholesterol that are important in the digestion of food, especially the solubilization of 
ingested fats. The Na* and K* salts of glycocholic acid and taurocholic acid are the 
principal bile salts (Figure 24.42). Glycocholate and taurocholate are conjugates of 
cholic acid with glycine and taurine, respectively. Because they contain both nonpolar 
and polar domains, these bile salt conjugates are highly effective as detergents. These 
substances are made in the liver, stored in the gallbladder, and secreted as needed into 
the intestines. 

The formation of bile salts represents the major pathway for cholesterol degrada- 
tion. The first step involves hydroxylation at C-7 (Figure 24.42). 7a-Hydroxylase, 
which catalyzes the reaction, is a mixed-function oxidase involving cytochrome 
P-450. Mixed-function oxidases use O, as substrate. One oxygen atom goes to hydrox- 
ylate the substrate while the other is reduced to water (Figure 24.43). The function of 
cytochrome P-450 is to activate O, for the hydroxylation reaction. Such hydroxyl- 
ations are quite common in the synthetic routes for cholesterol, bile acids, and steroid 
hormones and also in detoxification pathways for aromatic compounds. Several of 
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Niemann-Pick Type C Disease—A Hydrophobic Handoff Fumbled 


Niemann-Pick type C (NPC) is a lipid storage disease that causes 
a variety of neurological symptoms and arises from abnormal 
accumulation of cholesterol and other lipids in the lysosomes of 
cells in the liver, spleen, and brain. In most cases, neurological symp- 
toms begin appearing between the ages of 4 and 10. NPC is always 
fatal. Although it occurs in all races and populations, a higher inci- 
dence of NPC occurs among Ashkenazi Jews, French Canadians in 
Nova Scotia, Maghrebs in northern Africa, and Hispanic people in 
New Mexico and Colorado. 

Nearly all cases of NPC are caused by mutations in the genes for 
two proteins, termed NPC1 and NPC2. NPC2 is a soluble protein 
in the lysosomal lumen. NPC1 is a lysosomal membrane protein 
containing three luminal domains—an N-terminal domain (NTD), a 
middle luminal domain (MLD), a C-terminal domain (CTD) and a 
transmembrane domain (TM). The transmembrane domain includes 
a sterol-sensing domain (SSD) that is highly conserved in membrane 
proteins involved in cholesterol metabolism. Cryo-EM studies of the 
NPC1—NPC2 complex at pH 5.5 (the pH of the lysosomal lumen) 


Cytosol 


The hydrophobic handoff of cholesterol from NPC2 to NPC1. A 
conformation change in NTD transfers cholesterol to the tunnel of the 
SSD and then to the cytosol. Adapted from Qian, H., et al., 2020. 
Structural basis of low-pH-dependent lysosomal cholesterol egress by 
NPC1 and NPC2. Cell 182:98-111. 


reveal a central tunnel that is enclosed by the MLD and CTD domains 
and that connects the lumenal domains to the SSD. The lumenal end 
of the tunnel opens to the NTD and the rest traverses through the in- 
terface between the MLD and CTD domains, all the way down to the 
membrane-facing surface of the sterol-sensing domain. Cholesterol 
binds to NPC2 with its 8-carbon side chain buried within a hydropho- 
bic binding pocket and the 38-hydroxyl group exposed on the surface 
(see figure). In contrast, NPC1 binds cholesterol in the opposite orien- 
tation, with the 38-hydroxyl deep in the binding site and the side chain 
partially exposed. 

Brown and Goldstein have proposed a hydrophobic handoff 
model for the interaction of NPC1 and NPC2 that enables cholesterol 
transfer between these proteins and that is essential for export of 
cholesterol from the lysosomes. Structural studies suggest a K-turn 
trajectory for cholesterol delivery. Much resembling a K-turn for 
steering a vehicle, cholesterol first binds NPC2 exposing its hydroxyl 
end, reverses into NPC1(NTD), and eventually enters the central tun- 
nel with the hydrophobic side facing the membrane. 


Structure of the NPCI-NPC2 complex NPC! is gold. NPC2 is aqua. 
Cholesterol is shown in gray. (PDB id = 6W5V). 
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Figure 24.42 Cholic acid, a bile salt, is synthesized 
from cholesterol via 7a-hydroxycholesterol. 
Conjugation with taurine or glycine produces 
taurocholic acid and glycocholic acid, respectively. 
Taurocholate and glycocholate are freely water 
soluble and are highly effective detergents. 
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Human Biochemistry 


Steroid 5a-Reductase—A Factor in Male Baldness, 


Prostatic Hyperplasia, and Prostate Cancer 


An enzyme that metabolizes testosterone may be involved in the benign 
conditions of male-pattern baldness (also known as androgenic alopecia) 
and benign prostatic hyperplasia (prostate gland enlargement), as well as 
potentially fatal prostate cancers. Steroid Sa-reductases are membrane- 
bound enzymes that catalyze the NADPH-dependent reduction of 
testosterone to dihydrotestosterone (DHT) (see accompanying figure). 
Two isoforms of Sa-reductase have been identified. In humans, the 


NHC(CHs)5 
l 
C=O 
Gre 
CH; 
0) i © N H 
Dihydrotestosterone Finasteride 


type I enzyme predominates in the sebaceous glands of skin and liver, 
whereas type I is most abundant in the prostate, seminal vesicles, liver, 
and epididymis. DHT is a contributory factor in male baldness and 
prostatic hyperplasia, and it has also been shown to act as a mitogen 
(a stimulator of cell division). For these reasons, Sa-reductase inhibi- 
tors are potential candidates for treatment of these human conditions. 

Finasteride (see figure) is a specific inhibitor of type II Sa- 
reductase. It has been used clinically for treatment of benign prostatic 
hyperplasia, and it is also marketed under the trade name Propecia 
by Merck as a treatment for male baldness. Type II Sa-reductase 
inhibitors may also be potential therapeutic agents for treatment 
of prostate cancer. Somatic mutations occur in the gene for type II 
5a-reductase during prostate cancer progression. 

Because type I 5a-reductase is the predominant form of the 
enzyme in human scalp, the mechanism of finasteride’s promotion 
of hair growth in men with androgenic alopecia has been uncer- 
tain. However, scientists at Merck have shown that, whereas type I 
5a-reductase predominates in sebaceous ducts of the skin, type II 
5a-reductase is the only form of the enzyme present in hair follicles. 
Thus, finasteride’s therapeutic effects may arise from inhibition of 
the type II enzyme in the hair follicle itself. 
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Figure 24.43 The mixed-function oxidase activity of 
7a-hydroxylase. 


these are considered in the next section. 7a-Hydroxycholesterol is the precursor for 
cholic acid. Conjugation with taurine or glycine converts cholic acid to taurocholic 
and glycocholic acids, respectively. 


24.7 | How Are Steroid Hormones 
| Synthesized and Utilized? 


Steroid hormones are crucial signal molecules in mammals, controlling metabolism, 
salt balance, and sexual function. Their biosynthesis begins with the desmolase reac- 
tion, which converts cholesterol to pregnenolone (Figure 24.44). Desmolase is found 
in the mitochondria of tissues that synthesize steroids (mainly the adrenal glands and 
gonads). Desmolase activity includes two hydroxylases and utilizes cytochrome 
P-450. 

Steroid hormones act in a different manner from most hormones we have consid- 
ered. In many cases, they do not bind to plasma membrane receptors but rather pass 
easily across the plasma membrane. Steroids may bind directly to receptors in the 
nucleus or cytosolic steroid hormone receptors, which then enter the nucleus. In the 
nucleus, the hormone-receptor complex binds directly to specific nucleotide sequences 
in DNA, increasing transcription of DNA to RNA. 


24.7a Pregnenolone and Progesterone Are the Precursors 
of All Other Steroid Hormones 


Pregnenolone is transported from the mitochondria to the ER, where a hydroxyl 
oxidation and migration of the double bond yield progesterone. Pregnenolone 
synthesis in the adrenal cortex is activated by adrenocorticotropic hormone (ACTH), a 
peptide of 39 amino acid residues secreted by the anterior pituitary gland. 

Progesterone is secreted from the corpus luteum during the latter half of the 
menstrual cycle and prepares the lining of the uterus for attachment of a fertilized 
ovum. If an ovum attaches, progesterone secretion continues to ensure the successful 
maintenance of a pregnancy. Progesterone is also the precursor for synthesis of the 
other sex hormone steroids and the corticosteroids. Male sex hormone steroids are called 
androgens, and female hormones, estrogens. Testosterone is an androgen synthesized in 
males primarily in the testes (and in much smaller amounts in the adrenal cortex). 
Androgens are necessary for sperm maturation. Even nongonadal tissue (liver, brain, 
and skeletal muscle) is susceptible to the effects of androgens. 

Testosterone is also produced in the ovaries (and in much smaller amounts in the 
adrenal glands) of females as a precursor for the estrogens. B-Estradiol is the most 
prominent estrogen (Figure 24.44). 


24.7b Cortisol and Other Corticosteroids Regulate 
a Variety of Body Processes 


Corticosteroids, including the glucocorticoids and mineralocorticoids, are made by the 
cortex of the adrenal glands on top of the kidneys. Cortisol (Figure 24.44) is representa- 
tive of the glucocorticoids, a class of compounds that (1) stimulate gluconeogenesis and 
glycogen synthesis in liver (by promoting the synthesis of PEP carboxykinase, 
fructose-1,6-bisphosphatase, glucose-6-phosphatase, and glycogen synthase); (2) inhibit 
protein synthesis and stimulate protein degradation in peripheral tissues such as muscle; 
(3) inhibit allergic and inflammatory responses; (4) exert an immunosuppressive effect, 
inhibiting DNA replication and mitosis as well as repressing the formation of antibod- 
ies and lymphocytes; and (5) inhibit formation of fibroblasts involved in healing wounds 
and slow the healing of broken bones. 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


24.7 How Are Steroid Hormones Synthesized and Utilized? 


o 
| 


C—H 


~~ Isocaproic aldehyde 


3B-hydroxysteroid 
O dehydrogenase /A°-A* 
isomerase 
(Endoplasmic reticulum) 


— -> 


Desmolase 
(Mitochondria) 


HO HO 


Cholesterol Pregnenolone 


Testosterone 


Cortisol 
HC 


HO 
-Estradiol 


Aldosterone, the most potent of the mineralocorticoids (Figure 24.44), is involved in the 
regulation of sodium and potassium balances in tissues. Aldosterone increases the kidney’s 
capacity to absorb Na*, Cl”, and H,O from the glomerular filtrate in the kidney tubules. 

The dramatic effects of androgens on protein biosynthesis have led many athletes to 
use synthetic androgens, which go by the blanket term anabolic steroids. Despite numer- 
ous warnings from the medical community about side effects, which include kidney and 
liver disorders, sterility, and heart disease, abuse of such substances is epidemic. 
Stanozolol (Figure 24.45) was one of the agents found in the blood and urine of Ben 
Johnson following his record-setting performance in the 100-meter dash in the 1988 
Olympic Games. Because use of such substances is disallowed, Johnson lost his gold 
medal and Carl Lewis was declared the official winner. 
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Figure 24.44 The steroid hormones are synthesized 
from cholesterol, with intermediate formation of 
pregnenolone and progesterone. Testosterone, the 
principal male sex hormone steroid, is a precursor 
to B-estradiol. Cortisol, a glucocorticoid, and 
aldosterone, a mineralocorticoid, are also derived 
from progesterone. 


Figure 24.45 The structure of stanozolol, an 
anabolic steroid. 
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Chapter 24 Lipid Biosynthesis 


Foundational Biochemistry Things You Should Know after Reading Chapter 24. 


Formation of malonyl-CoA activates acetate units for fatty acid 
synthesis. 


Fatty acid biosynthesis depends on the reductive power of NADPH. 


The citrate-malate—pyruvate shuttle provides cytosolic acetate units 
and reducing equivalents for fatty acid synthesis. 


Acetate units are committed to fatty acid synthesis by formation of 
malonyl-CoA. 


Acetyl-CoA carboxylase is biotin dependent and displays ping-pong 
kinetics. 


Acetyl-CoA carboxylase in animals is a multifunctional protein. 


Phosphorylation of ACC modulates activation by citrate and inhibi- 
tion by palmitoyl-CoA. 


Acyl carrier proteins carry the intermediates in fatty acid synthesis. 


Fatty acid synthesis takes place in multienzyme complexes in some 
organisms. 


Decarboxylation drives the condensation of acetyl-CoA and malonyl- 
CoA in fatty acid synthesis. 


Reduction of the B-carbonyl group of a newly added acetate unit in 
fatty acid synthesis proceeds via reduction to a hydroxyl group, dehy- 
dration to form a double bond, and reduction of the double bond to 
form a single bond. 


The multiple enzyme domains of eukaryotic fatty acid synthases are 
arrayed on large protein structures termed megasynthases. 


Palmitate is the primary product of the fatty acid synthase. Longer 
chains are made through special elongation reactions, both in mito- 
chondria and on the surface of the endoplasmic reticulum. 


Unsaturation reactions occur in eukaryotes in the middle of an ali- 
phatic chain. 


Mammals cannot synthesize most polyunsaturated fatty acids, but 
arachidonic acid is synthesized from linoleic acid. 


Glucagon and insulin regulate acetyl-CoA carboxylase and fatty acid 
biosynthesis. Fatty acid synthesis is regulated by allosteric effectors, 
hormones, and phosphorylation/dephosphorylation. 


Glycerolipids are synthesized by phosphorylation and acylation of 
glycerol. 


Eukaryotes synthesize glycerolipids from CDP-diacylglycerol or 
diacylglycerol. 


Phosphatidylethanolamine is synthesized from diacylglycerol and 
CDP-ethanolamine. 


Exchange of ethanolamine for serine converts phosphatidylethanol- 
amine to phosphatidylserine. 


CDP-diacylglycerol is the precursor for phosphatidylinositol, phos- 
phatidylglycerol and diphosphatidylglycerol (cardiolipin). 


Sphingolipid biosynthesis begins with condensation of serine and 
palmitoyl-CoA, forming ceramides in four steps. Ceramides are the 
precursors for other sphingolipids and cerebrosides. 


Eicosanoids, including prostaglandins, thromboxanes, and leukotri- 
enes, are derived from arachidonic acid. 


Aspirin and the NSAIDs are inhibitors of the cyclooxygenase that is 
the first step in the conversion of arachidonic acid to the eicosanoids 
that facilitate inflammation. 


Cholesterol synthesis begins with formation of mevalonate from 
acetyl-CoA via the HMG-CoA synthase and HMG-CoA reductase 
reactions. 


Squalene is synthesized from mevalonate. Cyclization of squalene yields 
lanosterol, and cholesterol is formed after 20 steps from lanosterol. 


Most lipids, including phospholipids, triacylglycerols, cholesterol, and 
cholesterol esters, circulate in the body in the form of lipoprotein com- 
plexes. 


Lipoproteins are progressively degraded by lipoprotein lipases, con- 
verting chylomicrons to VLDLs, then to IDLs and LDLs, which are 
taken up by LDL receptors in the liver or redirected to adipose tissues 
and adrenal glands. 


HDLs function to return cholesterol and cholesterol esters to the liver. 


Defects in lipoprotein metabolism can lead to elevated serum choles- 
terol and resultant cardiovascular damage. 


Bile acids are synthesized via hydroxylation of cholesterol and sub- 
sequent steps that form cholic acid, taurocholic acid, and glycocholic 
acid. 


Pregnenolone and progesterone are the precursors of all other steroid 
hormones, including testosterone, estradiol, cortisol, and aldosterone. 


Answers to all problems are at the end of this book. Detailed solutions 
are available in the Student Solutions Manual. 


1. Explaining the Stoichiometry of Fatty Acid Synthesis Carefully count 


and account for each of the atoms (that is, explain where each one 
comes from) and charges in the equations for the synthesis of 
palmitoyl-CoA, the synthesis of malonyl-CoA, and the overall 
reaction for the synthesis of palmitoyl-CoA from acetyl-CoA. 


. Tracing Carbon Atom Incorporation in Fatty Acids (Integrates with 


Chapters 18 and 19.) Use the relationships shown in Figure 24.1 to 
determine which carbons of glucose will be incorporated into 
palmitic acid. Consider the cases of both citrate that is immediately 
exported to the cytosol following its synthesis and citrate that enters 
the TCA cycle. 


. Modeling the Regulation of Acetyl-CoA Carboxylase Based on the 


information presented in the text and in Figures 24.4 and 24.5, 
suggest a model for the regulation of acetyl-CoA carboxylase. 


Consider the possible roles of subunit interactions, phosphorylation, 
and conformation changes in your model. 


Mechanism of Acetyl-CoA Carboxylase (Integrates with Chapter 19.) 
What part of the enzyme mechanism is similar between citrate 
synthase (see Chapter 19) and acetyl-CoA carboxylase? 


. Estimating Active Site Separation in Animal Fatty Acid Synthase 


Consider the role of the pantothenic acid groups in animal FAS and 
the size of the pantothenic acid group itself, and estimate a maximal 
separation between the malonyl transferase and the B-ketoacyl-ACP 
synthase active sites. 


6. Assessing the Stoichiometry of Stearoyl-CoA Desaturase Carefully 


study the reaction mechanism for the stearoyl-CoA desaturase in 
Figure 24.14, and account for all of the electrons flowing through the 
reactions shown. Also account for all of the hydrogen and oxygen 
atoms involved in this reaction, and convince yourself that the 
stoichiometry is correct as shown. 
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10. 


11. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


19. 


20. 


. Hormonal Regulation of Fatty Acid Biosynthesis Glucagon and insulin 


are hormones released when blood glucose levels are low or high, 
respectively. Why does it make biochemical sense that glucagon inhibits 
and insulin activates fatty acid biosynthesis? (Hint: see Figure 24.17.) 


. Understanding the Stoichiometry of Glycerophospholipid Synthesis 


Write a balanced, stoichiometric reaction for the synthesis of 
phosphatidylethanolamine from glycerol, fatty acyl-CoA, and 
ethanolamine. Make an estimate of the A G°’ for the overall 
process. 


. Triglyceride Biosynthesis and Glycolysis Describe two biochemical 


links between the biosynthesis of triacylglycerides and the glycolysis 
pathway. 


Understanding the Stoichiometry of Cholesterol Synthesis Write a 
balanced, stoichiometric reaction for the synthesis of cholesterol 
from acetyl-CoA. 


Unspecific Squalene Monooxygenase and Squalene Epoxide Imagine 
the possibility that the squalene monooxygenase enzyme is not very 
specific and that it oxidized a different bond in squalene, as indicated 
in the figure. What could be the reaction products after the action of 
monooxygenase and cyclase (see Figure 24.35)? 


Tracing Carbon Atom Incorporation in Cholesterol Trace each of the 
carbon atoms of mevalonate through the synthesis of cholesterol, and 
determine the source (that is, the position in the mevalonate structure) 
of each carbon in the final structure. 


Understanding the Functional Role of O-linked Oligosaccharides in the 
LDL Receptor Suggest a structural or functional role for the O-linked 
saccharide domain in the LDL receptor (Figure 24.40). 


Assessing the Consequences of Low Cellular Levels of CTP Identify 
the lipid synthetic pathways that would be affected by abnormally 
low levels of CTP. 


Determining the ATP Requirements for Fatty Acid Synthesis Determine 
the number of ATP equivalents needed to form palmitic acid from 
acetyl-CoA. (Assume for this calculation that each NADPH is 

worth 3.5 ATPs.) 


Understanding the Mechanism of the 3-Ketosphinganine Synthase 
Reaction Write a reasonable mechanism for the 3-ketosphinganine 
synthase reaction, remembering that it is a pyridoxal phosphate— 
dependent reaction. 


Understanding the Involvement of FAD in the Conversion of Stearic 
Acid to Oleic Acid Why is the involvement of FAD important in the 
conversion of stearic acid to oleic acid? 


Understanding the Mechanism of the HMG-CoA Synthase Reaction 
Write a suitable mechanism for the HMG-CoA synthase reaction. 
What is the chemistry that drives this condensation reaction? 


Understanding the Mechanism of the B-Ketoacyl-ACP Synthase 
Reaction Write a suitable reaction mechanism for the B-ketoacyl-ACP 
synthase, showing how the involvement of malonyl-CoA drives this 
reaction forward. 


Studying the Structure of the Fatty Acyl Synthase In the FAS 
megasynthase structures, the multiple functional sites must lie 
within reach of the ACP and its bound acyl group substrates. 
Examine the mammalian FAS structure (see Figure 24.11), and 
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determine the distances between the various functional sites. You 
could approach this problem either by using a molecular modeling 
program (such as PyMol at www.pymol.org) or by consulting 
appropriate references (the following end-of-chapter reference is a 
good place to start: Maier, T., Leibundgut, M., and Ban, N., 2008. 
Science 321:1315—1322). Present quantitative evidence that the 
intersite distances can be traversed appropriately by the ACP group. 


. Modeling the LDL Receptor In the LDL receptor structure shown in 


Figure 24.41c, the B-propellor interaction with domains R4 and R5 is 
partly stabilized by salt bridges between acidic residues on R4 and R5 
that also coordinate Ca?* ions. Use a molecular modeling program or 
consult the literature to identify at least two such interactions. Two 
suitable references are Beglova, N., and Blacklow, S. C., 2005. Trends 
in Biochemical Sciences 30:309-316; and Rudenko, G., Henry, L., 

et al., 2002. Science 298:2353-2358. 


. Understanding the Function of the LDL Receptor Insights into 


the function of LDL receptors in displacing LDL particles in 
endosomes have come from an unlikely source: a study of LDL 
receptor binding by a human rhinovirus HRV2 (a common cold 
virus). Consult suitable references to learn how this study provided 
support for the model of LDL particle displacement presented in 
this chapter. Good references are Blacklow, S. C., 2004. Nature 
Structural and Molecular Biology 11:388—390; Verdaguer, N., et al., 
2004. Nature Structural and Molecular Biology 11:429-434; 

and Beglova, N., and Blacklow, S. C., 2005. Trends in Biochemical 
Sciences 30:309-316. 


. Detoxification of Oxygen (Integrates with Chapter 23.) An argument 


can be made that aerobic biochemistry is nothing more than the 
catalytic conversion of oxygen, an aggressive oxidizing agent, into 
harmless water. Which aspects of fatty acid biochemistry support this 
point of view (apart from their contribution to driving the electron 
transport/oxidative phosphorylation)? Describe one involvement of 
fatty acids in the detoxification of oxygen from Chapter 23 and this 
chapter. 


. Hibernating bears such as grizzlies undergo seasonal metabolic 


stresses. During late summer, grizzlies gorge, consuming more than 
50,000 kilocalories of food and gaining up to 16 pounds every day. 
Then, hibernating for up to 7 months, they subsist solely on stored 
body fat, without eating, drinking, urinating, or defecating. What 
might be the biochemical consequences of these dietary and meta- 
bolic changes? What signaling pathways and regulations might be 
involved? When you have composed your answer, consult the 
following reference to see how your ideas compare with the research 
presented: Nelson, O. L., Jansen, H. T., Galbreath, E., et al., 2014. 
Grizzly bears exhibit augmented insulin sensitivity while obese prior 
to a reversible insulin resistance during hibernation. Cell Metabolism 
20:376-382. 


. Most populations of phospholipids in cell membranes exhibit 


enrichment in particular fatty acids. For example, the phosphatidyl- 
inositol in mammalian membranes is predominantly 1-stearoyl- 
2-archidonoy! phosphatidylinositol. Assuming that this enrichment 
occurs after incorporation of PI into the membrane, propose a series 
of steps or a cycle of reactions that could achieve this enrichment. 
When you have constructed your model, consult the following 
reference to see how your model compares with the research 
presented: Epand, R. M., 2015. Enrichment of acyl chains in the 
lipids of the phosphatidylinositol cycle. ASBMB Today 14:16-17. 


. Comparing the Reaction of Fatty Acid Synthesis and the 


Tricarboxylic Acid Cycle Consider the synthesis of linoleic acid 
from palmitic acid, and identify a series of three consecutive 
reactions that exhibit chemistry similar to three reactions in the 
tricarboxylic acid cycle. 


. Writing an Equation to Describe the Synthesis of Behenic Acid Rewrite 


the equation in Section 24.1 to describe the synthesis of behenic acid 
(see Table 8.1). 
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PSAFE Problems 


(PSAFE models, listed by chapter or protein name, can be found at 
http://www. psafeonline.net.) 


28. Using the PSAFE structure for Niemann-Pick Cl, describe the 
structure of the sterol-binding pocket, and explain the interaction 
involved in cholesterol binding. 


29. Using the PSAFE structure for HMG-CoA reductase (HMGR), 
explain how HMGR is regulated. Why do researchers seek ways to 
inhibit HMGR? 


30. Using the PSAFE structure for cyclooxygenase-2, explain the 
mechanisms by which NSAIDs inhibit this enzyme. Discuss several 
examples. 


Think-Pair-Share Question 


Imagine yourself as a researcher at a pharmaceutical company. You have 
been tasked with the development of a drug that could lower triacylglyc- 
eride storage in the adipose tissues. Focusing only on triacylglyceride or 
fatty acid biochemistry (discussed in Chapters 22, 23, or 24), which 
enzymes, receptors, pathways, or processes could you suggest as potential 
targets for pharmacological intervention? List at least six possible targets. 
(Hint: Recall that enzymes can be activated or inhibited; receptors can be 
stimulated or blocked.) 
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Nitrogen Acquisition and 
Amino Acid Metabolism 


Nitrogen is an essential nutrient for all cells. Amino acids provide nitrogen for the synthesis of 
other nitrogen-containing biomolecules. Excess amino acids in the diet can be converted into 
a-keto acids and used for energy production. What are the biochemical pathways that form ammo- 
nium from inorganic nitrogen compounds prevalent in the inanimate environment? How is ammo- 
nium incorporated into organic compounds? How are amino acids synthesized and degraded? 


mino acids and nucleotides, as well as their polymeric forms (proteins and nucleic 

acids), are nitrogen-containing molecules upon which cell structure and function 
rely. In these compounds, the nitrogen atoms are typically bound to carbon and 
hydrogen atoms. That is, the nitrogen atom is in a reduced state. On the other hand, the 
prevalent forms of nitrogen in the environment are inorganic and oxidized, 
N- (dinitrogen gas) and NO; (nitrate ions) being the most abundant species. The two 
principal routes for nitrogen acquisition from the inanimate environment, nitrate as- 
similation and nitrogen fixation, lead to formation of ammonium ions (NH,’). 
Reactions that incorporate NH,* into organic linkage (the reactions of ammonium 
assimilation) are few, but they serve as the source of N atoms for all biologically 
significant nitrogen-containing molecules. Among these, glutamine synthetase merits 
particular attention because it conveys several important lessons in metabolic regula- 
tion. This chapter presents the pathways of amino acid biosynthesis and degradation; 
those involving the sulfur-containing amino acids provide an opportunity to introduce 
aspects of sulfur metabolism. 


25.1 | Which Metabolic Pathways Allow Organisms 


to Live on Inorganic Forms of Nitrogen? 
The prevalent forms of inorganic nitrogen in the environment can be interconverted 


through oxidation—reduction reactions carried out by organisms exploiting them to 
meet their particular metabolic needs. Collectively, these reactions constitute a 


25 


I was determined to know beans. 


Henry David Thoreau (1817-1862) 
The Writings of Henry David Thoreau, vol. 2, 
p. 178, Houghton Mifflin (1906) 


< Only plants and certain microorganisms are able to 
transform the oxidized, inorganic forms of nitrogen 
available in the inanimate environment into reduced, 
biologically useful forms. Soybean plants can meet their 
nitrogen requirements both by assimilating nitrate and, 
in symbiosis with bacteria, fixing N2. 


25.1 Which Metabolic Pathways Allow 
Organisms to Live on Inorganic 
Forms of Nitrogen? 


25.2 What Is the Metabolic Fate 
of Ammonium? 

25.3 What Regulatory Mechanisms Act 
on Escherichia coli Glutamine 
Synthetase? 


25.4 How Do Organisms Synthesize 
Amino Acids? 


How Does Amino Acid Catabolism 
Lead into Pathways of Energy 
Production? 


25.5 
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Nitrate reductase has two structural domains, the 
cytochrome b domain (top) and the molybdenum 


Corn nitrate reductase cytochrome 
domain (FAD shown in yellow) 


Fungal nitrate reductase molybdenum 


cofactor domain (Mo cofactor in gold) (2) NO. + 8 H* + 6 e —~+NH,* + 2 H,O (25.2) 
(PDB id = 2BIH) 
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nitrogen cycle. Ammonium ions, the most reduced of these substances, is the precur- 
sor to virtually all N-containing biological molecules. 


25.la Nitrogen Is Cycled between Organisms and the Inanimate 
Environment 


Nitrogen acquisition by biological systems is accompanied by its reduction to ammo- 
nium ion (NH,*) and the incorporation of NH,* into organic linkage as amino or 
amido groups (Figure 25.1). The reduction of NO; to NH4” occurs in green plants, 
various fungi, and certain bacteria in a two-step metabolic pathway known as nitrate 
assimilation. The formation of NH,* from N, gas is termed nitrogen fixation. N, fixa- 
tion is an exclusively prokaryotic process, although bacteria in symbiotic association 
with certain green plants also carry out nitrogen fixation. No animals are capable of 
either nitrogen fixation or nitrate assimilation, so they are totally dependent on plants 
and microorganisms for the synthesis of organic nitrogenous compounds, such as 
amino acids and proteins, to satisfy their requirements for this essential element. 

Animals release excess nitrogen in a reduced form, either as NH,* or as organic nitrog- 
enous compounds such as urea. The release of N occurs both during life and as a conse- 
quence of microbial decomposition following death. Various bacteria return the reduced 
forms of nitrogen back to the environment by oxidizing them. The oxidation of NH,* to 
NO,- by nitrifying bacteria, a group of chemoautotrophs, provides the sole source of chem- 
ical energy for the life of these microbes. Nitrate nitrogen also returns to the atmosphere as 
N; as a result of the metabolic activity of denitrifying bacteria. These bacteria are capable 
of using NO;~ and similar oxidized, inorganic forms of nitrogen as electron acceptors in 
place of O, in oxidative phosphorylation pathways. The NO;~ is reduced ultimately to 
dinitrogen (N2). These bacteria thus deplete the levels of combined nitrogen, that is, N joined 
with other elements in chemical compounds. Combined nitrogen is important as natural 
fertilizer. However, the denitrifying activity of bacteria is exploited in water treatment 
plants to reduce the load of combined nitrogen that might otherwise enter lakes, streams, 
and bays. 


(PDB id = 1CNE) 


25.1b Nitrate Assimilation Is the Principal Pathway 
for Ammonium Biosynthesis 
Nitrate assimilation occurs in two steps: the two-electron reduction of nitrate to 


nitrite, catalyzed by nitrate reductase (Equation 25.1), followed by the six-electron 
reduction of nitrite to ammonium, catalyzed by nitrite reductase (Equation 25.2). 


(1) NO; + 2H* + 2e —+NO, + H,O (25.1) 


Nitrate assimilation is the predominant means by which green plants, algae, and many 
microorganisms acquire nitrogen. The pathway of nitrate assimilation accounts for 


cofactor domain (bottom). more than 99% of the inorganic nitrogen (nitrate or N2) assimilated into organisms. 
Figure 25.1 The nitrogen cycle. Organic nitrogenous eine 
: ; + : 
compounds are formed by the incorporation of NH, Nitrate assimilatio 
into carbon skeletons. Note that denitrification and a 
nitrogen fixation are anaerobic processes. Nitrification 
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Nitrate Reductase Contains Cytochrome bss; and Molybdenum Cofactor 
NADH NOs- 
[-SH—> FAD >cytochrome b55; >MoCo] (25.3) 
NADH* NO g” 


A pair of electrons is transferred from NADH via enzyme-associated sulfhydryl 
groups, FAD, cytochrome bs5 and MoCo (an essential molybdenum-containing 
cofactor) to nitrate, reducing it to nitrite. The brackets [ ] denote the protein-bound 
prosthetic groups that constitute an e~ transport chain between NADH and nitrate. 
Nitrate reductases typically are cytosolic 220 kDa dimeric proteins. The structure of 
the molybdenum cofactor (MoCo) is shown in Figure 25.2a. Molybdenum cofactor is 
necessary for both nitrate reductase activity and the assembly of nitrate reductase 
subunits into the active dimeric holoenzyme form. Molybdenum cofactor is also an 
essential cofactor for a variety of enzymes that catalyze hydroxylase-type reactions, 
including xanthine dehydrogenase, aldehyde oxidase, and sulfite oxidase. 


Nitrite Reductase Contains Siroheme Six electrons are required to reduce NO, to 
NH,". Nitrite reductases in photosynthetic organisms obtain these electrons from six 
molecules of photosynthetically reduced ferredoxin (Fd,eq). 


Light— 6 Fdyeq NO g” 
[ (4Fe-4S) — siroheme] (25.4) 
6 Fd, NH,* 


Photosynthetic nitrite reductases are 63 kDa monomeric proteins having a tetranu- 
clear iron-sulfur cluster and a novel heme, termed siroheme, as prosthetic groups. 
The [4Fe—4S] cluster and the siroheme act as a coupled e` transfer center. Nitrite 
binds directly to siroheme, providing the sixth ligand, much as O, binds to the heme Spinach nitrate reductase (iron-sulfur cluster in gold, 
of hemoglobin. Nitrite is reduced to ammonium while liganded to siroheme. The © siroheme in red) (PDB id = 2AKJ) 

structure of siroheme is shown in Figure 25.2b. 
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Figure 25.2 The novel prosthetic groups of nitrate reductase and nitrite reductase. (a) The 
molybdenum cofactor of nitrate reductase. The fifth ligand for the Mo atom is the sulfur atom of 

a conserved cysteine residue in the nitrate reductase. The Mo cofactor is a pyranopterin ring system: 
To form a pterin, a pyrimidine ring (/eft) is fused with a piperazine ring (center [a piperazine ring is 
a six-membered ring, with two nitrogens directly opposite from one another]). Adding a pyran ring 
(right [a pyran ring contains one oxygen and five carbon atoms]) to the pterin yields a pyranopterin. 
(b) Siroheme, an essential prosthetic group of nitrite reductase. Siroheme is novel among hemes in 
having eight carboxylate-containing side chains. These carboxylate groups may act as H* donors 
during the reduction of NO,” to NH,*. 
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Figure 25.3 Domain organization within the Sequence Organization of the Nitrate Assimilation Enzymes 
enzymes of nitrate assimilation. The numbers denote 
residue number along the amino acid sequence of 
the proteins. 


Plant and Fungal Nitrate Reductases 
(~200 kDa homodimers) 


2 542 620 


656 787 917 


1 112 48 


Plant Nitrite Reductases 
(63 kDa monomers) 


FeS-siroheme/NO, 


473 518 566 


Fungal Nitrite Reductases 
(~250 kDa homodimers) 


FeS-siroheme/NO»,_ 


26 60 183 215 496 600 715 763 1176 


In higher plants, nitrite reductase is found in chloroplasts, where it has ready access to 
its primary reductant, photosynthetically reduced ferredoxin. Microbial nitrite reductases 
closely resemble nitrate reductases in having essential —SH groups and FAD prosthetic 
groups to couple enzyme-mediated NADPH oxidation to nitrite reduction (Figure 25.3). 


25.1c Organisms Gain Access to Atmospheric N; via the 
Pathway of Nitrogen Fixation 


Nitrogen fixation involves the reduction of nitrogen gas (N2) via an enzyme system 
found only in prokaryotic cells. The heart of the nitrogen fixation process is the 
enzyme known as nitrogenase, which catalyzes the reaction 


Ne + 10 H* + 8 © —> 2 NH + H, (25.5) 


Note that an obligatory reduction of two protons to hydrogen gas accompanies the 
biological reduction of N, to ammonia. Less than 1% of the inorganic N incorporated 
into organic compounds by organisms can be attributed to nitrogen fixation; how- 
ever, this process is the only way that organisms can tap into the enormous reservoir 
of N> in the atmosphere. 

Although nitrogen fixation is exclusively prokaryotic, N>-fixing bacteria may be either 
free living or living as symbionts with higher plants. For example, Rhizobia are bacteria 
that fix nitrogen in symbiotic association with soybeans and other leguminous plants. 
Because nitrogen in a metabolically useful form is often the limiting nutrient for plant 
growth, such symbiotic associations can be an important factor in plant productivity. 

Despite the wide diversity of bacteria in which nitrogen fixation takes place, all 
N>-fixing systems are nearly identical and all have four fundamental requirements: 
(1) the enzyme nitrogenase; (2) a strong reductant, such as reduced ferredoxin; (3) ATP; 
and (4) O,-free conditions. In addition, several modes of regulation act to control 
nitrogen fixation. 


The Nitrogenase Complex Is Composed of Two Metalloproteins Two metalloproteins 
constitute the nitrogenase complex: the Fe-protein or nitrogenase reductase and the 
MoFe-protein, which is another name for nitrogenase. Nitrogenase reductase is a 
60 kDa homodimer possessing a single [4Fe—4S] cluster as a prosthetic group. Nitro- 
genase reductase is inactivated by small amounts of O,. Nitrogenase reductase binds 
MgATP and hydrolyzes two ATPs per electron transferred during nitrogen fixation. 
Because reduction of N, to 2 NH4* + H; requires 8 electrons, 16 ATPs are consumed 
per N, reduced. 
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This ATP requirement seems paradoxical because the reaction is thermodynamically 
favorable: The €,’ for the reaction (N; + 8 e~ + 10 Ht > 2 NH,* + Hb) is —0.314 V. 
Ferredoxin, the most common e~ donor for nitrogen fixation, has an ©,’ that is more 
negative (see Table 3.3). The solution to the paradox is found in the very strong bonding 
between the two N atoms in N> (Figure 25.4). Substantial energy input is needed to 
overcome this large activation energy and break the N=N triple bond. In this biological 
system, the energy is provided by ATP. 

Nitrogenase, the MoFe-protein, is a 240-kDa a2B.-type heterotetramer. An aB-dimer 
serves as the functional unit, and each aB-dimer contains two types of metal centers: an 
unusual 8Fe-7S center known as the P-cluster (Figure 25.5a) and the novel 7Fe-1Mo-9S 
cluster known as the FeMo-cofactor (Figure 25.5b). Nitrogenase under unusual circum- 
stances may contain an iron-vanadium cofactor instead of the molybdenum-containing 
one. Like nitrogenase reductase, nitrogenase is very oxygen labile. 


The Nitrogenase Reaction In the nitrogenase reaction (Figure 25.6), electrons from 
reduced ferredoxin pass to nitrogenase reductase, which serves as electron donor to 
nitrogenase, the enzyme that actually catalyzes N, fixation. Electron transfer from 
nitrogenase reductase to nitrogenase takes place through docking of nitrogenase 
reductase with an aB-subunit pair of nitrogenase (Figure 25.7). Nitrogenase reductase 
transfers e~ to nitrogenase one electron at a time. N=N (or the substrate analog 
C=O) binding to FeMoCo displaces one of the nine S atoms in FeMoCo; thus, the 
FeMoCo structure shown in Figure 25.5b is Fe;MoSg, with the C atom of CO occupy- 
ing the site where the ninth S atom would be in the substrate-free complex. N, is bound 
within the FeMo-cofactor metal cluster until all electrons and protons are added; no 
free intermediates, such as HN—=NH or H,N—NH) are released. Electron transfer 
takes place in the following sequence: Fe-protein — P-cluster + FeMo-cofactor — N). 


(a) (b) 
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Figure 25.5 Structures of the two types of metal clusters found in nitrogenase. (a) The P-cluster 
consists of two Fe,S; clusters that share an S atom. (b) FeMoCo with bound carbon monoxide 
(C=O, like N,, has a triple bond between its two atoms and acts as a substrate analog of molecular 
nitrogen). CO is liganded to FeMoCo via two of its iron atoms. The side-chains of a-Val’° and 
a-His!® just nearby are catalytically important. Fe atoms are shown in orange, S in yellow, Mo in 
turquoise, C in gray, N in blue, and O in red. Note the key carbide C atom that occupies the center of 
the FeMoCo complex. Six of the 7 Fe atoms in FeMoCo have Fe-C metal-carbide bonds to this C 
atom, holding it in place. (Adapted from Figure 1(d) in Spatzal, T., Perez, K. A., Einsle, O., Howard, 
J. B., and Rees, D. C., 2014. Ligand binding to the Fe-Mo cofactor: Structures of CO-bound and 
reactivated nitrogenase. Science 345:1620—1623). See also Ohki, Y., et al., 2022. Nitrogen reduction by 
the Fe sites of synthetic [Mo3S,4Fe] cubes. Nature 607:86—-90. 
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Figure 25.4 The free energy of activation for the 
reduction of N» to 2 NH3. The triple bond in N» 
must be broken during nitrogen fixation. 
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Figure 25.6 The nitrogenase reaction. Depending on 
the bacterium, electrons for N, reduction may come 


from light, NADH, hydrogen gas, or pyruvate. The 
primary e~ donor for the nitrogenase system is 
reduced ferredoxin. 


Figure 25.7 Ribbon diagram of nitrogenase 
reductase (the Fe-protein, b/ue): nitrogenase (FeMo 
protein, green) complex. The Fe-protein iron-sulfur 
cluster is shown in yellow, bound ADP in orange. 
The nitrogenase FeMo cofactor is shown in cyan, 
the P-cluster in red (PDB id = 1N2C). 
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ATP hydrolysis is coupled to the transfer of an electron from the Fe-protein to the 
P-cluster. ATP hydrolysis leads to conformational change in the nitrogenase reduc- 
tase, so it no longer binds to nitrogenase. The ADP:oxidized nitrogenase reductase 
complex dissociates, making way for another ATP:reduced nitrogenase reductase 
complex to bind to nitrogenase. Interestingly, nitrogenase reductase is a member of 
the G-protein family; G proteins are molecular switches whose operation is driven by 
NTP hydrolysis. 

Nitrogenase is a rather slow enzyme: Its optimal rate of e~ transfer is about 12 e~ pairs 
per second per enzyme molecule; that is, it reduces only three molecules of nitrogen gas 
per second. Because its activity is so weak, nitrogen-fixing cells maintain large amounts 
of nitrogenase so that their requirements for reduced N can be met. As much as 5% of 
the cellular protein may be nitrogenase. 
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enzymes use their [4Fe-4S] cluster to generate a 
potent oxidant, the 5'-deoxyadenosyl radical, from 
SAM, and then use it to initiate radical chemistry 
on a variety of substrates. The Booker Lab seeks to 
characterize these complex reaction mechanisms to 
gain insight into the role of these enzymes in human 
health and disease, as well as to reveal how these 
enzymes solve difficult chemical problems. 


Dr. Booker did his undergraduate studies at Austin 
College before earning his Ph.D. in Biochemistry at 
MIT. At both schools, he had to actively seek out 
African-American role models in science because 
of their historical underrepresentation. In 2022, 
Dr. Booker was awarded both the American Society 
of Biochemistry and Molecular Biology (ASBMB)- 
Merck Award for his major contributions in 
biochemistry and molecular biology and the 
ASBMB Ruth Kirschstein Diversity in Science 
Award for sustained commitment to breaking down 
racial barriers. This was the first time ASBMB gave 
two major awards in the same year to one person. 
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Figure 25.8 Regulation of nitrogen fixation. (a) ADP inhibits nitrogenase activity and NH4* 
represses nif gene expression. (b) In some organisms, the nitrogenase complex is regulated by covalent 
modification. ADP-ribosylation of nitrogenase reductase leads to its inactivation. The modifying 
enzyme catalyzing ADP-ribosylation is dinitrogen reductase ADP-ribosy] transferase (or DraT), and 
the modifying enzyme catalyzing removal of ADP-ribose is dinitrogen reductase ADP-ribosyl 
glycohydrolase (or DraG). 


The Regulation of Nitrogen Fixation Two regulatory controls for nitrogen fixation are 
paramount (Figure 25.8): (1) ADP inhibits the activity of nitrogenase; thus, as the 
ATP/ADP ratio drops, nitrogen fixation is blocked. (2) NH,* represses the expression 
of the nif genes, the genes that encode the proteins of the nitrogen-fixing system. 
Repression of nif gene expression by ammonium, the primary product of nitrogen 
fixation, is an efficient and effective way of shutting down N; fixation when its end 
product is not needed. In addition, in some systems, covalent modification of nitroge- 
nase reductase leads to its inactivation. Inactivation occurs when Arg'! of nitrogenase 
reductase receives an ADP-ribosyl group donated by NAD*. 


25.2 | What Is the Metabolic Fate of Ammonium? 


Given the prevalence of N atoms in cellular components, it is surprising that only 
three enzymatic reactions introduce ammonium into organic molecules. Of these 
three, glutamate dehydrogenase and glutamine synthetase are responsible for most of 
the ammonium assimilated into carbon compounds. The third, carbamoyl-phosphate 
synthetase I, is a mitochondrial enzyme that participates in the urea cycle. 


25.2a The Citric Acid Cycle Intermediate a-Ketoglutarate Is Precursor 
to Both Glutamate and Glutamine 


Glutamate dehydrogenase (GDH) catalyzes the reductive amination of a-ketoglutarate to 
yield glutamate. Reduced pyridine nucleotides (NADH or NADPH) provide the 
reducing power: 


NH,” + a-ketoglutarate + NADPH + H* —> 
glutamate + NADP* + H,O (25.6) 


This reaction provides an important interface between nitrogen metabolism and 
cellular pathways of carbon and energy metabolism because a-ketoglutarate is a 
citric acid cycle intermediate. In vertebrates, GDH is an a¢-type multimeric enzyme 
localized in the mitochondrial matrix that uses NADPH as electron donor when 
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Figure 25.9 The glutamate dehydrogenase reaction. 


(a) 
NH} + 
(b) 
Figure 25.10 (a) The enzymatic reaction 
catalyzed by glutamine synthetase. (b) The 
reaction proceeds by (1) activation of the oO 
y-carboxyl group of Glu by ATP, followed 
by (2) amidation by NH,*. 
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operating in the biosynthetic direction (the direction of glutamate synthesis) 
(Figure 25.9). In mammals, GDH typically acts in the catabolic direction to gener- 
ate a-ketoglutarate from glutamate. The binding of NADP’, the electron acceptor 
for glutamate oxidation, shows negative cooperativity (see Chapter 15). The cata- 
bolic activity is allosterically activated by ADP and inhibited by GTP. Interestingly, 
mammalian GDH plays a prominent role in amino acid catabolism because it is the 
only mammalian enzyme that can carry out the oxidative deamination of an amino 
acid. In plants and bacteria, GDH usually operates in the anabolic direction of glu- 
tamate synthesis. 

Glutamine synthetase (GS) catalyzes the ATP-dependent amidation of the 
y-carboxyl group of glutamate to form glutamine (Figure 25.10). The reaction pro- 
ceeds via a y-glutamyl-phosphate intermediate, and GS activity depends on the pres- 
ence of divalent cations such as Mg?*. Glutamine is a major N donor in the biosynthesis 
of many organic N compounds such as purines, pyrimidines, and other amino acids, 
and GS activity is tightly regulated, as we shall soon see. The amide-N of glutamine 
provides the nitrogen atom in these biosyntheses. Glutamine is the most abundant 
amino acid in humans. 
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Carbamoy]-phosphate synthetase I, the third enzyme capable of using ammonium to 
form an N-containing organic compound, catalyzes an early step in the urea cycle. Two 
ATPs are consumed, one in the activation of HCO;~ for reaction with ammonium and 
the other in the phosphorylation of the carbamate formed (see also Figure 25.22): 


O 
|l 
NH,* + HCO,” + 2 ATP -H,N—C—O —PO ¿= + 2 ADP + P; +2H* (25.7) 


N-acetylglutamate is an essential allosteric activator for this enzyme. Both carbamoyl-P 
and N-acetylglutamate are precursors to arginine synthesis and intermediates in the 
urea cycle. 


25.2b The Major Pathways of Ammonium Assimilation 
Lead to Glutamine Synthesis 


In organisms that enjoy environments rich in nitrogen, GDH and GS acting in sequence 
furnish the principal route of NH4* incorporation (Figure 25.11). However, GDH has a 
significantly higher K», for NH,” than does GS. Consequently, in organisms such as 
green plants that grow under conditions where little NH4* is available, GDH is not effec- 
tive and GS is the only NH,*-assimilative reaction. Such a situation creates the need for 
an alternative mode of glutamate synthesis to replenish the glutamate consumed by the 
GS reaction. This need is filled by glutamate synthase (also known as GOGAT, the acro- 
nym for the other name of this enzyme—glutamate:oxo-glutarate amino-transferase). 
Glutamate synthase catalyzes the reductive amination of a-ketoglutarate using the 
amide-N of glutamine as the N donor: 


Reductant + a-KG + Gln —> 2 Glu + oxidized reductant (25.8) 


Two glutamates are formed—one from amination of a-ketoglutarate and the 
other from deamidation of Gln (Figure 25.12). These glutamates can now serve as 


(a) NH,* + a-ketoglutarate + vmm- lutamate + NADP* + H,O 


(b) Glutamate + NH,t + ce glutamine + ADP + P; 


SUM: 2 NH,* + a-ketoglutarate + NADPH + ATP 


glutamine + NADP* + ADP + P; + HO 


(a) NADH (yeast, Neurospora crassa) + H* NAD* 
NADPH (E. coli) + H* or + aKG + Gin ——> 2Glu + NADP or 
2 H* + 2 reduced ferredoxin (plants) 2 oxidized ferredoxin 
(b) = _ 
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Figure 25.12 The glutamate synthase reaction. (a, top) The overall reaction, showing the various 
reductants used by different organisms to drive this reductive amination reaction. (b, bottom) The 
amido group of glutamine that is transferred to a-ketoglutarate is highlighted in blue. (c, right) 
Molecular graphic of glutamate synthase (PDB id = 1LM1). FAD is shown in blue, the Fe-S cluster 
in yellow. 


Figure 25.11 The GDH/GS pathway 

of ammonium assimilation. The sum of 
these reactions is the conversion of 1 
a-ketoglutarate to | glutamine at the 
expense of 2 NH,*, 1 ATP, and 1 
NADPH. 
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(a) 2NH,* +2 ATP + 2 glutamate —CS > 2 glutamine + 2 ADP + 2 P; 


GOGAT 


(b) NADPH + a-ketoglutarate + glutamine 2 glutamate + NADP* 


SUM: 2 NH,* + a-ketoglutarate + NADPH + 2 ATP 


glutamine + NADPT + 2 ADP + 2 P; 


Figure 25.13 The GS/GOGAT pathway of ammonium assimilation. (a) The glutamine synthetase 
reaction; (b) the GOGAT reaction. The sum of these reactions results in the conversion of one 
a-ketoglutarate to one glutamine at the expense of two ATP and one NADPH. 


ammonium acceptors for glutamine synthesis by GS. Organisms variously use 
NADH, NADPH, or reduced ferredoxin as reductant. Glutamate synthases are typi- 
cally large, complex proteins; in Escherichia coli, GOGAT is an 800-kDa flavoprotein 
containing both FMN and FAD, as well as [4Fe—4S] clusters. 

Together, GS and GOGAT constitute a second pathway of ammonium assimila- 
tion, in which GS is the only NH,*-fixing step; the role of GOGAT is to regenerate 
glutamate (Figure 25.13). Note that this pathway consumes two equivalents of ATP 
and one NADPH (or similar reductant) per pair of N atoms introduced into Gln, in 
contrast to the GDH/GS pathway, in which only one ATP and one NADPH are con- 
sumed per pair of NH," fixed. Clearly, coping with a nitrogen-limited environment 
has its cost. 


25.3 | What Regulatory Mechanisms Act on 
Escherichia coli Glutamine Synthetase? 


As indicated earlier, glutamine plays a pivotal role in nitrogen metabolism by donat- 
ing its amide nitrogen to the biosynthesis of many important organic N compounds. 
Consistent with its metabolic importance, in prokaryotic cells such as E. coli, GS is 
regulated at three different levels: 


1. Its activity is regulated allosterically by feedback inhibition. 

2. GS is interconverted between active and inactive forms by covalent modification. 

3. Cellular amounts of GS are carefully controlled at the level of gene expression and 
protein synthesis. 


Eukaryotic versions of glutamine synthetase show none of these regulatory features. 

E. coli GS is a 600-kDa dodecamer (a )-type subunit organization) of identical 
52-kDa monomers (each monomer contains 468 amino acid residues). These mono- 
mers are arranged as a stack of two hexagons (Figure 25.14). The active sites are located 
at subunit interfaces within the hexagons; these active sites are recognizable in the X-ray 
crystallographic structure by the pair of divalent cations that occupy them. Adjacent 
subunits contribute to each active site, thus accounting for the fact that GS monomers 
are catalytically inactive. 


25.3a Glutamine Synthetase Is Allosterically Regulated 


Nine distinct feedback inhibitors (Gly, Ala, Ser, His, Trp, CTP, AMP, carbamoy]I-P, 
and glucosamine-6-P) act on GS. Gly, Ala, and Ser are key indicators of amino acid 
metabolism in the cell; each of the other six compounds represents an end product of 
a biosynthetic pathway dependent on Gln (Figure 25.15). AMP competes with ATP 
for binding at the ATP substrate site. Gly, Ala, and Ser compete with Glu for binding 
at the active site. Carbamoyl-P binds at a site that overlaps both the Glu site and the 
site occupied by the y-PO, of ATP. 
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(b) 


Figure 25.14 The subunit organization of bacterial 
glutamine synthetase. (a) Schematic; (b) molecular 
structure (note the pairs of metal ions [dark blue] 

that define the active sites; PDB id = 1FPY). 
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25.3b Glutamine Synthetase Is Regulated by Covalent Modification 


Each GS subunit can be adenylylated at a specific tyrosine residue (Tyr?) in an ATP- 
dependent reaction (Figure 25.16). Adenylylation inactivates GS. If we define n as the 
average number of adenylyl groups per GS molecule, GS activity is inversely propor- 
tional to n. The number n varies from 0 (no adenylyl groups) to 12 (every subunit in each 
GS molecule is adenylylated). Adenylylation of GS is catalyzed by the bifunctional con- 
verter enzyme ATP-GS-adenylyl transferase, or simply adenylyl transferase (AT). How- 
ever, whether or not this covalent modification occurs is determined by a highly 
regulated cycle (Figure 26.17). The bifunctional AT not only catalyzes adenylylation of 
GS, it also catalyzes deadenylylation—the phosphorolytic removal of the Tyr-linked ade- 
nylyl groups as ADP. The direction in which AT operates depends on the nature of a 
regulatory protein, Pp, associated with it. Py is a 44-kDa protein (trimer of 12 to 13 kDa 
subunits): The state of Py controls the direction in which AT acts. If Py is not uridylylated, 
that is, if it is in its so-called Pira form, the AT:Pira complex acts to adenylylate GS. When 
Py is uridylylated, that is, in its so-called Prp form, the AT:Pip complex catalyzes the 
deadenylylation of GS. The active sites of AT:Pira and AT:Pyp are different, consistent 
with the difference in their catalytic roles. In addition, the AT:Py4 and AT:Ppp complexes 
are allosterically regulated in a reciprocal fashion by the effectors a-KG and Gln. Gln 
activates AT:Py, activity and inhibits AT:Pip activity; the effect of a-KG on the activ- 
ities of these two complexes is diametrically opposite (Figure 25.17). Further, Gln favors 
conversion of Pip to Pra, whereas a-ketoglutarate favors the Pim over the Pira form. 

Clearly, the determining factor regarding the degree of adenylylation, n, and hence 
the relative activity of GS, is the [Gln]/[a-KG] ratio. A high [Gln] level signals cellular 
nitrogen sufficiency, and GS becomes adenylylated and inactivated. In contrast, a high 
[a-KG] level is an indication of nitrogen limitation and a need for ammonium fixation 
by GS. Note that Gln affects all four reactions of the cyclic cascade system, whereas 
a-KG affects only three. Thus, Gln is ultimately the key effector in this system. 


25.3c Glutamine Synthetase Is Regulated through Gene Expression 


The gene that encodes the GS subunit in Æ. coli is designated GlnA. The GinA 
gene is actively transcribed to yield GS mRNA for translation and synthesis of GS 
protein only if a specific transcriptional enhancer, NR,, is in its phosphorylated 


È NH, 
OCH 4 | p) 
To NSS 
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397 « — 12 


Adenylylated Tyr’’‘ in GS monomer 


Figure 25.16 Covalent modification of GS: Adenylylation of Tyr?” in the glutamine synthetase 
polypeptide via an ATP-dependent reaction catalyzed by the converter enzyme adenylyl transferase. 
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form, NR,-P. In turn, NR; is phosphorylated in an ATP-dependent reaction cata- 
lyzed by NRq, a protein kinase (Figure 25.18). However, if NRy is complexed with 
Pya, it acts not as a kinase but as a phosphatase, and the transcriptionally active 
form of NR;, namely NR;-P, is converted back to NR; with the result that GinA 
transcription halts. Recall from the foregoing discussion that a high [Gln]/[a-KG] 
ratio favors Pya at the expense of Pimp. Under such conditions, GS gene expression 
is not necessary. 


25.3d Glutamine in the Human Body 


Glutamine is not among the 10 most commonly occurring amino acids in proteins 
(see Figure 5.17), but it is the most abundant amino acid in human body fluids and 
tissues. The abundance of glutamine reflects its prominent anabolic role as the source 
of N atoms in the synthesis of purines and pyrimidines, amino sugars (Chapter 18), 
and other nitrogen-containing biological molecules. 

Mammalian cells grown in culture rely on glucose and glutamine as their principal 
substrates for energy production, generation of reducing equivalents in the form of 
NADPH, and the carbon and nitrogen atoms for anabolic growth. Donation of the 
amide group of glutamine during nucleotide biosynthesis or the creation of other 
N-containing biomolecules creates glutamate. Oxidative deamination of Glu by GDH 
or transfer of its a-amino group through transamination (see Figure 25.19) leads to the 
formation of a-ketoglutarate, thus replenishing a key citric acid cycle intermediate. 
Thus, carbon for biosynthesis is provided by glutamine through the amphibolic nature 
of the citric acid cycle. 


Glutamine in the Brain Glutamine is by far the most abundant amino acid in the 
central nervous system. Glutamine synthetase in astrocytes, a star-shaped type of 
glial cell, aids in meeting the demand for Gln. (Glial cells are cells that support and 
nourish neurons.) Gln serves as the precursor for both glutamate and y-aminobutyr- 
ate, two neurotransmitters that are synthesized in neurons. Excessive accumulation 
of Gln in the central nervous system can interfere with brain function. 


NR, NRi— ® NR,-P activates GlnA 
transcription; GS is 
synthesized 

NRir-Pira 
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Figure 25.18 Transcriptional regulation of GinA 
expression through the reversible phosphorylation 
of NR}. 
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Glutamine and Cancer Cancer cells growing under oxygen-limiting (hypoxic) condi- 
tions are dependent on glutamine for anabolic purposes. Glutamine uptake by cancer 
cells is followed by hydrolysis by glutaminase (an amidohydrolase) to produce gluta- 
mate and NH,*. Mitochondrial glutamate dehydrogenase converts the glutamate into 
a-ketoglutarate (Figure 25.9, running in reverse). The a-ketoglutarate can then enter 
the citric acid cycle, where it can be reductively carboxylated by isocitrate dehydroge- 
nase running in reverse, to form citrate. Citrate is a versatile anabolic precursor that 
can be used to synthesize many needed metabolites, including acetyl-CoA for lipid 
synthesis. (Export of mitochondrial citrate to the cytosol makes it available to citrate- 
ATP lyase, which converts it to OAA and acetyl-CoA [see Figure 24.1]). Certain cancer 
cell lines are very sensitive to glutamine deprivation. Therapeutic agents that block the 
use of glutamine for anabolic purposes are potential drugs in cancer chemotherapy. 


25.4 | How Do Organisms Synthesize Amino Acids? 


Organisms show substantial differences in their capacity to synthesize the 20 amino 
acids common to proteins. Typically, plants and microorganisms can form all of their 
nitrogenous metabolites, including all of the amino acids, from inorganic forms of N 
such as NH,* and NO; . In these organisms, the a-amino group for all amino acids is 
derived from glutamate, usually via transamination of the corresponding a-keto acid 
analog of the amino acid (Figure 25.19). In many cases, amino acid biosynthesis is 
thus a matter of synthesizing the appropriate a-keto acid carbon skeleton, followed 
by transamination with Glu. The amino acids can be classified according to the 
source of intermediates for the a-keto acid biosynthesis (Table 25.1). For example, the 
amino acids Glu, Gln, Pro, and Arg (and, in some instances, Lys) are all members of 
the a-ketoglutarate family because they are all derived from the citric acid cycle inter- 
mediate a-ketoglutarate. We return to this classification scheme later when we dis- 
cuss the individual biosynthetic pathways. 


| Table 251 | The Grouping of Amino Acids into Families According to the 


Metabolic Intermediates That Serve as Their Progenitors 


a-Ketoglutarate Family Oxaloacetate Family 

Glutamate Aspartate 

Glutamine Asparagine 

Proline Methionine 

Arginine Threonine 

Lysine* Isoleucine 
Lysine* 

Pyruvate Family 3-Phosphoglycerate Family 

Alanine Serine 

Valine Glycine 

Leucine Cysteine 


Phosphoenolpyruvate and Erythrose-4-P Family 


The aromatic amino acids 
Phenylalanine 
Tyrosine 
Tryptophan 


The remaining amino acid, histidine, is derived from PRPP 
(5-phosphoribosyl-1-pyrophosphate) and ATP. 


*Different organisms use different precursors to synthesize lysine. 
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Human Biochemistry 


Human Dietary Requirements for Amino Acids 


Humans can synthesize only 10 of the 20 common amino acids (see 
table); the others must be obtained in the diet (Table 1). Those that 
can be synthesized are classified as nonessential, meaning it is not es- 
sential that these amino acids be part of the diet. In effect, humans 
can synthesize the a-keto acid analogs of nonessential amino acids 
and form the amino acids by transamination. In contrast, humans 
are incapable of constructing the carbon skeletons of essential 
amino acids, so they must rely on dietary sources for these essential 
metabolites. Excess dietary amino acids cannot be stored for future 
use, nor are they excreted unused. Instead, they are converted to 
common metabolic intermediates that can be either oxidized by 
the citric acid cycle to generate metabolic energy or used to form 
glucose (see Section 25.5). 

Since autotrophic cells (and many prokaryotic cells) synthesize 
all 20 amino acids, several questions arise regarding human dietary 
requirements for amino acids. First, why is it that humans lack 
the ability to do what other organisms can do? The answer is that, 
over evolutionary time, human diets provided adequate amounts 
of those amino acids classified as essential. Thus, the loss of the 
metabolic pathways for synthesis of essential amino acids did not 
impair the fitness of humans. That is, synthesizing essential amino 
acids became superfluous, and no evolutionary pressure operated on 
humans to retain the genes for these pathways. A second question 
now emerges: Are there significant differences between synthesis of 
amino acids humans can make (the so-called nonessential amino 


| Table 1 | Essential and Nonessential Amino Acids in 


Humans 

Essential Nonessential 
Arginine* Alanine 
Histidine* Asparagine 
Isoleucine Aspartate 
Leucine Cysteine** 
Lysine Glutamate 
Methionine Glutamine 
Phenylalanine Glycine 
Threonine Proline 
Tryptophan Serine 
Valine Tyrosine* 


*Arginine and histidine are essential in the diets of juveniles, not adults. 

Tyrosine is classified as nonessential only because it is readily formed from essential 
phenylalanine. 

**Cysteine is classified as nonessential only because it is readily formed from essential 
methionine. 


acids) and those they can’t? Table 2 summarizes amino acid biosyn- 
thesis from citric acid cycle intermediates in terms of the number 
of reactions needed to make an amino acid from a TCA cycle 
intermediate. Nonessential amino acids are shown in blue; essential 
amino acids in red. Two conclusions stand out: Nonessential amino 
acids require fewer reaction steps for synthesis than essential amino 
acids, and nonessential amino acids tend to be more abundantly 
represented in proteins than essential amino acids. Thus, evolution- 
ary loss was not random: The biosynthetic pathways lost were those 
for amino acids requiring the most reaction steps. 


Reference 


Srinivasan, V., Morowitz, H., and Smith, E., 2007. Essential amino acids, 
from LUCA to LUCY. Complexity 13:8-9. 


| Table 2 | Nonessential Amino Acids Require Fewer 


Reactions for Synthesis 


Amino Acid Reaction Steps Mole % in Proteinst 
1 Alanine 1 T9. 
2 Aspartic acid 1 5.3 
3 Glutamic acid 1 6.7 
4 Asparagine 2 4.1 
5 Glutamine 2 4.0 
6 Serine 5 6.9 
7 Glycine 6 6.9 
8 Proline 6 4.8 
9 Cysteine? 7 1.5 
10 Threonine 6 5.4 
11 Valine 9 6.7 
12 Isoleucine 13 5.9 
13 Leucine 14 10.0 
14 Lysine 14 5.9 
15 Methionine 17 2.4 
16 Arginine 24 5.4 
17 Histidine 27 2.3 
18 Phenylalanine 29 4.0 
19 Tyrosine? 30 3.0 
20 Tryptophan 33 1.1 


tMole percentages are taken from amino acid representations among proteins in the 
Swiss-Prot protein knowledgebase: ca.expasy.org/sprot. 

+Note that “nonessential” cysteine can be made only from “essential” methionine and 
“nonessential” tyrosine can be made only from “essential” phenylalanine. 
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25.4a Amino Acids Are Formed from a-Keto Acids by Transamination 


Transamination involves transfer of an a-amino group from an amino acid to the 
a-keto position of an a-keto acid (Figure 25.19). In the process, the amino donor 
becomes an a-keto acid and the a-keto acid acceptor becomes an a-amino acid: 


Amino acid, + a-keto acid; —> a-keto acid, + amino acid, (25.9) 


The predominant amino acid/a-keto acid pair in these reactions is glutamate/ 
a-ketoglutarate, with the net effect that glutamate is the primary amino donor 
for the synthesis of amino acids. Transamination reactions are catalyzed by 
aminotransferases (the preferred name for enzymes formerly termed transami- 
nases). Aminotransferases are named according to their amino acid substrates, as 
in glutamate—aspartate aminotransferase. Aminotransferases are prime examples 
of enzymes that catalyze double displacement (ping-pong)—type bisubstrate reac- 
tions (see Figure 13.23). 


A Deeper Look 


The Mechanism of the Aminotransferase (Transamination) Reaction 


The aminotransferase (transamination) reaction is a workhorse 
in biological systems. It provides a general means for exchange of 
nitrogen between amino acids and a-keto acids. Aminotransferases 
depend on the coenzyme pyridoxal-phosphate (PLP), a vitamin Bs 
derivative. The mechanism involves loss of the C, proton of the 
amino acid, followed by an aldimine—ketimine tautomerization— 
literally a flip-flop of the Schiff base double bond from the pyridoxal 


aldehyde carbon to the a-carbon of the amino acid substrate. This 
is followed by hydrolysis of the ketimine intermediate to yield 
the product a-keto acid. Left in the active site is a pyridoxamine 
phosphate intermediate, which combines with another (substrate) 
a-keto acid to form a second ketimine, which rearranges to form an 
aldimine, followed by release as an amino acid. Transaldiminization 
with a lysine at the active site completes the reaction. 


Transamination | 
E-PLP complex intermediate H 
| ie 
| | areal 
R—C—coOo~ — ~ R'—C— C007 = 
i | c | < 7 
NH,* X 
i non w 
+ òT 
E*PLP 2-0,PO an | -OsPO 
l 
complex N 
n CH; 
H 
Aldimine Ketimine 


The mechanism of PLP-catalyzed transamination reactions. 
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(a) ao- SOE 
COO COO 
| | 
CHo R CHo R 
| | Pyridoxal phosphate- | | 
CHo + C=O 3 > CH, + H—C— NH4 
| | dependent | | 
H — C —iee COO aminotransferase c= COO 
| | 
(0070 COO™ 
Glutamate a-Keto acid a-KG a-Amino acid 
(b) 
COO 
| 
CHo COO COO 
| | Glutamate- | 
CHo + CH, = + CHo 
| | aspartate | 
H — C — NH C=O aminotransferase H — C — Ng 
| | | 
COO COO™ COO- 
Glutamate Oxaloacetate a-KG Aspartate 


25.4b The a-Ketoglutarate Family of Amino Acids 
Includes Glu, Gln, Pro, Arg, and Lys 


As indicated in Table 25.1, the amino acids can be grouped into families on the basis 
of the metabolic intermediates that serve as their precursors. Amino acids derived 
from a-ketoglutarate include glutamate (Glu), glutamine (Gln), proline (Pro), argi- 
nine (Arg), and in fungi and protista such as Euglena, lysine (Lys). The routes for Glu 
and Gln synthesis were described when we considered pathways of ammonium 
assimilation. 

Proline is derived from glutamate via a series of four reactions involving activation, 
then reduction, of the y-carboxyl group to an aldehyde (glutamate-5-semialdehyde ), 
which spontaneously cyclizes to yield the internal Schiff base, A’-pyrroline-5-carboxylate 
(Figure 25.20). NADPH-dependent reduction of the pyrroline double bond gives 
proline. 

Arginine biosynthesis involves enzymatic steps that are also part of the urea cycle, a 
metabolic pathway that allows certain animals (including humans) to excrete any excess 
nitrogen arising from overconsumption of protein. Net synthesis of arginine depends on 
the formation of ornithine. Interestingly, ornithine is derived from glutamate via a reac- 
tion pathway reminiscent of the proline biosynthetic pathway (Figure 25.21). Glutamate 
is first N-acetylated in an acetyl-CoA-—dependent reaction to yield N-acetylglutamate 
(Figure 25.21). An ATP-dependent phosphorylation of N-acetylglutamate to give 
N-acetylglutamate-5-phosphate primes this substrate for a reduced pyridine nucleotide- 
dependent reduction to the semialdehyde. N-acetylglutamate-5-semialdehyde then is 
aminated by a glutamate-dependent aminotransferase, giving N-acetylornithine, which 
is deacylated to ornithine. In mammals, ornithine is made from glutamate via a pathway 
that does not involve an N-acetyl block. 

Ornithine has three metabolic roles: (1) to serve as a precursor to arginine, (2) to func- 
tion as an intermediate in the urea cycle, and (3) to act as an intermediate in Arg degra- 
dation. In any case, the 6-NH;* of ornithine is carbamoylated in a reaction catalyzed 
by ornithine transcarbamoylase. The carbamoyl group is derived from carbamoyl-P 
synthesized by carbamoyl-phosphate synthetase I (CPS-I). CPS-I is the mitochondrial 
CPS isozyme; it uses two ATPs in catalyzing the formation of carbamoyl-P from NH, 


Figure 25.19 Glutamate-dependent transamination 
of a-keto acid carbon skeletons is a primary 
mechanism for amino acid synthesis. (a) The 
generic pyridoxal-P dependent aminotransferase 
reaction. (b) A prime example is the glutamate- 
aspartate aminotransferase reaction: the 
transamination of oxaloacetate by glutamate to 
yield aspartate and a-ketoglutarate. 
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Figure 25.20 The pathway of proline biosynthesis from glutamate. The enzymes are (1) y-glutamyl 
kinase, (2) glutamate-5-semialdehyde dehydrogenase, and (4) A!-pyrroline-5-carboxylate reductase; 
reaction (3) occurs nonenzymatically. 
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Figure 25.21 The bacterial pathway of ornithine biosynthesis from glutamate. The enzymes are 
(1) N-acetylglutamate synthase, (2) N-acetylglutamate kinase, (3) N-acetylglutamate-5-semialdehyde 
dehydrogenase, (4) N-acetylornithine 6-aminotransferase, and (5) N-acetylornithine deacetylase. 
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Figure 25.22 The mechanism of action of CPS-I. 


and HCO;~, one to activate the bicarbonate ion and the other to phosphorylate the 
carbamate arising from carboxyphosphate through its reaction with the ammonium ion 
(Figure 25.22). CPS-I represents the committed step in the urea cycle, and CPS-I is 
allosterically activated by N-acetylglutamate. Because N-acetylglutamate is both a pre- 
cursor to ornithine synthesis and essential to the operation of the urea cycle, it serves to 
coordinate these related pathways. 

The product of the ornithine transcarbamoylase reaction is citrulline (Figure 25.23). 
Ornithine and citrulline are two œ-amino acids of metabolic importance that neverthe- 
less are not among the 20 a-amino acids commonly found in proteins. Like CPS-I, 
ornithine transcarbamoylase is a mitochondrial enzyme. The reactions of ornithine syn- 
thesis and the rest of the urea cycle enzymes occur in the cytosol. 

The pertinent feature of the citrulline side chain is the ureido group. In a complex 
reaction catalyzed by argininosuccinate synthetase, this ureido group is first activated by 
ATP to yield a citrullyl-AMP derivative, followed by displacement of AMP by aspartate 
to give argininosuccinate (Figure 25.23). The formation of arginine is then accomplished 
by argininosuccinase, which catalyzes the nonhydrolytic elimination of fumarate from 
argininosuccinate. This reaction completes the biosynthesis of Arg. 


25.4c The Urea Cycle Acts to Excrete Excess N through Arg Breakdown 


The carbon skeleton of arginine is derived principally from a-ketoglutarate, but the 
N and C atoms composing the guanidino group (Figure 25.23) of the Arg side chain 
come from NH,*, HCO;~ (as carbamoyl-P), and the a-NH> groups of glutamate and 
aspartate. The circle of the urea cycle is closed when ornithine is regenerated from 
Arg by the arginase-catalyzed hydrolysis of arginine. Urea is the other product of this 
reaction and lends its name to the cycle. Note that the N atoms in urea arose from 
NH,* and the Asp amino group. In terrestrial vertebrates, urea synthesis is required 
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Figure 25.23 The urea cycle series of reactions: The enzymes are (1) ornithine transcarbamoylase 
(OTCase), (2a and 2b) argininosuccinate synthetase, (3) argininosuccinase, and (4) arginase. 
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A Deeper Look 


The Urea Cycle as Both an Ammonium and a Bicarbonate Disposal Mechanism 


Excretion of excess NH,* as nontoxic urea has traditionally been 
viewed as the physiological role of the urea cycle. However, the 
urea cycle also provides a mechanism for the excretion of excess 
HCO,- arising principally from a-carboxyl groups generated dur- 
ing the catabolism of a-amino acids. The following equations 


and the second is simply protonated and dehydrated to form CO, 
(reaction 2), which is easily excreted. One interpretation of the pre- 
ceding is that these coupled reactions allow a weak acid (NH4*) 
to protonate the conjugate base of a stronger acid (HCO; ). This 
protonation may appear thermodynamically unfavorable, but re- 


illustrate this property: call that in the urea cycle, four equivalents of ATP are consumed 
per equivalent of urea synthesized: two ATPs in the synthesis of 
carbamoyl-P, and two more as one ATP is converted to AMP + PP; 
in the synthesis of argininosuccinate from citrulline (Figure 25.23). 
If this interpretation is correct, the urea cycle may be considered 
an ATP-driven proton pump that transfers H* ions from NH,* 
to HCO; against a thermodynamic barrier. In the process, the 
potentially toxic waste products, ammonium and bicarbonate, are 
rendered innocuous and excreted. 


(1) HCO; + 2 NHy* —> H,NCONH, + 2 H,0 + H+} 
(2) HCO; + H* — H,0 + CO, 
Sum: 
2 HCO; + 2 NH,* —> H,NCONH; + CO, + 3 H,O 


That is, two moles of HCO; are eliminated in the synthesis of each 
mole of urea: One is incorporated into the product, urea (reaction 1), 


to excrete excess nitrogen generated by increased amino acid catabolism—for exam- 
ple, following dietary consumption of more than adequate amounts of protein. Urea 
formation is basically confined to the liver. 

Young adults eating a typical American diet will consume about 70 g (females) to 
105 g (males) of protein per day. These amounts are significantly more than the rec- 
ommended dietary allowance (RDA) of 56 g/day. Because these individuals remain in 
nitrogen balance (neither increasing nor decreasing their net protein levels), their bod- 
ies must dispose of the excess N derived from the amino acids in this dietary protein. 
Glutamate is key to this process because of the position of glutamate dehydrogenase 
at the interface of amino acid and carbohydrate metabolism and the importance of 
glutamate to the urea cycle. 

Increases in amino acid catabolism lead to elevated glutamate levels and a rise in 
N-acetylglutamate, the allosteric activator of CPS-I. Stimulation of CPS-I raises overall 
urea cycle activity because activities of the remaining enzymes of the cycle simply 
respond to increased substrate availability. Removal of potentially toxic NH,* by 
CPS-I is another important aspect of this regulation. The urea cycle is linked to the 
citric acid cycle through fumarate, a by-product of the action of argininosuccinase 
(Figure 25.23, reaction 3). 

Lysine biosynthesis in some fungi and in the protist Euglena also stems from 
a-ketoglutarate, making lysine a member of the a-ketoglutarate family of amino 
acids in these organisms. (As we shall see, the other organisms capable of lysine 
synthesis—namely, bacteria, other fungi, algae, and green plants—trely on aspartate 
[derived from oxaloacetate] as a precursor.) To make lysine from a-ketoglutarate 
requires a lengthening of the carbon skeleton by one CH) unit to yield a-ketoadipate 
(Figure 25.24). This addition is accomplished by a series of reactions reminiscent of 
the initial stages of the citric acid cycle. First, a two-carbon acetyl-CoA unit is added 
to the a-carbon of a-ketoglutarate to form homocitrate. Then, in a reaction sequence 
like that catalyzed by aconitase, homoisocitrate is formed from homocitrate. Oxidative 
decarboxylation (as in isocitrate dehydrogenase) removes one carbon (the original 
a-carboxyl group of a-ketoglutarate), leaving a-ketoadipate. A glutamate-dependent 
aminotransferase enzyme then aminates a-ketoadipate to give a-aminoadipate. Next, 
the 5-COO™ group is activated in an ATP-dependent adenylylation reaction, priming 
this 6-COO~ group for reduction to an aldehyde by NADPH. a-Aminoadipic- 
6-semialdehyde is then reductively aminated by addition of glutamate to its aldehydic 
carbon in an NADPH-dependent reaction leading to the formation of saccharopine. 
Oxidative cleavage of saccharopine by way of an NAD*-dependent dehydrogenase 
activity yields a-ketoglutarate and /ysine. This pathway is known as the a-aminoadipic 
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Figure 25.24 Lysine biosynthesis in certain fungi 
and Euglena: the a-aminoadipic acid pathway. 
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acid pathway of lysine biosynthesis. Interestingly, lysine degradation in animals leads 
to formation of a-aminoadipate by a reverse series of reactions identical to those 
occurring along the last steps of this biosynthetic pathway. 


25.4d The Oxaloacetate Family of Amino Acids 
Includes Asp, Asn, Lys, Met, Thr, and Ile 


The members of the oxaloacetate family of amino acids include aspartate (Asp), 
asparagine (Asn), lysine (via the diaminopimelic acid pathway), methionine (Met), 


threonine (Thr), and isoleucine (Ile). 


Aspartate is formed from the citric acid cycle intermediate oxaloacetate by transfer 
of an amino group from glutamate via a PLP-dependent aminotransferase reaction 
(Figure 25.25). Like glutamate synthesis from a-ketoglutarate, aspartate synthesis is 
a drain on the citric acid cycle. As we already saw, the Asp amino group serves as the 
N donor in the conversion of citrulline to arginine. In addition, the entire aspartate 
molecule is used in the biosynthesis of pyrimidine nucleotides. 

Asparagine is formed by amidation of the B-carboxyl group of aspartate. In bacteria, 
in analogy with glutamine synthesis, the nitrogen added in this amidation comes directly 
from NH,*. In other organisms, asparagine synthetase catalyzes the ATP-dependent 
transfer of the amido-N of glutamine to aspartate to yield glutamate, AMP, PP;, and 


asparagine (Figure 25.26). 
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Figure 25.26 Asparagine biosynthesis from 
Asp, Gln, and ATP by asparagine synthetase. 
B-Aspartyl adenylate is an enzyme-bound 
intermediate. 
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Figure 25.25 Aspartate biosynthesis via 
transamination of oxaloacetate by glutamate. 
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Threonine, methionine, and lysine biosynthesis in bacteria proceeds from the com- 
mon precursor aspartate, which is converted first to aspartyl-B-phosphate and then to 
B-aspartyl-semialdehyde. The first reaction is an ATP-dependent phosphorylation cata- 
lyzed by aspartokinase (Figure 25.27, reaction 1). In E. coli, there are three isozymes of 
aspartokinase, designated aspartokinases I, II, and III. Each of these isozymes is 
uniquely controlled by one of the three end-product amino acids. Form I is feedback- 
inhibited by threonine and form III, by lysine. Form II is not feedback-inhibited, but its 
synthesis is repressed by methionine. 

B-Aspartyl-semialdehyde is formed via NADPH-dependent reduction of aspartyl- 
B-phosphate in a reaction catalyzed by B-aspartyl-semialdehyde dehydrogenase 
(Figure 25.27, reaction 2). From here, the pathway of lysine synthesis diverges. The 
methyl carbon of pyruvate is condensed with B-aspartyl-semialdehyde, and H20 is 
eliminated to yield the cyclic compound 2,3-dihydropicolinate (Figure 25.27, reaction 10). 
Thus, lysine synthesized by this pathway must be considered a member of both the oxa- 
loacetate and the pyruvate families of amino acids. Lysine is a feedback inhibitor of this 
branch-point enzyme. Dihydropicolinate is then reduced in an NADPH-dependent 
reaction to A/-piperidine-2,6-dicarboxylate (Figure 25.27, reaction 11). A series of reac- 
tions, including a hydrolytic opening of the piperidine ring, a succinylation, a glutamate- 
dependent amination, and the hydrolytic removal of succinate, results in the formation 
of the symmetric L,L-a,e-diaminopimelate (Figure 25.27, reactions 12 through 14). 
Epimerization of this intermediate to the meso form, followed by decarboxylation, 
yields the end product lysine (Figure 25.27, reactions 15 and 16). Because this pathway 
proceeds through the symmetric L,L-a,é-diaminopimelate, one-half of the CO, evolved 
in the terminal decarboxylase step is derived from the carboxyl group of pyruvate and 
one-half from the a-carboxyl of Asp. 

The other metabolic branch diverging from B-aspartyl-semialdehyde leads to threo- 
nine and methionine via homoserine, an analog of serine that is formed by the NADPH- 
dependent reduction of B-aspartyl-semialdehyde (Figure 25.27, reaction 3) catalyzed by 
homoserine dehydrogenase. To form methionine, the —OH group of homoserine is first 
succinylated by homoserine acyltransferase (Figure 25.27, reaction 6). Methionine is a 
feedback inhibitor of this enzyme. The succinyl group of O-succinylhomoserine is then 
displaced by cysteine to yield cystathionine (Figure 25.27, reaction 7). The sulfur atom in 
methionine is contributed by a cysteine sulfhydryl. Cystathionine is then split to give 
pyruvate, NH4*, and homocysteine, a nonprotein amino acid whose side chain is one 
—CH),— group longer than that of Cys (Figure 25.27, reaction 8). Methylation of the 
homocysteine —SH via methyl transfer from the methyl donor, N>-methyl-THF 
gives methionine (Figure 25.27, reaction 9). In higher plants, methionine is synthesized 
ina slightly different way: Phosphohomoserine (the product of reaction 4 in Figure 25.27) 
and cysteine are joined to form cystathionine with the release of P;, bypassing reaction 6 
and the O-succinylhomoserine intermediate. Reactions 7 and 8 constitute a transsulfury- 
lation pathway that transfers the —SH group of Cys to homoserine through cystathio- 
nine as an intermediate. Methylation of homocysteine (reaction 9) yields methionine. 


SAM as a Methyl Group Donor S-Adenosylmethionine (SAM) acts as a methyl- 
group donor in methylation reactions. The enzyme S-adenosylmethionine synthase 
catalyzes the reaction of methionine with ATP to form S-adenosylmethionine, or 
SAM (Figure 25.28). SAM is a substrate of methyltransferases in a variety of 
methyl-donor reactions, such as the formation of phosphatidylcholine from phos- 
phatidylethanolamine (see Figure 8.6), DNA methylation (see the Human Bio- 
chemistry box in Section 11.2), and methylation of Arg and Lys residues in the 
regulation of DNA-histone interactions in chromatin (see Figure 29.29). Following 
decarboxylation, SAM is a source of propylamine for the synthesis of polyamines 
(Figure 25.28). 

The remaining amino acids of the oxaloacetate family are threonine and isoleucine. 
Threonine, like methionine, is synthesized from homoserine. Indeed, homoserine is the 
primary alcohol analog of the secondary alcohol Thr. To move this —OH from C-4 to 
C-3 requires activation of the hydroxyl through ATP-dependent phosphorylation by 
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Figure 25.27 Biosynthesis of threonine, methionine, and lysine, members of the aspartate family of 
amino acids. This B-aspartyl phosphate pathway is found in plants, fungi, and bacteria but absent in 
mammals. Thus, threonine, methionine, lysine, and isoleucine (which is synthesized from threonine 

[see Figure 25.29]) are essential amino acids for humans; they have to be obtained from dietary protein. 
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Figure 25.28 The synthesis of S-adenosylmethionine (SAM) and its fates. 
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homoserine kinase (see Figure 25.27, reaction 4). As the first reaction unique to Thr 
biosynthesis, homoserine kinase is feedback inhibited by threonine. The last step is cata- 
lyzed by threonine synthase, a PLP-dependent enzyme (see Figure 25.27, reaction 5). 

Tsoleucine is included in the oxaloacetate family of amino acids because four of its six 
carbons derive from Asp (via threonine) and only two come from pyruvate. Neverthe- 
less, four of the five enzymes necessary for isoleucine synthesis are common to the path- 
way for biosynthesis of valine, so discussion of isoleucine synthesis is presented under 
the biosynthesis of the pyruvate family of amino acids. 


25.4e The Pyruvate Family of Amino Acids Includes Ala, Val, and Leu 


The pyruvate family of amino acids includes alanine (Ala), valine (Val), and leucine 
(Leu). Transamination of pyruvate, with glutamate as amino donor, gives alanine. 
Because these transamination reactions are readily reversible, alanine degradation 
occurs via the reverse route, with a-ketoglutarate serving as amino acceptor. 

Transamination of pyruvate to alanine is a reaction found in virtually all organ- 
isms, but valine, leucine, and isoleucine are essential amino acids, and as such, they 
are not synthesized in mammals. In the organisms that synthesize them, the pathways 
of valine and isoleucine synthesis can be considered together because a set of four 
enzymes is common to the last four steps of both pathways (Figure 25.29). Both path- 
ways begin with an a-keto acid. Isoleucine can be considered a structural analog of 
valine that has one extra —CH,— unit, and its a-keto acid precursor, namely, a-keto- 
butyrate, is one carbon longer than the valine precursor, pyruvate. Interestingly, 
a-ketobutyrate is formed from threonine by threonine deaminase (Figure 25.29, reac- 
tion 1). This PLP-dependent enzyme (also known as threonine dehydratase or serine 
dehydratase) is feedback-inhibited by isoleucine, the end product. Note that part of 
the carbon skeleton for Ile comes from Asp by way of Thr. From here on, the Val and 
Ile pathways employ the same set of enzymes. The first reaction involves the genera- 
tion of hydroxyethyl-thiamine pyrophosphate from pyruvate in a reaction analogous 
to those catalyzed by transketolase and the pyruvate dehydrogenase complex. The 
two-carbon hydroxyethyl group is transferred from TPP to the respective keto acid 
acceptor by acetohydroxy acid synthase (acetolactate synthase) to give a-acetolactate 
or a-aceto-a-hydroxybutyrate (Figure 25.29, reaction 2). NAD(P)H-dependent reduc- 
tion of these a-keto hydroxy acids yields the dihydroxy acids a, 8-dihydroxyisovaler- 
ate and a,B-dihydroxy-B-methylvalerate (Figure 25.29, reaction 3). Dehydration of 
each of these dihydroxy acids by dihydroxy acid dehydratase gives the appropriate 
a-keto acid carbon skeletons a-ketoisovalerate and a-keto-B-methylvalerate (Figure 
25.29, reaction 4). Transamination by the branched-chain amino acid aminotransferase 
(BCAT) yields Val or Ile, respectively (Figure 25.29, reaction 5). 

Leucine synthesis depends on these reactions as well, because a-ketoisovalerate is 
a precursor common to both Val and Leu (Figure 25.30). Although Val and Leu dif- 
fer by only a single —CH,— in their respective side chains, the carboxyl group of 
a-ketoisovalerate first picks up two carbons from acetyl-CoA to give a-isopropylma- 
late in a reaction catalyzed by isopropylmalate synthase; the enzyme is sensitive to 
feedback inhibition by Leu (Figure 25.30, reaction 1). Isopropylmalate dehydratase 
(Figure 25.30, reaction 2) converts the a-isomer to the B-form, which undergoes an 
NAD*-dependent oxidative decarboxylation by isopropylmalate dehydrogenase 
(Figure 25.30, reaction 3), so the carboxyl group of a-ketoisovalerate is lost as CO». 
Amination of a-ketoisocaproate by leucine aminotransferase (Figure 25.30, 
reaction 4) gives Leu. 


25.4f The 3-Phosphoglycerate Family of Amino Acids 

Includes Ser, Gly, and Cys 
Serine, glycine, and cysteine are derived from the glycolytic intermediate 3-phospho- 
glycerate. The diversion of 3-PG from glycolysis is achieved via 3-phosphoglycerate 
dehydrogenase (Figure 25.31, reaction 1). This NAD*-dependent oxidation of 3-PG 
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Figure 25.29 Biosynthesis of valine and isoleucine. Note that the enzymes catalyzing reactions 3, 4, 
and 5 serve both the Val and Ile synthetic pathways. 


yields 3-phosphohydroxypyruvate—which, as an a-keto acid, is a substrate for 
transamination by glutamate to give 3-phosphoserine (Figure 25.31, reaction 2). 
Serine phosphatase then generates serine (Figure 25.31, reaction 3). Serine inhibits 
the first enzyme, 3-PG dehydrogenase, and thereby feedback-regulates its own 
synthesis. 
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Figure 25.30 Biosynthesis of leucine. 
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Figure 25.31 Biosynthesis of serine from 
3-phosphoglycerate. 


Glycine is made from serine via two related enzymatic processes. In the first 
(Figure 25.32a), serine hydroxymethyltransferase, a PLP-dependent enzyme, catalyzes 
the transfer of the serine B-carbon to tetrahydrofolate (THF), the principal agent of 
one-carbon metabolism. Glycine and N*,N'°-methylene-THF are the products. In addi- 
tion, glycine can be synthesized by a reversal of the glycine oxidase reaction 
(Figure 25.32b). Here, glycine is formed when N°,N!°-methylene-THF condenses with 
NH,* and CO). Via this route, the B-carbon of serine becomes part of glycine. The 
conversion of serine to glycine is a prominent means of generating one-carbon deriva- 
tives of THF, which are so important for the biosynthesis of purines and the C-5 methyl 
group of thymine, as well as the amino acid methionine. Glycine itself contributes to 


both purine and heme synthesis. 


How Do Organisms Synthesize Amino Acids? 
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Figure 25.32 Biosynthesis of glycine from serine 
(a) via serine hydroxymethyltransferase and (b) via 
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Cysteine synthesis is accomplished by sulfhydryl transfer to serine (Figure 25.33). 
In some bacteria, H,S condenses directly with serine via a PLP-dependent enzyme- 
catalyzed reaction (Figure 25.33a), but in most microorganisms and green plants, 
the sulfhydrylation reaction requires an activated form of serine, O-acetylserine 
(Figure 25.33b). O-acetylserine is made by serine acetyltransferase, with the transfer of 
an acetyl group from acetyl-CoA to the —OH of Ser. This enzyme is inhibited by Cys. 
O-Acetylserine then undergoes sulfhydrylation by HS with elimination of acetate; 
the enzyme is O-acetylserine sulfhydrylase. 

In humans, cysteine is synthesized from methionine by a reverse transsulfurylation 
pathway (reactions 7, 8, and 9 in Figure 25.27 constitute the forward transsulfurylation 
pathway). Because cysteine can be made from methionine via this pathway, cysteine is 
not considered by nutritionists to be an essential amino acid, even though methionine 
is. Mutations in the reverse transsulfurylation pathway lead to a rare but potentially 
fatal disease called homocystinuria, caused by accumulation of both methionine and 
homocysteine. The disease is treated by putting patients on a low-protein diet to mini- 
mize methionine intake and giving them vitamin B, supplements. 


Sulfide Synthesis from Sulfate Involves S-Containing ATP Derivatives Given the prevailing 
oxidative nature of our environment and the reactivity and toxicity of HS, the source 
of sulfide for Cys synthesis merits discussion. In microorganisms and plants, sulfide is 
the product of sulfate assimilation. Sulfate is the common inorganic form of combined 
sulfur, and its assimilation involves several interesting ATP derivatives. ATP sulfurylase 
(Figure 25.34, reaction 1) catalyzes the formation of adenosine-5'-phosphosulfate 
(APS). Then, adenosine-5’-phosphosulfate-3'-phosphokinase catalyzes the formation 
of 3’-phosphoadenosine-5'-phosphosulfate (PAPS) from APS and ATP (Figure 25.34, 
reaction 2). Sulfite is then liberated upon reduction of PAPS by reduced thioredoxin, 
leaving 3'-phosphoadenosine-S’-phosphate as a product (Figure 25.34, reaction 3). Sul- 
fite (SO3") is then reduced to sulfide (S2-) in a multielectron transfer reaction catalyzed 
by sulfite oxidase (Figure 25.34, reaction 4); NADPH is the electron donor. Sulfite reduc- 
tase, like nitrite reductase, possesses siroheme as a prosthetic group (see Figure 25.2). 
3'-Phosphoadenosine-5’-phosphosulfate is an intermediate in sulfate assimilation and a 
substrate for synthesis of sulfate esters, such as the sulfated polysaccharides found in the 
glycocalyx of animal cells. 
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Figure 25.33 Cysteine biosynthesis. (a) Direct sulfhydrylation of serine by H3S. (b) H,S-dependent 
sulfhydrylation of O-acetylserine. (c) In mammals, Cys is synthesized from methionine (via 
homocysteine) and Ser by a reverse transsulfurylation pathway (b/ue-shaded reactions). The enzymes 
are CBS (cystathionine-B-synthase and CGL (cystathionine-y-lyase). (Figure part (c) adapted from 
Figure 2 in Lieu, E. L., Nguyen, T., Rhyne, S., and Kim, J., 2020. Amino acids in cancer. 
Experimental and Molecular Medicine 52:15-30.) 
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25.4g The Aromatic Amino Acids Are Synthesized from Chorismate 


The aromatic amino acids, phenylalanine, tyrosine, and tryptophan, are derived from 
a shared pathway that has chorismic acid (Figure 25.35) as a key intermediate. Indeed, 
chorismate is common to the synthesis of cellular compounds having benzene rings, 
including these amino acids, the fat-soluble vitamins E and K, folic acid, as well as 
coenzyme Q and plastoquinone (the two quinones necessary to electron transport 
during respiration and photosynthesis, respectively). Lignin, a polymer of nine-carbon 
aromatic units, is also a derivative of chorismate. Lignin and related compounds can 
account for as much as 35% of the dry weight of higher plants; clearly, enormous 
amounts of carbon pass through the chorismate biosynthetic pathway. 
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Figure 25.35 Some of the aromatic compounds derived from chorismate. 


Chorismate Is Synthesized from PEP and Erythrose-4-P Chorismate biosynthesis occurs 
via the shikimate pathway (Figure 25.36). The precursors for this pathway are the com- 
mon metabolic intermediates phosphoenolpyruvate and erythrose-4-phosphate. These 
intermediates are linked to form 3-deoxy-D-arabino-heptulosonate-7-phosphate 
(DAHP) by DAHP synthase (Figure 25.36, reaction 1). Although this reaction is remote 
from the ultimate aromatic amino acid end products, it is an important point for regu- 
lation of aromatic amino acid biosynthesis. In the next step on the way to chorismate, 
DAHP is cyclized to form a six-membered saturated ring compound, 5-dehydroquinate 
(Figure 25.36, reaction 2), in a reaction catalyzed by dehydroquinate synthase (NA D* is 
a coenzyme in this reaction but is not modified by it). A sequence of reactions ensues 
that introduces unsaturations into the ring through dehydration (Figure 25.36, reac- 
tion 3, 5-dehydroquinate dehydratase) and reduction (reaction 4, shikimate dehydroge- 
nase), yielding shikimate. Phosphorylation of shikimate by shikimate kinase (reaction 
5), then addition of PEP by 3-enolpyruvylshikimate-5-phosphate synthase (reaction 6), 
followed by chorismate synthase (reaction 7), gives chorismate. Thus, two equivalents 
of PEP are needed to form chorismate from erythrose-4-P. 


Phenylalanine and Tyrosine Share Prephenate as a Precursor At chorismate, the pathway 
separates into three branches, each leading specifically to one of the aromatic amino 
acids. The branches leading to phenylalanine and tyrosine both pass through prephenate 
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Figure 25.36 The shikimate pathway leading to the 
synthesis of chorismate. 


(Figure 25.37). In some organisms, such as E. coli, the branches are truly distinct because 
prephenate does not occur as a free intermediate but rather remains bound to the bifunc- 
tional enzyme that catalyzes the first two reactions after chorismate. In any case, choris- 
mate mutase is the first reaction leading to Phe or Tyr (Figure 25.37, reaction 1). In the Phe 
branch, the —OH group para to the prephenate carboxyl is removed by prephenate 
dehydratase (Figure 25.37, reaction 2). In the Tyr branch, this —OH is retained; instead, 
an oxidative decarboxylation of prephenate catalyzed by prephenate dehydrogenase 
(Figure 25.37, reaction 4) yields 4-hydroxyphenylpyruvate. Glutamate-dependent amino- 
transferases (phenylalanine aminotransferase [Figure 25.27, reaction 3] and tyrosine 
aminotransferase [reaction 5]) introduce the amino groups into the two a-keto acids, 
phenylpyruvate and 4-hydroxyphenylpyruvate, to give Phe and Tyr, respectively. Some 
mammals can synthesize Tyr from Phe obtained in the diet via phenylalanine-4- 
monooxygenase (also known as phenylalanine hydroxylase), using O, and tetrahydrobiop- 
terin, an analog of tetrahydrofolic acid, as co-substrates (Figure 25.38). 
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Figure 25.37 The biosynthesis of phenylalanine, tyrosine, and tryptophan from chorismate. 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


Chapter 25 Nitrogen Acquisition and Amino Acid Metabolism 


Figure 25.38 The formation of tyrosine from 
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Tryptophan Is Synthesized from Chorismate and PRPP The pathway of tryptophan 
synthesis is perhaps the most thoroughly studied of any biosynthetic sequence, par- 
ticularly in terms of its genetic organization and expression. Synthesis of Trp from 
chorismate requires six steps (see Figure 25.37). In most microorganisms, the first 
enzyme, anthranilate synthase (see Figure 25.37, reaction 6), is an a8 -type protein, 
with the B-subunit acting in a glutamine—amidotransferase role to provide the 
—NH, group of anthranilate. Or, given high levels of NH,*, the a-subunit can carry 
out the formation of anthranilate directly by a process in which the activity of the 
B-subunit is unnecessary. Furthermore, in certain enteric bacteria, such as E. coli 
and Salmonella typhimurium, the second reaction of the pathway, the phosphoribosyl- 
anthranilate transferase reaction (see Figure 25.37, reaction 7), is an activity catalyzed 


A Deeper Look 


Amino Acid Biosynthesis Inhibitors as Herbicides 


Unlike animals, plants can synthesize all 20 of the common 
amino acids. Inhibitors acting specifically on the plant enzymes 
that are capable of carrying out the biosynthesis of the essen- 
tial amino acids (that is, enzymes that animals lack) have been 
developed. These substances appear to be ideal for use as herbi- 
cides because they should show no effect on animals. Glyphosate, 
sold commercially as Roundup, is a PEP analog that acts as an 
uncompetitive inhibitor of 3-enolpyruvylshikimate-5S-P synthase 
(Figure 25.36). Sulfometuron methyl, a sulfonylurea herbicide that 


inhibits acetohydroxy acid synthase, an enzyme common to Val, 
Leu, and Ile biosynthesis (Figure 25.29), is the active ingredient 
in Oust. Aminotriazole, sold as Amitrole, blocks His biosynthesis 
by inhibiting imidazole glycerol-P dehydratase (Figure 25.40). PPT 
(phosphinothricin) is a potent inhibitor of glutamine synthetase. 
Although Gln is a nonessential amino acid and glutamine syn- 
thetase is a ubiquitous enzyme, PPT is relatively safe for animals 
because it does not cross the blood-brain barrier and is rapidly 
cleared by the kidneys. 
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by the a-subunit of anthranilate synthase. PRPP (5-phosphoribosyl-1-pyrophosphate), 
the substrate of this reaction, is also a precursor for purine biosynthesis (see 
Chapter 26). Phosphoribosyl-anthranilate then undergoes a rearrangement wherein 
the ribose moiety is isomerized to the ribulosyl form in enol-1-(o-carboxyphenylamino)- 
1-deoxyribulose-5-phosphate by N-(5'-phosphoribosyl)-anthranilate isomerase (see 
Figure 25.37, reaction 8). Decarboxylation and ring closure ensue to yield the indole 
nucleus as indole-3-glycerol phosphate (indole-3-glycerol phosphate synthase, reaction 
9). The final two reactions (10 and 11 in Figure 25.37) are both catalyzed by tryptophan 
synthase, an a)8>-type protein. The a-subunit cleaves indoleglycerol-3-phosphate to 
form indole and 3-glycerol phosphate. The indole is then passed to the B-subunit, 
which adds serine in a PLP-dependent reaction. 

X-ray crystallographic analysis of S. typhimurium tryptophan synthase shows that 
the active sites of the a- and B-subunits are separated from each other by 2.5 nm but are 
connected by a hydrophobic tunnel wide enough to accommodate indole (Figure 25.39). 
Indole, the product of the reaction catalyzed by the a-subunit (see Figure 25.37, reac- 
tion 10), can be transferred directly to the B-subunit, which catalyzes condensation with 
serine to yield Trp (see Figure 25.37, reaction 11). Thus, indole is not lost from the 
enzyme complex and diluted in the surrounding milieu. This phenomenon of direct 
transfer of enzyme-bound metabolic intermediates, or tunneling, increases the effi- 
ciency of the overall pathway by preventing loss and dilution of the intermediate. In 
addition, molecular tunnels protect reactive intermediates (some of which might be 
harmful) from potentially unproductive side reactions in the intracellular environment. 
Also, by directing the intermediate from one active site to another, these tunnels favor a 
particular reaction sequence. Molecular tunnels are found in many enzymes. 


25.4h Histidine Biosynthesis and Purine Biosynthesis 
Are Connected by Common Intermediates 


Like aromatic amino acid biosynthesis, histidine biosynthesis shares metabolic interme- 
diates with the pathway of purine nucleotide synthesis. The pathway involves ten sepa- 
rate steps, the first being an unusual reaction that links ATP and PRPP (Figure 25.40). 
Five carbon atoms from PRPP and one from ATP end up in histidine. Step 5 involves 
some novel chemistry: The substrate, phosphoribulosylformimino-5-aminoimidazole- 
4-carboxamide ribonucleotide, picks up an amino group (from the amide of glutamine) 
in a reaction accompanied by cleavage and ring closure to yield two imidazole 
compounds—the histidine precursor, imidazole glycerol phosphate, and a purine nucleo- 
tide precursor, 5-aminoimidazole-4-carboxamide ribonucleotide (AICAR). Note that 
AICAR as a purine nucleotide precursor can ultimately replenish the ATP consumed in 


907 


Figure 25.39 Tryptophan synthase ribbon diagram. 
The a-subunit is in blue, and the B-subunit is in 
orange (N-terminal domain) and red (C-terminal 
domain). The tunnel connecting them is outlined by 
the yellow dots and is shown with several indole 
molecules (green) packed in head-to-tail fashion. 
The labels “IPP” and “PLP” point to the active sites 
of the a- and B-subunits, respectively, in which a 
competitive inhibitor (indole propanol phosphate, 
IPP) and the coenzyme PLP are bound. (Adapted 
from Hyde, C. C., et al., 1988. Three-dimensional 
structure of the tryptophan synthase multienzyme 
complex from Salmonella typhimurium. Journal of 
Biological Chemistry 263:17857-17871.) 
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Figure 25.40 The pathway of histidine biosynthesis. The following enzymes catalyze each reaction: 
(1) ATP-phosphoribosyl transferase, (2) pyrophosphohydrolase, (3) phosphoribosyl-AMP cyclohydrolase, { 
(4) phosphoribosylformimino-5-aminoimidazole carboxamide ribonucleotide isomerase, (5) imidazole To purine biosynthesis 


glycerol phosphate synthase, (6) imidazole glycerol-P dehydratase, (7) L-histidinol phosphate 
aminotransferase, (8) histidinol phosphate phosphatase, and (9) histidinol dehydrogenase. 
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reaction 1. Nine enzymes act in histidine’s ten synthetic steps. Reactions 9 and 10, the 
successive NAD*-dependent oxidations of an alcohol to an aldehyde and then to a car- 
boxylic acid, are catalyzed by the same dehydrogenase. 


25.5 | How Does Amino Acid Catabolism Lead 
into Pathways of Energy Production? 


In human adults, close to 90% of the energy requirement is met by oxidation of carbo- 
hydrates and fats; the remainder comes from oxidation of the carbon skeletons of amino 
acids derived from dietary protein. The dietary amount of free amino acids is trivial 
under most circumstances. The primary physiological purpose of amino acids is to serve 
as the building blocks for protein biosynthesis. However, if excess protein is consumed 
in the diet or if the amount of amino acids released during normal turnover of cellular 
proteins exceeds the requirements for new protein synthesis, the amino acid surplus 
must be catabolized. Also, if carbohydrate intake is insufficient (as during fasting or 
starvation) or if carbohydrates cannot be appropriately metabolized due to disease (as 
in diabetes mellitus), body protein becomes an important fuel for metabolic energy. 


25.5a The 20 Common Amino Acids Are Degraded by 20 Different 
Pathways That Converge to Just 7 Metabolic Intermediates 


Because the 20 common amino acids of proteins are distinctive in terms of their car- 
bon skeletons, each amino acid requires its own unique degradative pathway. Dietary 
recommendations indicate that protein should provide humans with about 30% of 
their daily caloric intake, so no single amino acid contributes significantly to energy 
needs. Degradation of the carbon skeletons of the 20 common a-amino acids con- 
verges to just 7 metabolic intermediates: acetyl-CoA, succinyl-CoA, pyruvate, a-keto- 
glutarate, fumarate, oxaloacetate, and acetoacetate (Table 25.2). Because succinyl-CoA, 
pyruvate, a-ketoglutarate, fumarate, and oxaloacetate can serve as precursors for 
glucose synthesis, amino acids giving rise to these intermediates are termed 
glucogenic. Those degraded to yield acetyl-CoA or acetoacetate are termed keto- 
genic, because these substances can be used to synthesize fatty acids or ketone bod- 
ies. Some amino acids are both glucogenic and ketogenic (Figure 25.41). Only leucine 
and lysine are solely ketogenic. 


The C-3 Family of Amino Acids: Alanine, Serine, and Cysteine The carbon skeletons of 
alanine, serine, and cysteine all converge to pyruvate (Figure 25.42). Transamination 
of alanine yields pyruvate: 


Alanine + a-ketoglutarate —— pyruvate + glutamate (25.10) 


| Table 25.2 Grouping of Amino Acids According to Their Degradation Product* 


Family C3 C-4 c-5 Succinyl-CoA Fumarate Acetyl-CoA Acetoacetate 

Degradation Pyruvate Oxaloacetate a-Ketoglutarate Succinyl-CoA Fumarate Acetyl-CoA Acetoacetate 

product 

Amino acids Alanine Aspartate Glutamate Valine Phenylalanine Leucine Leucine 
Serine Asparagine Glutamine Isoleucine Tyrosine Isoleucine Lysine 
Cysteine Proline Methionine Aspartate? Threonine Phenylalanine 
Glycine Arginine Tyrosine 
Threonine Histidine Tryptophan 
Tryptophan 


*Five of the amino acid degradation product families lead to metabolic intermediates that can serve as substrates for gluconeogenesis, namely pyruvate, oxaloacetate, 

a-ketoglutarate, succinyl-CoA, and fumarate. The other two families lead to acetyl-CoA or acetoacetate, and these intermediates cannot be used as substrates for gluconeogenesis. Thus, 
amino acids in these degradation families contribute to ketone body formation. 

Aspartate is readily deaminated to form oxaloacetate, but aspartate also leads to fumarate by virtue of the urea cycle (Figure 25.3, reaction 3) and reactions in the purine biosynthetic 
pathway (see Figure 26.3, reaction 9, and Figure 26.5). 
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Figure 25.41 Metabolic degradation of the common 
amino acids. Glucogenic amino acids are shown in 
pink, ketogenic in blue. Note that degradation of 12 
of the 20 amino acids leads to a citric acid cycle 
intermediate. 
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Deamination of serine by serine dehydratase also yields pyruvate. Cysteine is con- 
verted to pyruvate via a number of paths. 

The carbon skeletons of three other amino acids also become pyruvate. Glycine is 
convertible to serine and thus to pyruvate. The three carbon atoms of tryptophan that 
are not part of its indole ring appear as alanine (and, hence, pyruvate) upon tryptophan 
degradation. Threonine by one of its degradation routes is cleaved to glycine and acetal- 
dehyde. The glycine is then converted to pyruvate via serine; the acetaldehyde is oxidized 
to acetyl-CoA (Figure 25.42). 


The C-4 Family of Amino Acids: Aspartate and Asparagine Transamination of aspartate 
gives oxaloacetate: 


Aspartate + a-ketoglutarate oxaloacetate + glutamate (25.11) 


Hydrolysis of asparagine by asparaginase yields aspartate and NH,"*. Alternatively, 
participation of aspartate in the urea cycle and purine biosynthesis converts its carbon 
skeleton to a different citric acid cycle intermediate, namely, fumarate (Figure 25.23). 


The C-5 Family of Amino Acids Is Converted to a-Ketoglutarate via Glutamate The five- 
carbon citric acid cycle intermediate a-ketoglutarate is always a product of transami- 
nation reactions involving g/utamate. Thus, glutamate and any amino acid convertible 
to glutamate are classified within the C-5 family. These amino acids include gluta- 
mine, proline, arginine, and histidine (Figure 25.43). 
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Figure 25.44 Valine, isoleucine, and methionine are 
converted via propionyl-CoA to succinyl-CoA for 
entry into the citric acid cycle. The shaded carbon 
atoms of the three amino acids give rise to 
propionyl-CoA. 


Degradation of Valine, Isoleucine, and Methionine Leads to Succinyl-CoA The breakdown 
of valine, isoleucine, and methionine converges at propionyl-CoA (Figure 25.44). Methi- 
onine first becomes S-adenosylmethionine and then homocysteine (see Figure 25.28). 
The carboxyl groups from all three are lost as CO;. The two distal carbon atoms of 
isoleucine become acetyl-CoA. Propionyl-CoA subsequently is converted to methyl- 
malonyl-CoA and thence to succinyl-CoA via the same reactions mediating the oxida- 
tion of fatty acids that have odd numbers of carbon atoms (see Section 23.3). 


Leucine Is Degraded to Acetyl-CoA and Acetoacetate Leucine is one of only two purely 
ketogenic amino acids; the other is lysine. Deamination of leucine via a transamina- 
tion reaction yields a-ketoisocaproate, which is oxidatively decarboxylated to isovaleryl- 
CoA (Figure 25.45). Subsequent reactions, one of which is a biotin-dependent 
carboxylation, give B-hydroxy-B-methylglutaryl-CoA, which is then cleaved to yield 
acetyl-CoA and acetoacetate, a ketone body (see Figure 23.26). Neither of these prod- 
ucts is convertible to glucose. 


Degradation of the Branched-Chain Amino Acids Follows a Common Pathway Valine, 
leucine, and isoleucine are the three amino acids with branched aliphatic side 
chains, and all three are essential amino acids. Unlike the other 17 amino acids, 
which are broken down principally in the liver, Val, Ile, and Leu are also degraded 
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Figure 25.45 Leucine is degraded to acetyl-CoA and acetoacetate. 


in adipose tissue and skeletal muscle. The first three reactions in their catabolism are 
identical (Figure 25.46). These reactions, a transamination, followed by an oxidative 
decarboxylation to form a coenzyme A derivative, and then another dehydrogena- 
tion, are localized in the mitochondrial matrix. The liver has low levels of the first 
enzyme, branched-chain amino acid aminotransferase (BRAT), thus explaining 
branched-chain amino acid (BCAA) degradation in other tissues. The oxidative 
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Figure 25.46 The first three steps for degradation of 
valine, isoleucine, and leucine follow a common 
pathway. Reaction 1 is catalyzed by the branched- 
chain amino acid aminotransferase; reaction 2 is 
catalyzed by the branched-chain a-keto acid 
dehydrogenase (BCKDH); and reaction 3 is 
catalyzed by the acyl-CoA dehydrogenase of the 
fatty acid B-oxidation pathway (see Figure 23.9, 
reaction 1). The carbons leading to succinyl-CoA 
via propionyl-CoA are shown within blue-shaded 
regions; those leading to acetyl-CoA in orange- 
shaded regions; and those to acetoacetate in 
green-shaded regions. 
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Figure 25.47 Lysine degradation via 
the saccharopine, a-ketoadipate 
pathway culminates in the formation 
of the ketone body, acetoacetate. 


decarboxylation is carried out by the branched-chain a-keto acid dehydrogenase 
(BCKDH), an enzyme complex with striking similarity to the mitochondrial pyru- 
vate dehydrogenase and a-ketoglutarate dehydrogenase complexes discussed in 
Chapter 19 (see Figure 19.4 and Table 19.2). Hereditary defects in BCKDH leads to 
maple sugar urine disease. The a-keto acid derivatives of these amino acids accumu- 
late in the blood and ultimately, in the urine, where their presence causes the urine 
to smell like maple syrup. Such defects lead to abnormal development, intellectual 
disability, and shortened lifespan, even if dietary intake of branched-chain amino 
acids (BCAAs) is restricted. 


Lysine Degradation Lysine degradation proceeds by several pathways, but the sac- 
charopine pathway found in liver predominates (Figure 25.47). This degradative 
route proceeds backward along the lysine biosynthetic pathway through saccharo- 
pine and a-aminoadipate to a-ketoadipate (see Figure 25.24). Next, a-ketoadipate 
undergoes oxidative decarboxylation to glutaryl-CoA, which is then transformed 
into acetoacetyl-CoA and ultimately into the ketone body, acetoacetate. 


Phenylalanine and Tyrosine Are Degraded to Acetoacetate and Fumarate The first reac- 
tion in phenylalanine degradation is the hydroxylation reaction of tyrosine biosynthesis 
(see Figure 25.38). Both these amino acids thus share a common degradative pathway. 
Transamination of Tyr yields the a-keto acid p-hydroxyphenylpyruvate (Figure 25.48, 
reaction 1). p-Hydroxyphenylpyruvate dioxygenase, a vitamin C—dependent enzyme, 
then carries out a ring hydroxylation—oxidative decarboxylation to yield homogentis- 
ate (Figure 25.48, reaction 2). Ring opening and isomerization (Figure 25.48, reactions 
3 and 4) give 4-fumaryl-acetoacetate, which is hydrolyzed to acetoacetate and fumarate 
(reaction 6). 


Tryptophan Degradation Tryptophan is the least abundant amino acid, so its catab- 
olism to produce cellular energy is insignificant. However, tryptophan is a crucial 
precursor for synthesis of a variety of important substances, including the 
neurotransmitter serotonin (5-hydroxytryptamine) and the hormone melatonin 
(N-acetyl-S-methoxytryptamine). As noted earlier in the section on the degradation of 
the C-3 family of amino acids, degradation of the nonindole carbons of tryptophan 
yields pyruvate. The indole ring of Trp is converted by a series of reactions to 
a-ketoadipate and ultimately acetoacetate by the same reactions shown in Figure 25.47 
for lysine. The first reaction in tryptophan degradation is catalyzed by tryptophan-2,3- 
dioxygenase and results in the formation of N-formyl-kynurenine; and then kynurenine 
(Figure 25.49). If not degraded to a-ketoadipate, kynurenine leads to the formation of a 
wide range of secondary metabolites, including quinolinate (2,3-pyridinedicarboxylate), 
which is a precursor to the B vitamin, niacin, and thus, to NAD* and NADP?* synthe- 
sis. Interestingly, quinolinate is also a neurotoxic substance. Activation of the kynuren- 
ine pathway leading to quinolinate has been implicated in the neurodegeneration seen 
during aging, Alzheimer’s disease, Parkinson’s disease, Huntington’s disease, and 
amyotrophic lateral sclerosis. 
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kynurenine is not metabolized to a-ketoadipate. 
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Human Biochemistry 


Amino Acid Metabolism and Cancer 


Cancer is characterized by rapidly dividing cells with high energy 
demands. To meet the demands of growth, cancer cells extensively 
reorganize their metabolism. Glucose in the blood plasma (at more 
than 5 mM) is one source of energy and raw material for cancer 
cells, which elevate their glycolytic activity as a way of generating 
ATP for energy as well as metabolites for growth. Amino acids 
in blood plasma are another key resource for cancer-cell growth 
and proliferation, serving as the building blocks for synthesis of 
cancer-cell proteins but also as alternative fuels and as sources of 
carbon for anabolic pathways. In many cancers, amino acid me- 
tabolism is deregulated. The total concentration of amino acids in 
blood plasma exceeds 2 mM, with glutamine representing about 
40% of that total. Glutamine is exploited by cancer cells follow- 
ing deamination and anaplerotic metabolism (see Chapter 19) to 
a-ketoglutarate, which enters the citric acid cycle (see Chapter 
19). Branched-chain amino acids (BCAAs) such as isoleucine, 
leucine, and valine function as fuel sources through conversion to 
succinyl-CoA and as carbon sources for new lipid synthesis through 
formation of acetyl-CoA. The two key enzymes in BCAA me- 
tabolism, branched-chain aminotransferase (BCAT) and branched- 
chain keto acid dehydrogenase complex are overexpressed in many 
cancer types and the degree of overexpression correlates with 
malignancy. Aspartate, glutamate, glutamine, and glycine are pre- 


cursors to purine and pyrimidine biosynthesis; nucleotide synthesis 
is absolutely essential for rapidly proliferating cells. Glycine, serine, 
and methionine contribute to formation of methyl groups necessary 
for purine as well as thymine synthesis. 

Compared to normal cells, certain cancers show specific amino 
acid requirements or unique metabolic limitations and these 
dependencies make them vulnerable to targeted therapies. For 
example, both normal lymphocytes (a type of white blood cell) 
and the malignant lymphocytes of acute lymphoblastic leukemia 
(ALL) are highly dependent on the uptake of asparagine from 
the blood, since they do not express sufficient levels of asparagine 
synthetase. A successful strategy employed for many years is intro- 
duction of recombinant bacterial L-asparaginase into the plasma 
of ALL patients to lower Asn levels. Limiting exogenous aspara- 
gine or inhibiting its synthesis also reduces metastasis in breast 
cancer, resulting in cancer-cell death. Strategies addressing other 
cancers include limiting plasma levels of methionine, inhibiting 
glutaminase to block redirection of Gln into energy production 
or biosynthesis, and inhibiting BCAT to limit use of BCAAs as 
sources of carbon or fuel. As knowledge accumulates regarding 
the characteristic metabolic alterations defining different cancer- 
cell types, possibilities for therapeutic interventions will become 
even more promising. 


25.5b Animals Differ in the Form of Nitrogen That They Excrete 


Animals often enjoy a dietary surplus of nitrogen. Excess nitrogen liberated upon 
metabolic degradation of amino acids is excreted by animals in three different ways, 
based on the availability of water. Aquatic animals simply release free ammonia to 
the surrounding water; such animals are termed ammonotelic (from the Greek telos, 
meaning “end”). On the other hand, terrestrial and aerial species employ mechanisms 
that convert ammonium to less toxic waste compounds that require little H,O for 
excretion. Many terrestrial vertebrates, including humans, are ureotelic, meaning that 


Human Biochemistry 


Hereditary Defects in Phe Catabolism Underlie Alkaptonuria and Phenylketonuria 


Alkaptonuria and phenylketonuria are two human genetic diseases 
arising from specific enzyme defects in phenylalanine degradation. 
Alkaptonuria is characterized by urinary excretion of large amounts 
of homogentisate and results from a deficiency in homogentisate 
dioxygenase (step 3, Figure 25.48). Air oxidation of homogentisate 
causes urine to turn dark on standing, but the only malady suffered 
by carriers of this disease is a tendency toward arthritis later in life. 

In contrast, phenylketonurics, whose urine contains excessive 
phenylpyruvate (see accompanying figure), suffer severe intellectual 
disability if the defect is not recognized immediately after birth and 
treated by putting the victim on a diet low in phenylalanine. These 
individuals are deficient in phenylalanine hydroxylase (Figure 25.38), 


and the excess phenylalanine that accumulates is transaminated to 
phenylpyruvate and excreted. 
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Foundational Biochemistry 


they excrete excess N as urea, a highly water-soluble nonionic substance. Urea is 
formed by ureoteles via the urea cycle (see Figure 25.23). The uricotelic organisms are 
those animals using the third means of N excretion, conversion to uric acid, a rather 
insoluble purine analog. Birds and reptiles are uricoteles. Uric acid metabolism is 
discussed in the next chapter. Some animals can switch from ammonotelic to ureo- 
telic to uricotelic metabolism, depending on water availability. 


Foundational Biochemistry Things You Should Know after Reading Chapter 25. 


Nitrogen is cycled between organisms and the inanimate environment. 


The two principal forms of inorganic nitrogen in the environment are 
N; and nitrate. 


Nitrate assimilation is the principal pathway for ammonium biosyn- 
thesis and nitrogen acquisition in the biosphere. 


Nitrate assimilation is a two-step pathway consisting of nitrate reduc- 
tase and nitrite reductase. 


Organisms gain access to atmospheric N, via the pathway of nitrogen 
fixation. 


The nitrogenase enzyme complex that reduces N; to 2 NH4* is com- 
posed of two metalloproteins. 


Ammonium is introduced into organic linkage principally through 
three enzymatic reactions: glutamate dehydrogenase, glutamine syn- 
thetase, and carbamoyI-P synthetase I. 


The major pathways of ammonium assimilation, the GDH/GS route 
and the GOGAT/GS route, both lead to glutamine synthesis. 


E. coli glutamine synthetase is regulated allosterically by feedback 
inhibition through the action of nine metabolites that are key indica- 
tors of cellular nitrogen status. 


Glutamine synthetase is regulated by covalent modification through 
adenylylation of a Tyr side chain. 


Glutamine synthetase levels are regulated through controls on the 
expression of GinA, the gene encoding the GS subunit. 


Glutamine is the most abundant amino acid in human body fluids 
and the brain. 


Amino acids are formed from a-keto acids by transamination. 
Aminotransferases (transaminases) are pyridoxal-P-dependent enzymes. 


Amino acids can be grouped into families according to the metabolic 
intermediate that serves as their progenitor. 


The amino acid progenitors are a-ketoglutarate, oxaloacetate, pyru- 
vate, 3-phosphoglycerate, PEP, and erythrose-4-P. 


Humans can synthesize only 10 of the 20 common amino acids. 


Those amino acids that humans cannot synthesize are called essential 
amino acids. 


With only one minor exception, the pathways for essential amino acid 
synthesis contain more steps than the corresponding pathways for 
nonessential amino acid synthesis. 


The a-ketoglutarate family of amino acids includes glutamate, gluta- 
mine, proline, arginine, and lysine. 


The urea cycle acts to synthesize Arg and to excrete excess N through 
synthesis of urea, which is disposed of in the urine. 


The urea cycle can be viewed as both an ammonium and a bicarbon- 
ate disposal mechanism. 


The oxaloacetate family of amino acids includes Asp, Asn, Lys, Met, 
Thr, and Ile. 


Methionine, as S-adenosyl methionine (SAM), is the methyl donor in 
a variety of methylation reactions, including DNA methylation and 
histone modification. 


The pyruvate family of amino acids includes Ala, Val, and Leu. 


The 3-phosphoglycerate family of amino acids includes Ser, Gly, 
and Cys. 


The —SH group of cysteine comes from H3S. 

H2S synthesis from sulfate involves S-containing ATP derivatives. 
The aromatic amino acids are synthesized from chorismate. 
Chorismate is synthesized from PEP and erythrose-4-P. 


Chorismate is also a precursor for coenzyme Q, plastoquinone, vita- 
min E, and lignin. 

Anthranilate is an intermediate between chorismate and Trp in the 
tryptophan biosynthetic pathway. 


The pathways of essential amino acid biosynthesis are key targets of 
herbicides. 


Histidine biosynthesis and purine biosynthesis are connected by com- 
mon intermediates. 


Dietary protein is an important source of fuel for energy production. 


The 20 common amino acids are degraded by 20 different pathways 
that converge to just seven metabolic intermediates: a-ketoglutarate, 
oxaloacetate, pyruvate, succinyl-CoA, fumarate, acetyl-CoA, and ace- 
toacetate. 


Those amino acids that lead to a-ketoglutarate, oxaloacetate, pyru- 
vate, succinyl-CoA, or fumarate are considered glucogenic amino 
acids because these intermediates can serve as substrates for gluco- 
neogenesis. 


Those amino acids that lead to acetyl-CoA or acetoacetate are ke- 
togenic amino acids because they lead to the formation of ketone 
bodies. 


Only Leu and Lys are solely ketogenic. 
Ala, Ser, and Cys are degraded to pyruvate, as is Gly. 
Thr is degraded to pyruvate and acetyl-CoA. 


Asp and Asn are degraded to oxaloacetate. 
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e Glu, Gln, Pro, Arg, and His are degraded to a-ketoglutarate. 
e Degradation of Val, Ile, and Met leads to succinyl-CoA. 


e Leu is degraded to acetyl-CoA and acetoacetate. 


e Degradation of the branched-chain amino acids (Val, Leu, and Ile) 
follows a common pathway, a key reaction of which is the branched- 


chain a-keto acid dehydrogenase. 


e Lysis degraded to acetoacetate. 


Phe and Tyr are degraded to fumarate and acetoacetate. 


Hereditary defects in Phe catabolism lead to alkaptonuria or phenyl- 
ketonuria. 


Trp is degraded to pyruvate and acetoacetate. 


Trp is a precursor to the neurotransmitter serotonin, the hormone mela- 
tonin, and quinolinate, from which the B vitamin niacin can be formed. 


Amino acids in blood plasma are a key resource for cancer cells. 


Answers to all problems are at the end of this book. Detailed solutions are 
available in the Student Solutions Manual. 


1. The Oxidation States of Nitrogen What is the oxidation number of N 


in nitrate, nitrite, NO, NO, and N,? 


2. The Energetic Cost of Nitrogen Acquisition from the Environment 


How many ATP equivalents are consumed per N atom of ammonium 


formed by (a) the nitrate assimilation pathway and (b) the nitrogen 
fixation pathway? (Assume for this problem NADH, NADPH, and 
reduced ferredoxin are each worth three ATPs.) 


3. Structure of Nitrogenase Figure 25.5 shows the structures of metal 


clusters found in nitrogenase. One such cluster contains six iron atoms 


bonded to a single carbon atom. Carbon typically makes four 
chemical bonds. Explain how carbon could form six bonds. 


4. Glutamate Dehydrogenase Glutamate dehydrogenase catalyzes the 
reductive amination of a-ketoglutarate (see Figure 25.9). What is 
being reduced in this reaction? 


5. Regulation of Glutamine Synthetase by Covalent Modification 


Suppose at certain specific metabolite concentrations in vivo the cyclic 
cascade regulating E. coli glutamine synthetase has reached a dynamic 
equilibrium where the average state of GS adenylylation is poised at 


n = 6. Predict what change in n will occur if: 
a. [ATP] increases. 
b.  Pya/Pyp increases. 
c. [a-KG]/[Gln] increases. 
d. [P;] decreases. 
6. The Energetic Cost of Nitrogen Excretion via the Urea Cycle How 


many ATP equivalents are consumed in the production of one urea 
equivalent by the urea cycle? 


7. Why a High-Protein Diet Requires Greater Water Intake Why are 
persons on a high-protein diet (such as the Atkins diet) advised to 
drink lots of water? 


8. Succinylation in Biosynthesis of Lysine A succinylation step is 
involved in the biosynthesis pathway of lysine, as shown in Figure 
25.27. The formation of a carbon-nitrogen bond is an endergonic 


reaction. What provides the energy to drive this particular step in the 


pathway? 


9. The Energetic Cost of Lysine Synthesis How many ATP equivalents 
are consumed in the biosynthesis of lysine from aspartate by the 
pathway shown in Figure 25.27? 


10. PEP as a Precursor for Aromatic Amino Acid Biosynthesis If PEP 
labeled with '4C in the 2-position serves as the precursor to 
chorismate synthesis, which C atoms in chorismate are 
radioactive? 


11. Gluconeogenesis from Aspartate (Integrates with Chapter 22.) Write 
a balanced equation for the synthesis of glucose (by gluconeogenesis) 


from aspartate. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


19. 


20. 


21. 


Synthesis of Amino Acids from æ-Keto Acids For each of the 
20 common amino acids, give the name of the enzyme that catalyzes 
the reaction providing its a-amino group. 


Amino Acid Biosynthesis and TCA Cycle (Integrates with Chapter 19) 
For which two amino acids is their biosynthetic pathway very similar 
to the first half of the TCA cycle? 


The Essential Coenzyme in Amino Acid Metabolism Which vitamin is 
central in amino acid metabolism? Why? 


Vitamins and Coenzymes in Homocysteine Metabolism Vitamins Be, 
By, and folate may be recommended for individuals with high blood 
serum levels of homocysteine (a condition called hyperhomocystein- 

emia). How might these vitamins ameliorate hyperhomocysteinemia? 


Vitamins and Coenzymes in Branched-Chain Amino Acid Degrada- 
tion (Integrates with Chapter 19.) On the basis of the following 
information, predict a reaction mechanism for the mammalian 
branched-chain a-keto acid dehydrogenase complex (the BCKAD 
complex). This complex carries out the oxidative decarboxylation of 
the a-keto acids derived from valine, leucine, and isoleucine. 
a. One form of maple syrup urine disease responds well to adminis- 
tration of thiamine. 


b. Lipoic acid is an essential coenzyme. 
c. The enzyme complex contains a flavoprotein. 


Dietary Concerns in Phenylketonuria People with phenylketonuria 
must avoid foods containing the low-calorie sweetener Aspartame, 
also known as NutraSweet. Find the structure of Aspartame in the 
Merck Index (or other scientific source), and state why these people 
must avoid this substance. 


How the Herbicide Roundup Affects Plant Metabolism Glyphosate 
(otherwise known as Roundup) is an analog of PEP. It acts as a 
noncompetitive inhibitor of 3-enolpyruvylshikimate-5-P synthase; it 
has the following structure in its fully protonated state: 


HOOC—CH,—NH—CH),—PO3H) 


Consult Figures 25.35 and 25.36, and construct a list of the diverse 
metabolic consequences that might be experienced by a plant cell 
exposed to glyphosate. 


Glycine Synthesis from Glucose (Integrates with Chapter 18.) When 
cells convert glucose to glycine, which carbon atoms of glucose are 
represented in glycine? 


Glucogenic Amino Acids Review Figure 25.41 and explain why leucine 
and lysine cannot serve as precursors for gluconeogenesis. 


A Deficiency on 3-Phosphoglycerate Dehydrogenase Can Affect Amino 
Acid Metabolism Although serine is a nonessential amino acid, serine 
deficiency syndrome has been observed in humans. One such form 

of the syndrome is traceable to a deficiency in 3-phosphoglycerate 
dehydrogenase (see Figure 25.31). Individuals with this syndrome 

are serine-deficient and impaired in their ability to synthesize another 
common amino acid, as well as a class of lipids. Describe why. 
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22. Use the Protein Data Bank to Explore the Structure and Function of 
Glutamate Synthase Go to https://www.rcsb.org/, and examine the 
PDB file 1LM1 for glutamate synthase. Find its iron-sulfur cluster 
and FMN prosthetic group. Discover how this enzyme is organized 
into an N-terminal domain that functions in ammonia removal 
from glutamine (the glutaminase domain) and the a-ketoglutarate— 
binding site near the Fe/S and flavin prosthetic groups. Consult 
van den Heuvel, R. H. H., et al., 2002. Structural studies on the 
synchronization of catalytic centers in glutamate synthase. Journal 
of Biological Chemistry 277:24579-24583, to see how these two sites 
are connected by a tunnel for passage of ammonia from glutamine 
to a-ketoglutarate. 


23. The Energy Cost of Using Dietary Protein as Food The thermic effect 
of food is a term used to describe the energy cost of processing the 
food we eat, digesting it, and either turning it into precursors for 
needed biosynthesis, usable energy in the form of ATP, or storing the 
excess intake as fat. The thermic effect is usually approximated at 
10% of the total calories consumed, but the thermic effect of fat is 
only 2% to 3% of total fat calories and the thermic effect of protein is 
30% or more of calories consumed as protein. Why do you suppose 
dietary protein has a much higher thermic effect than either dietary 
carbohydrate or fat? 


24. From the dodecameric (a7) structure of glutamine synthetase shown 
in Figure 25.14, predict the relative enzymatic activity of GS 
monomers (isolated a-subunits). 
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25. Consider the synthesis and degradation of tyrosine as shown in 
Figures 25.37, 25.38, and 25.48 to determine where the carbon atoms 
in PEP and erythrose-4-P would end up in acetoacetate and fumarate. 


PSAFE Problems 


(PSAFE models, listed by chapter or protein name, can be found at http:// 
www.psafeonline.net.) 


26. Examine the PSAFE Model for glutamine synthetase, and describe 
the formation of the GS dodecamer. 


27. Using the PSAFE Model for glutamate oxaloacetate transaminase 1 
as a guide, write a double-displacement (“ping-pong”) catalytic 
mechanism for this enzyme. 


Think-Pair-Share Question 


The biosynthesis of amino acids involves the consumption of ATP and 
reducing power in the form of NADPH or NADH, as is typical for 
anabolic reactions. For the list of amino acid biosynthesis reactions 
shown, describe the reduction step(s) (there can be more than one) and 
indicate the changes in oxidation numbers of the atom(s) involved. 


a. Biosynthesis of proline (Figure 25.20). 
Biosynthesis of ornithine (Figure 25.21). 
Biosynthesis of lysine in fungi (Figure 25.24). 
Biosynthesis of threonine (Figure 25.27). 
Biosynthesis of valine (Figure 25.29). 


moans p 


Biosynthesis of leucine (Figure 25.30). 
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Synthesis and Degradation 26 
of Nucleotides 


Guano, a substance found on some coasts frequented 
by sea birds, is composed chiefly of the birds’ 
partially decomposed excrement. ... The name for 
the purine guanine derives from the abundance of 
this base in guano. 


J. C. Nesbit 
On Agricultural Chemistry and the Nature 
and Properties of Peruvian Guano (1850) 


< Pigeon drinking at Gaia Fountain, Siena, Italy. The basic 
features of purine biosynthesis were elucidated initially 
from metabolic studies of nitrogen metabolism in 
pigeons. Pigeons excrete excess N as uric acid, 
a purine analog. 


Adam Woolfitt/Corbis 


Virtually all cells are capable of synthesizing purine and pyrimidine nucleotides. These com- 26.1 How Do Cells Synthesize Nucleotides? 
pounds then serve as essential intermediates in metabolism and as the building blocks for 26.2 How Do Cells Synthesize Purines? 


, i : eran 
DNA and RNA synthesis. How do cells synthesize purines and pyrimidines? 26.3 Can Cells Salvage Purines? 


26.4 How Are Purines Degraded? 


; o . . . a Ba 26.5 How Do Cells Synthesize Pyrimidines? 
No are ubiquitous constituents of life, actively participating in the majority 26.6 


of biochemical reactions. Recall that ATP is the energy currency of the cell, that 
uracil nucleotide derivatives of carbohydrates are common intermediates in cellular trans- 


How Are Pyrimidines Degraded? 


26.7 How Do Cells Form the 
Deoxyribonucleotides That Are 


formations of carbohydrates (see Chapter 22), and that biosynthesis of phospholipids Necessary for DNA Synthesis? 
proceeds via cytosine nucleotide derivatives (see Chapter 24). In Chapter 30, we will see 26.8 How Are Thymine Nucleotides 
that GTP serves as the immediate energy source driving the endergonic reactions of pro- Synthesized? 


tein synthesis. Many of the coenzymes (such as coenzyme A, NAD, NADP, and FAD) are 
derivatives of nucleotides. Nucleotides also act in metabolic regulation, as in the response 
of key enzymes of intermediary metabolism to the relative concentrations of AMP, ADP, 
and ATP (PFK is a prime example here; see also Chapter 18). Furthermore, cyclic deriva- 
tives of purine nucleotides such as cAMP and cGMP have no other role in metabolism 
than regulation. Last but not least, nucleotides are the monomeric units of nucleic acids. 
Deoxynucleoside triphosphates (dNTPs) and nucleoside triphosphates (NTPs) serve as 
the immediate substrates for the biosynthesis of DNA and RNA, respectively. 


26.1 | How Do Cells Synthesize Nucleotides? 


Nearly all organisms can make the purine and pyrimidine nucleotides via so-called de 
novo biosynthetic pathways. (De novo means “anew”; a less literal but more apt transla- 
tion might be “from scratch” because de novo pathways are metabolic sequences that 
form complex end products from rather simple precursors.) Many organisms also have 
salvage pathways to recover purine and pyrimidine compounds obtained in the diet or 
released during nucleic acid turnover and degradation. Whereas the ribose of nucleotides 
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can be catabolized to generate energy, the nitrogenous bases do not serve as energy 
sources; their catabolism does not lead to products used by pathways of ATP synthesis. 
Compared to slowly dividing cells, rapidly proliferating cells synthesize larger 
amounts of DNA and RNA per unit time. To meet the increased demand for nucleic 
acid synthesis, substantially greater quantities of nucleotides must be produced. The 
pathways of nucleotide biosynthesis thus become attractive targets for the clinical con- 
trol of rapidly dividing cells such as cancers or infectious bacteria. Many antibiotics 
and anticancer drugs are inhibitors of purine or pyrimidine nucleotide biosynthesis. 


26.2 | How Do Cells Synthesize Purines? 


Substantial insight into the de novo pathway for purine biosynthesis was provided in 
1948 by John Buchanan, who cleverly exploited the fact that birds excrete excess nitro- 
gen principally in the form of uric acid, a water-insoluble purine analog. Buchanan fed 
isotopically labeled compounds to pigeons and then examined the distribution of the 


Human PRPP synthetase (PDB id = 2H06). labeled atoms in uric acid (Figure 26.1). By tracing the metabolic source of the vari- 
PRPP synthetase initiates the pathway for purine ous atoms in this end product, he showed that the nine atoms of the purine ring system 
synthesis. (Figure 26.2) are contributed by aspartic acid (N-1), glutamine (N-3 and N-9), glycine 


(C-4, C-5, and N-7), CO, (C-6), and THF one-carbon derivatives (C-2 and C-8). THF is 
tetrahydrofolate, a coenzyme serving as a one-carbon transfer agent, not only in purine 
ring synthesis but also in amino acid metabolism (see Figures 25.27 and 25.32) and in 
synthesis of the pyrimidine thymine (see Figure 26.26). The formation and function of 
THF is summarized in A Deeper Look box “Tetrahydrofolate and One-Carbon Units.” 


re) H 
H | 
Ma N . 
N | \=0 26.2a IMP Is the Immediate Precursor to GMP and AMP 
ox ji 


The de novo synthesis of purines occurs in an interesting manner: The atoms forming 
| the purine ring are successively added to ribose-5-phosphate; thus, purines are directly 
H H synthesized as nucleotide derivatives by assembling the atoms that comprise the purine 
Uric acid ring system directly on the ribose. Beginning with ribose-5-P and ending with IMP, the 


Figure 26.1 Nitrogen waste is excreted by birds prin- pathway consists of 11 enzymatic steps (Figure 26.1). In step 1, ribose-5-phosphate is 
cipally as the purine analog, uric acid. activated via the direct transfer of a pyrophosphoryl group from ATP to C-1 of the 


B. subtilis glutamine phosphoribosyl pyrophosphate N°-formyl-THF 
amidotransferase (iron clusters in red, AMP in 
orange) (PDB id = 1GPH). Gln PRPP synthetase 


ribose, yielding 5-phosphoribosyl-a-pyrophosphate (PRPP) (Figure 26.3). The enzyme 
is ribose-5-phosphate pyrophosphokinase (also known as PRPP synthetase). PRPP is 
the limiting substance in purine biosynthesis. The two major purine nucleoside diphos- 
phates, ADP and GDP, are negative effectors of ribose-5-phosphate pyrophosphoki- 
nase. However, because PRPP serves additional metabolic needs, the next reaction is 
actually the committed step in the pathway. Because of this, some discussions of purine 
biosynthesis number the enzymes starting with Step 2. 

Step 2 is catalyzed by glutamine phosphoribosyl pyrophosphate amidotransferase. The 
anomeric carbon atom of the substrate PRPP is in the a-configuration; the product is a 
B-glycoside (recall that all the biologically important nucleotides are B-glycosides). The 
N atom of this N-glycoside becomes N-9 of the nine-membered purine ring; it is the 
first atom added in the construction of this ring. Glutamine PRPP amidotransferase is 
subject to feedback inhibition by GMP, GDP, and GTP as well as AMP, ADP, and ATP. 
The G series of nucleotides interacts at a guanine-specific allosteric site on the enzyme, 
whereas the adenine nucleotides act at an A-specific site. The pattern of inhibition by 
these nucleotides is such that residual enzyme activity remains until sufficient amounts 

CO, 
` _ Glycine 

artate aan 
a a: cs S g4 M°-formyl-THF 

On Enn’ 


Glutamine (amide-N) 


catalyzes the committed step in purine synthesis. Figure 26.2 The metabolic origin of the nine atoms in the purine ring system. 
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Figure 26.3 The de novo pathway for purine synthesis. The atoms added in each step are highlighted 
in color. Note the change in ribose-5-P representation after step 4. IMP (inosine monophosphate or 


inosinic acid) serves as a precursor to AMP and GMP. 
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A Deeper Look 


Tetrahydrofolate and One-Carbon Units (a) 
Folic acid, a B vitamin found in green plants, fresh fruits, yeast, and HoN N NS H 
liver, takes its name from folium, Latin for “leaf.” Folic acid is a pterin ba | J. 
(the 2-amino-4-oxo derivative of pteridine); pterins are named from HN N” cti DN = 
the Greek word pté ryj, for “wing,” because these substances were s 
first identified as the pigments in insect wings. Mammals cannot o H 
synthesize pterins and thus cannot make folates; they derive folates Folate 
from their diet or from microorganisms in their intestines. (See the NADPH + H+. 
Human Biochemistry box “Folate Analogs as Antimicrobial and 
Anticancer Agents” for the complete structure of folate.) 
Folates are acceptors and donors of one-carbon units for all oxi- NADIE 
dation levels of carbon except CO, (for which biotin is the relevant 
carrier). The active form is tetrahydrofolate (THF). THF is formed H 
through two successive reductions of folate by dihydrofolate reduc- l H 
tase (panel a of figure). One-carbon units in three different oxidation HoN Pa N fi 
states may be bound to THF at the N° or N" nitrogens (table shown Y | Be 
on page 925 and panel b of figure). The one-carbon unit carried by HN N ENER 
THF can come from formate (HCOO7), the a-carbon of glycine, the m | 
B-carbon of serine (see Figure 25.32), or the 3-position carbon in the Oo | 
imidazole ring of histidine. NADPH-dependent reactions intercon- Dihydrofolate 
vert the oxidation states of the various THF-bound one-carbon units. NADPH + H*. 
NADP+ 
H 
| 
(b) HoN N. N. h 
l-Carbon unit hie H 
oxidation y » : ze = | 
level: l N N | HN N H 
TNR Z Daan A | CH,—N—R 
eo (0) l 
Ẹ MN Tetrahydrofolate 
= 
L 
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v 
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2n 
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OEP 
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[e] 
ia 
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(Continued) 
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Oxidation States of Carbon in One-Carbon Units Carried by Tetrahydrofolate 
Oxidation Number* Oxidation Level One-Carbon Form? — Tetrahydrofolate Form 
—2 Methanol —CH; N>-Methyl-THF 
(most reduced) 
Formaldehyde —CH,— N>?,N!°-Methylene-THF 
Formate (most © —CH=O N?-Formyl-THF 
oxidized) —CH=O N°-Formyl-THF 
—CH=NH N°-Formimino-THF 
—CH= N°, N!°-Methenyl-THF 
*Calculated by assigning valence bond electrons to the more electronegative atom and then counting the 
charge on the quasi ion. A carbon assigned four valence electrons would have an oxidation number of 0. 
The carbon in N5-methyl-THF is assigned six electrons from the three C—H bonds and thus has an 
oxidation number of —2. 
tNote: All vacant bonds in the structures shown are to atoms more electronegative than C. 
of both adenine and guanine nucleotides are synthesized. Glutamine phosphoribosyl Azaserine o 
pyrophosphate amidotransferase is also sensitive to inhibition by the glutamine ana- WA 
log azaserine (Figure 26.4). Azaserine has been used as an antitumor agent because N= N = CH i O — CH, — CH ES 
it irreversibly inactivates glutamine-dependent enzymes by reacting with nucleophilic ò NH; om 
groups at the glutamine-binding site. Two such enzymes are found at steps 2 and 5 of 
the purine biosynthetic pathway. ĉi , 
Step 3 is carried out by glycinamide ribonucleotide synthetase (GAR synthetase) ii . 
via its ATP-dependent condensation of the glycine carboxyl group with the amine of 1 Pa 
5-phosphoribosyl-B-amine (see Figure 26.3). The reaction proceeds in two stages. First, HN — C — CH, — CH, — CH — C 
the glycine carboxyl group is activated via ATP-dependent phosphorylation. Next, an 7 | Nee 
amide bond is formed between the activated carboxyl group of glycine and the B-amine. NH; 


Glycine contributes C-4, C-5, and N-7 of the purine. 

Step 4 is the first of two THF-dependent reactions in the purine pathway of eukary- 
otes. (In E. coli and related organisms, formate, not N!°-formyl-THF, is the source of 
formyl groups, both here in and in step 10. In prokaryotes, these reactions depend on 
ATP for formate activation.) GAR transformylase transfers the N!'°-formyl group of 
N’°-formyl-THF to the free amino group of GAR to yield a-N-formylglycinamide ribo- 
nucleotide (FGAR). Thus, C-8 of the purine is “formyl-ly” introduced. Although all of 
the atoms of the imidazole portion of the purine ring are now present, the ring is not 
closed until Reaction 6. 

Step 5 is catalyzed by FGAR amidotransferase (also known as FGA M synthetase). ATP- 
dependent transfer of the glutamine amido group to the C-4-carbonyl of FGAR yields 
formylglycinamidine ribonucleotide (FGA M). The imino-N becomes N-3 of the purine. 

Step 6 is an ATP-dependent dehydration that leads to formation of the imidazole 
ring. ATP is used to phosphorylate the oxygen atom of the formyl group, activating 
it for the ring closure step that follows. Because the product is 5-aminoimidazole 
ribonucleotide, or AIR, this enzyme is called AIR synthetase. In higher organisms, the 
enzymatic activities for steps 3, 4, and 6 (GAR synthetase, GAR transformylase, and 
AIR synthetase) reside on a single multifunctional polypeptide. 

In step 7, carbon dioxide is added at the C-4 position of the imidazole ring by AIR 
carboxylase in an ATP-dependent reaction; the carbon of CO, will become C-6 of the 
purine ring. The product is carboxyaminoimidazole ribonucleotide (CAIR). 

In step 8, the amino-N of aspartate provides N-1 through linkage to the C-6 car- 
boxyl function of CAIR. ATP hydrolysis drives the condensation of Asp with CAIR. 
The product is N-succinylo-5-aminoimidazole-4-carboxamide ribonucleotide (SATCAR). 
SAICAR synthetase catalyzes the reaction. The enzymatic activities for steps 7 and 8 
reside on a single, bifunctional polypeptide in eukaryotes known as PAICS, an acro- 
nym for phosphoribosylaminoimidazole carboxylase and phosphoribosylaminoimid- 
azolesuccinocarboxamide synthetase, which are alternative names for AIR carboxylase 
and SAICAR synthetase. 


Figure 26.4 The structure of azaserine. Azaserine 
acts as an irreversible inhibitor of glutamine- 
dependent enzymes by covalently attaching to 
nucleophilic groups in the glutamine-binding site. 
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Human Biochemistry 


SAICAR Is a Key Signal for Metabolic Reprogramming in Cancer Cells 


Cancer cells show two distinctive hallmarks: altered metabolism and 
enhanced proliferation. How cancer cells coordinate their various meta- 
bolic pathways to sustain high growth rates is a subject of intensive 
biomedical investigation. A key player is pyruvate kinase isoform M2 
(PK M2)—see the Critical Developments in Biochemistry box “The 
Warburg Effect and Cancer” in Chapter 18. Interestingly, recent studies 
have demonstrated that the purine biosynthetic intermediate SAICAR 
is an allosteric activator of PK M2. SAICAR levels rise in glucose- 
starved cancer cells (from as little as 0.02 mM to as much as 0.7 mM). 
The effective SAICAR concentration for half-maximal activation of PK 
M2 is 0.3 mM. The kea of the PK M2 reaction increases by 2- to 3-fold 
upon SAICAR binding and its Km for PEP drops 20-fold (from 2 mM 
to 0.1 mM). This activation of PK M2 allows the cells to redirect their 
metabolism away from the use of glycolytic intermediates for anabolic 
purposes if glucose is not abundant, rebalancing the flow of metabolites 


to maintain pathways of energy production. Even more surprisingly, PK 
M2 is a protein kinase that uses PEP, not ATP, to phosphorylate more 
than 100 different human proteins (see also the Human Biochemistry 
box “Pyruvate Kinase M) Moonlights as a Protein Kinase in Cancer,” 
in Section 18.4). Most of these proteins are themselves protein kinases 
that are involved in the regulation of cell growth and division. Thus, 
SAICAR, by activating PK M2, is serving as a key signal to reprogram 
metabolism in cancer cells under nutrient-limited conditions. 
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Step 9 removes the four carbons of Asp as fumarate in a nonhydrolytic cleavage. 
The product is 5-aminoimidazole-4-carboxamide ribonucleotide (AICAR); the enzyme is 
adenylosuccinase (adenylosuccinate lyase). Adenylosuccinase acts again in that part of 
the purine pathway leading from IMP to AMP and takes its name from this latter reac- 
tion (see following). AICAR is also a by-product of the histidine biosynthetic pathway 
(see Figure 25.40), but because ATP is the precursor to AICAR in that pathway, no net 
purine synthesis is achieved. 

Step 10 adds the formyl carbon of N!°-formyl-THF as the ninth and last atom 
necessary for forming the purine nucleus. The enzyme that mediates this reaction is 
called AICAR transformylase; the products are THF and N-formylaminoimidazole- 
4-carboxamide ribonucleotide (FAICAR). 

Step 11 involves dehydration and ring closure to form the purine nucleotide IMP 
(inosine-5'-monophosphate or inosinic acid); this completes the initial phase of purine 
biosynthesis. The enzyme catalyzing this reaction is IMP cyclohydrolase (also known as 
IMP synthase and inosinicase). Unlike step 6, this ring closure does not require ATP. 
In bacteria and eukaryotes, but not archaea, the enzymatic activities catalyzing steps 
10 and 11 (AICAR transformylase and inosinicase) activities reside on a bifunctional 
polypeptide that dimerizes. 

Thus, in higher organisms, 6 of the 11 enzymatic activities for purine synthesis are 
localized to 3 polypeptides: The enzymes for steps 3, 4, and 6 are found on a trifunctional 
polypeptide, and those for steps 7 and 8 and for 10 and 11 are found on bifunctional 
polypeptides. In total then, 6 proteins achieve the 11 steps necessary for IMP formation, 
starting from PRPP. Benkovic and his colleagues have shown in a cultured human cell 
line that these six proteins cluster together in the cytosol to form multiprotein com- 
plexes when the cells are purine-depleted. This association is dynamic, because when the 
cell medium is supplemented with purines, the clustered proteins disperse. Benkovic and 
his colleagues have called these clusters purinosomes, since they represent the agents of 
de novo purine biosynthesis (Figure 26.5). Further, adenylosuccinate synthetase, IMP 
dehydrogenase, and GMP synthetase, enzymes downstream from IMP, which lead to 
AMP and GMP, respectively, are also part of the purinosome. (Adenylosuccinate syn- 
thetase and IMP dehydrogenase are described in the paragraphs under AMP and GMP 
Are Synthesized from IMP, which follows). The purinosome is an excellent example of 
metabolic channeling, the transfer of metabolites between different enzymic sites on a 
multifunctional polypeptide chain (discussed further in Section 26.5a). 

Note that six ATPs are required in the purine biosynthetic pathway from ribose- 
5-phosphate to IMP: one each at steps 1, 3, 5, 6, 7, and 8. However, seven high-energy 
phosphate bonds (equal to seven ATP equivalents) are consumed because a-PRPP forma- 
tion in step 1 followed by PP; release in step 2 represents the loss of two ATP equivalents. 
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three siblings and large and loving family. She now 
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Figure 26.5 Purinosomes—the dynamic metabolons that synthesize purines. Purinosomes are protein 
complexes that collectively bear all of the enzymatic activities needed to synthesize purine nucleotides 
from PRPP. The six proteins shown are PPAT (phosphoribosyl pyrophosphate amidotransferase; 

step 2), GART (GAR synthetase/GAR transformylase/AIR synthetase; steps 3, 4, and 6), PFAMS 
(phosphoribosylformylglycinamidine synthase; step 5), PAICS (AIR carboxylase/SAICAR synthetase; 
steps 7 and 8), ADSL (adenylosuccinate lyase; step 9), and ATIC (AICAR transformylase/IMP 
synthase; steps 10 and 11). As shown, purinosomes localize near mitochondria, the sources of formate 
and glycine, two essential substrates for this pathway. (Though they are also part of the purinosome, 
not shown in the figure are the three additional proteins—adenylosuccinate synthetase, IMP dehydro- 
genase, and GMP synthetase—necessary for AMP and GMP synthesis from IMP.) (Adapted from the 
figure in Alexandrov, T., 2020. Probing metabolism in time and space. Science 368:241—242.) 
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Folate Analogs as Antimicrobial and Anticancer Agents 


The dependence of de novo purine biosynthesis on folic acid 
compounds at steps 4 and 10 means that antagonists of folic 
acid metabolism indirectly inhibit purine formation and, in turn, 
nucleic acid synthesis, cell growth, and cell division. Clearly, 
rapidly dividing cells such as malignancies or infective bacteria 
are more susceptible to these antagonists than slower-growing 
normal cells. Among the folic acid antagonists are sulfonamides (see 
accompanying figure, part a). Folic acid is a vitamin for animals 
and is obtained in the diet. In contrast, bacteria synthesize folic acid 
from precursors, including p-aminobenzoic acid (PABA), and thus 


reductase, or DHFR (see A Deeper Look box “Tetrahydrofolate 
and One-Carbon Units”). Methotrexate (amethopterin), aminop- 
terin, and trimethoprim are three analogs of folic acid (see accom- 
panying figure, part b). The first two have been used in cancer 
chemotherapy and the treatment of autoimmune disorders. Each 
binds to DHFR with about 1000-fold greater affinity than folate or 
DHF, thus acting as a virtually irreversible inhibitor of THF forma- 
tion. Trimethoprim acts more effectively on bacterial DHFR and 
is prescribed for infections of the urinary tract. Next-generation 
folate-analog inhibitors of DHFR include lipophilic molecules that 


can passively diffuse across cell membranes. These molecules are 
being used to develop new anti-cancer drugs as well as antibiotics 
targeted at bacterial and parasitic infections. 


are more susceptible to sulfonamides than are animal cells. 
Formation of THF, the functional folate form, depends on 
reduction of folate (and dihydrofolate or DHF) by dihydrofolate 


(a) (0) 


/ 
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2-Amino, 4-amino analogs of folic acid 
®=H Aminopterin 
® = CH, Amethopterin (methotrexate) 
(a) Sulfa drugs, or sulfonamides, owe their antibiotic properties to their similarity to p-aminobenzoate (PABA), an important precursor in 


folic acid synthesis. Sulfonamides block folic acid formation by competing with PABA. (b) Precursors and analogs of folic acid employed 
as antimetabolites include methotrexate, aminopterin, and trimethoprim, as well as sulfonamides. 


26.2b AMP and GMP Are Synthesized from IMP 


IMP is the precursor to both AMP and GMP. These major purine nucleotides are 
formed via distinct two-step metabolic pathways that diverge from IMP. The branch 
leading to AMP (adenosine 5'-monophosphate) involves the displacement of the 6-O 
group of inosine with aspartate (Figure 26.6) in a GTP-dependent reaction, followed 
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by the nonhydrolytic removal of the four-carbon skeleton of Asp as fumarate; the Asp 
amino group remains as the 6-amino group of AMP. Adenylosuccinate synthetase and 
adenylosuccinase are the two enzymes catalyzing these reactions. Recall that adenylosuc- 
cinase also acted at step 9 in the pathway from ribose-5-phosphate to IMP. Fumarate 
production provides a connection between purine synthesis and the citric acid cycle. 

The formation of GMP from IMP requires oxidation at C-2 of the purine ring, fol- 
lowed by a glutamine-dependent amidotransferase reaction that replaces the oxygen on 
C-2 with an amino group to yield 2-amino,6-oxy purine nucleoside monophosphate, or 
as this compound is commonly known, guanosine monophosphate (Figure 26.6). The 
enzymes in the GMP branch are IMP dehydrogenase and GMP synthetase. Note that, 
starting from ribose-5-phosphate, eight ATP equivalents are consumed in the synthesis 
of AMP and nine in the synthesis of GMP. 


26.2c The Purine Biosynthetic Pathway Is Regulated at Several Steps 


The regulatory network that controls purine synthesis is schematically represented 
in Figure 26.7. The purine biosynthetic pathway from ribose-5-phosphate to IMP is 
allosterically regulated at the first two steps. Ribose-5-phosphate pyrophosphokinase, 
although not the committed step in purine synthesis, is subject to feedback inhibition 
by ADP and GDP. The enzyme catalyzing the next step, glutamine phosphoribosyl 
pyrophosphate amidotransferase (Gln PRPP amidotransferase) has two allosteric sites, 
one where the A series of nucleoside phosphates (AMP, ADP, and ATP) binds and 
feedback-inhibits, and another where the corresponding G series binds and inhibits. 
Furthermore, PRPP is a feed-forward activator of this enzyme. Thus, the rate of IMP 
formation by this pathway is governed by the levels of the final end products, the ade- 
nine and guanine nucleotides. 

The purine pathway splits at IMP. The first enzyme in the AMP branch, adenylo- 
succinate synthetase, is competitively inhibited by AMP. Its counterpart in the GMP 
branch, IMP dehydrogenase, is inhibited in a similar fashion by GMP. Thus, the fate 
of IMP is determined by the relative levels of AMP and GMP, so any deficiency in the 
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Figure 26.6 The synthesis of AMP and GMP from 

IMP. (a) AMP synthesis: The two reactions of AMP 
synthesis mimic steps 8 and 9 in the purine pathway 

leading to IMP. (b) GMP synthesis. 


Human adenylosuccinate lyase (AMP in red; 
PDB id = 2J91) 
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Figure 26.7 The regulatory circuit controlling purine biosynthesis. 


amount of either of the principal purine nucleotides is self-correcting. This reciprocity 
of regulation is an effective mechanism for balancing the formation of AMP and GMP 
to satisfy cellular needs. Note also that reciprocity is even manifested at the level of 
energy input: GTP provides the energy to drive AMP synthesis, whereas ATP serves 
this role in GMP synthesis (Figure 26.7). 

Further, IMP dehydrogenase has another function: It not only serves in purine 
nucleotide biosynthesis and allosteric regulation, but it is also a DNA-binding protein 
involved in the regulation of gene expression in eukaryotic cells. Depending on meta- 
bolic state and/or cellular stress, IMP dehydrogenase moves from the cytosol into the 
nucleus, where it acts in a transcriptional repressor role to ensure that cell proliferation 
is coupled to metabolic sufficiency. 


26.2d ATP-Dependent Kinases Form Nucleoside Diphosphates 
and Triphosphates from the Nucleoside Monophosphates 
The products of de novo purine biosynthesis are the nucleoside monophosphates 


AMP and GMP. These nucleotides are converted by successive phosphorylation reac- 
tions into their metabolically prominent triphosphate forms, ATP and GTP. The first 


GMP synthetase tetramer with Mg” (yellow) phosphorylation, to give the nucleoside diphosphate forms, is carried out by two base- 
adjacent to AMP (red; PDB id = 1GPM). specific, ATP-dependent kinases, adenylate kinase and guanylate kinase. 


Adenylate kinase: AMP + ATP —> 2 ADP 


Guanylate kinase: GMP + ATP —> GDP + ADP 


These nucleoside monophosphate kinases also act on deoxynucleoside monophos- 
phates to give dADP or dGDP. 

Oxidative phosphorylation (see Chapter 20) is primarily responsible for the 
conversion of ADP into ATP. ATP then serves as the phosphoryl donor for synthesis 
of the other nucleoside triphosphates from their corresponding NDPs in a reaction 
catalyzed by nucleoside diphosphate kinase, a nonspecific enzyme. For example, 


GDP + ATP == GTP + ADP 
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Because this enzymatic reaction is readily reversible and nonspecific with respect to 
both phosphoryl acceptor and donor, in effect any NDP can be phosphorylated by 
any NTP. The preponderance of ATP over all other nucleoside triphosphates means 
that, in quantitative terms, it is the principal nucleoside diphosphate kinase substrate. 
The enzyme does not discriminate between the ribose moieties of nucleotides and thus 
functions in phosphoryl transfers involving deoxy-NDPs and deoxy-NTPs as well. 


26.3 | Can Cells Salvage Purines? 


Nucleic acid turnover (synthesis and degradation) is an ongoing metabolic process in 
most cells. Messenger RNA in particular is actively synthesized and degraded. These 
degradative processes can lead to the release of free purines in the form of adenine, 
guanine, and hypoxanthine (the base in IMP). Purines represent a metabolic invest- 
ment by cells. So-called salvage pathways exist to recover them in useful form. Also, 
purine salvage is the only way to form purine nucleotides in some parasitic organisms, 
such as protozoans like Giardia. Salvage reactions involve resynthesis of nucleotides 
from bases via phosphoribosyltransferases. 


Base + PRPP =—— nucleoside-5'-phosphate + PP; 


The subsequent hydrolysis of PP; to inorganic phosphate by pyrophosphatases renders 
the phosphoribosyltransferase reaction effectively irreversible. 

The purine phosphoribosyltransferases are adenine phosphoribosyltransferase 
(APRT), which mediates AMP formation, and hypoxanthine-guanine phosphoribosyl- 
transferase (HGPRT), which can act on either hypoxanthine to form IMP or guanine 


to form GMP (Figure 26.8). Human HGPRT (Mg”* in yellow, a-PRPP in blue, 
purine analog in red; PDB id = 1D6N). 


26.4 | How Are Purines Degraded? 


Because nucleic acids are ubiquitous in cellular material, significant amounts are 
ingested in the diet. Nucleic acids are degraded in the digestive tract to nucleotides by 
various nucleases and phosphodiesterases. Nucleotides are then converted to nucleo- 
sides by base-specific nucleotidases and nonspecific phosphatases. 


NMP + HO —— nucleoside + P; 
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HGPRT reaction. 
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Lesch—Nyhan Syndrome—HGPRT Deficiency Leads to a Severe Clinical Disorder 


The symptoms of Lesch-Nyhan syndrome are tragic: a crippling, gouty 
arthritis due to excessive uric acid accumulation (uric acid is a purine 
degradation product, discussed in Section 26.4) and, worse, severe 
malfunctions in the nervous system that lead to intellectual disability, 
spasticity, aggressive behavior, and self-mutilation. Lesch-Nyhan 
syndrome results from a complete deficiency in HGPRT activity. 
The structural gene for HGPRT is located on the X chromosome, 
and the disease is a congenital, recessive, sex-linked trait manifested 
only in males. The severe consequences of HGPRT deficiency argue 
that purine salvage has greater metabolic importance than simply 
the energy-saving recovery of bases. Although HGPRT might seem 
to play a minor role in purine metabolism, its absence has profound 
consequences: De novo purine biosynthesis is dramatically increased, 
and uric acid levels in the blood are elevated. Presumably, these 
changes ensue because lack of consumption of PRPP by HGPRT 
elevates its availability for glutamine-PRPP amidotransferase, 
enhancing overall de novo purine synthesis and, ultimately, uric acid 
production (see accompanying figure). The dramatically elevated uric 
acid levels lead to the particular neurological aberrations characteristic 
of the syndrome. Fortunately, deficiencies in HGPRT activity in fetal 
cells can be detected following amniocentesis. However, no medication 
alleviates the neurological and behavioral consequences of this disease. 
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Nucleosides are hydrolyzed by nucleosidases or nucleoside phosphorylases to release 
the purine base: 


Nucleoside + H,O —cleosidase_, base + ribose 


nucleoside phosphorylase 


Nucleoside + P; base + ribose-1-P 


The pentoses liberated in these reactions provide the only source of metabolic energy 
available from purine nucleotide degradation. 

Feeding experiments using radioactively labeled nucleic acids as metabolic tracers 
have demonstrated that little of the nucleotide ingested in the diet is incorporated into 
cellular nucleic acids. Dietary purines are converted to uric acid in the gut and excreted, 
and pyrimidine nucleosides are inefficiently absorbed into the bloodstream. These find- 
ings confirm the de novo pathways of nucleotide biosynthesis as the primary source of 
nucleic acid precursors. Ingested bases are, for the most part, excreted. Nevertheless, 
cellular nucleic acids do undergo degradation in the course of the continuous recycling 
of cellular constituents. 


26.4a The Major Pathways of Purine Catabolism Lead to Uric Acid 


The major pathways of purine catabolism in animals lead to uric acid formation 
(Figure 26.9). The various nucleotides are first converted to nucleosides by intracellular 
nucleotidases. These nucleotidases are under strict metabolic regulation to ensure that 
their substrates, which act as intermediates in many vital processes, are not depleted 
below critical levels. Nucleosides are then degraded by the enzyme purine nucleoside 
phosphorylase (PNP) to release the purine base and ribose-l-P. Note that neither 
adenosine nor deoxyadenosine is a substrate for PNP. Instead, these nucleosides are 
first converted to inosine by adenosine deaminase. The PNP products are merged into 
xanthine by guanine deaminase and xanthine oxidase, and xanthine is then oxidized to 
uric acid by this latter enzyme. 
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Severe Combined Immunodeficiency Syndrome—A Lack of Adenosine 
Deaminase Is One Cause of This Inherited Disease 


Severe combined immunodeficiency syndrome, or SCID, is a group 
of related inherited disorders characterized by the lack of an 
immune response to infectious disease. This immunological insuf- 
ficiency is attributable to the inability of B and T lymphocytes 
to proliferate and produce antibodies in response to an antigenic 
challenge. About 30% of SCID patients suffer from a deficiency 
in the enzyme adenosine deaminase (ADA). ADA deficiency is also 
implicated in a variety of other diseases, including AIDS, anemia, 
and various lymphomas and leukemias. Gene therapy, the repair of 
a genetic deficiency through introduction of a functional recombi- 
nant version of the gene, has been performed on individuals with 


SCID due to a defective ADA gene (see Section 12.5). ADA is 
a Zn?+-dependent enzyme, and Zn** deficiency can also lead to 
reduced immune function. 

In the absence of ADA, deoxyadenosine is not deaminated to 
deoxyinosine as normal (see Figure 26.9). Instead, it is salvaged by 
a nucleoside kinase, which converts it to dAMP. Phosphorylation of 
dAMP leads to dATP, a potent feedback inhibitor of deoxynucleotide 
biosynthesis (discussed in Section 26.7). Without deoxyribonucleo- 
tides, DNA cannot be replicated and cells cannot divide (see accom- 
panying figure). Rapidly proliferating cell types such as lymphocytes 
are particularly susceptible if DNA synthesis is impaired. 
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26.4b The Purine Nucleoside Cycle in Skeletal Muscle 
Serves as an Anaplerotic Pathway 


Deamination of AMP to IMP by AMP deaminase (Figure 26.9) followed by resynthesis 
of AMP from IMP by the de novo purine pathway enzymes, adenylosuccinate synthetase 
and adenylosuccinate lyase, constitutes a purine nucleoside cycle (Figure 26.10). This cycle 
has the net effect of converting aspartate to fumarate plus NH,*. Although this cycle 
might seem like senseless energy consumption, it plays an important role in energy metab- 
olism in skeletal muscle: The fumarate that it generates replenishes the levels of citric acid 
cycle intermediates lost in amphibolic side reactions (see Chapter 19). Skeletal muscle 
lacks the usual complement of anaplerotic enzymes and relies on enhanced levels of 
AMP deaminase, adenylosuccinate synthetase, and adenylosuccinate lyase to compensate. 


26.4c Xanthine Oxidase 


Xanthine oxidase (Figure 26.9) is present in large amounts in liver, intestinal mucosa, and 
milk. It oxidizes hypoxanthine to xanthine and xanthine to uric acid. Xanthine oxidase is a 
rather indiscriminate enzyme, using molecular oxygen to oxidize a wide variety of purines, 
pteridines, and aldehydes, producing HO, as a product. Xanthine oxidase possesses 
FAD, nonheme Fe-S centers, and molybdenum cofactor (a molybdenum-containing pterin 
complex very similar to the one shown in Figure 25.2) as electron-transferring prosthetic 
groups. In humans and other primates, uric acid is the end product of purine catabolism 
and is excreted in the urine. Birds, terrestrial reptiles, and many insects also excrete uric 
acid, but in these organisms, uric acid represents the major nitrogen excretory compound 
because, unlike mammals, they do not also produce urea (see Section 25.4d). Instead, the 
catabolism of all nitrogenous compounds, including amino acids, is channeled into uric 
acid. This route of nitrogen catabolism allows these animals to conserve water by excreting 
crystals of uric acid in paste-like solid form. 
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Figure 26.9 The major pathways for purine catabo- 
lism convert the common purine nucleotides to uric 


acid. 


26.4d Gout Is a Disease Caused by an Excess of Uric Acid 


Gout is the clinical term describing the physiological consequences accompanying 
excessive uric acid accumulation in body fluids. Uric acid and urate salts are rather 
insoluble in water and tend to precipitate from solution if produced in excess. The most 
common symptom of gout is arthritic pain in the joints as a result of urate deposi- 
tion in cartilaginous tissue. The joint of the big toe is particularly susceptible. Urate 
crystals may also appear as kidney stones and lead to painful obstruction of the uri- 
nary tract. Hyperuricemia, chronic elevation of blood uric acid levels, occurs in about 
3% of the population as a consequence of impaired excretion of uric acid or overpro- 
duction of purines. Purine-rich foods such as caviar (fish eggs rich in nucleic acids) 
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may exacerbate the condition. The biochemical causes of gout are varied. A common 
treatment for gout is allopurinol (Figure 26.11). This hypoxanthine analog binds tightly 
to xanthine oxidase, thereby inhibiting its activity and preventing uric acid formation. 
Hypoxanthine and xanthine do not accumulate to harmful concentrations because 
they are more soluble and thus more easily excreted. 


26.4e Different Animals Oxidize Uric Acid to Form 
Various Excretory Products 


The subsequent metabolism of uric acid in organisms that don’t excrete it is shown in 
Figure 26.12. In molluscs and in mammals other than primates, uric acid is oxidized by 
urate oxidase to allantoin and excreted. In bony fishes (teleosts), uric acid degradation 
proceeds through yet another step wherein allantoin is hydrolyzed to allantoic acid 
by allantoinase before excretion. Cartilaginous fish (sharks and rays) and amphibians 
further degrade allantoic acid via the enzyme allantoicase to liberate glyoxylic acid and 
2 equivalents of urea. Even simpler animals, such as most marine invertebrates (crus- 
tacea and so forth), use urease to hydrolyze urea to CO, and ammonia. In contrast 
to animals that must rid themselves of potentially harmful nitrogen waste products, 
microorganisms often are limited in growth by nitrogen availability. Many possess 
an identical pathway of uric acid degradation, using it instead to liberate NH; from 
uric acid so that it can be assimilated into organic-N compounds essential to their 
survival. 


26.5 | How Do Cells Synthesize Pyrimidines? 


In contrast to purines, pyrimidines are not synthesized as nucleotide derivatives. 
Instead, the pyrimidine ring system is constructed before a ribose-5-P moiety is 
attached. Also, only two precursors, carbamoyl-P and aspartate, contribute atoms to 
the six-membered pyrimidine ring (Figure 26.13), compared to seven precursors for the 
nine purine atoms. 
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Figure 26.10 The purine nucleoside cycle for anaple- 
rotic replenishment of citric acid cycle intermediates 
in skeletal muscle. 
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Figure 26.11 Allopurinol, an analog of hypoxan- 
thine, is a potent inhibitor of xanthine oxidase. 
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Figure 26.12 The catabolism of uric acid to allan- 
toin, allantoic acid, urea, or ammonia in various 
animals. 


26.5a The Pyrimidine Biosynthetic Pathway from Carbamoyl-P 
and Aspartate to UMP 


Mammals have two enzymes for carbamoyl phosphate synthesis. Carbamoyl phosphate 
for pyrimidine biosynthesis is formed by carbamoyl phosphate synthetase IT (CPS-ID), 
a cytosolic enzyme. Recall that carbamoyl phosphate synthetase I is a mitochondrial 
enzyme dedicated to the urea cycle and arginine biosynthesis (see Figures 25.22 
and 25.23). The substrates of carbamoyl phosphate synthetase II are HCO; , H20, 
glutamine, and two ATPs (Figure 26.14). Thus, the N atom in carbamoyl phosphate 
made by CPS-II comes from the amide group of glutamine. The first ATP is used to 
form carboxy phosphate, an activated form of CO, (Figure 26.14, step 1). The glutamine 
amide displaces phosphate from carboxy phosphate to give carbamate (step 2). 
Phosphorylation of carbamate by the second ATP in the overall reaction yields 
carbamoyl phosphate (step 3). Because carbamoyl phosphate made by CPS-II in 
mammals has no fate other than incorporation into pyrimidines, mammalian CPS-II 
can be viewed as the committed step in the pyrimidine de novo pathway (Figure 26.15, 
step 1). Bacteria and plants have only one CPS, and its carbamoyl phosphate product 
is incorporated into arginine as well as pyrimidines. Thus, the committed step in 
bacterial pyrimidine synthesis is the next reaction, which is mediated by aspartate 
transcarbamoylase (ATCase). 

ATCase catalyzes the condensation of carbamoyl phosphate with aspartate to 
form carbamoyl aspartate (Figure 26.15, step 2). No ATP input is required at this step 
because carbamoyl phosphate is an activated carbamoyl group. 

Step 3 of pyrimidine synthesis involves ring closure and dehydration via linkage of 
the —NH) group introduced by carbamoyl phosphate with the B-COO™ of aspartate; 
this reaction is mediated by the enzyme dihydroorotase. The product of the reaction 
is dihydroorotate (DHO), a six-membered ring compound. Dihydroorotate is not a 
true pyrimidine, but its oxidation yields orotate, which is. This oxidation (step 4) is 
catalyzed by dihydroorotate dehydrogenase. Bacterial dihydroorotate dehydrogenases 
are NAD*-linked flavoproteins which are somewhat unusual in possessing both 
FAD and FMN; these enzymes also have nonheme Fe-S centers as additional 
redox prosthetic groups. The eukaryotic version of dihydroorotate dehydrogenase 
is a protein component of the inner mitochondrial membrane; its immediate e~ 
acceptor is a quinone, and oxidation of the reduced quinone by the mitochondrial 
e` transport chain can drive ATP synthesis via oxidative phosphorylation. At this 
stage, ribose-5-phosphate is joined to N-1 of orotate in appropriate N-B-glycosidic 
configuration, giving the pyrimidine nucleotide orotidine-5’-monophosphate, or OMP 
(step 5, Figure 26.15). The ribose phosphate donor is PRPP; the enzyme is orotate 
phosphoribosyltransferase. The next reaction is catalyzed by OMP decarboxylase. 
Decarboxylation of OMP gives UMP (uridine-5’-monophosphate, or uridylic acid), 
one of the two common pyrimidine ribonucleotides. 


26.5b UMP Synthesis Leads to Formation of the Two Most Prominent 
Ribonucleotides—UTP and CTP 
The two prominent pyrimidine ribonucleotide products are derived from UMP 


via the same unbranched pathway. First, UDP is formed from UMP via an ATP- 
dependent nucleoside monophosphate kinase. 


UMP + ATP == UDP + ADP 


Carbamoyl-P | — Aspartate 


ee: 


Figure 26.13 The metabolic origin of the six atoms of the pyrimidine ring. 
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Carbamoyl-P (CP) the expense of two ATP. 
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Figure 26.15 The de novo pyrimidine biosynthetic pathway synthesizes UMP from carbamoyl-P, 
aspartate, and PRPP. 
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Figure 26.16 CTP synthetase catalyzes the ATP- o NH, 
dependent transfer of the amido group of Gln to 
UTP, forming CTP. Hn NÝ 


| CTP synthetase 


As > | 
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39G D KRY 


+ ATP + ADP + ® yi 


+ H,O 


Then, UTP is formed by nucleoside diphosphate kinase. 
UDP + ATP =~ UTP + ADP 


Amination of UTP at the 6-position gives CTP. The enzyme CTP synthetase is a glu- 
tamine amidotransferase (Figure 26.16). ATP hydrolysis provides the energy to drive 
the reaction. Human CTP synthetase, like acetyl-CoA carboxylase (Chapter 24), is one 
of many metabolic enzymes known to assemble into filamentous structures. UTP and 
ATP favor such assembly and CTP and ADP reverse their effects. CTP synthetase fila- 
ments are enzymatically more active than the CTP synthetase tetramers from which 
they are formed. 


26.5c Pyrimidine Biosynthesis Is Regulated at ATCase 
in Bacteria and at CPS-II in Animals 


Pyrimidine biosynthesis in bacteria is allosterically regulated at aspartate transcarba- 
moylase (ATCase). Escherichia coli ATCase is feedback inhibited by the end product, 
CTP. ATP, which can be viewed as a signal of both energy availability and purine suf- 
ficiency, is an allosteric activator of ATCase. CTP and ATP compete for a common 
allosteric site on the enzyme. In many bacteria, UTP, not CTP, acts as the ATCase 
feedback inhibitor. 

In animals, CPS-II catalyzes the committed step in pyrimidine synthesis and serves 
as the focal point for allosteric regulation. UTP is a feedback inhibitor of the CPS-II 
activity of CAD, whereas PRPP is an allosteric activator. Figure 26.17 compares the 
regulatory circuits governing pyrimidine synthesis in bacteria and animals. 
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Figure 26.17 A comparison of the regulatory circuits that control pyrimidine synthesis in Æ. coli and 
animals. 
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26.5d “Metabolic Channeling” by Multifunctional Enzymes 
of Mammalian Pyrimidine Biosynthesis 


CPS-II (Figure 26.15), glutamine 5-phosphoribosyl-a-pyrophosphate amidotrans- 
ferase (see Figure 26.3), and CTP synthetase (Figure 26.16) are enzymes with metabo- 
lite tunnels (see also Figure 25.39). In a subtle distinction, metabolic channeling refers 
to the transfer of metabolites between different enzymic sites on a multifunctional poly- 
peptide chain as seen in Figure 26.5 for purinosomes. 

In bacteria, the six enzymes of de novo pyrimidine biosynthesis exist as distinct pro- 
teins, each independently catalyzing its specific step in the overall pathway. In contrast, 
in mammals, the six enzymatic activities are distributed among only three proteins, 
two of which are multifunctional polypeptides. The first three steps of pyrimidine 
synthesis, CPS-II, aspartate transcarbamoylase, and dihydroorotase, are all localized 
on a single 250-kDa multifunctional polypeptide called CAD, from the first letters 
of its three enzymatic activities. CAD polypeptides assemble to form hexameric, 1.5 
megadalton particles. Step 4 in Figure 26.15 is catalyzed by DHO dehydrogenase, a 
separate enzyme associated with the outer surface of the inner mitochondrial mem- 
brane, but the enzymatic activities mediating steps 5 and 6, namely, orotate phospho- 
ribosyltransferase and OMP decarboxylase in mammals, are also found on a single 
cytosolic polypeptide known as UMP synthase. 

Such multifunctional enzymes confer an advantage: The product of one reaction in a 
pathway is the substrate for the next. In multifunctional enzymes, such products remain 
bound and are channeled directly to the next active site, rather than dissociated into 
the surrounding medium for diffusion to the next enzyme. This metabolic channeling 
is more efficient for a variety of reasons: Transit time for movement from one active 
site to the next is shortened, substrates are not diluted into the solvent phase, chemi- 
cally reactive intermediates are protected from decomposition in the aqueous milieu, no 
pools of intermediates accumulate, and intermediates are shielded from interactions with 
other enzymes that might metabolize them. The end result is that metabolic channeling 
can increase the catalytic efficiency of a reaction sequence by as much as two orders 
of magnitude while minimizing the overall concentration of the metabolic intermediates 
involved. 


26.6 | How Are Pyrimidines Degraded? 


In some organisms, free pyrimidines are salvaged and recycled to form nucleotides 
via phosphoribosyltransferase reactions similar to those discussed earlier for purines. 
In humans, however, pyrimidines are recycled from nucleosides, but free pyrimidine 
bases are not salvaged. Pyrimidine catabolism results in degradation of the pyrimidine 
ring to products reminiscent of the original substrates, aspartate, CO», and ammonia 
(Figure 26.18). B-Alanine can be recycled into the synthesis of coenzyme A. Catabolism 
of the pyrimidine base, thymine (5-methyluracil), yields 8B-aminoisobutyric acid instead 
of B-alanine. 

Interestingly, exercise induces the release of B-aminoisobutyric acid (or BAIBA) 
from muscle. BAIBA leads to activation of fatty acid oxidation in the liver and pro- 
motes the formation of brown adipose tissue from white fat. Levels of BAIBA in the 
blood are inversely correlated with metabolic risk factors. It has been known for a long 
time that exercise protects against metabolic diseases. It is now proposed that BAIBA 
mediates this protective effect. 

Pathways presented thus far in this chapter account for the synthesis of the four 
principal ribonucleotides: ATP, GTP, UTP, and CTP. These compounds serve 
important coenzymic functions in metabolism and are the immediate precursors 
for ribonucleic acid (RNA) synthesis. Roughly 90% of the total nucleic acid in cells 
is RNA, with the remainder being deoxyribonucleic acid (DNA). DNA differs from 
RNA in being a polymer of deoxyribonucleotides, one of which is deoxythymidylic 
acid. We now turn to the synthesis of these compounds. 


26.6 How Are Pyrimidines Degraded? 
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Figure 26.18 Pyrimidine degradation. 
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Figure 26.19 Deoxyribonucleotide synthesis involves 
reduction at the 2'-position of the ribose ring of 
nucleoside diphosphates. 


Structure of the human ribonucleotide reductase 
R1 dimer. One monomer is shown in yellow, the 
other in cyan. The catalytic, or C, sites are occupied 
by GDP (orange); the specificity, or S, sites are 
occupied by TTP (green); and the activity, or A, 
sites are occupied by ATP (purple). This molecular 
graphic is a composite of PDB id = 3HND and 
PDB id = 3HNE. (Adapted from Figure 1 in 
Fairman, J. W., et al., 2011. Structural basis for 
regulation of human ribonucleotide reductase 

by nucleotide-induced oligomerization. Nature 
Structural and Molecular Biology 18:316—323.) 


26.7 | How Do Cells Form the Deoxyribonucleotides 
That Are Necessary for DNA Synthesis? 


The deoxyribonucleotides have only one metabolic purpose: to serve as precursors for 
DNA synthesis. In most organisms, ribonucleoside diphosphates (NDPs) are the sub- 
strates for deoxyribonucleotide formation. Reduction at the 2'-position of the ribose 
ring in NDPs produces 2'-deoxy forms of these nucleotides (Figure 26.19). This reac- 
tion involves replacement of the 2’-OH by a hydride ion (H: ~) and is catalyzed by 
an enzyme known as ribonucleotide reductase. Enzymatic ribonucleotide reduction 
involves a free radical mechanism, and three classes of ribonucleotide reductases are 
known, differing from each other in their mechanisms of free radical generation. Class I 
enzymes, found in Æ. coli and virtually all eukaryotes, are Fe-dependent and generate 
the required free radical on a specific tyrosyl side chain. 


26.7a E. coli Ribonucleotide Reductase Has Three 
Different Nucleotide-Binding Sites 


The enzyme system for dNDP formation consists of four proteins, two of which con- 
stitute the ribonucleotide reductase proper, an enzyme of the a8, type. The other two 
proteins, thioredoxin and thioredoxin reductase, function in the delivery of reducing 
equivalents, as we shall see shortly. The two proteins of ribonucleotide reductase are 
designated R1 (86 kDa) and R2 (43.5 kDa), and each is a homodimer in the holo- 
enzyme (Figure 26.20). The R1 homodimer carries two types of regulatory sites in 
addition to the catalytic site (the active site). Substrates (ADP, CDP, GDP, and UDP) 
bind at the catalytic site. One regulatory site—the substrate specificity site—binds ATP, 
dATP, dGTP, or dTTP, and which of these nucleotides is bound there determines 
which nucleoside diphosphate is bound at the catalytic site. The other regulatory site, 
the overall activity site, binds either the activator ATP or the negative effector dATP; 
the nucleotide bound here determines whether the enzyme is active or inactive. Activity 
depends also on residues Cyst, Cys*?>, and Cys*? in R1. The R2 homodimer is the 
source of the free radical that initiates the ribonucleotide reductase reaction. Its two 
Fe atoms generate this free radical on R2 Tyr!?, which in turn generates a thiyl free 
radical (Cys-S-) on Cys**?. Cys¥9-S- initiates ribonucleotide reduction by abstracting 
the 3'-H from the ribose ring of the nucleoside diphosphate substrate (Figure 26.21) 
and forming a free radical on C-3’. Subsequent dehydration forms the deoxyribonucle- 
otide product. 
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Figure 26.20 E. coli ribonucleotide reductase is an a> tetramer organized as a dimer of R1 and R2 
dimers. 
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Figure 26.21 The free radical mechanism of ribonucleotide reduction. H, designates the C-3' 
hydrogen, and H,, the C-2' hydrogen atom. The polypeptide chain of R1 is represented by the 
green band carrying the Cys residues of R1 involved in the reaction mechanism. 


26.7b Thioredoxin Provides the Reducing Power 
for Ribonucleotide Reductase 


NADPH is the ultimate source of reducing equivalents for ribonucleotide reduction, 
but the immediate source is reduced thioredoxin, a small (12 kDa) protein with reactive 
Cys-sulfhydryl groups situated near one another in the sequence Cys-Gly-Pro-Cys. These 
Cys residues are able to undergo reversible oxidation—reduction between (—S—S—) and 
(—SH HS—) and, in their reduced form, serve as primary electron donors to regenerate 
the reactive —SH pair of the ribonucleotide reductase active site (Figure 26.21). In turn, 
the sulfhydryls of thioredoxin must be restored to the (—SH HS—) state for another 
catalytic cycle. Thioredoxin reductase, an a-type enzyme composed of 58-kDa flavopro- 
tein subunits, mediates the NADPH-dependent reduction of thioredoxin (Figure 26.22). 
Thioredoxin functions in a number of metabolic roles besides deoxyribonucleotide syn- 
thesis, the common denominator of which is reversible sulfide: sulfhydryl transitions. 
The substrates for ribonucleotide reductase are CDP, UDP, GDP, and ADP, and the 
corresponding products are (CDP, dUDP, dGDP, and dADP. Because CDP is not an 
intermediate in pyrimidine nucleotide synthesis, it must arise by dephosphorylation of 
CTP, for instance, via nucleoside diphosphate kinase action. Although uridine nucleotides 
do not occur in DNA, UDP is a substrate. The formation of dUDP is justified because it 
is a precursor to dTTP, a necessary substrate for DNA synthesis (see following discussion). 


(a) 
NADPH FAD SH HS os) SH HS NDP 
+ Ht 
NADPH FADH, E= SH HS s—s dNDP 
Thioredoxin reductase Thioredoxin Ribonucleotide 
reductase 


Figure 26.22 NADPH is the ultimate source of reducing power for ribonucleotide reduction. (a) The 
(—S—S—)/(—SH HS—) oxidation—reduction cycle involving ribonucleotide reductase, thioredoxin, 
thioredoxin reductase, and NADPH. (b) Thioredoxin reductase (œ, dimer) with bound ligands: FAD 
(gold) and NADP analog (blue). Each a-subunit is associated with a thioredoxin molecule (yellow; 
PDB id = 1F6M). 
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26.7c Both the Specificity and the Catalytic Activity 
of Ribonucleotide Reductase Are Regulated 
by Nucleotide Binding 


Ribonucleotide reductase activity must be modulated in two ways to maintain an 
appropriate balance of the four deoxynucleotides essential to DNA synthesis, namely, 
dATP, dGTP, dCTP, and dTTP. First, the overall activity of the enzyme must be turned 
on and off in response to the need for dNTPs. Second, the relative amounts of each 
NDP substrate transformed into dNDP must be controlled so that the right balance of 
dATP:dGTP:dCTP: dTTP is produced. The two different allosteric sites on ribonucle- 
otide reductase, discrete from the substrate-binding catalytic site, are designed to serve 
these purposes. As noted previously, these two allosteric sites are designated the overall 
activity site and the substrate specificity site. Only ATP and dATP are able to bind at 
the overall activity site. ATP is an allosteric activator and dATP is an allosteric inhibi- 
tor, and they compete for the same site. If ATP is bound, the enzyme is active, whereas 
if its deoxy counterpart, dATP, occupies this site, the enzyme is inactive. Although 
ATP is more abundant than dATP, dATP binds to ribonucleotide reductase with 
100-fold greater affinity. When dATP binds, R1 dimers aggregate to form inactive 
hexamers; ATP binding reverses this oligomerization. 

The second allosteric site, the substrate specificity site, can bind either ATP, dTTP, 
dGTP, or dATP, and the substrate specificity of the enzyme is determined by which of 
these nucleotides occupies this site. If ATP is in the substrate specificity site, ribonucleotide 
reductase preferentially binds pyrimidine nucleotides (UDP or CDP) at its active site and 
reduces them to (UDP and dCDP. With dTTP in the specificity-determining site, GDP 
is the preferred substrate. When dGTP binds to the specificity site, ADP becomes the 
favored substrate for reduction. The rationale for these varying affinities is as follows 
(Figure 26.23): High [ATP] is consistent with cell growth and division and, consequently, 
the need for DNA synthesis. Thus, ATP binds in the overall activity site of ribonucleotide 
reductase, turning it on and promoting production of dNTPs for DNA synthesis. Under 
these conditions, ATP is also likely to occupy the substrate specificity site, so UDP and 
CDP are bound at the catalytic site and reduced to (UDP and dCDP. Both of these 
pyrimidine deoxynucleoside diphosphates are precursors to dTTP. Thus, elevation of 
dUDP and dCDP levels leads to an increase in [ATTP]. High dTTP levels increase the 
likelihood that it will occupy the substrate specificity site, in which case GDP becomes 
the preferred substrate and dGTP levels rise. Upon dGTP association with the substrate 
specificity site, ADP is the favored substrate, leading to ADP reduction and the eventual 
accumulation of dATP. Binding of dATP to the overall activity site then shuts the enzyme 
down. In summary, the relative affinities of the three classes of nucleotide-binding sites 
in ribonucleotide reductase for the various substrates, activators, and inhibitors are such 
that the formation of dNDPs proceeds in an orderly and balanced fashion. As these 
dNDPs are formed in amounts consistent with cellular needs, their phosphorylation by 
nucleoside diphosphate kinases produces dNTPs, the actual substrates of DNA synthesis. 
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Figure 26.23 Regulation of deoxynucleotide biosynthesis: the rationale for the various affinities dis- 
played by the two nucleotide-binding regulatory sites on ribonucleotide reductase. 
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26.8 | How Are Thymine Nucleotides Synthesized? 


The synthesis of thymine nucleotides proceeds from other pyrimidine deoxyribonucleo- 
tides. Cells have no requirement for free thymine ribonucleotides and do not synthesize 
them. Small amounts of thymine ribonucleotides do occur in tRNA (tRNA is notable 
for having unusual nucleotides), but these Ts arise via methylation of U residues already 
incorporated into the tRNA. Both dUDP and dCDP can lead to formation of dUMP, 
the immediate precursor for dTMP synthesis (Figure 26.24). Interestingly, formation 
of dUMP from dUDP passes through dUTP, which is then cleaved by dUTPase, a 
pyrophosphatase that removes PP; from dUTP. The action of dUTPase prevents dUTP 
from serving as a substrate in DNA synthesis. An alternative route to (UMP formation 
starts with dCDP, which is dephosphorylated to (CMP and then deaminated by dCMP 
deaminase (Figure 26.25), leaving (UMP. dCMP deaminase provides a second point 
for allosteric regulation of dNTP synthesis; it is allosterically activated by dCTP and 
feedback-inhibited by dTTP. Of the four dNTPs, only dCTP does not interact with 
either of the regulatory sites on ribonucleotide reductase (see Figure 26.20). Instead, it 
acts upon dCMP deaminase. 

Synthesis of dTMP from dUMP is catalyzed by thymidylate synthase (Figure 26.26). 
This enzyme methylates (UMP at the 5-position to create dTMP; the methyl donor is 
the one-carbon folic acid derivative N°,N!°-methylene-THF. The reaction is actually a 
reductive methylation in which the one-carbon unit is transferred at the methylene level 
of reduction and then reduced to the methyl level. The THF cofactor is oxidized at the 
expense of methylene reduction to yield DHF. DHFR then reduces DHF back to THF 
for service again as a one-carbon vehicle (see panel a of the figure in A Deeper Look 
box “Tetrahydrofolate and One-Carbon Units” in Section 26.2). Thymidylate synthase 
sits at a junction connecting dNTP synthesis with folate metabolism. It has become a 
preferred target for inhibitors designed to disrupt DNA synthesis. An indirect approach 
is to employ folic acid precursors or analogs as antimetabolites of dTMP synthesis (see 
panel b of the figure in A Deeper Look box “Folate Analogs as Antimicrobial and 
Anticancer Agents” in Section 26.2). Purine synthesis is affected as well because it is 
also dependent on THF (see Figure 26.3). 


From dUDP: dUDP dUTP > dUMP > dTMP 
From dCDP: dCDP ——_____+ dCMP > dUMP > dTMP 


Figure 26.24 Pathways of dTMP synthesis. dTMP production is dependent on dUMP formation from 
dCDP and dUDP. 
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Figure 26.25 Deamination of dCMP forms dUMP. (a) The dCMP deaminase reaction. (b) Trimeric 
dCMP deaminase. Each chain has a bound dCTP molecule (purple) and a Mg?" ion (orange; PDB id = 
1XS4). 
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Figure 26.26 Thymidylate is synthesized from UMP 
and N°, N! methylene-THF. (a) The thymidylate syn- 
thase reaction. The 5-CH; group is ultimately derived 
from the B-carbon of serine. (b) Thymidylate synthase 
dimer. Each monomer has a bound folate analog 
(green) and dUMP (light blue; PDB id = 1JUJ). 


Human Biochemistry 


Fluoro-Substituted Pyrimidines in Cancer Chemotherapy, 


Fungal Infections, and Malaria 


5-Fluorouracil (5-FU; see part a of the figure) is a thymine analog. 
It is converted in vivo to 5’-fluorouridylate by a PRPP-dependent 
phosphoribosyltransferase and passes through the reactions of 
dNTP synthesis, culminating ultimately as 2'-deoxy-5-fluorouridylic 
acid, a potent inhibitor of dTMP synthase (see A Deeper Look box 
“Fluoro-Substituted Analogs Are Good Trojan Horse Inhibitors” 
in Section 26.8). 5-FU is used as a chemotherapeutic agent in the 
treatment of human cancers. Similarly, 5-fluorocytosine (see part b) 
is used as an antifungal drug because fungi, unlike mammals, can con- 
vert it to 2’-deoxy-5-fluorouridylate. Furthermore, malarial parasites 
can use exogenous orotate to make pyrimidines for nucleic acid syn- 
thesis, whereas mammals cannot. Thus, 5-fluoroorotate (see part c) 
is an effective antimalarial drug because it is selectively toxic to these 
parasites. 


(a) o (b) NH, 
wy ry 
Bo as 


H H 


5-Fluorouracil 5-Fluorocytosine 


(c) 


5-Fluoroorotate 
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A Deeper Look 


Fluoro-Substituted Analogs Are Good Trojan Horse Inhibitors 


Carbon-fluorine bonds are exceedingly rare in nature, and fluorine 
is an uncommon constituent of biological molecules. F has three 
properties attractive to drug designers: (1) It is the smallest replace- 
ment for an H atom in organic synthesis, (2) fluorine is the most elec- 
tronegative element, and (3) the F—C bond is relatively unreactive. 
This steric compactness and potential for strong inductive effects 
through its electronegativity renders F a useful substituent in the 
construction of inhibitory analogs of enzyme substrates. One inter- 
esting strategy is to devise fluorinated precursors that are taken up 
and processed by normal metabolic pathways to generate a potent 
antimetabolite. A classic example is fluoroacetate. FCH,COO7™ is 
exceptionally toxic because it is readily converted to fluorocitrate 
by citrate synthase of the citric acid cycle (see Chapter 19). In 
turn, fluorocitrate is a powerful inhibitor of aconitase. The meta- 
bolic transformation of an otherwise innocuous compound into a 
poisonous derivative is termed lethal synthesis. 5-Fluorouracil and 
5-fluorocytosine are also examples of this strategy. 

Unlike hydrogen, which is often abstracted from substrates as 
H+, electronegative fluorine cannot be readily eliminated as the 
corresponding F*. Thus, enzyme inhibitors can be fashioned in 


Cys —® 


which F replaces H at positions where catalysis involves H removal N°,N!°-methylene- THF 
as H*. Thymidylate synthase catalyzes removal of H from dUMP H 

as H* through a covalent catalysis mechanism. A thiol group on sl 

this enzyme normally attacks the 6-position of the uracil moiety of HoN A NG 
2'-deoxyuridylic acid so that C-5 can act as a carbanion in attack on A | + ale 
the methylene carbon of N°,N!°-methylene-THF (see accompanying H^ NN N“ SCH, 
figure). Regeneration of free enzyme then occurs through loss of the o l |10 
C-5 H atom as H* and dissociation of product dTMP. If F replaces HC — a 
H at C-5 as in 2’-deoxy-5-fluorouridylate (FAUMP), the enzyme is R 


immobilized in a very stable ternary [enzyme : FAUMP: methylene- 
THF] complex and effectively inactivated. Enzyme inhibitors like 
FdUMP whose adverse properties are elicited only through direct 
participation in the catalytic cycle are variously called mechanism- 
based inhibitors, suicide substrates, or Trojan horse substrates. 


The effect of the 5-fluoro substitution on the mechanism of action of 
thymidylate synthase. An enzyme thiol group (from a Cys side chain) 
ordinarily attacks the 6-position of (UMP so that C-5 can react as a 
carbanion with N°, N!°-methylene-THF. Normally, free enzyme is regen- 
erated following release of the hydrogen at C-5 as a proton. Because 
release of fluorine as F* cannot occur, the ternary (three-part) complex of 


[enzyme: fluorouridylate : methylene-THF] is stable and persists, preventing O N S R 
enzyme turnover. (The N°, N-methylene-THF structure is given in abbrevi- og | 
ated form.) Cys —(E) 


Ternary complex 


Foundational Biochemistry Things You Should Know after Reading Chapter 26. 


e Nucleotides are key participants in metabolism and essential building © THF, the active coenzyme form of the B vitamin folate, is a coenzyme 
blocks for RNA and DNA synthesis. in one-carbon transfer reactions. 

e Nearly all organisms can carry out de novo synthesis of purine and © Folate analogs are used as drugs in cancer chemotherapy and as 
pyrimidine nucleotides. antimicrobial agents. 

© The nine atoms that compose the purine ring come from five different © IMP is the first purine nucleotide made by the de novo pathway of 
precursors: glutamine, glycine, CO, aspartate, and N!°-formyl-THF. purine synthesis. 

e 


The atoms that form the purine ring are assembled on a ribose-5-P © Only seven proteins are needed to carry out the 11 reactions of 
platform. purine biosynthesis, because some of these reactions are catalyzed by 
multifunctional polypeptides. 
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e AMP and GMP are synthesized from IMP by distinct two-step meta- 
bolic pathways. 


e Inhumans, purinosomes catalyze de novo purine biosynthesis. 


e The purine biosynthetic pathway is regulated through feedback inhi- 
bition of ribose-5-P pyrophosphokinase (reaction 1) and glutamine- 
PRPP amidotransferase (reaction 2). 


e ATP-dependent kinases form NDPs and NTPs from NMPs. 


e Purine salvage pathways capture purines released in catabo- 
lism and relink them to ribose-5-P through the action of purine 
phosphoribosyltransferases. 


e Some parasitic organisms rely on purine salvage to obtain purine 
nucleotides. 


e The major pathways of purine catabolism lead to uric acid. 


e The purine nucleoside cycle in skeletal muscle acts as an anaplerotic 
route to replenish citric acid cycle intermediates. 


e Xanthine oxidase is a molybdo-enzyme that oxidizes hypoxanthine 
and xanthine to uric acid. 


e Gout is a disease caused by excess uric acid. 
e Different animals oxidize uric acid to form various excretory products. 
e Pyrimidines are synthesized de novo from carbamoyl-P and aspartate. 


e —Incontrast to purine biosynthesis, pyrimidines are synthesized as the free 
base, orotate, and then ribose-5-P is added, using PRPP as the substrate. 


e Both purine and pyrimidine biosynthesis in mammals are prime 
examples of metabolic channeling by multifunctional enzymes. 


UMP is the precursor to both UTP and CTP. 


Pyrimidine biosynthesis is regulated at carbamoyl-P synthetase II in 
mammals and at aspartate transcarbamoylase in bacteria. 


Pyrimidines are degraded to form B-alanine, NH3, and CO2. The 
carbons of B-alanine are the ones contributed by aspartate during 
pyrimidine synthesis. 


Deoxyribonucleotide synthesis involves reduction at the 2’-position 
of the ribose ring of NDPs; the reducing power comes from NADPH. 


Deoxyribonucleotides are synthesized through the action of ribonu- 
cleotide reductase. 


Eukaryotic ribonucleotide reductase has three nucleotide-binding sites: the 
catalytic (or C) site, the specificity (or S) site, and the activity (or A) site. 


The S site and the A site of ribonucleotide reductase are allosteric sites. 


Thioredoxin, reduced by NADPH through the action of thioredoxin 
reductase, is the direct reductant for ribonucleotide reductase. 


Reduction of NDPs to dNDPs by ribonucleotide reductase proceeds 
via a free radical mechanism. 


ATP is an allosteric activator of ribonucleotide reductase and dATP 
is an allosteric inhibitor. 


The substrate for dTMP synthesis is (UMP. 
Two dNDPs lead to formation of (UMP: dUDP and dCDP. 


Thymidylate synthase catalyzes the formation of dTMP from dUMP, 
using N°, N!°-methylene-THF as methyl donor. 


5-Fluoro-substituted pyrimidines are useful drugs in the treatment of 
cancer, fungal infections, and malaria. 


Answers to all problems are at the end of this book. Detailed solutions are 
available in the Student Solutions Manual. 


1. Metabolic Origin of the Atoms in Purine and Pyrimidine Rings Draw 
the purine and pyrimidine ring structures, indicating the metabolic 
source of each atom in the rings. 


2. The Energy Cost of Nucleotide Biosynthesis Starting with glutamine, 
aspartate, glycine, CO, and N!°-formyl-THF, how many ATP equiva- 
lents are expended in the synthesis of (a) ATP, (b) GTP, 

(c) UTP, and (d) CTP? 


3. Biosynthesis of AMP or GMP Consider the structures shown and 
organize them in the order they may appear in the biosynthetic path- 
way for AMP or GMP. 


O O 
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\ \ HN \ 
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Ribose-5P Ribose-5P 
Structure D Structure E 


4. 


ATP Consumption in Biosynthesis of AMP and GMP As shown in 
Figures 26.3 and 26.5, the biosynthesis of adenine consumes many 
ATP equivalents. How, then, can a net amount of adenine ever be 
generated? 


Allosteric Regulation of Purine and Pyrimidine Biosynthesis Draw a 
diagram illustrating the key points of regulation in (a) the biosynthe- 
sis of IMP, AMP, and GMP; (b) E. coli pyrimidine biosynthesis; and 
(c) mammalian pyrimidine biosynthesis. 


Inhibition of Purine and Pyrimidine Metabolism by Pharmacological 
Agents Indicate which reactions of purine or pyrimidine metabolism 
are affected by the inhibitors (a) azaserine, (b) methotrexate, 

(c) sulfonamides, (d) allopurinol, and (e) 5-fluorouracil. 


The Metabolic Role of UDP Since dUTP is not a normal component 
of DNA, why do you suppose ribonucleotide reductase has the 
capacity to convert UDP to dUDP? 


Allosteric Regulation of Ribonucleotide Reductase by ATP and Deoxynu- 
cleotides Describe the underlying rationale for the regulatory effects 
exerted on ribonucleotide reductase by ATP, dATP, dTTP, and dGTP. 


Features of Ribonucleotide Reductase (Integrates with Chapters 21 
and 22) 


a. The generation of a tyrosyl radical is crucial in the mechanism 
of ribonucleotide reductase. What other crucially important bio- 
chemical reaction requires the presence of a tyrosyl radical in the 
active site of an enzyme? 

b. Apart from being a source of ATP, what are the (dual) important 
functions of glucose in the generation of deoxynucleotides for 
the synthesis of DNA? 
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20. 


Nucleotide Catabolism as a Source of Metabolic Energy (Integrates 
with Chapters 18-20 and 22.) By what pathway(s) does the ribose 
released upon nucleotide degradation enter intermediary metabolism 
and become converted to cellular energy? How many ATP equivalents 
can be recovered from one equivalent of ribose? 


Convergence of Purine and Histidine Biosynthesis (Integrates with 
Chapter 25.) At which steps does the purine biosynthetic pathway 
resemble the pathway for biosynthesis of the amino acid histidine? 


The Mechanism of Action of Purine Biosynthetic Enzymes Write 
reasonable chemical mechanisms for steps 6, 8, and 9 in purine 
biosynthesis (see Figure 26.3). 


The Anaplerotic Purine Nucleoside Cycle of Skeletal Muscle Write a 
balanced equation for the conversion of aspartate to fumarate by the 
purine nucleoside cycle in skeletal muscle. 


The Catabolism of Uric Acid Write a balanced equation for the oxida- 
tion of uric acid to glyoxylic acid, CO», and NH;, showing each step 
in the process and naming all of the enzymes involved. 


The Allosteric Kinetics of Aspartate Transcarbamoylase (Integrates 
with Chapter 15.) E. coli aspartate transcarbamoylase (ATCase) 
displays classic allosteric behavior. This a58¢ enzyme is activated by 
ATP and feedback-inhibited by CTP. In analogy with the behavior of 
glycogen phosphorylase shown in Figure 15.15, illustrate the allosteric 
v versus [aspartate] curves for ATCase (a) in the absence of effectors, 
(b) in the presence of CTP, and (c) in the presence of ATP. 


The Functional Organization of a Heteromeric Allosteric Enzyme 
(Integrates with Chapter 15.) Unlike its allosteric counterpart glycogen 
phosphorylase (an œz enzyme), E. coli ATCase has a heteromeric 
(a686) organization. The a-subunits bind aspartate and are 

considered catalytic subunits, whereas the B-subunits bind CTP 

or ATP and are considered regulatory subunits. How would you 
describe the subunit organization of ATCase from a functional point 
of view? 


The Energy Cost of dTTP Synthesis (Integrates with Chapter 20.) 
Starting from HCO,, glutamine, aspartate, and ribose-5-P, how 
many ATP equivalents are consumed in the synthesis of dTTP in 

a eukaryotic cell, assuming dihydroorotate oxidation is coupled 

to oxidative phosphorylation? How does this result compare with the 
ATP costs of purine nucleotide biosynthesis calculated in problem 2? 


The Chemistry of dTMP Synthesis and the Substrate-Binding Site of 
Thymidylate Synthase Write a balanced equation for the synthesis 

of dTMP from UMP and N°,N!°-methylene-THF. Thymidylate syn- 
thase has four active-site arginine residues (Arg”, Arg!”*’, Arg!”, 
and Arg”!®) involved in substrate binding. Postulate a role for the side 
chains of these Arg residues. 


Representatives of the PRT Family of Enzymes in Purine Synthesis 
Enzymes that bind phosphoribosyl-5-phosphate (PRPP) have a com- 
mon structural fold, the PRT fold, which unites them as a structural 
family. PRT here refers to the phosphoribosy] transferase activity 
displayed by some family members. Typically, in such reactions, PP; 
is displaced from PRPP by a nitrogen-containing nucleophile. Several 
such reactions occur in purine metabolism. Identify two such reac- 
tions where the enzyme involved is likely to be a PRT family member. 


Explore the Structure of DHFR Reductase The crystal structure of E. 
coli dihydrofolate reductase (DFR) with NADP* and folate bound 
can be found in the Protein Data Bank (https//www.rcsb.org) as file 
7DER. Go to this website, enter “7DFR” in the search line, and click 
on one of the viewers under “Display options” when the 7DFR page 


21. 


22. 


23. 


24. 


25. 
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Problems 


comes up. Explore the graphic of the DFR structure to visualize how 
its substrates are bound. (If you hold down the left button on your 
mouse and move the cursor over the image, you can rotate the struc- 
ture to view it from different perspectives.) Note in particular the 
spatial relationship between the nicotinamide ring of NADP* and 
the pterin ring of folate. How might substrate juxtaposition facilitate 
DFR reductase function? Note also the location of polar groups on 
the two substrates in relation to the DFR structure. 


Explore the Structure of Aspartate Transcarbamoylase, an Allosteric 
Enzyme E. coli aspartate transcarbamoylase is an allosteric enzyme 
(see problem 12) composed of six catalytic (C) subunits and six regu- 
latory (R) subunits. Protein Data Bank file 1RAA shows one-third 
of the ATCase holoenzyme (two C subunits and two R subunits; 
CTP molecules are bound to the R subunits). Explore this structure 
using one of the display options. What secondary structural motif 
dominates the R-subunit structure? Protein Data Bank file 2IPO also 
shows one-third of the ATCase holoenzyme (two C subunits and two 
R subunits), but in this structure molecules of the substrate analog 
N-(2-phosphonoacetyl)-L-asparagine are bound to the C subunits. 
Explore this structure using one of the display options. Note the 
distance separating the ATCase active site from its allosteric site. Inter- 
pret what you see in terms of the Monod—Wyman—Changeux model 
for allosteric regulation (see Chapter 15). Which of these structures 
corresponds to the MWC R-state, and which corresponds to the 
T-state? 


Folic Acid and Pregnancy Pregnant women are recommended to take 
in extra amounts of folic acid to benefit the growing fetus. Give at 
least one reason why this recommendation is made. 


Nucleotide Biosynthesis and Anticancer Agents Analogs of folate or 
fluoro-substituted pyrimidines can be used as anticancer agents. Such 
agents act as potent inhibitors of key enzymes functioning in the 
nucleotide biosynthetic pathways. However, chemotherapy with any 
such agent does lead to serious side effects. Explain why these side 
effects would occur and why, despite the side effects, the chemothera- 
peutic agents are still useful. 


Allosteric Regulation of Purine Biosynthesis Examine Figure 26.6, 
and predict the relative rates of the regulated reactions in the purine 
biosynthetic pathway from ribose-5-P to GMP and AMP under con- 
ditions in which GMP levels are very high. 


Glucose as a Source of Carbon in THF-Mediated 1-Carbon Transfers 
Decide from Figures 18.1, 25.31, 26.26, and the Deeper Look box 
“Tetrahydrofolate and One-Carbon Units” in Section 26.2 which carbon 
atom(s) in glucose would be most likely to end up as the 5-CH; carbon in 
dTMP. 


PSAFE Problems 


(PSAFE models, listed by chapter or protein name, can be found at http:// 
www.psafeonline.net.) 


26. 


Using the PSAFE Module for CTP synthetase, describe how 
glutamine binding occurs, and explain how His” regulates 
ammonium diffusion. 


27. Using the PSPAFE Module for HGPRT, discuss the putative function 


of the flexible loop. Describe the difference between the loop-open 
and loop-closed conformations. 


Think-Pair-Share Question 


Briefly review all previous chapters and come up with as many functions 
of adenine in biochemistry as you can. 
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Metabolic Integration 
and Organ Specialization 


Cells are systems in a dynamic steady state, maintained by a constant flux of nutrients that 
serve as energy sources or as raw material for the maintenance of cellular structures. 
Catabolism and anabolism are ongoing, concomitant processes. What principles underlie the 
integration of catabolism and energy production with anabolism and energy consumption? 
How is metabolism integrated in complex organisms with multiple organ systems? 


E the preceding chapters, we have explored the major metabolic pathways— 
glycolysis, the citric acid cycle, electron transport and oxidative phosphorylation, 
photosynthesis, gluconeogenesis, fatty acid oxidation, lipid biosynthesis, amino acid 
metabolism, and nucleotide metabolism. Several of these pathways are catabolic and 
serve to generate chemical energy useful to the cell; others are anabolic and use this 
energy to drive the synthesis of essential biomolecules. Despite their opposing pur- 
poses, these reactions typically occur at the same time as nutrient molecules are 
broken down to provide the building blocks and energy for ongoing biosynthesis. 
Cells maintain a dynamic steady state through processes that involve considerable 
metabolic flux. The metabolism that takes place in just a single cell is so complex that 
it resists detailed quantitative description. However, an appreciation of overall rela- 
tionships can be achieved by stepping back and considering intermediary metabolism 
at a systems level of organization. 


27.1 | Can Systems Analysis Simplify 


the Complexity of Metabolism? 
The metabolism of a typical heterotrophic cell can be portrayed by a schematic diagram 


consisting of three interconnected functional blocks: (1) catabolism, (2) anabolism, and 
(3) macromolecular synthesis and growth (Figure 27.1). 
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Study of an enzyme, a reaction, or a 
sequence can be biologically relevant only 
if its position in the hierarchy of function 
is kept in mind. 


Daniel E. Atkinson 
Cellular Energy Metabolism and Its Regulation 
(1977) 


< The Washington, D.C., Metro map. The coordinated 
flow of passengers along different transit lines is an apt 
metaphor for metabolic regulation. 


27.1 Can Systems Analysis Simplify 

the Complexity of Metabolism? 
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Is There a Good Index of Cellular 
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Figure 27.1 Block diagram of intermediary a-KG 
metabolism. The beige-shaded box (top) illustrates Succinyl-CoA 
metabolic blocks found in photosynthetic organisms. Oxaloacetate 


1. Catabolism Energy-yielding nutrients are oxidized to CO, and H,O in catabolism, 
and most of the electrons liberated are passed to oxygen via an electron-transport 
pathway coupled to oxidative phosphorylation, resulting in the formation of ATP. 
Some electrons go to reduce NADP* to NADPH, the source of reducing power for 
anabolism. Glycolysis, the citric acid cycle, electron transport and oxidative phos- 
phorylation, and the pentose phosphate pathway are the principal pathways within 
this block. The metabolic intermediates in these pathways also serve as substrates for 
processes within the anabolic block. 


Fat ——- 


Protein —> 


2. Anabolism The biosynthetic reactions that form the many cellular molecules col- 
lectively comprise anabolism. For thermodynamic reasons, the chemistry of anabo- 
lism is more complex than that of catabolism (that is, it takes more energy [and often 
more steps] to synthesize a molecule than can be produced from its degradation). 
Metabolic intermediates derived from glycolysis and the citric acid cycle are the 
precursors for this synthesis, with NADPH supplying the reducing power and ATP, 
the coupling energy. 


3. Macromolecular Synthesis and Growth The organic molecules produced in anabo- 
lism are the building blocks for creating macromolecules. Like anabolism, macromo- 
lecular synthesis is driven by energy from ATP, although indirectly in some cases: 
GTP is the principal energy source for protein synthesis, CTP for phospholipid 
synthesis, and UTP for polysaccharide synthesis. However, keep in mind that ATP is 
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the principal phosphoryl donor for formation of GTP, CTP, and UTP from GDP, 
CDP, and UDP, respectively. Macromolecules are the agents of biological function 
and information—proteins, nucleic acids, lipids that self-assemble into membranes, 
and so on. Growth can be represented as cellular accumulation of macromolecules 
and the partitioning of these materials of function and information into daughter 
cells in the process of cell division. 


27.1a Only a Few Intermediates Interconnect the Major Metabolic Systems 


Despite the complexity of processes taking place within each block, the connections 
between blocks involve a limited number of substances. Just ten or so kinds 
of catabolic intermediates from glycolysis, the pentose phosphate pathway, and the 
citric acid cycle serve as the raw material for most of anabolism: four kinds of sugar 
phosphates (triose-P, tetrose-P, pentose-P, and hexose-P), three a-keto acids (pyru- 
vate, oxaloacetate, and a-ketoglutarate), two coenzyme A derivatives (acetyl-CoA 
and succinyl-CoA), and PEP (phosphoenolpyruvate). 


27.1b ATP and NADPH Couple Anabolism and Catabolism 


Metabolic intermediates are consumed by anabolic reactions and must be continu- 
ously replaced by catabolic processes. In contrast, the energy-rich compounds ATP 
and NADPH are recycled rather than replaced. When these substances are used in 
biosynthesis, the products are ADP and NADP*, and ATP and NADPH are regener- 
ated during catabolism. ATP and NADPH are unique in that they are the only 
compounds whose purpose is to couple the energy-yielding processes of catabolism to 
the energy-consuming reactions of anabolism. Certainly, other coupling agents serve 
essential roles in metabolism. For example, NADH and [FADH,] participate in the 
transfer of electrons from substrates to O» during oxidative phosphorylation. However, 
these reactions are solely catabolic, and the functions of NADH and [FADH)] are 
fulfilled within the block called catabolism. 


27.1c Phototrophs Have an Additional Metabolic 
System—The Photochemical Apparatus 


The systems in Figure 27.1 reviewed thus far are representative only of metabolism as 
it exists in aerobic heterotrophs. The photosynthetic production of ATP and NADPH 
in photoautotrophic organisms entails a fourth block, the photochemical system 
(Figure 27.1). This block consumes H,O and releases O2. When this fourth block oper- 
ates, energy production within the catabolic block can be largely eliminated. Yet an- 
other block, one to account for the fixation of carbon dioxide into carbohydrates, is 
also required for photoautotrophs. The inputs to this fifth block are the products of the 
photochemical system (ATP and NADPH) and CO, derived from the environment. 
The carbohydrate products of this block may enter catabolism but not primarily for 
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Stoichiometry is the measurement of the amounts 

of chemical elements and molecules involved in 
chemical reactions (from the Greek stoicheion, meaning 
“element,” and metria, meaning “measure”). 


energy production. In photoautotrophs, carbohydrates are fed into catabolism to gen- 
erate the metabolic intermediates needed to supply the block of anabolism. Although 
these diagrams are oversimplifications of the total metabolic processes in heterotrophic 
or phototrophic cells, they are useful illustrations of functional relationships between 
the major metabolic subdivisions. This general pattern provides an overall perspective 
on metabolism, making its purpose easier to understand. 


27.2 | What Underlying Principle Relates ATP Coupling 
to the Thermodynamics of Metabolism? 


Virtually every metabolic pathway either consumes or produces ATP. The amount 
of ATP involved—that is, the stoichiometry of ATP synthesis or hydrolysis—lies 
at the heart of metabolic relationships. The overall thermodynamic efficiency of 
any metabolic sequence, be it catabolic or anabolic, is determined by ATP coupling. 
In every case, the overall reaction mediated by any metabolic pathway is energetically 
favorable because of its particular ATP stoichiometry. In the highly exergonic reactions 
of catabolism, much of the energy released is captured in ATP synthesis. In turn, the 
thermodynamically unfavorable reactions of anabolism are driven by energy released 
upon ATP hydrolysis. 


27.2a The Three Types of Stoichiometries in Cells 


The metabolic significance of ATP coupling is revealed by considering the three kinds 
of stoichiometry in cellular processes. The first two are fixed by the laws of chemistry, 
but the third is unique to living systems and reveals a fundamental difference between 
the inanimate world of chemistry and physics and the world of biological function, as 
shaped by evolution—that is, the world of living organisms. The fundamental differ- 
ence is the stoichiometry of ATP coupling. 


1. Reaction Stoichiometry This is simple chemical stoichiometry—the number of each 
kind of atom in any chemical reaction remains the same, and thus equal numbers must 
be present on both sides of the equation. This requirement holds even for a process as 
complex as cellular respiration: 


C6H120¢ + 6 O- —> 6 CO, + 6 HO 


The six carbons in glucose appear as six CO», the 12 H of glucose appear as the 12 H 
in six molecules of water, and the 18 oxygens are distributed between CO, and H30. 


2. Obligate Coupling Stoichiometry Cellular respiration is an oxidation-reduction pro- 
cess, and the oxidation of glucose is coupled to the reduction of NAD* and [FAD]. 
(Brackets here denote that the relevant FAD is covalently linked to succinate dehydro- 
genase; see Chapter 20.) The NADH and [FADH,] thus formed are oxidized in the 
electron-transport pathway: 
(a) CeH}20¢6 + 10 NAD* + 2 [FAD] + 6 H:O —> 

6 CO, + 10 NADH + 10 H* + 2 [FADH;] 


(b) 10 NADH + 10 H* + 2 [FADH,] + 6 O, —> 12 H,O + 10 NAD* + 2 [FAD] 


Sequence (a) accounts for the oxidation of glucose via glycolysis and the citric acid 
cycle. Sequence (b) is the overall equation for electron transport per glucose. The 
stoichiometry of coupling by the biological e~ carriers, NAD* and FAD, is fixed by 
the chemistry of electron transfer; each of the coenzymes serves as ane pair acceptor. 
Reduction of each O atom takes an e~ pair. Metabolism must obey these facts of 
chemistry: Biological oxidation of glucose releases 12 e` pairs, creating a requirement 
for 12 equivalents of e` pair acceptors, which transfer the electrons to 12 O atoms. By 
evolutionary chance, NAD*/NADH and FAD/FADH) carry these electrons, but the 
stoichiometry is fixed by the chemistry. 
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3. Evolved Coupling Stoichiometries The participation of ATP is fundamentally differ- 
ent from the role played by pyridine nucleotides and flavins. The stoichiometry of 
adenine nucleotides in metabolic sequences is not fixed by chemical necessity. Instead, 
the stoichiometries we observe are the consequences of evolutionary design. The over- 
all equation for cellular respiration, including the coupled formation of ATP by oxida- 
tive phosphorylation, is 


Ce6H.0¢ + 6 O2 + 38 ADP + 38 P; —> 6 CO, + 38 ATP + 44 H,O 


The stoichiometry of ATP formation, 38 ADP + 38 P, —>38 ATP + 38 H,O, cannot 
be predicted from any chemical considerations. The value of 38 ATP is an end result 
of biological adaptation. It is a trait that evolved through interactions between chem- 
istry, heredity, and the environment over the course of evolution. Like any evolved 
character, ATP stoichiometry is the result of compromise. The final trait is one par- 
ticularly suited to the fitness of the organism. 

The number 38 is not magical. The value of 38 was established a long time ago in 
evolution, when the prevailing atmospheric conditions and the competitive situation 
were undoubtedly very different from those today. The significance of this number is 
that it provides a high yield of ATP for each glucose molecule, yet the yield is still low 
enough that essentially all of the glucose is metabolized. 

This ATP yield of 38 for cellular respiration is for the reaction within an uncompart- 
mentalized (prokaryotic) cell. In eukaryotes, where much of the cellular respiration is 
compartmentalized within mitochondria, mitochondrial ADP/ATP exchange imposes 
a metabolic cost on the proton gradient of one H* per ATP, so the overall yield of ATP 
per glucose is 32-34, not 38 (see Chapter 20). 


27.2b ATP Coupling Stoichiometry Determines 
the Keq for Metabolic Sequences 


The fundamental biological purpose of ATP as an energy-coupling agent is to drive 
thermodynamically unfavorable reactions. In effect, the energy release accompanying 
ATP hydrolysis is transmitted to the unfavorable reaction so that the overall free en- 
ergy change for the coupled process is negative (that is, favorable). The involvement of 
ATP serves to alter the free energy change for a reaction; or to put it another way, the 
role of ATP is to change the equilibrium ratio of [reactants] to [products] for a reac- 
tion. (See the A Deeper Look box at the end of Section 3.7: “ATP Changes the Keq by 
a Factor of 10%.”) 

Another way of viewing these relationships is to note that, at equilibrium, the con- 
centrations of ADP and P; will be vastly greater than that of ATP because A G°’ for ATP 
hydrolysis is a large negative number. 

Since AG°’ = —30.5 kJ/mol, In Keq = 12.3. So Keg = 2.2 X 10°. Choosing starting 
conditions of [ATP] = 8 mM, [ADP] = 8 mM, and [P;] = 1 mM, we can assume that, at 
equilibrium, [ATP] has fallen to some insignificant value x, [ADP] = approximately 16 
mM, and [P;] = approximately 9 mM. The concentration of ATP at equilibrium, x, then 
calculates to be about 1 nM. 

However, the cell where this reaction is at equilibrium is a dead cell. Living cells 
break down energy-yielding nutrient molecules to generate ATP. It is significant to note 
that the catabolism of glucose by glycolysis requires the investment of two ATP per 
glucose before any energy yields are realized. Further, fatty acid oxidation depends on 
fatty acid activation by acyl-CoA synthetase, a reaction that consumes two ATP 
equivalents. Thus, a cell deficient in ATP is doomed. Nevertheless, catabolism of energy- 
yielding nutrients proceeds with an overall decrease in free energy, and much of this 
energy is captured in ATP synthesis. Kinetic controls over the rates of the catabolic 
pathways are designed to ensure that the [ATP]/([ADP][P;]) ratio is maintained very 
high. The cell, by employing kinetic controls over the rates of metabolic pathways, 
maintains a very high [ATP|/([ADP][P;]) ratio so that ATP hydrolysis can serve as the 
driving force for virtually all biochemical events. 
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Adenylate 
kinase 
ATP + AMP ——— 2apP 


0.2 0.4 0.6 0.8 1.0 
Energy charge 


Figure 27.2 Relative concentrations of AMP, ADP, 
and ATP as a function of energy charge. (This graph 
was constructed assuming that the adenylate kinase 
reaction is at equilibrium and that A G°’ for the 
reaction is —473 J/mol; Keq = 1.2.) 


27.2c ATP Has Two Metabolic Roles 
The role of ATP in metabolism is two-fold: 


1. It serves in a stoichiometric role to establish large equilibrium constants for meta- 
bolic conversions and to render metabolic sequences thermodynamically favorable. 
This is the role referred to when we call ATP the energy currency of the cell. 

2. ATP also serves as an important allosteric effector in the kinetic regulation of 
metabolism. Its concentration (relative to those of ADP and AMP) is an index 
of the energy status of the cell and determines the rates of regulatory enzymes 
situated at key points in metabolism, such as PFK in glycolysis and FBPase in 
gluconeogenesis. 


27.3 | Is There a Good Index of Cellular Energy Status? 


Energy transduction and energy storage in the adenylate system—ATP, ADP, and 
AMP-—lie at the very heart of metabolism. The amount of ATP a cell uses per minute 
is roughly equivalent to the steady-state amount of ATP it contains. Thus, the meta- 
bolic lifetime of an ATP molecule is brief. ATP, ADP, and AMP are all important 
effectors in exerting kinetic control on regulatory enzymes situated at key points 
in metabolism, so uncontrolled changes in their concentrations could have drastic 
consequences. The regulation of metabolism by adenylates in turn requires close 
control of the relative concentrations of ATP, ADP, and AMP. Some ATP-consuming 
reactions produce ADP; PFK and hexokinase are examples. Others lead to the forma- 
tion of AMP, as in fatty acid activation by acyl-CoA synthetases: 


Fatty acid + ATP + coenzyme A —> AMP + PP; + fatty acyl-CoA 


27.3a Adenylate Kinase Interconverts ATP, ADP, and AMP 


Adenylate kinase catalyzes the reversible phosphorylation of AMP by ATP (see 
Chapter 18), linking all three members of the adenylate pool—AMP, ADP, and ATP: 


ATP + AMP ==? ADP 


The free energy of hydrolysis of a phosphoanhydride bond is essentially the same in 
ADP and ATP (see Chapter 3), and the standard free energy change for this reaction 
is close to zero. 


27.3b Energy Charge Relates the ATP Levels 
to the Total Adenine Nucleotide Pool 


The role of the adenylate system is to provide phosphoryl groups at high group-transfer 
potential to drive thermodynamically unfavorable reactions. The capacity of the ade- 
nylate system to fulfill this role depends on how fully charged it is with phosphoric 
anhydrides. Energy charge is an index of this capacity: 


1 2 [ATP] + [ADP] ) 
2\ [ATP] + [ADP] + [AMP] 


The denominator represents the total adenylate pool ([ATP] + [ADP] + [AMP]); the 
numerator is the number of phosphoric anhydride bonds in the pool, two for each 
ATP and one for each ADP. The factor 5 normalizes the equation so that energy 
charge, or E.C., has the range 0 to 1.0. If all the adenylate is in the form of ATP, 
E.C. = 1.0, and the potential for phosphoryl transfer is maximal. At the other extreme, 
if AMP is the only adenylate form present, E.C. = 0. It is reasonable to assume that 
the adenylate kinase reaction is never far from equilibrium in the cell. Then, the 
relative amounts of the three adenine nucleotides are fixed by the energy charge. 
Figure 27.2 shows the relative changes in the concentrations of the adenylates as 
energy charge varies from 0 to 1.0. 


Energy charge = 
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27.3c Key Enzymes Are Regulated by Energy Charge 


Regulatory enzymes typically respond in reciprocal fashion to adenine nucleotides. 
For example, PFK is stimulated by AMP and inhibited by ATP. If the activities of 
various regulatory enzymes are examined in vitro as a function of energy charge, an 
interesting relationship appears. Regulatory enzymes in energy-producing catabolic 
pathways show greater activity at low energy charge, but the activity falls off abruptly 
as E.C. approaches 1.0. In contrast, regulatory enzymes of anabolic sequences are not 
very active at low energy charge, but their activities increase exponentially as E.C. 
nears 1.0. These contrasting responses are termed R, for ATP-regenerating, and U, for 
ATP-utilizing (Figure 27.3). Regulatory enzymes such as PFK and pyruvate kinase in 
glycolysis follow the R response curve as E.C. is varied. Note that PFK itself is an 
ATP-utilizing enzyme, using ATP to phosphorylate fructose-6-phosphate to yield 
fructose-1,6-bisphosphate. Nevertheless, because PFK acts physiologically as the 
valve controlling the flux of carbohydrate down the catabolic pathways of cellular 
respiration that lead to ATP regeneration, it responds as an R enzyme to energy 
charge. Regulatory enzymes in anabolic pathways, such as acetyl-CoA carboxylase, 
which initiates fatty acid biosynthesis, respond as U enzymes. 

The overall purposes of the R and U pathways are diametrically opposite in terms of 
ATP involvement. Note in Figure 27.3 that the R and U curves intersect at a rather high 
E.C. value. As E.C. increases past this point, R activities decline precipitously and U 
activities rise. That is, when E.C. is very high, biosynthesis is accelerated, whereas 
catabolism diminishes. The consequence of these effects is that ATP is used up faster 
than it is regenerated, and so E.C. begins to fall. As E.C. drops below the point of 
intersection, R processes are favored over U. Then, ATP is generated faster than it is 
consumed, and E.C. rises again. The net result is that the value of energy charge 
oscillates about a point of steady state (Figure 27.3). The experimental results obtained 
from careful measurement of the relative amounts of AMP, ADP, and ATP in living 
cells reveals that normal cells have an energy charge in the neighborhood of 0.85 to 0.88. 
Maintenance of this steady-state value is one criterion of cell health and normalcy. ATP 
levels are always substantially greater than ADP or AMP levels. Further, ATP levels in 
cells are virtually constant. The consequence of these considerations is that changes in 
the amounts of ADP and AMP are far more important regulatory signals than [ATP]. 


27.3d Phosphorylation Potential Is a Measure of Relative ATP Levels 


Because energy charge is maintained at a relatively constant value in normal cells, it 
is not an informative index of cellular capacity to carry out phosphorylation reactions. 
The relative concentrations of ATP, ADP, and P; do provide such information, and a 
function called phosphorylation potential has been defined in terms of these 
concentrations: 


ADP + P; — ATP + H-O 


Phosphorylation potential, T, is equal to [ATP]/([ADP][P;]). 

Note that this expression includes a term for the concentration of inorganic phos- 
phate. [P;] has substantial influence on the thermodynamics of ATP hydrolysis. In con- 
trast with energy charge, phosphorylation potential varies over a significant range as the 
actual proportions of ATP, ADP, and P; in cells vary in response to metabolic state. 
T ranges from 200 to 800 M~!, or more, with higher levels signifying more ATP and 
correspondingly greater phosphorylation potential. Since I varies as [ATP], [ADP], and 
[PI ] vary, it is a better index of cellular energy status than energy charge. 


27.4 | How Is Overall Energy Balance Regulated in Cells? 


AMP-activated protein kinase (AMPK) is the cellular energy sensor. Metabolic inputs 
to this sensor determine whether its output, protein kinase activity, takes place. When 
cellular energy levels are high, as signaled by high ATP concentrations, AMPK is 


— 


R (ATP- 
generating) 


Point of metabolic ~ _\ 
steady state 


Reaction rate 


U (ATP-utilizing) 


Energy charge 


Figure 27.3 Responses of regulatory enzymes to 
variation in energy charge. 
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Figure 27.5 Core structure of the mammalian 
AMPK heterotrimer. The a-subunit is green, the 
B-subunit is yellow, and the y-subunit is white. The 
y-subunit has three AMP-binding sites; the three 
molecules of bound AMP are shown, one in red, 
one in pink. ATP and ADP can compete with AMP 
for binding at the AMP sites shown as occupied by 


red and pink AMP molecules. The molecule of AMP 


shown in blue cannot be exchanged for ATP or ADP 
(PDB id = 2V8Q). 


inactive. When cellular energy levels are depleted, as signaled by high [AMP], AMPK 
is allosterically activated and phosphorylates many targets controlling cellular energy 
production and consumption. Recall that, due to the nature of the adenylate kinase 
equilibrium (see Section 18.3), AMP levels increase exponentially as ATP levels 
decrease. AMP is an allosteric activator of AMPK, whereas ATP at high levels acts as 
an allosteric inhibitor by displacing AMP from the allosteric site. Thus, competition 
between AMP and ATP for binding to the AMPK allosteric sites determines the 
activity of AMPK. Activation of AMPK (1) sets in motion catabolic pathways leading 
to ATP synthesis and (2) shuts down pathways that consume ATP energy, such as 
biosynthesis and cell growth. In effect, AMPK acts like a metabolic fuel gauge. 


27.4a The Structure and Regulation of AMPK 


AMPK is an aBy-heterotrimer (Figure 27.4). The a-subunit is the catalytic subunit; it 
has an N-terminal Ser/Thr protein kinase domain and a C-terminal By-binding 
domain. The B-subunit has at its C terminus an ay-binding domain. The y-subunit is 
the regulatory subunit; it has a pair of allosteric sites where adenine nucleotides bind. 
These sites are located toward its C terminus within four CBS domains (so named for 
their homology to cystathionine-B-synthase). These CBS domains act in pairs to form 
structures known as Bateman modules. Bateman modules CBS! and CBS3 reversibly 
bind AMP, ADP, or ATP; module CBS2 is never occupied; and CBS4 is always occu- 
pied by AMP. AMP binding to the CBS1 and CBS3 sites is highly cooperative, such that 
binding of AMP to one module markedly enhances AMP binding at the other. This 
cooperativity renders AMPK exquisitely sensitive to changes in AMP concentration. 

AMP binding to AMPK increases its protein kinase activity by more than 1000-fold. 
The underlying mechanism involves a pseudosubstrate sequence (see Section 15.4b) 
within CBS domain 2 that fits into the a-subunit catalytic site. When AMP binds to the 
Bateman modules, conformational changes in the y-subunit displace the pseudosubstrate 
sequence from the kinase catalytic site, freeing it to act. The structural relationships 
between the AMPK subunits can be seen in the core structure of the mammalian aBy 
complex (Figure 27.5). 

AMP activates AMPK in two ways: First, it is an allosteric activator; second, AMP 
(or ADP) binding favors phosphorylation of Thr!” within the a-subunit kinase domain. 
Phosphorylation of Thr!” is necessary for a-subunit protein kinase activity and is 
achieved by several upstream protein kinases (such as LKB1 and CamK KB), which are 
themselves constitutively active (that is, always active). Whether AMPK remains phos- 
phorylated depends on the activity of phosphoprotein phosphatases, and AMP (or ADP) 
exerts its favorable effects on Thr!” phosphorylation by inhibiting its dephosphorylation 
by these phosphatases. Thr!” lies within the activation loop of the kinase; activation 
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Figure 27.4 Domain structure of the AMP-activated protein kinase subunits. (Adapted from Figure 1 
in Hardie, D. G., Hawley, S. A., and Scott, J. W., 2006. AMP-activated protein kinase: Development 
of the energy sensor concept. Journal of Physiology 574:7-15.) 
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loops are common features of protein kinases whose activation requires phosphoryla- 
tion by other protein kinases. Both of these favorable actions are reversed if ATP dis- 
places AMP or ADP from the CBS1 and CBS3 allosteric sites. 

In actuality, AMPK is acting as an energy charge sensor, because the two exchange- 
able adenine nucleotide-binding sites on the y-subunit can bind either AMP, ADP, or 
ATP. AMP and ATP exert allosteric effects, but ADP does not. However, ADP occupa- 
tion of these sites prevents dephosphorylation of Thr!’?, whereas ATP favors Thr!” 
dephosphorylation. Thus, ADP binding to the y-subunit protects AMPK from dephos- 
phorylation, which would inactivate it. In its ADP-bound, Thr!’?-phosphorylated state, 
AMPK is poised to undergo allosteric activation during energy stress, when AMP levels 
rise to a level where AMP displaces ADP from the allosteric sites. Since the intracellular 
concentrations of ADP normally exceed those of AMP, competition between ADP 
and ATP for binding to the y-subunit is the likely determinant of the state of Thr!” 
phosphorylation. 


27.4b AMPK Targets Key Enzymes in Energy Production and Consumption 


Activation of AMPK leads to phosphorylation of many key enzymes in energy 
metabolism. Those involved in energy production that are activated upon 
phosphorylation by AMPK include phosphofructokinase-2 (PFK-2; see Chapter 22). 
In contrast to protein kinase A phosphorylation of PFK-2, AMPK phosphorylation 
of liver PFK-2 enhances fructose-2,6-bisphosphate synthesis, which in turn stimulates 
glycolysis. Enzymes involved in energy consumption that are down-regulated upon 
phosphorylation by AMPK include glycogen synthase (see Chapter 22), acetyl-CoA 
carboxylase (which catalyzes the committed step in fatty acid biosynthesis; see 
Chapter 24), and 3-hydroxy-3-methylglutaryl-CoA reductase, which carries out the 
key regulatory reaction in cholesterol biosynthesis (see Chapter 24). Further, AMPK 
phosphorylation of various transcription factors leads to diminished expression of 
genes encoding biosynthetic enzymes and elevated expression of catabolic genes. 
Activation of AMPK also enhances energy production by promoting mitochondrial 
biogenesis. 


27.4c AMPK Controls Whole-Body Energy Homeostasis 


Beyond these cellular effects, AMPK plays a central role in energy balance in multi- 
cellular organisms (Figure 27.6). AMPK in skeletal muscle is activated by hormones 
such as adiponectin and leptin, adipocyte-derived hormones that govern eating 
behavior and energy homeostasis (see section 27.6). Physical activity (exercise) also 
activates muscle AMPK. In turn, skeletal muscle AMPK activates glucose uptake, 
fatty acid oxidation, and mitochondrial biogenesis through its phosphorylation of 
metabolic enzymes and transcription factors that control expression of genes involved 
in energy production and consumption. AMPK’s actions in the liver lead to lowered 
ATP (energy) consumption through down-regulation of fatty acid synthesis, choles- 
terol synthesis, and gluconeogenesis. Metformin, a widely used drug for the treatment 
of type 2 diabetes (see the Human Biochemistry box “Metformin—a Diabetes Drug 
with Multiple Actions” in Section 22.1), lowers blood glucose levels through inhibition 
of liver gluconeogenesis; metformin achieves this result through activation of AMPK. 
AMPK blocks insulin secretion by pancreatic B-cells; insulin is a hormone that favors 
energy storage (glycogen and fat synthesis). AMPK is also a master regulator of eat- 
ing behavior through its activity in the hypothalamus, the key center for regulation of 
food intake. These effects of AMPK are described in Section 27.6. 


27.4d AMPK Inhibits mTORCI, the Master Growth Regulator 

of Eukaryotic Cells 
mTOR (mechanistic Target of Rapamycin) is a master coordinator of metabolism in 
multicellular organisms. mTOR is a Ser/Thr protein kinase of the phosphoinositide- 
3-kinase (PIKK) family. Its unusual name stems from its inhibition by rapamycin, a 
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Figure 27.6 AMPK regulation of energy production 
and consumption in mammals. (Adapted from 
Figure 1 in Kahn, B. B., Alquier, T., Carling, D., and 
Hardie, D. G., 2005. AMP-activated protein kinase: 
Ancient energy gauge provides clues to modern 
understanding of metabolism. Cell Metabolism 
1:15-25.) 
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compound produced by bacteria discovered in a soil sample from Easter Island (Rapa 
Nui is the Polynesian name for Easter Island). Rapamycin inhibits the proliferation of 
eukaryotic cells. The physiological role of mTOR is to respond to environmental and 
hormonal signals and decide whether cells should grow and divide. mTOR is the cata- 
lytic subunit in two multisubunit complexes, mTORCI and mTORC2. The human 
mTORCI1 complex (958 kDa) consists of six subunits organized as two trimers 
(Figure 27.7). Each trimer contains an mTOR protein kinase (1349 residues) subunit 
and one each of the non-catalytic polypeptides Raptor (regulatory-associated protein 
of mTOR, 1343 residues) and mLST$8 (326 residues). As the central regulator of cell 
function through its control over anabolism and catabolism, the mTORCI complex is 
responsive to nutrient levels. If nutrient levels are high, mTORCI1 phosphorylates pro- 
teins controlling transcription and translation of genes encoding the enzymes of ana- 
bolic pathways necessary for growth, such as lipid and pyrimidine biosynthesis and 
ribosomal biogenesis. (mTORC2? regulates cell proliferation in a similar matter.) 


Localization to the Lysosomal Membrane Is Essential for mTORCI Activation The response 
of mTORCI to nutrients is complex and indirect. Both glucose and amino acids induce 
localization of mTORCI to the lysosome, the organelle whose array of digestive en- 
zymes is involved in nutrient recycling. Localization of mTORCI! to the cytosolic sur- 
face of lysosomes is an essential step in mTORCI activation and the promotion of cell 
growth in response to abundant nutrient levels. Recruitment of mTORCI to the lyso- 
some is carried out by the lysosomal membrane-associated Rag proteins. These proteins 
are members of the Ras-related GTP-binding protein (Rag) family of GTPases, consist- 
ing of four proteins in higher eukaryotes—RagA (313 residues), RagB (374 residues), 
RagC (399 residues), and Rag D (400 residues). 
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Rag GTPases function as heterodimers in which one monomer of either RagA or 
RagB partners with one monomer of RagC or RagD. When amino acids are abundant, 
RagA is loaded with GTP. If RagC has GDP bound, the RagA:GTP/RagC:GDP com- 
plex recruits mTORC1 to the lysosomal membrane. The reciprocal arrangement— 
RagA:GDP/RagC:GTP—cannot recruit mTORCI to the lysosome. 


The Response of mTORC1 to Amino Acid Availability Amino acid signaling to mTORC1 
is dependent on the lysosome-based GATOR (GTPase-Activating Protein towards 
Rags) complexes, which respond to multiple amino acids, including arginine, leucine, 
and S-adenosylmethionine. When these amino acids are abundant, the GATOR1 
complex inhibits the intrinsic GTPase activity of RagA, so GTP is the favored nucle- 
otide, and mTORCI! is recruited and activated. GATOR2 is an upstream regulator of 
GATORI that restores the GTPase-activating function of GATORI, leading to hy- 
drolysis of RagA-bound GTP, thereby shutting down RagA activation of mTORCI. 


Activation of mTORC]1 Depends on Lysosomal Rheb Once mTORCI becomes associated 
with the lysosome, the protein kinase activity of mMTORCI is allosterically activated 
through association with Rheb. Rheb (“Ras homologue enriched in brain”) is another 
small G protein (184 amino acid residues). Rheb is anchored to the lysosomal mem- 
brane by a C-terminal farnesyl group (see Figure 9.23c). Activation of the mTORCI1 
protein kinase leads to protein phosphorylations that stimulate the anabolic pathways 
driving cellular growth. 


The Response of mTORCI to Glucose When glucose levels are low, the cell’s energy 
status declines. Such conditions trigger activation of AMPK. AMPK disrupts the ac- 
tion of mTORCI through phosphorylation of the heterodimeric protein TSC1/TSC2. 
TSC proteins are tumor suppressor gene products. The TSC heterodimer is a GTPase- 
activating protein for Rheb, converting its bound GTP to GDP, thereby suppressing 
Rheb’s ability to allosterically activate mTORCI. This inhibition of mTORCI by 
AMPK conserves energy by turning off anabolic pathways. Nutrient availability is 
vital to cells, and nutrient signaling to mTORCI is complex and redundant. A diverse 
range of sensors and signaling complexes in addition to AMPK mediate mTORCI 
responses to glucose levels. 


Figure 27.7 The structure of the mTORCI complex. 
(a) The mTORC1 complex is organized as a dimer 
of trimers composed of mTOR (blue), Raptor 
(yellow), and mLSTS8 (gray; PDB id = 6BCX). The 
mTOR kinase active sites are indicated by the 
arrows. See Yang, H., et al., 2017. Structural 
mechanisms of mTORC! activation by RHEB and 
inhibition by PRAS40. Nature 552:368-373. 
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27.5 | How Is Metabolism Integrated 
in a Multicellular Organism? 


In complex multicellular organisms, organ systems have arisen to carry out specific 
physiological functions. Each organ relies on a repertoire of metabolic pathways that 
is consistent with its physiological purpose. Such specialization depends on coordi- 
nation of metabolic responsibilities among organs so that the organism as a whole 
may thrive. Essentially all cells in animals have the set of enzymes common to the 
central pathways of intermediary metabolism, especially the enzymes involved in the 
formation of ATP and the synthesis of glycogen and lipid reserves. Nevertheless, 
organs differ in the metabolic fuels they prefer as substrates for energy production. 
Important differences also occur in the ways ATP is used to fulfill the organs’ 
specialized metabolic functions. To illustrate these relationships, we will consider the 
metabolic interactions among the major organ systems found in humans: brain, 
skeletal muscle, heart, adipose tissue, and liver. In particular, the focus will be on 
energy metabolism in these organs (Figure 27.8). The major fuel depots in animals are 
glycogen in liver and muscle; triacylglycerols (fats) stored in adipose tissue; and protein, 
most of which is in skeletal muscle. In general, the order of preference for the use of 
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Figure 27.8 Metabolic relationships among the 
major human organs. 
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Table 27.1 Energy Metabolism in Major Vertebrate Organs 


Organ Energy Reservoir Preferred Substrate Energy Sources Exported 
Brain None Glucose (ketone bodies None 
during starvation) 
Skeletal muscle (resting) | Glycogen Fatty acids None 
Skeletal muscle None Glucose from glycogen Lactate 
(strenuous exertion) 
Heart muscle Glycogen Fatty acids None 
Adipose tissue Triacylglycerol Fatty acids Fatty acids, glycerol 
Liver Glycogen, Amino acids, Fatty acids, glucose, 
triacylglycerol glucose, fatty acids ketone bodies 


these fuels is the order given: glycogen > triacylglycerol > protein. Nevertheless, the 
tissues of the body work together to maintain energy homeostasis (caloric homeostasis), 
defined as a constant availability of fuels in the blood. 

Humans have a total of 11 organ systems, but only the five earlier mentioned play a 
major role in energy metabolism: the brain, muscles, heart (a specialized muscle), fat 
(adipose tissue), and liver. Table 27.1 summarizes the metabolism of these major human 
organs. Each has a specialized metabolic role. 


27.5a The Brain Accounts for 20% of the Body’s Oxygen Consumption 


The brain has two remarkable metabolic features. First, it has a very high respiratory 
metabolism. In resting adult humans, 20% of the oxygen consumed is used by the 
brain, even though it constitutes only 2% or so of body mass. Interestingly, this level 
of oxygen consumption is independent of mental activity, continuing even during 
sleep. Second, the brain is an organ with no significant fuel reserves—no glycogen, 
expendable protein, or fat. Normally, the brain uses only glucose as a fuel and is to- 
tally dependent on the blood for a continuous, incoming supply. Interruption of glu- 
cose supply for even brief periods (as in a stroke) can lead to irreversible losses in 
brain function. The brain uses glucose to carry out ATP synthesis via cellular respira- 
tion. High rates of ATP production are necessary to power the plasma membrane 
Nat, K*-ATPase so that the membrane potential essential for transmission of nerve 
impulses is maintained. 

During prolonged fasting or starvation, the body’s glycogen reserves are depleted. 
Under such conditions, the brain adapts to use B-hydroxybutyrate (Figure 27.9) as a 
source of fuel, converting it to acetyl-CoA for energy production via the citric acid 
cycle. B-Hydroxybutyrate (see Chapter 23) is formed from fatty acids in the liver. 
Although the brain cannot use free fatty acids or lipids directly from the blood as fuel, 
the conversion of these substances to B-hydroxybutyrate in the liver allows the brain to 
use body fat as a source of energy. The brain’s other potential source of fuel during 
starvation is glucose obtained from gluconeogenesis in the liver (see Chapter 22), using 
the carbon skeletons of amino acids derived from muscle protein breakdown. The 
adaptation of the brain to use B-hydroxybutyrate from fat spares protein from degrada- 
tion until lipid reserves are exhausted. 


27.5b Muscle Metabolism Is Fueled by Glucose and Free Fatty Acids 


Skeletal muscle is responsible for about 30% of the O, consumed by the human body 
at rest. During periods of maximal exertion, skeletal muscle can account for more 
than 90% of the total metabolism. Muscle metabolism is primarily dedicated to the 
production of ATP as the source of energy for contraction and relaxation. Muscle 
contraction occurs when a motor nerve impulse causes Ca?” release from specialized 
endomembrane compartments (the transverse tubules and sarcoplasmic reticulum). 
Ca’ floods the sarcoplasm (the cytosolic compartment of muscle cells), where it binds 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


963 


964 Chapter 27 Metabolic Integration and Organ Specialization 


OH 
O 
| “ 
p-B-Hydroxybutyrate CH3 — C — CHy — ES 
H or 
NAD* 
B-Hydroxybutyrate dehydrogenase 
NADH + H?* 
a O 
Il Z 
Acetoacetate CH; — C— CH — C 
o- 


Succinyl-CoA 
3-Ketoacyl-CoA transferase 


Succinate 


Acetoacetyl-CoA 


HS-CoA Thiolase 


2 Acetyl-CoA 


Figure 27.9 Ketone bodies such as B-hydroxybutyrate provide the brain with a source of acetyl-CoA 
when glucose is unavailable. 


to troponin C, a regulatory protein, initiating a series of events that culminate in the 
sliding of myosin thick filaments along actin thin filaments. This mechanical move- 
ment is driven by energy released upon hydrolysis of ATP (see Chapter 16). The net 
result is that the muscle shortens. Relaxation occurs when the Ca** ions are pumped 
back into the sarcoplasmic reticulum by the action of a Ca**-transporting membrane 
ATPase. Two Ca?" ions are translocated per ATP hydrolyzed. The amount of ATP 
used during relaxation is almost as much as that consumed during contraction. 

Because muscle contraction is an intermittent process that occurs upon demand, 
muscle metabolism is designed to generate energy upon demand. Muscle at rest uses free 
fatty acids, glucose, or ketone bodies as fuel and produces ATP via oxidative phosphoryla- 
tion. Resting muscle also contains about 2% glycogen and about 0.08% phosphocreatine 
by weight (Figure 27.10). When ATP is used to drive muscle contraction, the ADP formed 
can be reconverted to ATP by creatine kinase at the expense of phosphocreatine. Muscle 
phosphocreatine can generate enough ATP to power about 4 seconds of exertion. During 
strenuous exertion, such as a 100 meter sprint, once the phosphocreatine is depleted, mus- 
cle relies solely on its glycogen reserves, making the ATP for contraction via glycolysis. In 
contrast with the citric acid cycle and oxidative phosphorylation pathways, glycolysis is 
capable of explosive bursts of activity, and the flux of glucose-6-phosphate through this 
pathway can increase 2000-fold almost instantaneously. The triggers for this activation are 
Ca?* and the fight-or-flight hormone epinephrine (see Chapters 22 and 32). Little interor- 
gan cooperation occurs during strenuous (anaerobic) exercise. 

Muscle fatigue is the inability of a muscle to maintain power output. During 
maximum exertion, the onset of fatigue takes only 20 seconds or so. Fatigue is not the 
result of exhaustion of the glycogen reserves, nor is it a consequence of lactate 
accumulation in the muscle. Instead, it is caused by a decline in intramuscular pH as 
protons are generated during glycolysis. (The overall conversion of glucose to two 
lactate in glycolysis is accompanied by the release of two H*.) The pH may fall as low 
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as 6.4. Lower pH directly affects the ability of muscle to contract. Further, the decline 
in PFK activity at low pH leads to a lowered flux of hexose through glycolysis and 
inadequate ATP levels. One benefit of PFK inhibition is that the ATP remaining is not 
consumed in the PFK reaction, thereby sparing the cell from the more serious 


consequences of losing all of its ATP. 


During fasting or excessive activity, skeletal muscle protein is degraded to amino 
acids so that their carbon skeletons can be used as fuel. Many of the skeletons are con- 
verted to pyruvate, which can be transaminated back into alanine for export via the 
circulation (Figure 27.11). Alanine is carried to the liver, which in turn deaminates it 
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alanine by glutamate-alanine aminotransferase. 


Human Biochemistry 


Athletic Performance Enhancement with Creatine Supplements? 


The creatine pool in a 70-kg (154-pound) human body is about 
120 grams. This pool includes dietary creatine (from meat) and cre- 
atine synthesized by the human body from its precursors (arginine, 


AP Images/Michael Probst 


AP Images/Kirsty Wigglesworth 


glycine, and methionine). Of this creatine, 95% is stored in the 
skeletal and smooth muscles, about 70% of which is in the form of 
phosphocreatine. Supplementing the diet with 20 to 30 grams of 
creatine per day for 4 to 21 days can increase the muscle creatine 
pool by as much as 50% in someone with a previously low creatine 
level. Thereafter, supplements of 2 grams per day will maintain 
elevated creatine stores. Studies indicate that creatine supplemen- 
tation gives some improvement in athletic performance during 
high-intensity, short-duration events (such as weight lifting), but 
no benefit in endurance events (such as distance running). The 
distinction makes sense in light of phosphocreatine’s role as the 
substrate that creatine kinase uses to regenerate ATP from ADP. 
Intense muscular activity quickly (less than 2 seconds) exhausts 
ATP supplies; [phosphocreatine] muscle is sufficient to restore ATP 
levels for a few extra seconds, but no more. The U.S. Food and 
Drug Administration advises consumers to consult with their doc- 
tors before using creatine as a dietary supplement. 
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back into pyruvate so that it can serve as a substrate for gluconeogenesis. Although 
muscle protein can be mobilized as an energy source, it is not efficient for an organism 
to consume its muscle and lower its overall fitness for survival. Muscle protein repre- 
sents a fuel of last resort. 


27.5c The Heart Prefers Fatty Acids as Fuel 


In contrast with the intermittent work of skeletal muscle, the activity of heart muscle 
is constant and rhythmic. The range of activity in heart is also much less than that in 
muscle. Consequently, the heart functions as a completely aerobic organ and, as such, 
is very rich in mitochondria. Roughly half the cytoplasmic volume of heart muscle 
cells is occupied by mitochondria. Under normal working conditions, the heart 
prefers fatty acids as fuel, oxidizing acetyl-CoA units via the citric acid cycle and 
producing ATP for contraction via oxidative phosphorylation. Heart tissue has 
minimal energy reserves: a small amount of phosphocreatine and limited quantities 
of glycogen. As a result, the heart must be continually nourished with oxygen and free 
fatty acids, glucose, or ketone bodies as fuel. 


27.5d Adipose Tissue Is Hormonally Active 


Adipose tissue is an amorphous tissue that is widely distributed throughout the 
body—around blood vessels, in the abdominal cavity and mammary glands, and 
most prevalently, as deposits under the skin. Long considered merely a storage depot 
for fat, adipose tissue is now appreciated as an endocrine organ responsible for secre- 
tion of a variety of hormones that govern eating behavior and caloric homeostasis. It 
consists principally of cells known as adipocytes that no longer replicate. However, 
adipocytes can increase in number as adipocyte precursor cells divide, and obese 
individuals tend to have more of them. As much as 65% of the weight of adipose tissue 
is triacylglycerol that is stored in adipocytes, essentially as oil droplets. The average 
70-kg man has enough caloric reserve stored as fat to sustain a 6000 kJ/day rate of 
energy production for 3 months, which is adequate for survival, assuming no serious 
metabolic aberrations (such as nitrogen, mineral, or vitamin deficiencies). Despite 
their role as energy storage depots, adipocytes have a high rate of metabolic activity, 
synthesizing and breaking down triacylglycerol so that the average turnover time for 
a triacylglycerol molecule is just a few days. Adipocytes actively carry out cellular 
respiration, transforming glucose to energy via glycolysis, the citric acid cycle, and 
oxidative phosphorylation. If glucose levels in the diet are high, glucose is converted 
to acetyl-CoA for fatty acid synthesis. However, under most conditions, free fatty 
acids for triacylglycerol synthesis are obtained from the liver. Because adipocytes 
lack glycerol kinase, they cannot recycle the glycerol of triacylglycerol but rather de- 
pend on glycolytic conversion of glucose to dihydroxyacetone-3-phosphate (DHAP) 


Human Biochemistry 


Fat-Free Mice—A Model for Diabetes 


Scientists at the U.S. National Institutes of Health have created 
transgenic mice that lack white adipose tissue throughout their 
lifetimes. These mice were created by blocking the normal differ- 
entiation of stem cells into adipocytes so that essentially no white 
adipose tissue can be formed in these animals. These fat-free mice 
have double the food intake and five times the water intake of 
normal mice. Fat-free mice also show decreased physical activity 
and must be kept warm on little heating pads to survive, because 
they lack insulating fat. They are also diabetic, with three times 
the normal blood glucose and triacylglycerol levels and only 5% 
of normal leptin levels; they die prematurely. Like type 2 dia- 


betic patients, fat-free mice have markedly elevated insulin levels 
(50 to 400 times normal) but are unresponsive to insulin. These 
mice serve as an excellent model for the disease /ipoatrophic diabe- 
tes, an inherited disease characterized by the absence of adipose 
tissue and severe diabetic symptoms. Indeed, transplantation of 
adipose tissue into these fat-free mice cured their diabetes. As the 
major organ for triacylglycerol storage, white adipose tissue helps 
control energy homeostasis (food intake and energy expenditure) 
via the release of leptin and other hormonelike substances (see the 
discussion in Section 27.6). Clearly, absence of adipose tissue has 
widespread, harmful consequences for metabolism. 
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and the reduction of DHAP to glycerol-3-phosphate for triacylglycerol biosynthesis. 
Adipocytes also require glucose to feed the pentose phosphate pathway for NADPH 
production. 

Glucose plays a pivotal role for adipocytes. If glucose levels are adequate, glycerol- 
3-phosphate is formed in glycolysis and the free fatty acids liberated in triacylglycerol 
breakdown are re-esterified to glycerol to re-form triacylglycerols. However, if glucose 
levels are low, [glycerol-3-phosphate] falls and free fatty acids are released to the blood- 
stream (see Chapter 23). 


Brown Fat A specialized type of adipose tissue, brown fat, is found in newborns and 
hibernating animals. An abundance of mitochondria, which are rich in cytochromes, 
is responsible for the brown color of this fat. As usual, these mitochondria are very 
active in electron transport—driven proton translocation, but these particular mito- 
chondria contain in their inner membranes a protein, thermogenin, also known as 
uncoupling protein I (see Chapter 20), that creates a passive proton channel, permitting 
the H* ions to reenter the mitochondrial matrix without generating ATP. Instead, the 
energy of oxidation is dissipated as heat. Indeed, brown fat is specialized to oxidize 
fatty acids for heat production rather than ATP synthesis. 


27.5e The Liver Is the Body’s Metabolic Processing Center 


The liver serves as the major metabolic processing center in vertebrates. Except for 
dietary triacylglycerols, which are metabolized principally by adipose tissue, most 
of the incoming nutrients that pass through the intestinal tract are routed via the 
portal vein to the liver for processing and distribution. Much of the liver’s activity cen- 
ters around conversions involving glucose-6-phosphate (Figure 27.12). Glucose- 
6-phosphate can be converted to glycogen, released as blood glucose, used to generate 
NADPH and pentoses via the pentose phosphate cycle, or catabolized to acetyl-CoA 
for fatty acid synthesis or for energy production via oxidative phosphorylation. Most 
of the liver glucose-6-phosphate arises from dietary carbohydrate, from degradation 
of glycogen reserves, or from muscle lactate that enters the gluconeogenic pathway. 


Glycogen 
Glucose-6-phosphate ————> Blood glucose 
Triacylglycerols, 
phospholipids Glycolysis Pentose phosphate 
pathway 
Lipid 
synthesis Cholesterol Na ATP 
Fat i NADPH 
atty acids y a 
Pyruvate pentose 
phosphates 
Fatty acid 
synthesis y 
Acetyl-CoA 
Citric ATP 
acid Oxidative 
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y 
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Figure 27.12 Metabolic conversions of glucose-6- 
phosphate in the liver. 
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Human Biochemistry 


The Metabolic Effects of Alcohol Consumption 


Ethanol metabolism alters the NAD*/NADH ratio. Ethanol is 
metabolized to acetate in the liver by alcohol dehydrogenase and 
aldehyde dehydrogenase: 


CH;CH,OH + NAD+ == CH;CHO + NADH + H+ 
CH;CHO + NAD* + H2O == CH;COO™ + NADH + 2H* 


Acetate is then converted to acetyl-CoA. Excessive conversion 
of available NAD* to NADH impairs NAD*-requiring reac- 
tions, such as the citric acid cycle, gluconeogenesis from lactate, 
and fatty acid oxidation. Accumulation of acetyl-CoA favors 
fatty acid synthesis, which, along with blockage of fatty acid 
oxidation, causes elevated triacylglycerol levels in the liver. Over 


time, these triacylglycerols accumulate as fatty deposits, which 
ultimately contribute to cirrhosis of the liver. Impairment of 
gluconeogenesis leads to buildup of this pathway’s substrate, 
lactate. Lactic acid accumulation in the blood causes acidosis. 
A further consequence is that acetaldehyde can form adducts 
with protein —NH) groups, which may inhibit protein function. 
Because gluconeogenesis is limited, alcohol consumption can 
cause hypoglycemia (low blood sugar) in someone who is under- 
nourished. In turn, hypoglycemia can cause irreversible damage 
to the central nervous system. A high [NADH]/[NAD‘*] ratio also 
impairs SIRT1 function; SIRT1 is a key player in protecting cells 
against stress-induced degenerative disease (see Section 27.7). 


The liver plays an important regulatory role in metabolism by buffering the level of 
blood glucose. Liver has two enzymes for glucose phosphorylation: hexokinase and 
glucokinase (type-IV hexokinase). Unlike hexokinase, glucokinase has a low affinity for 
glucose. Glucokinase requires 5 mM glucose to reach 0.5 Vinax. When blood glucose 
levels are high, glucokinase activity augments hexokinase in phosphorylating glucose as 
an initial step leading to its storage in glycogen. The major metabolic hormones— 
epinephrine, glucagon, and insulin—all influence glucose metabolism in the liver to keep 
blood glucose levels relatively constant (see Chapters 22 and 32). 

The liver is a major center for fatty acid turnover. When the demand for metabolic 
energy is high, triacylglycerols are broken down and fatty acids are degraded in the liver 
to acetyl-CoA to form ketone bodies, which are exported to the heart, brain, and other 
tissues. If energy demands are low, fatty acids are incorporated into triacylglycerols that 
are carried to adipose tissue for deposition as fat. Cholesterol is also synthesized in the 
liver from two-carbon units derived from acetyl-CoA. 

In addition to these central functions in carbohydrate and fat-based energy metabolism, 
the liver serves other purposes. For example, the liver can use amino acids as metabolic 
fuels. Amino acids are first converted to their corresponding a-keto acids by aminotrans- 
ferases. The amino group is excreted after incorporation into urea in the urea cycle. The 
carbon skeletons of glucogenic amino acids can be used for glucose synthesis, whereas 
those of ketogenic amino acids appear in ketone bodies (see Figure 25.41). The liver is also 
the principal detoxification organ in the body. The endoplasmic reticulum of liver cells is 
rich in enzymes that convert biologically active substances such as hormones, poisons, and 
drugs into less harmful byproducts. 

Liver disease leads to serious metabolic derangements, particularly in amino acid 
metabolism. In cirrhosis, the liver becomes defective in converting NH,* to urea for 
excretion, and blood levels of NH,* rise. Ammonia is toxic to the central nervous 
system, and coma ensues. 


27.6 | What Regulates Our Eating Behavior? 


Over 70% of Americans are overweight, and more than one in three Americans is 
clinically obese (overweight by 20% or more). Obesity is the single most important 
cause of type 2 (adult-onset, insulin-independent) diabetes. Research into the regula- 
tory controls that govern our feeding behavior has become a medical urgency with 
great financial incentives, given the epidemic proportions of obesity and widespread 
preoccupation with dieting and weight loss. 
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27.6a The Hormones That Control Eating Behavior 
Come from Many Different Tissues 


Appetite and weight regulation are determined by a complex neuroendocrine system 
that involves hormones produced in the stomach, small intestines, pancreas, adipose 
tissue, and central nervous system. These hormones act in the brain, principally on 
neurons within the arcuate nucleus region of the hypothalamus. The arcuate nucleus 
is an anatomically distinct brain area that functions in homeostasis of body weight, 
body temperature, blood pressure, and other vital functions (Figure 27.13). The 
neurons respond to these signals by activating, or not, pathways involved in eating 
(food intake) and energy expenditure. Hormones that regulate eating behavior can be 
divided into short-term regulators that determine individual meals and long-term 
regulators that act to stabilize the levels of body fat deposits. Two subsets of neurons 
are involved: (1) the NPY/AgRP-producing neurons that release NPY (neuropeptide Y), 
the protein that stimulates the neurons that trigger eating behavior, and (2) the 
melanocortin-producing pro-opiomelanocortin (POMC) neurons, whose products in- 
hibit the neurons initiating eating behavior. AgRP is agouti-related peptide, a protein 
that blocks the activity of melanocortin-producing neurons. Melanocortins are a group 
of peptide hormones that includes a- and B-melanocyte—stimulating hormones (a-MSH 
and B-MSH). Melanocortins act on melanocortin receptors (MCRs), which are mem- 


Hypothalamus 


Food Energy 
intake expenditure 


Arcuate 
nucleus 


Melanocortin 


NPY/PYY3, 36 
receptor Y2R 
Melanocortin 


Leptin receptor 


EBEBEB & 


Figure 27.13 The regulatory pathways that control eating. The pink neurons targeted by NPY/AgRP 
and melanocortin-producing pro-opiomelanocortin (POMC) neurons are likely neurons of the PVH 
(paraventricular region of the hypothalamus). In turn, neurons of the PVH make connections to the 
hindbrain and spinal cord regions more directly involved in regulation of food intake and energy 
expenditure. It’s complicated. (Adapted from Figure | in Schwartz, M. W., and Morton, G. J., 2002. 
Obesity: Keeping hunger at bay. Nature 418:595—597.) 
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bers of the 7-TMS G-protein—coupled receptor (GPCR) family of membrane recep- 
tors; MCRs trigger cellular responses through adenylyl cyclase activation. 


27.6b Ghrelin and Cholecystokinin Are Short-Term 
Regulators of Eating Behavior 


Short-term regulators of eating include ghrelin and cholecystokinin. Ghrelin is an 
appetite-stimulating peptide hormone produced in the stomach. Production of ghre- 
lin is maximal when the stomach is empty, but ghrelin levels fall quickly once food is 
consumed. Cholecystokinin is a peptide hormone released from the gastrointestinal 
tract during eating. In contrast to ghrelin, cholecystokinin signals satiety (the sense 
of fullness) and tends to curtail further eating. Together, ghrelin and cholecystoki- 
nin constitute a meal-to-meal control system that regulates the onset and end of 
eating behavior. The activity of this control system is also modulated by the long- 
term regulators. 


27.6c Insulin and Leptin Are Long-Term Regulators of Eating Behavior 


Long-term regulators of eating behavior include insulin and leptin, both of which in- 
hibit eating and promote energy expenditure. Insulin is the key hormone produced in 
response to food excess and the well-fed state. It signals caloric abundance. Insulin is 
produced in the B-cells of the pancreas when blood glucose levels rise. A major role of 
insulin is to stimulate glucose uptake from the blood into muscle, fat, and other tis- 
sues. Blood insulin levels correlate with body fat amounts. Insulin also stimulates fat 
cells to make leptin. Leptin (from the Greek word lepto, meaning “thin”) is a 16-kDa, 
146-amino acid residue protein produced principally in adipocytes (fat cells). Leptin 
has a four-helix bundle tertiary structure similar to that of cytokines (protein hor- 
mones involved in cell-cell communication; see Figure 27.14). Normally, as fat 
deposits accumulate in adipocytes, more and more leptin is produced in these cells 
and released into the bloodstream. Leptin levels in the blood communicate the status 
of triacylglycerol levels in the adipocytes to the central nervous system so that appro- 
priate changes in appetite take place. Ifleptin levels are low (starvation), appetite 
increases; if leptin levels are high (overfeeding), appetite is suppressed. Leptin also 
regulates fat metabolism in adipocytes, inhibiting fatty acid biosynthesis and stimu- 
lating fat metabolism. In the latter case, leptin induces synthesis of the enzymes in the 
fatty acid oxidation pathway and increases expression of uncoupling protein 2 (UCP2), 
a mitochondrial protein that uncouples oxidation from phosphorylation so that the 
energy of oxidation is lost as heat (thermogenesis). Leptin binding to leptin receptors 
in the hypothalamus inhibits release of NPY. Because NPY is a potent orexic (appetite- 
stimulating) peptide hormone, leptin is therefore an anorexic (appetite-suppressing) 


N 


Figure 27.14 Structures of human leptin (PDB id = 1AX8), neuropeptide Y (PDB id = IRON), and 
PPY3.36 (PDB id = 2DEZ). (The leptin structure shown here is for leptin E100 [a W100E mutant]. 
Leptin E100 is a fully functional form of leptin.) 
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27.7 Can You Really Live Longer by Eating Less? 


agent. Functional leptin receptors are also essential for pituitary function, growth 
hormone secretion, and normal puberty. When body fat stores decline, the circulating 
levels of leptin and insulin also decline. Hypothalamic neurons sense this decline and 
act to increase appetite to restore body fat levels. 

Intermediate regulation of eating behavior is accomplished by the gut hormone 
PYY3.36. PY Y3.35 is produced in endocrine cells found in distal regions of the small 
intestine, areas that receive ingested food some time after a meal is eaten. PYY3.36 
inhibits eating for many hours after a meal by acting on the NPY/AgRP-producing 
neurons in the arcuate nucleus. Clearly, the regulatory controls that govern eating are 
complex and layered. Some believe that defects in these controls are common and biased 
in favor of overeating, an advantageous evolutionary strategy that may have unforeseen 
consequences in these bountiful times. 

To this point, our focus has been on hormonal controls influencing energy homeostasis 
and food intake. Superimposed on these regulatory mechanisms are neural controls origi- 
nating in those cognitive centers of the brain involved in learning, memory, socialization, 
and decision-making processes. For example, hedonistic (pleasure-seeking) mechanisms 
triggered by seeing, smelling, tasting, or even imagining food can override the normal 
hormonal regulation of eating behavior that is driven by nutrient levels in the blood. 
Eating or not is determined by a very complex interplay between energy status and the 
emotional and/or rational disposition of the brain. Our lack of understanding of these 
phenomena hinders our treatment of the obesity epidemic. 


27.6d AMPK Mediates Many of the Hypothalamic 
Responses to These Hormones 


The actions of leptin, ghrelin, and NPY converge at AMPK. Leptin inhibits AMPK 
activity in the arcuate nucleus, and this inhibition underlies the anorexic effects of 
leptin. Leptin action on AMPK depends on a particular melanocortin receptor type 
known as the melanocortin-4 receptor (MC4R). On the other hand, ghrelin and NPY 
activate hypothalamic AMPK, which stimulates food intake and leads, over time, 
to increased body weight. The effects of AMPK in the hypothalamus that lead to 
alterations in eating behavior may be mediated through changes in malonyl-CoA 
levels. Malonyl-CoA is believed to be a key indicator of energy levels in hypothalamic 
neurons. Low [malonyl-CoA] in hypothalamic neurons is associated with increased 
food intake, and elevated malonyl-CoA levels are associated with suppression of 
eating. The inhibition of acetyl-CoA carboxylase (and thus, malonyl-CoA synthesis) as 
a result of phosphorylation by AMPK plays an important part in the regulation of our 
eating behavior. Dysregulation of the malonyl-CoA and AMPK circuit is implicated in 
the development of metabolic syndrome, a suite of metabolic aberrations associated 
with obesity, including insulin resistance, high levels of insulin in the blood, abnormal 
blood lipid profiles, inflammation, and the likelihood to develop type 2 diabetes. 


27.7 | Can You Really Live Longer by Eating Less? 


Nutritional studies published in 1935 showed that rats fed a low-calorie, but balanced 
and nutritious diet, lived nearly twice as long as rats with unlimited access to food 
(2.4 years versus 1.3 years). Subsequent research over the ensuing 85 years has shown 
that the relationship between diet and longevity is a general one for organisms from 
yeast to mammals. 


27.7a Caloric Restriction Leads to Longevity 


To achieve this effect of caloric restriction (CR) on longevity, animals are given a level 
of food that amounts to 60% to 70% of what they would eat if they were allowed free 
access to food. In animals, CR results in lower blood glucose levels, declines in glyco- 
gen and fat stores, enhanced responsiveness to insulin, lower body temperature, and 
diminished reproductive capacity. The extended life span given by CR offers a definite 
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evolutionary advantage: Any animal that could slow the aging process and postpone 
reproduction in times of food scarcity and then resume reproduction when food be- 
came available would out-compete animals without such ability. 

Another remarkable feature of CR is that it diminishes the likelihood for development 
of many age-related diseases, such as cancer, diabetes, and atherosclerosis. Is this benefit 
simply the result of lowered caloric intake, or does CR lead to significant regulatory 
changes that affect many aspects of an organism’s physiology? The answer to this and 
other questions is emerging from vigorous research efforts toward understanding CR. 

In this regard, experiments show that mTOR plays a role in the aging process. Either 
genetic disruption of mTOR signaling or rapamycin inhibition of mTOR activity ex- 
tends lifespan in experimental animals (nematodes, flies, and mice). Beyond its effects on 
aging, rapamycin delays the onset of diseases associated with aging, such as Alzheimer’s 
and cancer. 


27.7b Mutations in the SIR2 Gene Decrease Life Span 


Many important clues about the mechanism of CR came from genetic investigations. 
Deletion of a gene termed S/R2 (for silent information regulator 2) abolished the abil- 
ity of CR to lengthen life span in yeast and roundworms, implicating the S/R2 gene 
product in longevity. S/R2 originally was discovered through its ability to silence the 
transcription of genes that encode rRNA. S/R2-related genes are found in some pro- 
karyotes and virtually all eukaryotes, including yeast, nematodes, fruit flies, and hu- 
mans. Humans have seven such genes, designated S[RT/ through SIRT7, SIRT being 
an abbreviation of sirtuin, the generic name for proteins encoded by S/R2 genes. 
SIRTs differ in their cellular localization, with SIRT1 cycling between the cytosol 
and nucleus, and SIRTs 3, 4, and 5 confined to the mitochondria. SIRT1 is the best 
studied of the human sirtuins. Sirtuins are NAD*-dependent protein deacetylases 
(Figure 27.15). Cleavage of acetyl groups from acetylated lysine side chains in proteins 
is an exergonic reaction, as is cleavage of the N-glycosidic bond in NAD* (AG°’ = —34 
kJ/mol). Thus, involvement of NAD* in the reaction is not a thermodynamic neces- 
sity. However, NAD* involvement does couple the reaction to an important signal of 
metabolic status, namely, the availability of NAD*. So oxidative metabolism, which 
drives conversion of NADH to NAD*, enhances sirtuin activity. One adaptive re- 
sponse found with CR is increased mitochondrial biogenesis in liver, fat, and muscle, 
which is a response that would raise the NAD* concentration. 

Sirtuin-catalyzed removal of acetyl groups from lysine residues of histones, the core 
proteins of nucleosomes, would allow the nucleosomes to interact more strongly with 
DNA, making transcription more difficult. Perhaps more significantly, SIRT-catalyzed 
removal of acetyl groups from K residues in transcription factors and metabolic enzymes 
is an effective mechanism for transcriptional and metabolic regulation in response to 
changes in the NAD*/NADH ratio. Almost all the enzymes of energy metabolism are 
acetylated and thus potentially regulated by SIRT deacetylation. Thus, sirtuins are en- 
ergy sensors that integrate metabolism and the regulation of gene expression. 
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Figure 27.15 The NAD*-dependent protein deacetylase reaction of sirtuins. The enzymes catalyzing 
this reaction are classified as lysine (K) deacetylases (K DACs). Acetylated peptides include N-acetyl 
lysine side chains of histone H3 and H4 and acetylated p53 (p53 is the protein product of the p53 
tumor suppressor gene). 
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27.7 Can You Really Live Longer by Eating Less? 


Recent research suggests that sirtuins might function as general protein deacylases, 
capable of removing any acyl group from a protein and targeting a wide range of acylated 
proteins. In addition, suggesting that SIRT1 acts to promote longevity may be misleading. 
Instead, it seems more likely that the actions of SIRT1 are directed toward mobilizing 
pathways that protect against stress. Given that chronic diseases and aging are a direct 
result of the ravaging effects of stress, this hypothesis proposes that sirtuins promote 
health maintenance, not longevity, and that longevity is a secondary effect. 


27.7¢ SIRTI Is a Key Regulator in Caloric Restriction 


Asakey regulator in CR, the human sirtuin protein SIRT1 connects nutrient availability 
to the expression of metabolic genes. A striking feature of CR is the loss of fat stores 
and reduction in white adipose tissue (WAT). SIRT1 participates in the transcriptional 
regulation of adipogenesis through interaction with PPARy (peroxisome proliferator- 
activator receptor-y), a nuclear hormone receptor that activates transcription of genes 
involved in adipogenesis and fat storage. SIRT1 binding to two PPAR; corepressors, 
NCoR and SMRT, prevents transcription of genes controlled by PPARy, leading to loss 
of fat stores. Because adipose tissue functions as an endocrine organ, this loss of fat has 
significant hormonal consequences for energy metabolism (Figure 27.15). 

In liver, SIRT1 interacts with and deacetylates PGC-la (peroxisome proliferator- 
activator receptor-y coactivator-1), a transcriptional regulator of genes involved in 
energy metabolism and mitochondrial biogenesis. Thus, CR leads to increased tran- 
scription of the genes encoding the enzymes of gluconeogenesis and repression of genes 
encoding glycolytic enzymes. Acting in these roles, SIRT1 connects nutrient availability 
to the regulation of major pathways of energy storage (glycogen and fat) and fuel 
utilization. 


27.74 SIRT1, MTORC1, AMPK, CR, and Metabolic Syndrome 


SIRTI is also regulated through reversible phosphorylation. Phosphorylation of 
SIRTI Ser” by the master regulator mTORCI inhibits SIRT1. Recall that mTORC1 
is active when nutrient levels are high. In contrast, when cAMP-dependent PKA 
phosphorylates Ser** within the SIRTI catalytic domain, SIRTI is activated. Thus, 
activation of SIRTI is a response to B—adrenergic hormones like epinephrine. 
Adrenergic hormones are released as a response to cold stress, and when they acti- 
vate SIRTI, it promotes fatty acid oxidation and thermogenesis in brown fat. Further, 
when glucose levels are depleted, as in CR, AMPK activates SIRT1 through phos- 
phorylation of T°”. 

Certain generalizations are clear. AMPK is a key regulator of metabolism through 
its role as the sensor of energy status. The NAD*-dependent deacetylase SIRT1 be- 
comes activated when nutrients are limiting, as in CR. So, both AMPK and SIRT! re- 
spond to nutrient limitation. AMPK responds by down-regulating anabolic processes 
and activating ATP-producing catabolic processes. AMPK accomplishes these ends 
through phosphorylation of metabolic enzymes and transcription factors. The effects 
of SIRT1 are achieved through its deacetylation of metabolic enzymes and transcrip- 
tion factors, and depending on the transcription factor, gene expression may be en- 
hanced or diminished. AMPK and SIRT! functions are coordinated to ensure that the 
cellular responses to changing conditions of nutrient availability are appropriate. 

As mentioned in Section 27.6, metabolic syndrome is a suite of symptoms that pre- 
cede the development of type 2 diabetes; the symptoms of metabolic syndrome are the 
exact opposite of the benefits of caloric restriction. The symptoms include high blood 
pressure, elevated blood triglyceride levels, low blood HDL-cholesterol levels, high 
blood glucose levels, and central obesity (accumulation of body fat around the waist). 
The relationships between the altered metabolism seen in metabolic syndrome and the 
effects of CR are intriguing. Caloric restriction reverses all the symptoms of metabolic 
syndrome. A deeper understanding of the relationships between nutrient intake and 
changes in metabolic regulation is key to health and wellbeing. 
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Foundational Biochemistry Things You Should Know after Reading Chapter 27. 


Systems analysis can simplify the complexity of metabolism. 


The metabolism of a heterotrophic cell can be portrayed as three 
interconnected functional blocks: catabolism, anabolism, and macro- 
molecular synthesis/growth. 


Only a few intermediates connect the systems of catabolism and anabo- 
lism: four sugar-Ps, three a-keto acids, two coenzyme A derivatives, 
and PEP. 


ATP and NADPH couple catabolism and anabolism. 


Phototrophs have a fourth functional block consisting of photosyn- 
thesis and CO, fixation. 


In phototrophs, ATP and NADPH are produced by photosynthesis. 
Virtually every metabolic pathway either produces or consumes ATP. 


The overall thermodynamic efficiency of any metabolic sequence is 
determined by its ATP stoichiometry. 


Catabolic pathways release energy that is captured as ATP. 


The thermodynamically unfavorable reactions of anabolism are driven 
by ATP hydrolysis. 


Obligate coupling stoichiometry: Chemical considerations fix the 
stoichiometry of electron-pair carriers such as NADH and [FADH,] 
in metabolic pathways. 


The stoichiometry of ATP coupling is not fixed by chemistry; instead, 
it is an evolved characteristic of living systems. 


ATP coupling stoichiometry determines the Kq for a metabolic 
sequence. 


ATP has two metabolic roles: (1) ATP serves a stoichiometric role that 
renders metabolic pathways emphatically favorable, and (2) ATP has 
an allosteric role in the kinetic regulation of metabolism. 


Energy charge is a concept used to quantify cellular energy status. 


Energy charge is based on the ratio of the sum of the phosphoric 
anhydride bonds in ADP and ATP divided by the total adenine 
nucleotide pool. 


The adenylate kinase reaction connects the three principal adenine 
nucleotides: AMP, ADP, and ATP. 


Because of the reciprocal relationship between adenine nucleotide 
regulation of catabolic and anabolic pathways, the energy charge 
of cells is tightly regulated to remain fairly constant at high energy 
charge. 


Phosphorylation potential (the ratio of [ATP]/[ADP][Pj]) changes as 
ATP levels change. 


AMP-activated protein kinase (AMPK) is the cellular energy sensor. 


ATP is an allosteric inhibitor of AMPK; AMP is an allosteric activator 
of AMPK. 


When cellular energy status is high ([ATP] is high), AMPK is inactive. 
When cellular energy status is low ([AMP] is high), AMPK is activated. 


AMPK targets key enzymes in energy-producing and energy- 
consuming pathways, turning on those that lead to ATP synthesis and 
turning off those that consume ATP. 


mTOR (mechanistic target of rapamycin) is a master coordinator of 
metabolism in multicellular organisms. 


AMPK disrupts the action of mTORC1 through phosphorylation of 
a heterodimeric protein called TSC1/2. 


When nutrient levels are high, the mTORC1 complex phosphorylates 
proteins controlling transcription and translation of genes that en- 
code anabolic pathways. 


Metabolism is integrated in multicellular organisms. 
The tissues of the body work together to maintain energy homeostasis. 


The organ systems of multicellular organisms have specialized meta- 
bolic roles. 


The brain has very high respiratory metabolism, accounting for 20% 
of the total oxygen consumption by the body. 


The brain has no stored fuel, so it depends on the bloodstream for a 
constant supply of glucose. 


Under conditions of starvation, the brain can use ketone bodies instead 
of glucose for fuel. 


Muscle is responsible for about 30% of O, consumption at rest, and 
more than 90% during strenuous exercise. 


Almost as much ATP is used to pump Ca?* ions back into the sarco- 
plasmic reticulum as was used in muscle contraction. 


Creatine kinase allows muscle to use phosphocreatine to restore ATP 
levels that have been depleted. 


During fasting or starvation, muscle protein is degraded to form a-keto 
acids that can be used as energy sources or to fuel gluconeogenesis. 


Heart muscle prefers fatty acids as fuel. 


Adipose tissue is not merely a repository for triglycerides; it is also an 
important endocrine organ, secreting hormones such as leptin that 
regulate eating behavior and caloric homeostasis. 


Brown fat is adipose tissue rich in mitochondria. 


Thermogenesis is the metabolic role of brown fat: The mitochondria 
in brown fat have uncoupling proteins that dissipate the proton gradi- 
ent as heat. 


The liver is the major metabolic processing center in animals. 


The liver is responsible for glucose homeostasis through maintenance 
of constant blood [glucose]. 


The liver is the principal site of fatty acid turnover. 


The liver can use amino acids as metabolic fuel, for gluconeogenesis, 
or for ketone body synthesis. 


The hormones that control eating behavior come from many different 
organs—fat, stomach, pancreas, intestines, and the central nervous 
system. 


Two subsets of neurons located in the arcuate nucleus of the hypothala- 
mus regulate eating behavior: the NPY/AgRP-producing neurons that 
release NPY, a hormone that triggers eating, and the melanocortin- 
producing POMC neurons whose products inhibit eating behavior. 


Ghrelin and cholecystokinin are short-term regulators of eating 
behavior. 


Leptin and insulin are long-term regulators of eating behavior. 


The gut hormone PPY3.3. is an intermediate regulator of eating be- 
havior. 


AMPK mediates many of the hypothalamic responses to these hor- 
mones. 


Superimposed on these biochemical mechanisms are neuronal con- 
trols arising from cognitive centers in the brain that can override 
metabolic regulation. 
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© Caloric restriction (CR) leads to longevity. 


e The SIRT gene products, referred to as SIRTs, are sirtuins: NAD*- 
dependent protein deacetylases (NAD*-dependent KDACs). 


e Almost all metabolic enzymes are acetylated and thus potentially 
regulated by the sirtuins. 


e SIRTI deacetylates metabolic enzymes, histones, and certain trans- 
cription factors. 


e The metabolic role of SIRT1 is to integrate metabolism and gene 
expression with the availability of calories. 


e SIRTI is a key regulator in caloric restriction. 
e Thus, both AMPK and SIRT1 respond to nutrient limitation. 


e AMPK accomplishes these ends through phosphorylation of metabolic 
enzymes and transcription factors. 


e SIRTI activity is dependent on the availability of NAD* and, thus, 
oxidative metabolism. 
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The effects of SIRT1 are achieved through its deacetylation of meta- 
bolic enzymes and transcription factors. 


AMPK and SIRTI! functions are coordinated so that cellular responses 
to changing nutrient availability are appropriate. 


mTORCI phosphorylation of SIRT1 Ser*’ inhibits its deacetylase 
activity. 


PKA phosphorylation of Ser+* within the SIRTI catalytic domain 
activates SIRT1. 


AMPK and SIRTI act to ensure that cellular responses to changes in 
of nutrient availability are appropriate. 


Sirtuins promote health maintenance by mobilizing pathways that 
protect against physiological stress. 


Caloric restriction reverses the symptoms of metabolic syndrome. 


Answers to all problems are at the end of this book. Detailed solutions are 
available in the Student Solutions Manual. 


1. The Energetics of Opposing Metabolic Sequences (Integrates with 
Chapters 3, 18, and 22.) The conversion of PEP to pyruvate by 
pyruvate kinase (glycolysis) and the reverse reaction to form PEP 
from pyruvate by pyruvate carboxylase and PEP carboxykinase 
(gluconeogenesis) represent a substrate cycle. The direction of net 
conversion is determined by the relative concentrations of allosteric 
regulators that exert kinetic control over pyruvate kinase, pyruvate 
carboxylase, and PEP carboxykinase. Recall that the last step in 
glycolysis is catalyzed by pyruvate kinase: 


PEP + ADP == pyruvate + ATP 


The standard free energy change is —31.7 kJ/mol. 

a. Calculate the equilibrium constant for this reaction. 

b. If [ATP] = [ADP], by what factor must [pyruvate] exceed [PEP] 
for this reaction to proceed in the reverse direction? 


The reversal of this reaction in eukaryotic cells is essential to gluco- 
neogenesis and proceeds in two steps, each requiring an equivalent of 
nucleoside triphosphate energy: 


Pyruvate carboxylase 
Pyruvate + CO, + ATP —> oxaloacetate + ADP + P; 


PEP carboxykinase 
Oxaloacetate + GTP —> PEP + CO, + GDP 


Net: Pyruvate + ATP + GTP —> PEP + ADP + GDP + P; 


c. The AG” for the overall reaction is +0.8 kJ/mol. What is the 
value of Keq? 

d. Assuming [ATP] = [ADP], [GTP] = [GDP], and [Pj] = 1 mM 
when this reaction reaches equilibrium, what is the ratio of 
[PEP]/[pyruvate]? 

e. Are both directions in the substrate cycle likely to be strongly 
favored under physiological conditions? 


2. Calculating Energy Charge and Phosphorylation Potential (Integrates 
with Chapter 3.) Assume the following intracellular concentrations in 
muscle tissue: [ATP] = 8 mM, [ADP] = 0.9 mM, [AMP] = 0.04 mM, 
[Pi] = 8 mM. What is the energy charge in muscle? What is the 
phosphorylation potential? 


3. Evolved Stoichiometry in ATP Production (Integrates with Chapter 20) 


Section 27.2 highlights the evolved nature of ATP coupling coeffi- 
cients in metabolic pathways. Revisit the electron transport/oxidative 
phosphorylation process in Chapter 20 and describe at least three 
aspects of this process that could affect the stoichiometry of ATP 
production. 


. How Long Do ATP and Phosphocreatine Supplies Last During Muscle 


Contraction? Strenuous muscle exertion (as in the 100 meter dash) 
rapidly depletes ATP levels. How long will 8 mM ATP last if 1 gram of 
muscle consumes 300 umol of ATP per minute? (Assume muscle is 70% 
water.) Muscle contains phosphocreatine as a reserve of phosphoryla- 
tion potential. Assuming [phosphocreatine] = 40 mM, [creatine] = 4 
mM, and AG” (phosphocreatine + H,O =— creatine + Pi) = —43.3 
kJ/mol, how low must [ATP] become before it can be replenished by the 
reaction: phosphocreatine + ADP =— ATP + creatine? [Remember, 
AG" (ATP hydrolysis) = —30.5 kJ/mol.] 


. What Is the Metabolic Value of NADPH in ATP Equivalents? 


(Integrates with Chapter 20.) The standard reduction potentials for 
the (NAD*t/NADH) and (NADP*/NADPH) couples are identical, 
namely, —320 mV. Assuming the in vivo concentration ratios NAD*/ 
NADH = 20 and NADP*/NADPH = 0.1, what is AG for the 
following reaction? 


NADPH + NAD* == NADP* + NADH 


Assuming standard state conditions for the reaction, ADP + P; —> 
ATP + H,O, calculate how many ATP equivalents can be formed 
from ADP + P; by the energy released in this reaction. 


. NAD*t/NADH in Catabolism or Anabolism NAD*/NADH is largely 


used in the catabolic block of metabolism. Name an anabolic 
pathway that uses NAD*/NADH. 


. Adenylate Kinase Is an Amplifier (Integrates with Chapter 3.) Assume 


the total intracellular pool of adenylates (ATP + ADP + AMP) = 
8 mM, 90% of which is ATP. What are [ADP] and [AMP] if the 
adenylate kinase reaction is at equilibrium? Suppose [ATP] drops 
suddenly by 10%. What are the concentrations now for ADP and 
AMP, assuming that the adenylate kinase reaction is at equilibrium? 
By what factor has the AMP concentration changed? 


. Regulating the Flux Through Metabolic Pathways (Integrates with 


Chapters 18 and 22.) The reactions catalyzed by PFK and FBPase 
constitute another substrate cycle. PFK is AMP activated; FBPase 
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is AMP inhibited. In muscle, the maximal activity of PFK (mmol of 
substrate transformed per minute) is ten times greater than FBPase 
activity. If the increase in [AMP] described in problem 5 raised 

PFK activity from 10% to 90% of its maximal value but lowered 
FBPase activity from 90% to 10% of its maximal value, by what 
factor is the flux of fructose-6-P through the glycolytic pathway 
changed? (Hint: Let PFK maximal activity = 10, FBPase maximal 
activity = 1; calculate the relative activities of the two enzymes at 
low [AMP] and at high [AMP]; let J, the flux of F-6-P through the 
substrate cycle under any condition, equal the velocity of the PFK 
reaction minus the velocity of the FBPase reaction.) 


. The Effects of Leptin on Fat Metabolism (Integrates with Chapters 23 


and 24.) Leptin induces synthesis of fatty acid oxidation enzymes and 
uncoupling protein 2 in adipocytes as well as causes inhibition of 
acetyl-CoA carboxylase, resulting in a decline in fatty acid biosynthe- 
sis. This effect on acetyl-CoA carboxylase, as an additional conse- 
quence, enhances fatty acid oxidation. Explain how leptin-induced 
inhibition of acetyl-CoA carboxylase might promote fatty acid 
oxidation. 


Ethanol as a Source of Metabolic Energy (Integrates with Chapters 19 
and 20.) Acetate produced in ethanol metabolism can be transformed 
into acetyl-CoA by the acetyl thiokinase reaction: 


Acetate + ATP + CoASH —> acetyl-CoA + AMP + PP; 


Acetyl-CoA then can enter the citric acid cycle and undergo oxidation 
to two CO;. How many ATP equivalents can be generated in a liver 
cell from the oxidation of one molecule of ethanol to two CO, by this 
route, assuming oxidative phosphorylation is part of the process? (As- 
sume all reactions prior to acetyl-CoA entering the citric acid cycle 
occur outside the mitochondrion.) Per carbon atom, which is a better 
metabolic fuel, ethanol or glucose? That is, how many ATP equiva- 
lents per carbon atom are generated by combustion of glucose versus 
ethanol to CO,? 


ATP Coupling Coefficients and the Thermodynamics of Opposing 
Metabolic Sequences | (Integrates with Chapter 23.) Assuming each 
NADH is worth three ATP, each FADH; is worth two ATP, and each 
NADPH is worth four ATP: How many ATP equivalents are 
produced when one molecule of palmitoyl-CoA is oxidized to eight 
molecules of acetyl-CoA by the fatty acid B-oxidation pathway? How 
many ATP equivalents are consumed when eight molecules of 
acetyl-CoA are transformed into one molecule of palmitoyl-CoA by 
the fatty acid biosynthetic pathway? Can both of these metabolic 
sequences be metabolically favorable at the same time if AG for ATP 
synthesis is +50 kJ/mol? 


ATP Coupling Coefficients and the Thermodynamics of Opposing 
Metabolic Sequences II (Integrates with Chapters 18-21.) If each 
NADH is worth three ATP, each FADH; is worth two ATP, and each 
NADPH is worth four ATP, calculate the equilibrium constant for 
cellular respiration, assuming synthesis of each ATP costs 50 kJ/mol 
of energy. Calculate the equilibrium constant for CO, fixation under 
the same conditions, except here ATP will be hydrolyzed to ADP + P; 
with the release of 50 kJ/mol. Comment on whether these reactions 
are thermodynamically favorable under such conditions. 


ATP as Regulator (Integrates with Chapter 15) Discuss at least three 
ways in which ATP acts, directly or indirectly, as a potent regulator of 
enzymes or proteins. 


Acetylation of Lysine and DNA/Histone Interactions Sirtuin catalyzes 
the removal of acetyl groups from lysine residues of histone proteins 
in the nucleosomes. This allows the nucleosomes to interact more 
strongly with DNA, limiting transcription. Explain how the deacety- 
lation of lysine residues on histone proteins leads to stronger 
interactions between the histones and DNA. 


Metabolic Inhibitors as a Diabetes Treatment Strategy (Integrates with 
Chapter 22.) In type 2 diabetics, glucose production in the liver is not 


16. 


17. 


18. 


19. 


20. 


21. 


22. 


appropriately regulated, so glucose is overproduced. One strategy to 
treat this disease focuses on the development of drugs targeted against 
regulated steps in glycogenolysis and gluconeogenesis, the pathways 
by which liver produces glucose for release into the blood. Which 
enzymes would you select for as potential targets for such drugs? 


Treatment of Diabetes Diabetes is characterized by chronic elevated 
blood levels of glucose due to lack of insulin (for type 1 diabetes) or 
unresponsiveness to insulin (for type 2 diabetes). Independent of the 
type of diabetes, lifestyle factors like diet control and regular exercise 
can help patients to manage the condition. Explain how regular 
exercise can help a patient manage blood glucose levels. (Hint: See 
section 27.4b.) 


Drug Targets to Regulate Eating Behavior As chief scientist for drug 
development at PhatFarmaceuticals, Inc., you want to create a series 
of new weight-loss drugs. You have a grand plan to design drugs that 
might limit production of some hormones or promote the production 
of others. Which hormones are on your “limit production” list and 
which are on your “raise levels” list? 


Leptin Injections as an Obesity Therapy The existence of leptin was 
revealed when the ob/ob genetically obese strain of mice was 
discovered. These mice have a defective leptin gene. Predict the effects 
of daily leptin injections into ob/ob mice on food intake, fatty acid 
oxidation, and body weight. Similar clinical trials have been conduct- 
ed on humans, with limited success. Suggest a reason why this therapy 
might not be a miracle cure for overweight individuals. 


Hormones That Regulate Eating Behavior Would it be appropriate to 
call neuropeptide Y (NPY) the obesity-promoting hormone? What 
would be the phenotype of a mouse whose melanocortin-producing 
neurons failed to produce melanocortin? What would be the 
phenotype of a mouse lacking a functional MC3R gene? What 
would be the phenotype of a mouse lacking a functional leptin 
receptor gene? 


Consequences of Alcohol Consumption on the NAD* /NADH 

Ratio The Human Biochemistry box, “The Metabolic Effects 

of Alcohol Consumption,” points out that ethanol is metabolized 
to acetate in the liver by alcohol dehydrogenase and aldehyde 
dehydrogenase: 


CH;CH,0H + NAD* == CH;CHO + NADH + H+ 
CH;CHO + NAD* + HO == CH;COO- + NADH + 2H* 


These reactions alter the NAD*/NADH ratio in liver cells. From your 
knowledge of glycolysis, gluconeogenesis, and fatty acid oxidation, 
what might be the effect of an altered NAD*/NADH ratio on these 
pathways? What is the basis of this effect? 


The Phenotype of a T172D Mutation in AMPK A T172D mutant of 
AMPK is locked in a permanently active state. Explain. 


Malonyl-CoA as a Key Indicator of Nutrient Availability 


a. Some scientists support the malonyl-CoA hypothesis, which 
suggests that malonyl-CoA is a key indicator of nutrient 
availability and the brain uses its abundance to assess whole- 
body energy homeostasis. Others have pointed out that malonyl- 
CoA is a significant inhibitor of carnitine acyltransferase-1 (see 
Figure 24.16). Thus, malonyl-CoA may be influencing the levels 
of another metabolite whose concentration is more important as 
a signal of energy status. What metabolite might that be? 

b. Another test of the malonyl-CoA hypothesis was conducted 
through the creation of a transgenic strain of mice that lacked 
functional hypothalamic fatty acid synthase (see Chapter 24). 
Predict the effect of this genetic modification on cellular 
malonyl-CoA levels in the hypothalamus, the eating behavior of 
these transgenic mice, their body fat content, and their physical 
activity levels. Defend your predictions. 
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23. Confirming the Function of Leptin 

a. Leptin is an anorexic (appetite-suppressing) protein. A congeni- 
tally obese strain of mice (ob/ob mice) lacks both copies of the 
gene encoding leptin. Propose an experiment to confirm the 
function of leptin. 

b. A second strain of obese mice (db/db mice) produces leptin in 
abundance but fails to respond to it. Assuming the db mutation 
leads to loss of function in a protein, what protein is likely to be 
nonfunctional or absent? How might you test your idea? 


24. Preferred Metabolic Fuels Consult Figure 27.6, and answer the 
following questions: Which organs use both fatty acids and glucose as 
a fuel in the well-fed state, which rely mostly on glucose, which rely 
mostly on fatty acids, which one never uses fatty acids, and which one 
produces lactate. 


25. R versus U Enzymes in Response to Energy Charge Figure 27.3 
illustrates the response of R (ATP-regenerating) and U (ATP-using) 
enzymes to energy charge. 

a. Would hexokinase be an R enzyme or a U enzyme? Would 
glutamine-PRPP amidotransferase, the second enzyme in purine 
biosynthesis, be an R enzyme or a U enzyme? 
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b. If energy charge = 0.5: Is the activity of hexokinase high or low? 
Is ribose-5-P pyrophosphokinase activity high or low? 

c. If energy charge = 0.95: Is the activity of hexokinase high or 
low? Is ribose-5-P pyrophosphokinase activity high or low? 


PSAFE Problem 


(PSAFE models, listed by chapter or protein name, can be found at http:// 
www.psafeonline.net.) 

Examine the PSAFE for resistin, and describe the formation of the 
resistin hexamer. 


Think-Pair-Share Question 


Consider the regulation of energy metabolism in the organs shown in 
Figure 27.8. 

Predict which arrows in this figure would be activated or inhibited by: 
insulin, glucagon, leptin, AMPK, or epinephrine. Are there any differ- 
ences in regulation between the organs shown? Refer to any previous 
chapter to justify your answer. 
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DNA Metabolism: Replication, 
Recombination, and Repair 


DNA is the physical repository of genetic information and the material of heredity that is passed 
on to progeny. How is this genetic information in the form of DNA replicated, how does 
rearrangement of this information occur, and how is its integrity maintained in the face of damage? 


Hu: which we can define generally as the tendency of an organism to possess the 
characteristics of its parent(s), is clearly evident throughout nature and since the 
dawn of history has served to justify the classification of organisms according to shared 
similarities. The basis of heredity, however, was a mystery. Early in the twentieth century, 
geneticists demonstrated that genes, the elements or units carrying and transferring 
inherited characteristics from parent to offspring, are contained within the nuclei of cells 
in association with the chromosomes. Yet the chemical identity of genes remained un- 
known, and genetics was an abstract science. Even the realization that chromosomes are 
composed of nucleic acids and proteins did little to define the molecular nature of 
the gene because, at the time, no one understood either of these substances. 

The material of heredity must have certain properties. It must be very stable so that 
genetic information can be stored in it and transmitted countless times to subsequent 
generations. It must be capable of precise copying or replication so that its information 
is not lost or altered. And, although stable, it must also be subject to change to account, 
in the short term, for the appearance of mutant forms and, in the long term, for evolu- 
tion. DNA is the material of heredity. 


DNA Metabolism 


The chapters that precede this one have reviewed the major pathways of intermediary 
metabolism, pointing out the synthesis and breakdown of the major classes of cellular 
molecules. When we consider DNA from the perspective of metabolism, its role as the 
hereditary substance focuses our inquiry. While DNA catabolism contributes little to 
energy metabolism in most cells, DNA synthesis is a central aspect of cellular existence. 
New DNA molecules are synthesized by copying existing DNA in a manner that 
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Part IV 
Information Transfer 


Heredity 


I am the family face; 

Flesh perishes, I live on, 
Projecting trait and trace 
Through time to times anon, 
And leaping from place to place 
Over oblivion. 


The years-heired feature that can 
In curve and voice and eye 
Despise the human span 

Of durance—that is I; 

The eternal thing in man, 

That heeds no call to die. 


Thomas Hardy 
(in Moments of Vision and 
Miscellaneous Verses, 1917) 


< An illustration of DNA, the substance of heredity. 


28.8 
28.9 


How Is DNA Replicated? 

What Are the Functions of DNA 
Polymerases? 

Why Are There So Many DNA 
Polymerases? 

How Is DNA Replicated in 
Eukaryotic Cells? 

How Are the Ends of Chromosomes 
Replicated? 

How Are RNA Genomes Replicated? 
How Is the Genetic Information 
Rearranged by Genetic 
Recombination? 

Can DNA Be Repaired? 


What Is the Molecular Basis of 
Mutation? 


28.10 Can Genetic Recombination Create 


Protein Diversity? 
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A template is something whose edge is shaped in 

a particular way so that it can serve as a guide in 
making a similar object with a corresponding contour. 
In DNA replication, each of the original strands acts as 
a template by specifying the order of nucleotides in the 
newly synthesized strand according to the base-pairing 
rules of Watson and Crick. 


Old New New Old 
Emerging progeny DNA 


Figure 28.1 DNA replication: Strand separation 
followed by the copying of each strand. 


reproduces its information content through a process called DNA replication. As the 
genetic material, DNA persists over the lifetime of the cell and appears essentially un- 
altered in subsequent generations. Occasionally, its information is shuffled as regions 
of DNA are exchanged between chromosomes, a process known as recombination. 
Recombination creates genetic variation upon which evolution can act. As a chemical, 
DNA is susceptible to reactions that modify its bases or disrupt its sugar-phosphate 
backbone. Because its effective lifetime exceeds the lifetime of cells, DNA repair is nec- 
essary to preserve the fidelity of its genetic information. This chapter addresses these 
dominant features of DNA metabolism: replication, recombination, and repair. 


28.1 | How Is DNA Replicated? 


Transfer of genetic information from generation to generation requires the faithful 
reproduction of the parental DNA. DNA reproduction produces two identical copies of 
the original DNA double helix through DNA replication. The mechanism for DNA repli- 
cation is strand separation followed by the copying of each strand. In the process, each 
separated strand acts as a template for the synthesis of a new complementary strand whose 
nucleotide sequence is fixed by the base-pairing rules proposed by Watson and Crick (see 
Chapter 10). Strand separation is achieved by unwinding the double helix (Figure 28.1). 
Base pairing then dictates the proper sequence of nucleotide addition to achieve an 
accurate replication of each original strand. Thus, each original strand ends up paired with 
a new complementary partner, and two identical double-stranded DNA molecules are 
formed from one. This mode of DNA replication is referred to as semiconservative because 
one of the two original strands is conserved in each of the two progeny molecules. 


28.1a DNA Replication Is Bidirectional 


Replication of DNA molecules begins at one or more specific regions called the 
origin(s) of replication and, excepting certain bacteriophage chromosomes and plas- 
mids, proceeds in both directions from this origin (Figure 28.2). For example, replica- 
tion of E. coli DNA begins at oriC, a unique 245-bp chromosomal site that contains 
11 GATC tetranucleotide sequences along its length. From oriC, replication advances 
in both directions around the circular chromosome. That is, bidirectional replication 
involves two replication forks that move in opposite directions. Bidirectional replica- 
tion predicts that, if radioactively labeled nucleotides are provided as substrates for 
new DNA synthesis, both replication forks will become radioactively labeled. The 
experiment illustrated in Figure 28.2(b) confirms this prediction. 


28.1b Replication Requires Unwinding of the DNA Helix 


Semiconservative replication depends on unwinding the DNA double helix to expose 
single-stranded templates to polymerase action. For a double helix to unwind, it must 
either rotate about its axis (while the ends of its strands are held fixed), or positive 
supercoils must be introduced, one for each turn of the helix unwound (see Chapter 
11). If the chromosome is circular, as in E. coli, only the latter alternative is possible. 
Because DNA replication in E. coli proceeds at a rate approaching 1000 nucleotides 
per second and there are about 10 bp per helical turn, the chromosome would accu- 
mulate 100 positive supercoils per second! In effect, the DNA would become too 
tightly supercoiled to allow unwinding of the strands. 

DNA gyrase, a type II topoisomerase, acts to overcome the torsional stress imposed 
upon unwinding; DNA gyrase introduces negative supercoils at the expense of ATP hydro- 
lysis. The unwinding reaction is driven by helicases (see Section 16.4), a class of proteins 
that catalyze the ATP-dependent unwinding of DNA double helices. Unlike topoisomer- 
ases that alter the linking number of dsDNA through phosphodiester bond breakage and 
reunion (see Chapter 11), helicases simply disrupt the base-pairing interactions that hold 
the two strands of duplex DNA together. A helicase molecule requires a single-stranded 
region for binding. It then moves along the single strand, unwinding the double-stranded 
DNA in an ATP-dependent process. SSB (single-stranded DNA-binding protein) binds to 
the unwound strands, preventing their re-annealing. At least 10 distinct DNA helicases 
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(a) Labeled DNA 
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Figure 28.2 Bidirectional replication. 

(a) Comparison of labeling during unidirectional 
versus bidirectional replication. (b) An autoradiogram 
of E. coli chromosome replication in the presence 
of radioactive thymidine confirms bidirectional 
replication. The replicating DNA was obtained from 
detergent-treated cells, spread out on a microscope 
slide and dried. See Prescott, D. M., and Kuempel, 
P. L., 1972. Bidirectional Replication of the 
Chromosome in Escherichia coli. Proceedings of the 
National Academy of Sciences, USA 69:2842-2845. 
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Chemistry in History 


Slayton A. Evans Jr 


Dr. Slayton Evans Jr (1943-2001) was a distinguished 
professor of chemistry at the University of North 
Carolina at Chapel Hill (UNC-CH). He rose from 
modest beginnings to become the Kenan Professor 
of Chemistry at UNC-CH and an internationally 
known expert in organophosphorus chemistry. 

Dr. Evans was born in Chicago, but was raised 
in a segregated housing project in Meridian, 
Mississippi. He was fascinated by plants and insects 
at an early age and kept a notebook that recorded 
the characteristics of insects in his collection. He 
paid for his tuition in a segregated Roman Catholic 
grade school by mowing lawns and (in 8" grade) 
working as a janitor at the school. He won both 
academic and athletic scholarships to Tougaloo 
College, where he excelled at chemistry. He did 
graduate work at the Illinois Institute of Technology 


and received the Ph.D. in Chemistry from Case 
Western Reserve University. 

Following postdoctoral work at the University 
of Texas at Arlington and the University of Notre 
Dame, he took a position as the first African- 
American assistant professor of chemistry at 
UNC-CH. Within 10 years, he had risen to the rank 
of Professor of Chemistry and had established a 
research laboratory of graduate and undergraduate 
students and postdocs from around the world. He 
championed the recruitment of minority students 
to UNC-CH and served on and chaired committees 
for the American Chemistry Society, the National 
Institutes of Health and the National Science 
Foundation. Following his death from cancer in 
2001, a lectureship and a research award were 
established in his name. 


involved in different aspects of DNA and RNA metabolism have been found in E. coli 
alone. DnaB is the DNA helicase acting in E. coli DNA replication. DnaB helicase assem- 
bles as a hexameric (œ) doughnut-shaped protein ring, with DNA passing through its hole. 


28.1c DNA Replication Is Semidiscontinuous 


As shown in Figure 28.2, both parental DNA strands are replicated at each advancing 
replication fork. The enzyme that carries out DNA replication is DNA polymerase. 
This enzyme uses single-stranded DNA (ssDNA) as a template and makes a comple- 
mentary strand by polymerizing deoxynucleotides in the order specified by their pair- 
ing with bases in the template. DNA polymerases synthesize DNA only in a 5’->3’ 
direction, reading the antiparallel template strand in a 3'—>5' sense. A dilemma arises: 
How does DNA polymerase copy the parent strand that runs in the 5'—3' direction 
at the replication fork? It turns out that replication is semidiscontinuous (Figure 28.3). 
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(a) 3'-55’ parental strand Figure 28.3 The semidiscontinuous model for DNA replication. Newly 
synthesized DNA is shown as red. (a) Leading- and lagging-strand 
synthesis. Note that the leading strand is synthesized continuously in the 
5'—3' direction as the DNA polymerase reads along the 3'—5' parental 
template for the leading strand in the 3'—5' direction. Since DNA 
polymerases can only move along any template in a 3'—5' direction, the 
other parental strand, the one labeled “5'—>3’” is copied, by necessity, in 


] F a discontinuous fashion to form short, 5'—>3' fragments (lagging strand 
5-53! synthesis). (b) Synthesis of both strands carried out by a dimeric DNA 
parental strand Okazaki fragments Movement of polymerase situated at the replication fork. Because DNA polymerase 
> replication fork must read the template strand in the 3'—5' direction, the 5'—>3' parental 
(b) strand must wrap around in trombone fashion. (c) Bidirectional 


replication, showing how half of each new strand is replicated by leading- 


A SAANEN OO TH strand synthesis and half by lagging-strand synthesis. 
` Leading strand 
3 Lagging strand 
5! d Ila L Li 


Okazaki fragments Movement of 


replication fork 


> 


(c) 


Okazaki fragments 
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Okazaki fragments 


As the DNA helix is unwound during its replication, the 3'—>5' strand (as defined by 
the direction that the replication fork is moving) can be copied continuously by DNA 
polymerase synthesizing in the 5'—>3' direction behind the replication fork. The other 
parental strand is copied only when a sufficient stretch of its sequence has been ex- 
posed for DNA polymerase to read it in the 3'—5' sense. Thus, one parental strand is 
copied continuously to give a newly synthesized copy, called the leading strand, at 
each replication fork. The other parental strand is copied in an intermittent, or dis- 
continuous, mode to yield a set of fragments 1000 to 2000 nucleotides in length, called 
the Okazaki fragments (Figure 28.3a). These fragments are then joined to form an 
intact lagging strand. Because both strands are synthesized in concert by a dimeric 
DNA polymerase situated at the replication fork, the 5'—>3' parental strand must 
wrap around in trombone fashion so that the unit of dimeric DNA polymerase 
replicating it can move along it in the 3’—5’ direction (Figure 28.3b). Because 
DNA replication is bidirectional, each of the two DNA duplexes produced in DNA 
replication contains one old and one new DNA strand, and half of the new strand was 
formed by leading strand synthesis and the other half by lagging strand synthesis. 


28.1d The Biochemical Evidence for Semidiscontinuous DNA Replication 


In 1968, Tsuneko and Reiji Okazaki provided biochemical verification of the semidis- 
continuous pattern of DNA replication just described. The Okazakis exposed a rapidly 
dividing E. coli culture to *H-labeled thymidine for 30 seconds, quickly collected the 
cells, and found that half of the label incorporated into nucleic acid appeared in short 
ssDNA chains just 1000 to 2000 nucleotides in length. (The other half of the radioactiv- 
ity was recovered in very large DNA molecules.) Subsequent experiments demonstrated 
that with time, the newly synthesized short ssDNA Okazaki fragments covalently joined 
to form longer polynucleotide chains, in accord with a semidiscontinuous mode of rep- 
lication. The generality of this mode of replication has been corroborated with electron 
micrographs of DNA undergoing replication in eukaryotic cells. 
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28.le Initiation of DNA Replication 


The initiation of DNA replication must be timed to serve the cell’s metabolic state—that 
is, it must take place when, and only when, the cell is ready to divide. When that time 
comes, the DNA double helix must be melted at origin-of-replication (ori) sites and the 
protein complexes that catalyze DNA replication must be recruited there and as- 
sembled into a functional replication complex, called a replicon. The proteins that bind 
to ori sites and recruit helicases to unwind the DNA double helix are AAA+ ATPases, 
an evolutionarily ancient family of proteins involved in a variety of energy-dependent 
unfolding and remodeling of proteins and nucleic acids (see Section 16.4). In E. coli, the 
oriC DNA sequence is composed of conserved repeats that are recognized by DnaA, 
an AAA+ ATPase replication initiator. In addition, oriC has an easily melted AT-rich 
region known as the DUE (DNA unwinding element) where the double helix is un- 
wound to form the single-stranded replication bubble. The hexameric helicase DnaB is 
recruited to the replication bubble by DnaA with the assistance of DnaC, another 
AAA+ ATPase, and loaded onto the DNA single strand that will become the lagging 
strand during DNA replication. This process of AAA+ ATPase proteins binding at ori- 
gins of replication, DNA melting, and recruitment of helicases is fundamental in all 
cells, whether prokaryotic or eukaryotic. However, the specific mechanisms and the 
proteins that drive them vary across bacteria, archaea, and eukaryotes. 


28.2 | What Are the Functions of DNA Polymerases? 


The enzymes that replicate DNA are called DNA polymerases. All DNA polymer- 
ases, whether from prokaryotic or eukaryotic sources, share the same fundamental 
catalytic mechanism: 


1. The incoming base is selected within the DNA polymerase active site, as deter- 
mined by Watson—Crick geometric interactions with the corresponding base in the 
template strand. 

2. Chain growth is in the 5’-s3’ direction and is antiparallel to the template strand. 

3. DNA polymerases cannot initiate DNA synthesis de novo—all require a primer 
oligonucleotide with a free 3’-OH to build upon. 


The material discussed in this section applies to all DNA polymerases, prokaryotic 
and eukaryotic. 


28.2a Biochemical Characterization of DNA Polymerases 


DNA polymerases can catalyze the synthesis of DNA if provided with all four deoxy- 
nucleoside-5’-triphosphates (dATP, dTTP, dCTP, dGTP), a template DNA strand to 
copy, and a primer. A primer is essential because DNA polymerases can elongate only 
preexisting polynucleotide chains; they cannot join two deoxyribonucleoside-S’- 
phosphates together to make the initial phosphodiester bond. The primer base pairs 
with the template DNA, forming a short, double-stranded region. This primer must 
possess a free 3’-OH end to which an incoming deoxynucleoside monophosphate is 
added. In vivo, a short RNA strand is invariably the primer; it is synthesized by 
a DNA-dependent RNA polymerase activity known as primase. Like other RNA 
polymerases, primase can synthesize oligonucleotide chains de novo. Unlike DNA 
polymerases, it does not require an oligonucleotide precursor. 

DNA replication begins when DNA polymerase binds at the 3’-end of the primer. 
The appropriate one of the four dNTPs is selected as substrate, pyrophosphate (PP;) is 
released, and the dNMP is linked to the 3'-OH of the primer chain through formation 
of a phosphoester bond (Figure 28.4). The deoxynucleotide selected as substrate is cho- 
sen through its geometric fit with the template base to form a Watson—Crick base pair. 
As DNA polymerase catalyzes the successive addition of deoxynucleotide units to the 
3’-end of the primer, the chain is elongated in the 5’—3’ direction, forming a poly- 
nucleotide sequence that is antiparallel and complementary to the template. Cells 
contain a variety of different DNA polymerases (see Table 28.1) and they differ in how 
many rounds of deoxynucleotide addition they can catalyze before dissociating from the 
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Figure 28.4 The chain elongation reaction catalyzed 
by DNA polymerase. The 3’-OH of the terminal 
nucleotide in the primer strand carries out a 
nucleophilic attack on the a-phosphory! group of 
the incoming dNTP to form a phosphoester bond, 
and PP; is released. The subsequent hydrolysis of PP; 
by inorganic pyrophosphatase renders the reaction 
effectively irreversible. 


TO 


| Table 23.1 | 7) Properties of the DNA Polymerases of E. coli 


Property Pol | Pol Il Pol IIl (core)* 

Mass (kDa) 103 88 130(a), 27.5(e), 8.6(0) 
Molecules/cell 400 40 
Turnover number* 20 40 1000 
Polymerization 5’—>3’ Yes Yes Yes 
Exonuclease 3’—>5' Yes Yes Yes 
Exonuclease 5’—>3' Yes No No 


*q-, e-, and 0-subunits. 
tNucleotides polymerized at 37°C/second/molecule of enzyme. 


template strand. The degree to which a particular DNA polymerase remains associated 
with the template is referred to as its processivity. Some DNA polymerases, such as 
those whose primary function is DNA repair rather than DNA replication, are only 
modestly processive, synthesizing a DNA strand only 3 to 200 bases long before falling 
off. On the other hand, replicative DNA polymerases are extraordinarily processive. The 
E. coli replicative DNA polymerase, known as DNA polymerase III holoenzyme, can 
replicate an entire strand of the E. coli chromosome (4.6 megabases) without falling off! 


28.2b E. coli Cells Have Several Different DNA Polymerases 


Table 28.1 compares the properties of the principal DNA polymerases in E. coli. 
These enzymes are nicknamed pol and numbered I through V in order of their 
discovery. DNA polymerases I, II, and V function principally in DNA repair; DNA 
polymerase III is the chief DNA-replicating enzyme of E. coli. Only 40 or so copies of 
this enzyme are present per cell. 


28.2c E. coli DNA Polymerase III Holoenzyme Replicates 
the E. coli Chromosome 


In its holoenzyme form, DNA polymerase III is the enzyme responsible for replication of 
the E. coli chromosome. The simplest form of DNA polymerase HI showing any DNA- 
synthesizing activity in vitro, core DNA polymerase III, is 165 kDa in size and consists of 
three polypeptides: a (130 kDa), e (27.5 kDa), and 0 (8.6 kDa). In vivo, core DNA poly- 
merase III functions as part of a multisubunit complex, the DNA polymerase III holo- 
enzyme, which is composed of 10 different kinds of subunits (Table 28.2). The various 
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| Table 28.2 | kW) Subunits of E. coli DNA Polymerase IIl Holoenzyme 


Subunit Mass (kDa) Function 
a 130 Polymerase 
è 245 3'-Exonuclease 
0 8.6 e-Subunit stabilization 
T 71 DNA template binding; core enzyme dimerization 
B 41 Sliding clamp, processivity 
y 47.5 Part of the y-complex* 
6 39 Part of the y-complex* 
ô' 37 Part of the y-complex* 
X 17 Interaction with SSB and the y-complex 
Y 15 Interaction with y and the y-complex 


*Subunits 7, y, 6, 6’, x, and y form the so-called y-complex responsible for adding B-subunits (the sliding clamp) to 
DNA and anchoring the sliding clamp to the two core DNA polymerase III structures. The y-complex is referred to as 
the clamp loader. 


auxiliary subunits increase both the polymerase activity of the core enzyme and its pro- 
cessivity. DNA polymerase III holoenzyme synthesizes DNA strands at a speed of nearly 
1 kb/sec. DNA polymerase HI holoenzyme is organized in the following way: Two core 
(ae0) DNA polymerase II units and one y-complex are attached to DnaB helicase via two 
z-subunits to form a structure known as DNA polymerase III*. (A y-complex consists of 
one copy each of the y-subunit, the 6-subunit, and the 6’-subunit.) In turn, each core poly- 
merase within DNA polymerase III* binds to a B-subunit dimer to create DNA poly- 
merase III holoenzyme, a 17-subunit ((a&0),2B,7yd6'yW) complex (Figure 28.5). The 
17-subunit DNA polymerase III holoenzyme structure is sometimes referred to as a repli- 
some. The y-complex is responsible for assembly of the DNA polymerase III holoenzyme 
complex onto DNA. The y-complex of the holoenzyme acts as a clamp loader by catalyzing 
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Figure 28.5 DNA polymerase III holoenzyme 

is a dimeric polymerase. One unit of polymerase 
synthesizes the leading strand, and the 

other synthesizes the lagging strand. Because 
DNA synthesis always proceeds in the 5'—3' 
direction as the template strand is read in the 
3'’—>5' direction, lagging-strand synthesis must 
take place on a looped-out template. This single- 
stranded loop is more than 1000 bases long. 
Lagging-strand synthesis requires repeated 
priming. Primase bound to the DnaB helicase 
carries out this function, periodically forming new 
RNA primers on the lagging strand. All single- 
stranded regions of DNA are coated with SSB 
(not shown). 
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Figure 28.6 Structure of the E. coli B>-sliding 
clamp. (a) Ribbon diagram of the B-subunit dimer 
of the DNA polymerase III holoenzyme on 
B-DNA, viewed down the axis of the DNA. One 
monomer of the B-subunit dimer is colored blue 
and the other yellow. The centrally located DNA 
is multicolored. (b) Space-filling model of the 
B-subunit dimer on B-DNA. The hole formed by 
the B-subunits (diameter ~ 3.5 nm) is large enough 
to easily accommodate the product of DNA 
polymerase III holoenzyme action, double- 
stranded DNA (diameter ~ 2.5 nm) with no steric 
repulsion (PDB id = 2POL). The rest of 
polymerase III holoenzyme (core polymerase + 
y-complex) associates with this sliding clamp to 
form the replicative polymerase (not shown). 


the ATP-dependent transfer of a pair of B-subunits to each strand of the DNA template. 
Each B-subunit dimer forms a closed ring around a DNA strand and acts as a tight clamp 
that can slide along the DNA (Figure 28.6). Each B)-sliding clamp tethers a core poly- 
merase to the template, accounting for the great processivity of the DNA polymerase 
holoenzyme. This complex can replicate an entire strand of the E. coli genome (more than 
4.6 megabases) without dissociating. Compare this to the processivity of DNA poly- 
merase I, which is only 20 bases! 

The core polymerase synthesizing the lagging strand must release from the DNA 
template when synthesis of an Okazaki fragment is completed and rejoin the template 
at the next RNA primer to begin synthesis of the next Okazaki fragment. The r-subunit 
serves as a “processivity switch” that accomplishes this purpose. The r-subunit is usually 
“off” and is turned “on” only on the lagging strand and only when synthesis of an 
Okazaki fragment is completed. When activated, 7 ejects the B>-sliding clamp bound to 
the lagging strand core polymerase. Almost immediately, the lagging strand core poly- 
merase is reloaded onto a new B>-sliding clamp at the 3’-end of next RNA primer, and 
synthesis of the next Okazaki fragment commences. 


PDB id = 4ESV 
DnaB bound to ssDNA 28.2d A DNA Polymerase IlI Holoenzyme Sits at Each Replication Fork 


Figure 28.7 and Table 28.3 present a snapshot of the enzymatic apparatus assembled 
at a replication fork. DNA gyrase (topoisomerase) and DnaB helicase unwind the 


Dimeric replicative 
DNA polymerase 


. B-Subunit 
Newly synthesized “sliding clamp” 
leading strand Leading strand 


template 


Okazaki 
DNA polymerase I fragment 


Previous Okazaki 
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DNA ligase Primer 


Primer Primer 
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Figure 28.7 General features of a replication fork. The DNA duplex is unwound by the action of 
DNA gyrase and helicase, and the single strands are coated with SSB (ssDNA-binding protein). 
Primase periodically primes synthesis on the lagging strand. Each half of the dimeric replicative 
polymerase is a core polymerase bound to its template strand by a B-subunit sliding clamp. DNA 
polymerase I and DNA ligase act downstream on the lagging strand to rrmove RNA primers, replace 
them with DNA, and ligate the Okazaki fragments. 
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28.2 What Are the Functions of DNA Polymerases? 


| Table 28.3 | It ÆJ Proteins Involved in DNA Replication in E. coli 


Protein Function 

DNA gyrase Unwinding DNA 

SSB Single-stranded DNA binding 

DnaA Initiation factor; origin-binding protein 
DnaB 5’—>3’ helicase (DNA unwinding) 
DnaC DnaB chaperone; loading DnaB on DNA 
Primase (DnaG) Synthesis of RNA primer 

DNA polymerase III holoenzyme Elongation (DNA synthesis) 

DNA polymerase I Excises RNA primer, fills in with DNA 
DNA ligase Covalently links Okazaki fragments 
Tus Termination 


DNA double helix, and the unwound, single-stranded regions of DNA are maintained 
through interaction with SSB. SSB removes any secondary structure from ssDNA 
and prevents endonucleolytic cleavage of the DNA. Primase (DnaG) synthesizes an 
RNA primer on the lagging strand; the leading strand, which needs priming only 
once, was primed when replication was initiated. The lagging strand template is 
looped around, and each replicative DNA polymerase moves 5’—>3’ relative to its 
strand, copying template and synthesizing a new DNA strand. Each replicative poly- 
merase is tethered to the DNA by its B-subunit sliding clamp. The DNA polymerase 
HI y-complex periodically unclamps and then reclamps B-subunits on the lagging 
strand as the primer for each new Okazaki fragment is encountered. Downstream on 
the lagging strand, DNA polymerase I excises the RNA primer and replaces it with 
DNA, and DNA ligase seals the remaining nick. 


28.2e DNA Polymerase | Removes the RNA Primers 
and Fills in the Gaps 


E. coli DNA polymerase I, the first DNA polymerase discovered (in 1957 by Arthur 
Kornberg), acts to remove the RNA primers and replace them with DNA. DNA poly- 
merase I also acts in DNA repair. It fulfills these roles because, in addition to its 
5S'—3' polymerase activity, E. coli DNA polymerase I has two other catalytic func- 
tions: a 3’—95’ exonuclease (3’-exonuclease) activity and a 5’—3’ exonuclease 
(5’-exonuclease) activity. The three distinct catalytic activities of DNA polymerase I 
reside in separate active sites in the enzyme. The 5’-exonuclease activity is not a com- 
monly found property of DNA polymerases. This 5’-exonuclease activity is the mech- 
anism for removal of the RNA primers. 


28.2f DNA Polymerase | Also Plays a Role in 
DNA Double-Strand Break Repair 


DNA polymerase I can bind to double-stranded DNA where there is a break or gap in 
one of the strands. It then uses its 5’-exonuclease activity to remove mononucleotides 
and oligonucleotides, degrading the broken strand from its exposed 5’-end as it moves 
in a 5'—3 direction. One function of the 5’-exonuclease is to remove distorted 
(mispaired) segments lying in the path of an advancing polymerase. Indeed, the bio- 
logical roles of DNA polymerase I depend on its ability to bind at nicks (single- 
stranded breaks) in dsDNA and move in the 5'—3' direction, removing successive 
nucleotides with its 5'-exonucleolytic activity. (This overall process is known as nick 
translation because the nick is translated [that is, moved] down the DNA.) This 
5'-exonuclease activity plays an important role in primer removal during DNA repli- 
cation, as we’ve noted. The second E. coli DNA polymerase, DNA polymerase IT, is a 
DNA repair enzyme expressed in response to DNA damage. 
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28.2g DNA Ligase Seals the Nicks Between Okazaki Fragments 


DNA ligase seals nicks in double-stranded DNA where a 3’-OH and a 5’-phosphate are 
juxtaposed (see Chapter 12). This enzyme is responsible for joining Okazaki fragments 
together to make the lagging strand a covalently contiguous polynucleotide chain. 


Template Mismatched 


28.2h DNA Polymerase Is Its Own Proofreader 


DNA polymerases possess a 3’-exonuclease activity, a function that enhances the 
accuracy of DNA replication. This 3’-exonuclease, a property of the e-subunit, 
removes nucleotides from the 3’-end of the growing chain (Figure 28.8). By this means, 
the 3’-exonuclease negates the action of the polymerase. Its purpose, however, is to 
remove incorrect (mismatched) bases. Although the 3’-exonuclease works slowly com- 
pared to the polymerase, the polymerase cannot elongate an improperly base-paired 
primer terminus. Thus, the relatively slow 3’-exonuclease has time to act and remove 
the mispaired nucleotide. Therefore, the polymerase active site is a proofreader, and 
the 3’-exonuclease activity is an editor. This check on the accuracy of base pairing 
enhances the overall precision of the process. 


Exonuclease 
hydrolysis site 


DNA 
polymerase I 


Figure 28.8 The 3'—5' exonuclease activity of 
DNA polymerase I removes nucleotides from the 
3’-end of the growing DNA chain. The newly 
synthesized strand is shown as a purple ribbon. 


28.2i DNA Replication Terminates at the Ter Region 


Located diametrically opposite from oriC on the E. coli circular chromosome is a ter- 
minus region, the Ter, or t, locus. The oppositely moving replication forks meet here, 
and replication is terminated. The Ter region contains a number of short DNA se- 
quences, with a consensus core element 5’-GTGTGTTGT. These Ter sequences act as 
terminators; clusters of three or four Ter sequences are organized into two sets inversely 
oriented with respect to one another. One set blocks the clockwise-moving replication 
fork, and its inverted counterpart blocks the counterclockwise-moving replication fork. 
Termination requires binding of a specific replication termination protein, Tus protein, 
to Ter. Tus protein is a contrahelicase. That is, Tus protein prevents the DNA duplex 
from unwinding by blocking progression of the replication fork and inhibiting the ATP- 
dependent DnaB helicase activity. Final synthesis of both duplexes is completed. 


A Deeper Look 


A Mechanism for All Polymerases 


Thomas A. Steitz suggested that biosynthesis of nucleic acids pro- 
ceeds by an enzymatic mechanism that is universal among all DNA 
and RNA polymerases. His suggestion is based on structural studies 
indicating that DNA polymerases use a two-metal-ion mechanism 
to catalyze nucleotide addition during elongation of a growing poly- 
nucleotide chain (see accompanying figure). The incoming nucleotide 
has two Mg”* ions (shown here as Me”*, a generic representation for Prey o 

divalent cations) coordinated to its phosphate groups. These metal 
ions interact with two aspartate residues that are highly conserved Q -0 ome OH 

in DNA (and RNA) polymerases. These residues in phage T7 DNA Si ü ö gO 
polymerase are D705 and D882. One metal ion, designated A, inter- C i I NU Tie a 
acts with the O atom of the free 3’-OH group on the polynucleotide S we, ; 
chain, lowering its affinity for its hydrogen. A water molecule in the 5 S o-0x o ľ : -e 
active site (not shown) likely assists in this deprotonation. This interac- @----- (Me) 2) KO g \ 
tion promotes nucleophilic attack of the 3'-O on the phosphorus atom A, D882 US SL? aa 
in the a-phosphate of the incoming nucleotide. The second metal ion > i o 
(B in the figure) assists departure of the product pyrophosphate group 
from the incoming nucleotide. Together, the two metal ions stabilize 
the pentacovalent transition state on the a-phosphorus atom. 
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28.3 Why Are There So Many DNA Polymerases? 


28.2) DNA Polymerases Are Immobilized in Replication Factories 


Chromosomal DNA replication begins when two replisomes are assembled at ori 
sequences to form two replication forks (one replisome per fork). One replisome is 
depicted in Figure 28.7. The traditional description of what takes place next is a pro- 
cess whereby the replication forks diverge in opposite directions as DNA replication 
proceeds. In this view, the replicative DNA polymerases are pictured as locomotives 
moving in opposite directions along DNA railroad tracks (as might be imagined for 
Figure 28.2). In actuality, in vivo studies reveal that the two replication forks remain 
linked together. Such co-joined replisomes (one for each replication fork) represent a 
structure aptly described as a replication factory (Figure 28.9). The DNA is fed through 
the DNA polymerases within the replication factory, much like tape is fed past the 
heads of a tape player, with all four newly replicated strands of DNA looping out from 
this fixed structure (Figure 28.9). Note that each newly replicated strand is paired 
with a strand of parental DNA. Replication factories, also called replicons, are thought 
to be a feature of chromosomal DNA replication across all domains of life. 


28.3 | Why Are There So Many DNA Polymerases? 


Table 28.1 listed the many DNA polymerases found in Æ. coli cells. Their existence 
raises the question: Why do cells have multiple DNA polymerases? 


28.3a Cells Have Different Versions of DNA Polymerase, 
Each for a Particular Purpose 


A host of different DNA polymerases have been discovered, and even simple bacteria 
such as E. coli have seven distinct DNA polymerases. Yeast (Saccharomyces cerevi- 
siae) has 9, and 16 have been identified in humans. Based on sequence homology, 
polymerases can be grouped into seven different families. The families differ in terms 
of the biological function served by family members. For example, family A includes 
DNA polymerases principally involved in DNA repair (E. coli DNA polymerase I, 
mammalian DNA-repair DNA polymerase 0, and mitochondrial DNA polymerase y, 
which carries out mitochondrial DNA replication as well as repair). Family B polymer- 
ases include the eukaryotic DNA polymerases predominantly involved in replication 
of chromosomal DNA (the mammalian replicative DNA polymerases a, ô, and £, as 
well as the eukaryotic DNA-repair DNA polymerase ¢ and E. coli DNA polymerase II). 
Family C has the bacterial DNA-replicating enzymes (including E. coli DNA poly- 
merase III). Members of families X and Y act in DNA repair pathways. Family X 
members include the eukaryotic DNA polymerases $, o, A, and u, and others. Family 
Y comprises the translesion DNA polymerases, low-fidelity enzymes that allow the 
replication fork to bypass chemically damaged regions of DNA and continue on. 
Family Y includes eukaryotic DNA polymerases t, k, and 7 (7 is also known as Revl). 
Family RT designates the DNA polymerases of retroviruses (such as HIV) and the 
telomerases that renew the ends of eukaryotic chromosomes. RT polymerases are 
novel in that they use RNA as the template (see Section 28.6). 


28.3b The Common Architecture of DNA Polymerases 


Despite sequence variation, the various DNA polymerase structures more or less follow a 
common architectural pattern that is reminiscent of a right hand, with distinct struct- 
ural domains referred to as fingers, palm, and thumb (Figure 28.10). The active site of 
the polymerase, where deoxynucleotide addition to the growing chain is catalyzed, is lo- 
cated in the crevice within the palm domain that lies between the fingers and thumb do- 
mains. The fingers domain acts in deoxynucleotide recognition and binding, and the 
thumb is responsible for DNA binding, in the following manner: When the DNA 
polymerase binds to template-primer duplex DNA, its thumb domain closes around the 
DNA so that the DNA is bound in a groove formed by the thumb and palm. A dNTP 
substrate is then selected by the polymerase, and dNTP binding induces a conformational 
change in the fingers, which now rotate toward the polymerase active site in the palm. 
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Figure 28.9 Model of the replication factory. The 
replication factory (or replicon) is composed of two 
replisomes that extrude loops of newly synthesized 
DNA as the parental DNA duplex is fed in. The 
replication factory is indicated by the purple-shaded 
region. The replisomes (orange) define the replication 
fork and the movement of parental and newly 
synthesized dsDNA is indicated by arrows. The green 
bands represent double-stranded DNA, where one 
strand is parental and the other newly synthesized. 
The double-stranded DNA represented by the green 
band looping out above the replication factory is one 
of the two progeny dsDNA molecules produced by 
the replication factory from the parental DNA. The 
green band looping out below the replication factory 
is the other progeny dsDNA molecule. Adapted from 
Figure | in Li, H., Yao, N. Y., and O’Donnell, M. E., 
2020. Anatomy of a DNA replication factory. 
Biochemical Society Transactions 48:2769-2778. 
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Figure 28.10 A structural paradigm for DNA 
polymerases, bacteriophage RB69 DNA polymerase. 
Ternary complex formed between the RB69 DNA 
polymerase, DNA, and dNTP. The N-terminal 


domain of the protein (residues 1-108 and 340-382) is 


in yellow, the exonuclease domain (residues 109-339) 
is in red, the palm (residues 383-468 and 573-729) is 
magenta, the fingers (residues 469-572) are blue, and 
the thumb (residues 730-903) is green. The DNA is 
given in stick representation, with the primer in gold 
and the template in b/ue-purple. A dNTP substrate 
(red) is shown at the active site, as are the two Ca?* 
ions (light blue spheres that are necessary for the two- 
metal-ion DNA polymerase catalytic mechanism). 
Note also the calcium ion (blue sphere) at the 
exonuclease active site. (PDB id = 11G9. Adapted 


from Figure | in Franklin, M. C., Wang, J., and Steitz, 


T. A., 2001. Structure of the replicating complex of a 
Pol a family DNA polymerase. Cell 105:657—-667.) 
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Figure 28.11 The eukaryotic cell cycle. The stages of 
mitosis and cell division define the M phase (M for 
mitosis). G, (G for gap, not growth) is typically the 
longest part of the cell cycle; G; is characterized by 
rapid growth and metabolic activity. Cells that are 
quiescent, that is, not growing and dividing (such as 
neurons), are said to be in Gy. The S phase is the 
time of DNA synthesis. S is followed by G», a 
relatively short period of growth when the cell 
prepares for cell division. Cell cycle times vary from 
less than 24 hours (rapidly dividing cells such as the 
epithelial cells lining the mouth and gut) to 
hundreds of days. 
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The enzymatic reaction of phosphodiester bond formation now takes place as a dNMP is 
added to the 3’-end of the growing primer strand. Pyrophosphate is released, and the 
polymerase translocates one base farther along the template strand. In essence, all DNA 
polymerases are molecular motors that synthesize DNA, using dNTP substrates to add 
dNMP units to the primer strand, as they move along the template strand, reading its base 
sequence. 


28.4 | How Is DNA Replicated in Eukaryotic Cells? 


DNA replication in eukaryotic cells shows strong parallels with prokaryotic DNA repli- 
cation, but it is vastly more complex. First, eukaryotic DNA is organized into chromo- 
somes that are compartmentalized within the nucleus. Furthermore, these chromosomes 
must be duplicated with high fidelity once (and only once!) each time the cell divides. For 
example, in a dividing human cell, a carefully choreographed replication of 6 billion bp 
of DNA distributed among 46 chromosomes occurs. The events associated with cell 
growth and division in eukaryotic cells fall into a general sequence called the cell cycle. 
The cell cycle has four distinct phases: M, G;, S, and G (Figure 28.11). (Fully differenti- 
ated cells in multicellular organisms typically enter a quiescent, nondividing state after 
G; that is called Go. Such cells may remain in Go indefinitely.) Eukaryotic cells have 
solved the problem of replicating their enormous genomes in the few hours allotted to the 
S phase by initiating DNA replication at multiple origins of replication distributed along 
each chromosome. Eukaryotic origins of replication seem to be defined more by pecu- 
liarities in the local structure of chromatin and DNA than by any sequence context. 
Depending on the organism and cell type, replication origins are DNA regions 0.5 to 2 
kbp in size that occur every 3 to 300 kbp (for example, an average human chromosome 
has several hundred replication origins). Since eukaryotic DNA replication proceeds 
concomitantly throughout the genome, each eukaryotic chromosome must contain 
many replicons. 


28.4a The Cell Cycle Controls the Timing of DNA Replication 


Progression through the cell cycle is regulated through a series of checkpoints that 
control whether the cell continues into the next phase. These checkpoints are situated 
to ensure that a// the necessary steps in each phase of the cycle have been satisfacto- 
rily completed before the next phase is entered. If conditions for advancement to the 
next phase are not met, the cycle is arrested until they are. 
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Checkpoints, Cyclins, and CDKs Checkpoints depend on cyclins and cyclin-dependent 
protein kinases (CDKs). Cyclin is the name given to a class of proteins synthesized at 
one phase of the cell cycle and degraded at another. Thus, cyclins appear and then 
disappear at specific times during the cell cycle. Cyclins are larger than the small CDK 
protein kinase subunits to which they bind. The various CDKs are inactive unless 
complexed with their specific cyclin partners. In turn, these CDKs control events at 
each phase of the cycle by targeting specific proteins for phosphorylation. Destruction 
of the phase-specific cyclin at the end of the phase inactivates the CDK. 


Initiation of Replication DNA replication is the defining characteristic of the S phase of 
the cell cycle. The switch from G1 to S is triggered by phosphorylation events carried 
out by two protein kinases: the Dbf4-dependent kinase DDK (consisting of the Cdc7 
protein kinase and its regulatory subunit Dbf4) and the S-phase cyclin-dependent pro- 
tein kinase S-CDK. Eukaryotic cells initiate DNA replication at multiple origins, and 
two replication forks are established at each origin. Initiation of replication depends on 
the origin recognition complex, or ORC, a protein complex that binds to replication ori- 
gins. Indeed, eukaryotic replication origins are defined as nucleotide sequences that 
bind ORC. Stable maintenance of the eukaryotic genome demands that DNA replica- 
tion occurs only once per cell cycle. This demand is met by dividing initiation of DNA 
replication into two steps: (1) the licensing of replication origins during late M or early 
G1, and (2) the activation of replication at the origins during S phase through the action 
of DDK and S-CDK. 


Licensing of Replication Licensing involves the highly regulated assembly of prerepli- 
cation complexes (pre-RCs) on origins of replication. Early in G1 (just after M), the 
ORC (a heterohexameric complex of Orcl-6) serves as a landing pad for proteins es- 
sential to replication control. Binding of these proteins to ORC establishes a pre-RC, 
but only within this window of opportunity during G1. The crescent-shaped ORC 
complex binds DNA along its concave face, bending and unwinding the DNA in an 
ATP-dependent process. ORC then recruits Cdc6 (in its phosphorylated form, Cdc6p), 
and the MCM proteins. Cdc6p and Cdtl are replication activation proteins. 

The MCM proteins are a family of six related proteins. MCM proteins are also 
known as replication licensing factors because they license, or permit, DNA replication 
to occur. The MCM proteins assemble as hexameric helicases that render the chromo- 
somes competent for replication. Two MCM complexes are active within each origin, 
one for each replication fork. The pre-RC therefore consists of ORC, Cdc6, Cdt1, the 
MCM complexes, and other proteins. 


Assembly of the DNA Replication Factory Construction of the eukaryotic DNA replica- 
tion apparatus begins when S-CDK and DDK phosphorylate initiation factors Sld2 
and Sld3. In effect, the phosphorylations catalyzed by S-CDK and DDK trigger the 
passage of the cell from G1 to S and the activation of replication initiation. The initia- 
tion factor Dpbl1 interacts with Sld2p and Sld3p to form a ternary complex (the Sld2p/ 
Sld3p/Dbp11 complex) which then associates with the origin of replication and recruits 
proteins Cdc45 and the GINS complex. While the Sld2p/Sld3pDpbl1 complex is es- 
sential for the initiation of DNA replication, it is not retained once replication has be- 
gun. In addition, Cdc6 is degraded following replication initiation, thereby precluding 
the possibility for errant replication initiation events until after mitosis, when Cdc6 
accumulates again. 

Cdc45 and GINS bind MCM2-7, completing formation of the CMG (Cdce45-MCM- 
GINS) helicase, the active helicase that unwinds the DNA duplex so that DNA replica- 
tion can proceed. The CMG helicase and the Sld2p/Sld3pDpb1 1complex also establish 
contact with the DNA polymerases that replicate the DNA: DNA polymerase 6 is the 
lagging-strand DNA polymerase and DNA polymerase e is the leading-strand poly- 
merase (see Table 28.4). The Cdc45 protein is a key factor in converting the pre-RC into 
a functional initiation complex; the GINS protein complex binds single-stranded DNA 
and aids in recruiting the polymerase that synthesizes the lagging-strand RNA primers 
(primase). 


How Is DNA Replicated in Eukaryotic Cells? 
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Figure 28.12 The eukaryotic replicon. In this view, 
the two replisomes of the replicon are bound 
together by Ctf4 (pink). Each is extruding newly 
synthesized DNA below as parental DNA duplex is 
fed in from the top. The CMG helicase (blue) 
consists of Cdc45, Mcm2-7 hexamer, and GINS 
complex. Each CMG helicase creates a replication 
fork as it unwinds the parental DNA entering it. As 
the helicase separates the double-stranded parental 
DNA to expose the single strands as templates, 
DNA polymerases carry out leading and lagging 
strand synthesis. Note that the two replisomes share 
the single primase (green), labeled Pol a-primase 
here. As indicated by the black arrows above and 
below it, this single primase alternately serves each 
replication fork by synthesizing primers on each of 
the lagging strands. The eukaryotic sliding clamps 
(red) are trimeric proteins called PCNA. Eukaryotic 
DNA polymerase 6 (yellow) is the lagging-strand 
DNA polymerase; eukaryotic DNA polymerase e is 
the leading-strand DNA polymerase (orange). 
Adapted from Figure 4 in Li, H., Yao, N. Y., and 
O’Donnell, M. E., 2020. Anatomy of a DNA 
replication factory. Biochemical Society Transactions 
48:2769-2778. 


| Table 28.4 | 7") Biochemical Properties of the Principal Human DNA Polymerases 


Subunits 
Polymerase Localization and Function (mass in kDa) Subunit Function 
DNA polymerase a 
Nuclear; initiation of nuclear DNA 180 Catalytic subunit 
replication 68 Protein—protein interactions 
55 Primase 
48 Primase 
DNA polymerase 5 
Nuclear; principal polymerase in lagging- 125 Catalytic subunit 
strand synthesis; highly processive 66 Structural 
50 Interaction with PCNA 
12 Protein—protein interactions 
DNA polymerase £ 
Nuclear; leading-strand synthesis, sensor 261 Catalytic subunit 
of DNA damage checkpoint control 59 Multimerization 
17 Protein—protein interactions 
12 Protein—protein interactions 
DNA polymerase y 
Mitochondria; mitochondrial DNA 140 Catalytic subunit 
replication 55 Processivity 
DNA polymerase B 
DNA repair 39 


It is important to stress that two replisomes are assembled at each origin of replica- 
tion and together they constitute the replicon. Establishment of the replicon begins 
when two CMG helicases form a dimer in the presence of Ctf4 (Chromosome transmis- 
sion fidelity protein 4), itself a homotrimer. Each Ctf4 trimer acts as the hub of its repli- 
some (Figure 28.12). It also serves as the platform for the primase (DNA polymerase a). 
Thus, Ctf4 trimers serve to organize replicons containing two CMG helicases and one 
centrally located primase (Figure 28.12). 


Parental Parental 


Ctf4 Pola 


primase 
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The CMG helicases use the energy of ATP hydrolysis to unwind the duplex DNA 
and separate the strands. As each CMG helicase opens the DNA duplex, two replica- 
tion forks are created, one for each CMG helicase. DNA replication commences at each 
replication fork (Figure 28.12). The DNA strand that runs through each CMG helicase 
is the template for leading-strand synthesis by DNA polymerase e. The strand not run- 
ning through the CMG helicase is the lagging strand. It is primed by the Pol a—primase 
and copied by DNA polymerase 6 (Figure 28.12). 


28.4b Proteins of the Prereplication Complex 
Are AAA+ ATPase Family Members 


Cdc6, the Orc proteins, and MCM proteins are members of the AAA+ ATPase family, 
a group of proteins characterized by sequence and structural homology, ATPase 
activity, and a general function as molecular chaperones. The binding of both ORC 
and Cdcé6 to chromatin in the process of pre-RC assembly is ATP-dependent. ORC 
with ATP bound can bind to origins of replication and recruit Cdc6 and Cdtl to form 
a pre-RC in which both ORC and Cdc6 have bound ATP. Cdcé6 is required for the 
recruitment of the MCM proteins, specifically, a complex of MCM2-7 with ATP- 
dependent helicase activity. To establish the pre-RC, MCM2-7 must be stably associ- 
ated with the origin, and this stability is achieved following ATP hydrolysis, first by 
Cdc6 and then by ORC. 


28.4c Geminin Provides Another Control Over Replication Initiation 


Geminin is a protein that provides another level of regulatory control over DNA replica- 
tion. Geminin inhibits DNA replication by preventing the incorporation of MCM com- 
plexes into the pre-RC. Geminin is active during S, G}, and M phases, but its destruction 
during mitosis permits replication initiation in G. Geminin binds to Cdtl, preventing it 
from recruiting MCM proteins to the pre-RC. Geminin exists as a parallel coiled-coil 
homodimer. It interacts with Cdtl in two ways: It has an array of glutamate residues on 
its surface that interact with positive charges on Cdtl, and an adjacent region on gemi- 
nin interacts independently with the N-terminal region of Cdtl. 


28.4d Eukaryotic Cells Also Contain a Number 
of Different DNA Polymerases 


At least 19 different DNA polymerases have been described in eukaryotic cells thus far. 
These various polymerases have been assigned Greek letters in the order of their dis- 
covery (Table 28.4 lists the principal ones). Multiple DNA polymerases participate in 
leading- and lagging-strand synthesis, but three—a, ô, and e—share the major burden. 
DNA polymerase @ has an associated primase subunit and functions in the initiation of 
nuclear DNA replication. Given a template, it not only synthesizes an RNA primer of 
about 10 nucleotides, but it then adds 20 to 30 deoxynucleotides to extend the chain in 
the 5’-3’ direction. DNA polymerase 6, a heterotetrameric enzyme, is the lagging- 
strand DNA polymerase in eukaryotic DNA replication. It interacts with PCNA pro- 
tein (PCNA stands for proliferating cell nuclear antigen). Through its association with 
PCNA, DNA polymerase 6 carries out highly processive DNA synthesis. PCNA is the 
eukaryotic counterpart of the E. coli B.-sliding clamp; it clamps DNA polymerase 6 to 
the DNA. Like 62, PCNA encircles the double helix, but in contrast to the prokaryotic 
B>-sliding clamp, PCNA is a homotrimer, not a homodimer (Figure 28.13; see also 
Figure 28.7). DNA polymerase e has an acidic C-terminal extension lacking in other 
DNA polymerases, and evidence suggests that this domain is a sensor for DNA dam- 
age checkpoint control, halting DNA replication until the damage is repaired. DNA 
polymerase y is the DNA-replicating enzyme of mitochondria; DNA polymerase B 
functions in DNA repair. The more recently discovered eukaryotic DNA polymerases 
(including @, n, t, K, and Revl) are more error-prone, resulting in lower fidelity of DNA 
replication. Nevertheless, they have the important ability to function in DNA replica- 
tion and repair when damaged regions of DNA are encountered. 
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Figure 28.13 Structure of the human PCNA 
homotrimer. (a) Ribbon representation of the PCNA 
trimer (PDB id = 1AXC) with an axial view of a 
B-form DNA duplex in its center. The molecular mass 
of each PCNA monomer is 37 kDa. (b) Molecular 
surface of the PCNA trimer-DNA complex. 
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Other proteins involved in eukaryotic DNA replication include replication protein A 
(RPA), an ssDNA-binding protein that is the eukaryotic counterpart of SSB, and 
replication factor C (RFC). RFC loads the PCNA sliding clamp onto replicating DNA, 
thus acting as the eukaryotic counterpart of the prokaryotic y-complex. 


28.5 | How Are the Ends of Chromosomes Replicated? 


Eukaryotic chromosomes are linear. The ends of chromosomes have specialized 
structures known as telomeres. The telomeres of virtually all eukaryotic chromo- 
somes consist of short (5-8 bp), tandemly repeated, GC-rich nucleotide sequences 
that form protective caps | to 12 kbp long at the ends of chromosomal DNA. For 
example, the telomeres of human germline cells (sperm and eggs) contain between 
1000 and 1700 copies of the hexameric repeat TTAGGG (Figure 28.14a). Telomeres 
aid in chromosome maintenance and stability by protecting against DNA degrada- 
tion or rearrangement. DNA polymerases cannot replicate the extreme 5’-ends of 
chromosomes because these enzymes require a template and a primer and replicate 
only in the 5’ 3’ direction. Thus, lagging strand synthesis at the 3’-ends of chromo- 
somes is primed by RNA primase to form Okazaki fragments, but these RNA prim- 
ers are subsequently removed, resulting in gaps in the progeny 5’-terminal strands 
at each end of the chromosome after each round of replication (primer gap; see 
Figure 28.14). 

Telomeres are added to the ends of chromosomal DNA by an RNA-dependent 
DNA polymerase known as telomerase. It maintains chromosome length by restoring 
telomeres at the 3’-ends of chromosomes. Telomerase is a specialized reverse transcrip- 
tase containing a catalytic subunit, referred to as TERT, for TElomerase Reverse 
Transcriptase, and an RNA component that serves as the template for telomere syn- 
thesis, referred to variously as RT, for RNA Template, or TER, for TEmplate-contain- 
ing telomerase RNA. In human telomerase, the RNA part is 451 nucleotides long. Its 
template sequence is CUAACCCUAAC (base-pairs with TTAGGG). During the S- 
phase of the cell cycle, telomerase uses the 3’-end of the DNA as a primer and adds 
successive TTAGGG repeats to it, employing its RNA as template over and over again 
(Figure 28.14). The G-rich ends of telomeres can arrange into four-stranded G-qua- 
druplex structures (see Figure 11.20). 

The single-stranded 3'-overhangs at the telomeric ends of chromosomes are a hazard 
because they resemble damaged DNA. Such damage has the potential to recruit DNA 
repair enzymes whose action at telomeres would do more harm than good. Shelterin, a 
protein complex with six distinct subunits, is the telomere end-protection protein. It binds 
to telomeres and folds the DNA strands so that no single-stranded ends are exposed to 
DNA repair systems. In addition, several of the shelterin subunits act to recruit telomer- 
ase when it’s time for the telomeres to be replicated. 


28.6 | How Are RNA Genomes Replicated? 


Many viruses have genomes composed of RNA, not DNA. How then is the informa- 
tion in these RNA genomes replicated? In 1964, Howard Temin noted that inhibitors 
of DNA synthesis prevented infection of cells in culture by RNA tumor viruses such 
as avian sarcoma virus. On the basis of this observation, Temin proposed that DNA 
is an intermediate in the replication of such viruses; that is, an RNA tumor virus can 
use viral RNA as the template for DNA synthesis. 


RNA viral chromosome —> DNA intermediate —> RNA viral chromosome 


In 1970, Temin and David Baltimore independently discovered a viral enzyme capa- 
ble of mediating such a process, namely, an RNA-directed DNA polymerase or, as it is 
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(a) 


Site of telomerase 
DNA polymerase 
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Telomerase 
protein 


Figure 28.14 Telomere replication. (a) Telomeres on human chromosomes. TTAGGG tandem repeats 
are found at the 3’-ends of the DNA strands, paired with the complementary sequence 
3'-AATCCC-S’ on the other DNA strand. Thus, a G-rich region is created at the 3’-end of each 
DNA strand, and a C-rich region is created at the 5’-end of each DNA strand. Typically, at each end 
of the chromosome, the G-rich strand protrudes 12 to 16 nucleotides beyond its complementary 
C-rich strand. (b) The ribonucleic acid of human telomerase serves as the template for the DNA 
polymerase activity of telomerase. Nucleotides 46 to 56 of this RNA are CUAACCCUAAC and 
provide the template function for the telomerase-catalyzed addition of TTAGGG units to the 3’-end 
of a DNA strand. (c) In replication of the lagging strand, short RNA primers are added (pink) 

and extended by DNA polymerase. When the RNA primer at the 5'-end of each strand is removed, 
there is no nucleotide sequence to read in the next round of DNA replication. The result is a gap 
(primer gap) at the 5’-end of each strand (only one end of a chromosome is shown in this figure). 

(d) Asterisks indicate sequences at the 3’-end that cannot be copied by conventional DNA replication. 
Synthesis of telomeric DNA by telomerase extends the 5’-ends of DNA strands, allowing the strands 
to be copied by normal DNA replication. 
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A Deeper Look 


RNA as Genetic Material 


The genetic material of cells is dsDNA but virtually all plant 
viruses, several bacteriophages, and many animal viruses have 
genomes consisting of RNA. Viruses with RNA genomes can 
evolve more quickly to changing circumstances because viral RNA 
polymerases are more error-prone. (They lack the proofreading 
mechanisms that DNA polymerases have.) As a consequence, 
RNA viruses are better equipped to evade ever-evolving host 
defense mechanisms. In most cases, this RNA is single stranded. 
Viruses with single-stranded genomes use the single strand as a 


template for synthesis of a complementary strand, which can then 
serve as template in replicating the original strand. Retroviruses 
are an interesting group of eukaryotic viruses with single-stranded 
RNA genomes that replicate through a dsDNA intermediate. 
Furthermore, the life cycle of retroviruses includes an obligatory 
step in which the dsDNA is inserted into the host cell genome in a 
transposition event (see Section 28.7). Retroviruses are responsible 
for many diseases, including tumors and other disorders. HIV-1, 
the human immunodeficiency virus that causes AIDS, is a retrovirus. 


(a) 


(b) 


Figure 28.15 The structures of AZT (3’-azido-2’, 
3'-dideoxythymidine) and 3TC (2’,3’-dideoxy-3'- 
thiacytidine). These 3’-substituted dideoxynucleosides 


are phosphorylated in vivo to form 


dideoxynucleoside-5'-triphosphate substrate analogs 
for HIV reverse transcriptase. AZT is a drug known 
as Zidovudine or Retrovir, and 3TC is known as 


Lamivudine or Epivir. 


usually called, reverse transcriptase. All RNA tumor viruses contain such an enzyme 
within their virions (viral particles). RNA viruses that replicate their RNA genomes via 
a DNA intermediate are classified as retroviruses. 

Like other DNA and RNA polymerases, reverse transcriptase synthesizes polynucle- 
otides in the 5'—»3’ direction, and like all DNA polymerases, reverse transcriptase 
requires a primer. Interestingly, the primer is a specific tRNA molecule captured by the 
virion from the host cell in which it was produced. The 3’-end of the tRNA is base- 
paired with the viral RNA template at the site where DNA synthesis initiates, and its 
free 3'-OH accepts the initial deoxynucleotide once transcription commences. Reverse 
transcriptase then transcribes the RNA template into a complementary DNA (cDNA) 
strand to form a double-stranded DNA:RNA hybrid. 


28.6a The Enzymatic Activities of Reverse Transcriptases 


Reverse transcriptases possess three enzymatic activities, all of which are essential to 
viral replication: 


1. RNA-directed DNA polymerase activity, for which the enzyme is named (see 
Figure 12.11). This enzymatic activity creates a single-stranded DNA copy of 
the RNA genome of the virus. 

2. RNase H activity. Recall that RNase H is a nuclease that specifically degrades RNA 
chains in DNA:RNA hybrids (see Figure 12.11). The RNase H function of reverse 
transcriptase is an exonuclease activity that degrades the template genomic RNA 
and also removes the priming tRNA after DNA synthesis is completed. 

3. DNA-directed DNA polymerase activity. This activity replicates the ssDNA 
remaining after RNase H degradation of the viral genome, yielding a DNA 
duplex. This DNA duplex directs the remainder of the viral infection process or 
becomes integrated into the host chromosome, where it can lie dormant for many 
years as a provirus. Activation of the provirus restores the infectious state. 


HIV reverse transcriptase is of great clinical interest because it is the enzyme for 
replication of the AIDS virus. DNA synthesis by HIV reverse transcriptase is blocked 
by dideoxynucleoside analogs such as AZT and 3TC (Figure 28.15). HIV reverse tran- 
scriptase incorporates these analogs into growing DNA chains in place of dTMP (in the 
case of AZT) or dCMP (in the case of 3TC). Once incorporated, these analogs block 
further chain elongation because they lack a 3’-OH where the next incoming dNTP can 
be added. HIV reverse transcriptase is error-prone: It incorporates the wrong base at a 
frequency of 1 per 2000 to 4000 nucleotides polymerized. This high error rate during 
replication of the HIV genome means that the virus is ever changing, a feature that 
makes it difficult to devise an effective vaccine. 
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28.7 | How Is the Genetic Information Rearranged 
by Genetic Recombination? 


Genetic recombination is the natural process by which genetic information is rear- 
ranged to form new associations. For example, compared to their parents, progeny 
may have new combinations of traits because of genetic recombination. At the 
molecular level, genetic recombination is the exchange (or incorporation) of one 
DNA sequence with (or into) another. When recombination involves reaction 
between very similar sequences (homologous sequences) of DNA, the process is 
called homologous recombination. When very different nucleotide sequences recom- 
bine, it’s nonhomologous recombination. Transposition—the enzymatic insertion of a 
transposon (a mobile segment of DNA, see Figure 28.24) into a new location in the 
genome—and nonhomologous recombination (incorporation of a DNA segment 
whose sequence differs greatly from the DNA at the point of insertion) are two types 
of recombination that play a significant evolutionary role. Nonhomologous recom- 
bination occurs at a low frequency in all cells and serves as a powerful genetic force 
that reshapes the genomes of all organisms. Homologous recombination involves an 
exchange of DNA sequences between homologous chromosomes, resulting in the 
arrangement of genes into new combinations. Homologous recombination is gener- 
ally used to fix the DNA so that information is not lost. For example, large lesions in 
DNA are repaired via recombination of the damaged chromosome with a homolo- 
gous chromosome. 

The process underlying homologous recombination is termed general recombination 
because the enzymatic machinery that mediates the exchange can use essentially any 
pair of homologous DNA sequences as substrates. Homologous recombination occurs 
in all organisms and is particularly prevalent during the production of gametes (meiosis) 
in diploid organisms. In higher animals—that is, those with immune systems— 
recombination also occurs in the DNA of somatic cells responsible for expressing pro- 
teins of the immune response, such as the immunoglobulins. This somatic recombination 
rearranges the immunoglobulin genes, dramatically increasing the potential diversity of 
immunoglobulins available from a fixed amount of genetic information. Homologous 
recombination can also occur in bacteria. Indeed, even viral chromosomes undergo 
recombination. For example, if two mutant viral particles simultaneously infect a host 
cell, a recombination event between the two viral genomes can lead to the formation of 
a virus chromosome that is wild-type. 


28.7a General Recombination Requires Breakage 
and Reunion of DNA Strands 


Recombination occurs by the breakage and reunion of DNA strands so that a 
physical exchange of parts takes place. Matthew Meselson and Jean-Jacques Weigle 
demonstrated this in 1961 by coinfecting E. coli with two genetically distinct 
bacteriophage A strains, one of which had been density-labeled by growth in !3C- 
and !°N-containing media (Figure 28.16). The phage progeny were recovered and 
separated by CsCl density gradient centrifugation. Phage particles that displayed 
recombinant genotypes were distributed throughout the gradient while parental 
(nonrecombinant) genotypes were found within discrete heavy and light bands 
in the density gradient. The results showed that recombinant phage contained 


Figure 28.16 Meselson and Weigle’s experiment. Density-labeled, heavy phage, symbolized as ABC 
phage, was used to coinfect bacteria along with light, unlabeled phage, the abc phage. The progeny 
from the infection were collected and subjected to CsCl density gradient centrifugation. Parental-type 
ABC and abc phage were well separated in the gradient, but recombinant phage (ABc, Abc, aBc, 
aBC, and so on) were distributed diffusely between the two parental bands because they contained 
chromosomes constituted from fragments of both heavy and light DNA. 
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DNA derived in varying proportions from both parents. The obvious explanation 
is that these recombinant DNAs arose via the breakage and rejoining of DNA 
molecules. 


28.7b Homologous Recombination Proceeds According 
to the Holliday Model 


In 1964, Robin Holliday proposed a model for homologous recombination in fungi 
that has been proven to be a generally valid depiction of homologous recombination 
in bacteriophages, viruses, prokaryotes, and eukaryotes (Figure 28.17). The two 
homologous DNA duplexes are first juxtaposed so that their sequences are aligned. 
This process of chromosome pairing is called synapsis (Figure 28.17a). Holliday 
suggested that recombination begins with the introduction of single-stranded nicks 
in the DNA at homologous sites on the two paired chromosomes (Figure 28.17b). 
The two duplexes partially unwind, and the free, single-stranded end of one duplex 
begins to base-pair with its nearly complementary, single-stranded region along the 
intact strand in the other duplex, and vice versa (Figure 28.17c). This strand invasion 
is followed by ligation of the free ends from different duplexes to create a cross- 
stranded intermediate known as a Holliday junction (Figure 28.17d). The cross- 
stranded junction can now migrate in either direction (branch migration) by 
unwinding and rewinding of the two duplexes (Figure 28.17e). Branch migration 
results in strand exchange; heteroduplex regions of varying length are possible. For 
the joint molecule formed by strand exchange to be resolved into two DNA duplex 
molecules, another pair of nicks must be introduced. Resolution can be represented 
best if the duplexes are drawn with the chromosome arms bent up or down to give a 
planar representation (Figure 28.17f). Nicks then take place, either at E and W, that 
is, in the — strands that were originally nicked (see Figure 28.17b), or at N and S, that 
is, in the + strands (the strands not previously nicked). Duplex resolution is most 
easily kept straight by remembering that + strands are complementary to — strands 
and any resultant duplex must have one of each. Nicks made in the strands originally 
nicked lead to DNA duplexes in which one strand of each remains intact. Although 
these duplexes contain heteroduplex regions, they are not recombinant for the 
markers (AZ, az) that flank the heteroduplex region; such heteroduplexes are called 
patch recombinants (Figure 28.17g). Nicks introduced into the two strands not 
previously nicked yield DNA molecules that are both heteroduplex and recombinant 
for the markers A/a and Z/z; these heteroduplexes are termed splice recombinants 
(Figure 28.17h). Although this Holliday model explains the outcome of recombination, 
it provides no mechanistic explanation for the strand exchange reactions and other 
molecular details of the process. 


28.7c The Enzymes of General Recombination Include 
RecA, RecBCD, RuvA, RuvB, and RuvC 


To illustrate recombination mechanisms, we focus on general recombination as it 
occurs in E. coli. The principal players in the process are the RecBCD enzyme 
complex, which initiates recombination; the RecA protein, which binds single- 
stranded DNA, forming a nucleoprotein filament capable of strand invasion and 
homologous pairing; and the RuvA, RuvB, and RuvC proteins, which drive branch 
migration and process the Holliday junction into recombinant products. Eukary- 
otic homologs of these prokaryotic recombination proteins have been identified, 
indicating that the fundamental process of general recombination is conserved 
across all organisms. 
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Figure 28.17 The Holliday model for homologous recombination. Above: Molecular graphic showing 
the structure of a Holliday junction. Each of the four DNA strands is colored differently (PDB id = 
1DCW). Right: The + and — signs label strands of like polarity. For example, assume that the two 
strands running 5'—>3' as read left to right are labeled +; and the two strands running 3'—>5' as read 
left to right are labeled —. Only strands of like polarity exchange DNA during recombination. (See 
text for detailed description.) 
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28.7d The RecBCD Enzyme Complex Unwinds dsDNA 
and Cleaves Its Single Strands 


The RecBCD complex is composed of the proteins RecB (140 kDa; 1180 amino 
acids), RecC (130 kDa; 1122 amino acids), and RecD (67 kDa; 608 amino acids). This 
multifunctional enzyme complex has both helicase and nuclease activity and initiates 
recombination by attaching to the end of a DNA duplex (or at a double-stranded 
break in the DNA) and using its ATP-dependent helicase function to unwind the 
dsDNA (Figure 28.18a). RecB and RecD are helicases powered by ATP hydrolysis; 
each consumes an ATP per base pair of DNA traversed. The RecD motor is faster 
than the RecB motor and leads the way. The greater speed of RecD causes the DNA 
to loop out between RecD and RecB. The RecB subunit also contains a nuclease 
domain, which is tethered to the RecB helicase domain by a 30-residue-long linker. 
As RecBCD progresses along unwinding the duplex, the RecB nuclease activity 
cleaves both of the newly formed single strands (although the strand that provided 
the 3’-end at the RecBCD entry site is cut more frequently than the 5’-terminal strand 
[Figure 28.18b]). SSB (and some RecA protein) readily binds to the emerging single 
strands. Sooner or later, RecBCD encounters a particular nucleotide sequence, a 
so-called Chi (crossover hotspot instigator, symbolized by y) site, characterized by 
the sequence 5’-GCTGGTGG-3’. These y sites are recombinational hot spots; 
1009 y sites have been identified in the E. coli genome (on average, about one every 
4.5 kbp of DNA). When a RecBCD complex approaches a y sequence from its 3’ 
side (the...G-3’side), RecBCD cleaves the y-bearing DNA strand four to six bases 
to the 3’ side of y (Figure 28.18c). RecBCD flips so that the RecB motor leads the 
way, and the RecBCD complex no longer expresses nuclease activity against the 
3’-terminal strand, but nuclease activity against the 5’-terminal strand increases 
(Figure 28.18d). 

Resuming its helicase function, RecBCD unwinds the dsDNA, and collectively these 
processes generate an ssDNA tail bearing a y site at its 3’-terminal end. This ssDNA 
may reach several kilobases in length. RecA protein now binds to the 3’-terminal strand 
to form a nucleoprotein filament (Figure 28.18e). This filament is active in pairing and 
strand invasion with a homologous region in another dsDNA molecule. A molecular 
graphic of RecBCD bound to DNA is shown in Figure 28.19. 

RecBCD also plays a vital role in DNA repair (see Section 28.8). Because DNA 
damage is potentially lethal, RecBCD can readily displace other tightly bound DNA- 
binding proteins (such as RNA polymerase), which allows RecBCD function to 
supercede any other aspect of DNA metabolism. Consistent with its importance, a 
cell contains 10 to 50 RecBCD molecules, so multiple DNA breaks per cell can be 
readily repaired. 


28.7e The RecA Protein Can Bind ssDNA and Then 
Interact with Duplex DNA 


The RecA protein, or recombinase, is a multifunctional protein that acts in general 
recombination (Figure 28.20a). RecA mediates the ATP-dependent DNA strand 
exchange reaction leading to formation of a Holliday junction (Figure 28.17b-f). In 
the presence of ATP and ssDNA, RecA forms a right-handed helical filament hav- 
ing six monomers per turn, with each monomer spanning about three nucleotides 
of DNA. The RecA nucleoprotein filament serves as a scaffold upon which the 
events of recombination take place. This filament has a deep spiral groove large 
enough to encompass three strands of DNA. Although RecA-bound ssDNA is 
relatively stretched and underwound, with about 18.5 nucleotides per turn, its local 
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Figure 28.18 Model of RecBCD-dependent initiation of recombination. (a) RecBCD binds to a 
duplex DNA end, and its helicase activity begins to unwind the DNA double helix. Rabbit ears of 
ssDNA loop out from RecBCD because the rate of DNA unwinding exceeds the rate of ssDNA 
release by RecBCD. (b) As it unwinds the DNA, SSB (and some RecA) bind to the single-stranded 
regions; the RecBCD endonuclease activity randomly cleaves the ssDNA, showing a greater tendency 
to cut the 3’-terminal strand rather than the 5’-terminal strand. (c) When RecBCD encounters a 
properly oriented y site, the 3’-terminal strand is cleaved just below the 3’-end of x. (d) RecBCD now 
directs the binding of RecA to the 3’-terminal strand, as RecBCD endonuclease activity now acts 
more often on the 5’-terminal strand. (e) A nucleoprotein filament consisting of RecA-coated 
3'-strand ssDNA is formed. This nucleoprotein filament is capable of homologous pairing with a 
dsDNA and strand invasion. 
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Figure 28.21 Model for homologous recombination 
as promoted by RecA enzyme. (a) RecA protein 
(and SSB) aid strand invasion of the 3’-ssDNA into 
a homologous DNA duplex, (b) forming a D-loop. 
(c) The D-loop strand that has been displaced by 
strand invasion pairs with its complementary strand 
in the original duplex to form a Holliday junction as 
strand invasion continues. 


Figure 28.20 The structure of RecA, the ATP-dependent protein that mediates DNA strand 
exchange in general recombination. (a) RecA, a 352-residue, 38-kDa protein, with dATP in the 
ATP-binding site (PDB id = 20DN). (b) Structure of a fused RecA hexamer with bound ssDNA 
(as poly(dT))s, magenta). Each of the six RecA units is a different color, and each has a bound 
ADP molecule (marine blue). The adenine nucleotide-binding sites lie at the interfaces between 
RecA units (PDB id = 3CMU). 


conformation resembles B-DNA (Figure 28.20b). Thus, when dsDNA adds to the 
complex, the ssDNA is poised to search for the sequence homology that leads to 
strand exchange. When the ssDNA encounters a region of homology that is 8 to 20 
base pairs in length, DNA strand exchange begins. 

Procession of strand separation of dsDNA and the re-pairing into hybrid strands 
along the DNA duplex initiates strand invasion (Figure 28.21b). Strand invasion drives 
the displacement of the homologous DNA strand from the DNA duplex and its 
replacement with the ssDNA strand, a process known as single-strand assimilation 
(or single-strand uptake). Strand assimilation does not occur if there is no sequence 
homology between the ssDNA and the invaded DNA duplex. The DNA strand dis- 
placed by the invading 3’-terminal ssDNA is free to anneal with the 5'-terminal strand 
in the original DNA, a step that is also mediated by RecA protein and SSB 
(Figure 28.21c). The result is a Holliday junction, the classic intermediate in genetic 
recombination. 


28.7f RuvA, RuvB, and RuvC Proteins Resolve the Holliday 
Junction to Form the Recombination Products 


The Holliday junction is processed into recombination products by RuvA (203 amino 
acids), RuvB (336 amino acids), and RuvC (173 amino acids). Specifically, RuvA and 
RuvB work together as a Holliday junction—specific helicase complex that dissociates 
the RecA filament and catalyzes branch migration. An RuvA tetramer (Figure 28.22a) 
fits precisely within the junction point (Figure 28.22b), which has a square-planar ge- 
ometry, and this RuvA tetramer mediates the assembly of RuvB around opposite arms 
of the DNA junction. The RuvB protein binds to form two oppositely oriented, hexa- 
meric [(RuvB)<¢] ring structures encircling the dsDNAs, one on each side of the Holliday 
junction. Rotation of the dsDNAs by the RuvB hexameric rings pulls the dsDNAs 
through (RuvB), and unwinds the DNA strands across the “spool” of RuvA, 
which threads the separated single strands into newly forming hybrid (recombinant) 
duplexes (Figure 28.22b). The RuvA tetramer is a disclike structure, one face of which 
(Figure 28.22c) has four negatively charged central pins, each contributed by a pair of 
residues (Glu® and Asp*) from each RuvA monomer. These four pins fit neatly into 
the hole at the center of the Holliday junction. The negatively charged sugar—phosphate 
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Figure 28.22 Model for the resolution of a Holliday junction in E. coli by the RuvA, RuvB, and 
RuvC proteins. (a) Ribbon diagram of the RuvA tetramer. RuvA monomers have an overall L shape 
(PDB id = 1CUK). (b) Model for RuvA/RuvB action. (left) The RuvA tetramer fits snugly within the 
Holliday junction point. (center) Oppositely facing RuvB hexameric rings assemble on the 
heteroduplexes, with the DNA passing through their centers. These RuvB hexamers act as motors to 
promote branch migration by driving the passage of the DNA duplexes through themselves. (right) 
Binding of RuvC at the Holliday junction and strand scission by its nuclease activity. The locations of 
the RuvC active sites are indicated by the scissors. (c) Charge distribution on the concave surface of 
an RuvA tetramer. Blue indicates positive charge and red, negative charge. Note the four red 
(negatively charged) pins at its center. (d) Structural model for the interaction of (RuvA), with the 
hypothesized square-planar Holliday junction center (PDB id = 1C7Y). (Adapted from Figures 1, 2, 
and 3 in Rafferty, J. B., et al., 1996. Crystal structure of DNA recombination protein RuvA and a 
model for its binding to the Holliday junction. Science 274:415-421.) 


backbones of the four DNA duplexes of the Holliday junction are threaded along 
grooves in the positively charged RuvA face, with the negatively charged central pins 
appropriately situated to transiently separate the dsDNA molecules into their compo- 
nent single strands through repulsive electrostatic interactions with the phosphate 
backbones of the DNA. The separated strands of each parental duplex are then 
channeled into grooves in the RuvA face, where they are led into hydrogen-bonding 
interactions with bases contributed by strands of the other parental DNA to form the 
two daughter hybrid duplexes flowing out from the RuvAB complex (Figure 28.22b). 
Figure 28.22d illustrates a model for the RuvA tetramer with the square-planar 
Holliday junction. 

Depending on how the strands in the Holliday junction are cleaved and resolved, 
patch or splice recombinant duplexes result (Figure 28.17g and h). RuvC is an endo- 
nuclease that resolves Holliday junctions into heteroduplex recombinant products 
(RuvC resolvase). An RuvC dimer binds at the Holliday junction and cuts pairs of DNA 
strands of similar polarity (Figure 28.22b); whether a patch or a splice recombinant 
results depends on which DNA pair is cleaved. 

RuvB hexamers are AAA*-ATPase-type, DNA-driving molecular motors; similar 
motors act during DNA replication to propel strand separation and initiate DNA syn- 
thesis. Thus, the RuvABC system for processing Holliday junctions may represent a 
general paradigm for DNA manipulation in all cells. 


28.7g The Three Rs of Genomic Manipulation: Replication, 
Recombination, and Repair 


DNA replication, recombination, and repair were once considered to be separate aspects 
of DNA metabolism. However, it is now clear that DNA replication is an essential com- 
ponent of both DNA recombination and DNA repair processes. Further, recombination 
mechanisms play an absolutely vital role in restarting replication forks that become 
halted at breaks or other lesions in the DNA strands. 
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A Deeper Look 


Knockout Mice: A Method to Investigate the Biological Role of a Gene 


The function of a particular gene is not always obvious. Targeted 
inactivation (knockout) of the gene followed by analysis of the cell 
or organism may reveal defects that identify the purpose of a gene. 
Such studies with mice lead to knockout mice—mice having a selec- 
tively inactivated gene. One way to inactivate a target gene is to use 
homologous recombination to replace a gene with a nonfunctional 
version of itself. For example, inactivation can be accomplished by 
inserting a foreign gene, such as neo, a gene encoding resistance 
to the drug G4/8, within one of the exons of a copy of the gene 
of interest. Homologous recombination between the neo-bearing 
transgene and DNA in wild-type mouse ES cells replaces the target 


gene with the inactive transgene (see accompanying figure). ES cells 
in which homologous recombination has occurred will be resistant 
to G418, and such cells can be selected. These recombinant ES 
cells can then be injected into early-stage mouse embryos, where 
they have a chance of becoming the germline cells of the newborn 
mouse. If they do, an inactivated target gene is then present in the 
gametes of this mouse. Mating between male and female mice with 
inactive target genes yields a generation of homozygous knock- 
out mice—mice lacking a functional copy of the targeted gene. 
Characterization of these knockout mice reveals the physiological 
functions of the gene. 
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the neo cassette and one below, lead to insertion of neo 
within the host gene. The neo gene encodes an enzyme 
that inactivates the antibiotic G418. Cells carrying the 
3 neo gene are resistant to G418 and can be identified by 
this antibiotic resistance. 


It is likely that most replication forks that begin at the ŒE. coli oriC initiation sites (or 
analogous initiation sites in eukaryotes) are derailed by nicks or more extensive DNA damage 
lying downstream. However, DNA replication can be reinitiated (and genome replication can 
be completed) following replication fork restart. Recombinational repair of stalled replication 
forks requires the action of enzymes from every aspect of DNA metabolism: replication, 
recombination, and repair. The initial steps in recombination-dependent replication restart 
depend on the recombination proteins RecA and RecBCD and the formation of a D-loop 
(Figure 28.21). The ability of RecA to mediate recombinational repair of stalled replication 
forks is undoubtedly the reason for the presence of RecA-type proteins in virtually all organ- 
isms. Indeed, the recombination system, a feature common to all cells, evolved to carry out 
this essential repair purpose. (A diagram illustrating recombination-dependent restart of a 
stalled replication fork is shown in the section on DNA repair [Section 28.8, Figure 28.26].) 

It is significant to note that the organization of DNA in a D-loop is analogous to 
that in a replication fork, where the leading strand has a free 3’-end, just like the invad- 
ing strand in the D-loop, and the lagging strand is coated with SSB (see Figure 28.19b). 
Subsequent steps in recombination-dependent replication restart depend on PriA, 
a multifunctional DNA helicase with two DNA-binding domains located at its 
N-terminus and an SSB-binding domain at its C-terminus (Figure 28.23). PriA can also 
rescue a stalled replication fork. The 3’-BD DNA-binding domain of PriA binds tightly 
to the 3’-end of invading-strand DNA (or its counterpart in a stalled replication fork, 
the 3’-end of leading-strand DNA). PriA’s WH DNA-binding domain recognizes the 
three DNA arms of a D-loop (homologous DNA duplex, invading-strand:homologous 
DNA duplex, and 5'-end ssDNA strand; see Figure 28.21b) or the analogous structures 
in a stalled replication fork (parental, leading-strand, and lagging-strand). Once bound 
to DNA, PriA uses its CTD (C-terminal domain) to interact with SSB in order to 
identify single-stranded DNA regions of the 5’-, or lagging, strand. Then, the helicase 
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domains of PriA bind the ssDNA, displacing SSB from these regions and reloading 
DnaB helicase as the initial step in re-establishing a functional replication fork. 


28.7h Homologous Recombination in Eukaryotes Helps to Prevent Cancer 


As in prokaryotes, eukaryotic homologous recombination eliminates double-stranded 
breaks and other deleterious lesions from chromosomes. Since the accumulation of DNA 
lesions is a necessary step in the development of cancer, eukaryotic homologous recom- 
bination is crucial to the prevention of cancers in humans. As in prokaryotes, eukaryotic 
recombination is mediated by recombinases. The eukaryotic counterpart to RecA is 
Rad51, an ATP-dependent, single-stranded DNA-binding protein. Rad51 assembles on 
ssDNA in a manner similar to RecA, except Rad51 is present in solution as a heptamer, 
not a hexamer. The ssDNA:Rad51 nucleoprotein complex in eukaryotes is referred to as 
a presynaptic filament. Rad51 assembles on ssDNA, forming a right-handed helical nu- 
cleoprotein complex that can extend for thousands of bases. Each helical turn of the fila- 
ment contains 18 or 19 bases of DNA and 6 Rad51 monomers. Thus, Rad51 holds the 
DNA in a highly extended conformation. Assembly of the Rad51:DNA filament is de- 
pendent on proteins that mediate the Rad51:DNA interaction, including BRCA2, the 
protein product of the BRCA2 gene whose mutation may lead to breast cancer. BRCA2 
and other mediators, such as Rad52, are needed because RPA, the eukaryotic counter- 
part of the prokaryotic DNA replication protein SSB, competes with Rad51 for binding 
to ssDNA. Mediators such as BRCA2 displace RPA, allowing Rad51 to bind. Strand 
invasion and D-loop formation require other proteins, including Hop2-Mnd1. Rad54 
acts in the ATP hydrolysis-driven strand exchange reaction that leads to formation of a 
Holliday junction. Specific endonucleases then cleave the DNA strands to resolve the 
Holliday junction. BRCA2 can also mediate the binding of RAD51 to DNA regions that 
have become single-stranded as a result of DNA damage or DNA replication problems. 


28.71 Transposons Are DNA Sequences That Can 
Move from Place to Place in the Genome 


In 1950, Barbara McClintock reported the results of her studies on an activator gene 
in maize (Zea mays, or as it’s usually called, corn) that was recognizable principally 
by its ability to cause mutations in other genes. Activator genes were thus an internal 
source of mutation. A most puzzling property was their ability to move relatively 
freely about the genome. Since scientists had labored to establish that genes occupied 
fixed locations on chromosomes, the idea of genes moving around was viewed as 
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Figure 28.23 PriA protein (PDB id = 4NL4). 

(a) Structural domains (color-coded) within PriA (its 
N-terminus is at the top). PriA’s two DNA-binding 
domains (3'-BD and WH, or winged helix) are at 
the N-terminal end. The helicase domains dominate 
the structure. The C-terminal domain (CTD) is the 
site of the SSB-binding domain of PriA. (b) Two 
views of the tertiary structure of PriA (rotated 180° 
relative to each other. (Adapted from Figure 1 in 
Bhattacharyya, B., George, N. P., Thurmes, T. M., 
Zhou, R., Jani, N., Wessel, S. R., et al. 2014. 
Structural mechanisms of PriA-mediated DNA 
replication restart. Proceedings of the National 
Academy of Sciences, U.S.A. 111:1373-1378.) 
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Human Biochemistry 


The Breast Cancer Susceptibility Genes BRCA1 and BRCA2 
Are Involved in DNA Damage Control and DNA Repair 


Mutations in the BRCA1 and BRCA2 genes cause increased likelihood 
of breast, ovarian, and other cancers. The BRCA1 protein functions in 
regulation of the cell cycle in response to DNA damage control. Phos- 
phorylation of BRCA1 by DNA damage-response proteins controls 
the expression, phosphorylation, and cellular localization of specific 
cyclin-CDKs involved in the cell cycle G/M checkpoint and the onset 
of mitosis. Activation of these cyclin-CDKs leads to arrest of the 
cell cycle in G, until the damage to DNA is repaired. Furthermore, 
BRCAI aids in homologous recombination repair of DSBs during 
S/G2 by loading the Rad51 recombinase at the DSB. Mutations in 
BRCAI that impair its function allow the cell cycle to enter mitosis 
and DNA damage to accumulate, raising the risk of cancer. 

The BRCA2 protein participates in the pathway for DNA repair by 
homologous recombination. The BRCA2 protein (3418 amino acids) 
is a very large protein with 8 conserved sequence motifs of about 30 
amino acids each, known as the BRC repeats. These repeats act as bind- 
ing sites for RADS1, the eukaryotic analog of RecA. BRCA2 transfers 


RADS1 to a ssDNA strand coated with RPA (the eukaryotic counter- 
part of SSB), allowing formation of the RAD51-ssDNA nucleopro- 
tein filament that is an essential intermediate in eukaryotic homologous 
recombination (see Figure 28.20b). Mutations that impair BRCA2 
function prevent DNA repair by homologous recombination, leading 
to accumulation of DNA damage and a greater likelihood of cancer. 
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incredible. The recognition that McClintock so richly deserved for her explanation of 
this novel phenomenon had to await verification by molecular biologists. In 1983, 
Barbara McClintock was finally awarded the Nobel Prize in Physiology or Medicine. 
The mobility of McClintock’s activator genes, or so-called jumping genes led to 
their designation as mobile elements, transposable elements, or, simply, transposons. 
Transposons are found in all kingdoms of life and constitute a significant portion of 
genomes. In humans, 54% of the genome is transposon DNA. 

Transposons are segments of DNA that are moved enzymatically from place to place 
in the genome (Figure 28.24). Transposons range in size from several hundred base 
pairs to more than 8 kbp. Transposons contain a gene encoding an enzyme called 
transposase. This is the enzyme responsible for transposon mobility. It binds to the end 
of a transposon, and catalyzes a cut-and-paste mechanism that inserts the transposon 
into a chromosome or remobilizes the transposon to a different location in the genome. 
These movements are termed transposition events. The smallest transposons are called 
insertion sequences, or ISs, signifying their ability to insert apparently at random in the 
genome. Insertion into a new site can cause a mutation if a gene or regulatory region at 
the site is disrupted. Because transposition events can move genes to new places or lead 
to the duplication of existing genes, transposition is a major force in evolution. 


28.8 | Can DNA Be Repaired? 


Biological macromolecules are susceptible to chemical alterations that arise from envi- 
ronmental damage or errors during synthesis. For RNAs, proteins, or other cellular 
molecules, most consequences of such damage are avoided by replacement of these 
molecules through normal turnover (synthesis and degradation). However, the integrity 
of DNA is vital to cell survival and reproduction. Its information content must be pro- 
tected over the life span of the cell and preserved from generation to generation. Safe- 
guards include (1) high-fidelity replication systems and (2) repair systems that correct 
DNA damage that might alter its information content. DNA is the only molecule that, 
if damaged, is repaired by the cell. Usually, accurate repair is possible because the in- 
formation content of duplex DNA is inherently redundant; the nucleotide sequence in 
one strand is directly related to the sequence in the other. DNA damage may arise from 
endogenous processes or from exogenous agents, such as UV light, ionizing radiation, 
or mutagenic chemicals. The most common forms of endogenous DNA damage arise 
from chemical reactions (oxidation, alkylation, or deamination of bases) or loss of 
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Inverted repeats Figure 28.24 The typical transposon has 


inverted nucleotide-sequence repeats at its 
~ D termini, represented here as the 12-bp sequence 
Transposon ACGTACGTACGT (a). It acts at a target 


sequence (shown here as the sequence CATGC) 

within host DNA by creating a staggered cut (b) 

whose protruding single-stranded ends are then 
ligated to the transposon (c). The gaps at the 

(a) target site are then filled in, and the filled-in 
strands are ligated (d). Transposon insertion thus 

generates direct repeats of the target site in the 


host DNA, and these direct repeats flank the 
inserted transposon. 
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bases due to cleavage of N-glycosidic bonds. Exogenous agents can damage DNA in a 
variety of ways, including UV-induced free-radical generation and crosslinking of ad- 
jacent pyrimidines, breakage of the polynucleotide backbone by ionizing radiation, and 
base modifications through chemical reactions. Cells have extraordinarily diverse and 
effective systems to repair these lesions in DNA so that the genetic information is not 
lost or altered. The human genome has 150 or so genes associated with DNA repair. 
DNA repair systems include direct reversal damage repair, single-strand damage re- 
pair, double-stranded break (DSB) repair, and translesion DNA synthesis. 

Chemical reactions that reverse the damage, returning the DNA to its proper state, 
are called direct reversal repair systems. Examples include methyltransferases to remove 
methyl groups from chemically modified bases and photolyase, which repairs thymine 
dimers, as discussed in Section 28.8 (see Figure 28.28). However, if the nucleotide 
sequence has been altered, other systems of DNA repair are required. Single-strand 
damage repair relies on the intact complementary strand to guide repair. Systems repair- 
ing this sort of DNA damage include mismatch repair (MMR), base excision repair 
(BER), and nucleotide excision repair (NER). Such systems will be described further on. 


28.8a Double-Strand DNA Breaks 


Double-strand DNA breaks (DSBs) are the most dangerous form of DNA damage. 
They are a particular threat to genome stability because lost sequence information can- 
not be recovered from the same DNA double helix. DSBs arise in all cells, prokaryotic 
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Figure 28.25 A model for NHEJ repair of 

DSBs. (a) Double-stranded break. (b) Ku70/80 
heterodimers (yellow) recognize the broken ends 

in a DSB and bind. (c) Ku70/80 heterodimers bind 
DNA-PK catalytic subunits (DNA-PKcs, blue), 
leading to formation of DNA-PK holoenzymes 
(Ku70/80: DNA-PKcs complexes) at each end. 

(d) Interactions between the C-terminal domains 
of the Ku80 subunits drives dimerization of the 
DNA-PK holoenzymes, bringing the broken ends 
close to one another. (e) Formation of the 
DNA-PK dimer creates a platform for recruitment 
of nucleases (red), DNA ligase 4 (green), and other 
factors essential for processing the broken ends and 
re-ligating them together. (f) Repaired DSB. 


or eukaryotic, but they can be repaired. The most reliable repair mechanism is to use 
the information in a homologous chromosome to direct the repair and restore the fidel- 
ity of the broken dsDNA. Diploid cells can always provide a homologous dsDNA, but 
so can haploid cells such as bacteria, if the break occurs when such cells are transiently 
diploid, as they are during replication. DSBs are common in eukaryotic cells; estimates 
suggest a frequency of 10 DSBs per cell per day. Because the threat posed by DSBs is so 
severe, cells have several systems to repair them. The two most prominent pathways of 
DSB repair are: nonhomologous DNA end-joining (NHEJ) and homologous recombina- 
tion (HR). 

In NHEJ, the broken ends at the break site are rejoined. While NHEJ is the simplest 
way to repair a DSB, the lack of a proper DNA template means that NHEJ is error-prone 
(Figure 28.25). NHEJ is the dominant DSB repair pathway during most of the cell cycle 
and arose early in evolution as the principal DSB repair pathway. A key problem in NHEJ 
is to bring the ends near one another so that they can be linked together again. A hetero- 
dimeric protein, Ku70/80, composed of 70-kDa Ku70 and 83-kDa Ku80 subunits, binds 
to the broken DNA ends and moves onto the DNA like a nut screwing on a bolt 
(Figure 28.25). A Ku70/80 heterodimer at each broken end then binds the 470-kDa 
catalytic subunit of DNA-protein kinase (DNA-PKcs), a PI3K protein kinase family 
member. Assembly of Ku70/80 with DNA-PKcs completes the formation of the DNA-PK 
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holoenzyme at each end, and the two ends come together through dimerization of the 
DNA-PK holoenzymes, driven by interactions between the C-terminal domains of the 
Ku80 subunits in each DNA-PK holoenzyme. This DNA-PK holoenzyme dimer serves 
as a platform for recruitment of proteins (including nucleases and DNA ligase 4) that 
process the damaged DNA ends and religate them. (Some proteins involved in repairing 
DSBs also function in immunoglobulin gene rearrangements; see Figure 28.39.) 

DSBs that arise as the cell progresses from S into G» during the cell cycle can be 
repaired through homologous recombination (Figure 28.26). DSB repair via homolo- 
gous recombination is much less error-prone than other alternatives. The intact sister 
chromatid of the damaged DNA duplex guides the process. Processing of the DSB 
creates single-stranded tails that become substrates for RecA-mediated nucleoprotein 
filament formation and homology recognition within the sister chromatid. DNA syn- 
thesis propagated from the D-loop and migration of the Holliday junction, followed by 
strand cleavage within the Holliday junction by resolvase, either by cleaving the crossed 
strands (Figure 28.26(g), left) or the noncrossed strands of the Holliday junctions 
(Figure 28.26(g), right). 

What is effectively a DSB can arise during DNA replication if DNA damage causes the 
replication fork to stall (Figure 28.27). Suppose a lesion in the leading strand (circle, 
Figure 28.27b) causes leading-strand synthesis to stall. Continued lagging-strand synthesis 
results in a single-stranded region on the leading-strand template. Leading-strand invasion 
of the new lagging-strand duplex (mediated by RecA) creates a D-loop (Figure 28.27c), 
and leading-strand synthesis directed by the lagging-strand template (Figure 28.27d) 
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Figure 28.26 DSB repair through homologous 
DNA recombination. (a) Two homologous 
chromatids or DNA regions, their double- 

stranded DNA symbolized by a pair of lines, one 
chromatid orange/red, its sister chromatid black/blue. 
(b) a double-stranded break in the orange/red 
chromatid. Homologous recombination creates a 
D-loop (c), and sister chromatid-directed DNA 
replication restores the information content of 

the damaged duplex (d—-f). Depending on how 

the Holliday junctions are resolved, the products 

(g) are either (/eft) noncrossover or (right) crossover 
recombinants. (Adapted from Figure 1 in Hiom, K., 
2010. Coping with DNA double strand breaks. DNA 
Repair 9:1256-1263.) 
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Figure 28.27 Restarting a stalled replication fork 
through homologous DNA recombination. A lesion 
in the DNA is symbolized by a circle; in this case, 
the lesion is in the leading-strand template (a). 
Leading-strand synthesis halts because of the lesion 
(b). Lagging-strand synthesis (red) continues, and 
the Okazaki fragments are ligated (c). When the 
leading strand invades the new DNA duplex formed 
by lagging-strand synthesis, a D-loop is formed and 
strand exchange occurs. Using the lagging strand as 
a template, synthesis of the leading strand (black) 
resumes (d), and the replication fork is reestablished 
(e). (Adapted from Figure 3 in Wyman, C., and 
Kanaar, R., 2006. DNA double-strand break repair: 
All’s well that ends well. Annual Review of Genetics 
40:363-383.) 
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reestablishes a competent replication fork (Figure 28.27e). The site of the lesion is repaired 
later, usually by nucleotide excision repair (see Section 28.8c). Homologous recombination- 
mediated restarting of replication forks is believed to be the evolutionary driving force for 
the emergence of homologous recombination. Even if the NHEJ or homologous recombi- 
nation repair systems fail to act, the genome may be preserved if an error-prone mode of 
replication allows the lesion to be bypassed. Such translesion DNA synthesis is more of a 
tolerance mechanism than a repair mechanism because it allows replication without neces- 
sarily repairing the damage. Specialized translesion DNA polymerases, such as DNA poly- 
merase IV in E. coli and DNA polymerase n (n = eta) in humans, substitute for the replica- 
tive DNA polymerase in this process. Although translesion DNA polymerases tend to be 
error-prone, they have the advantage of allowing DNA replication to continue. 

Human DNA replication has an error rate of about three base-pair mistakes dur- 
ing copying of the 6 billion base pairs in the diploid human genome. The low error 
rate is due to those DNA repair systems that review and edit the newly replicated 
DNA. Furthermore, about 10* bases (mostly purines) are lost per cell per day from 
spontaneous breakdown in human DNA; repair systems must replace these bases to 
maintain the fidelity of the encoded information. 


28.8b Mismatch Repair Corrects Errors Introduced During DNA Replication 


The mismatch repair system corrects errors introduced when DNA is replicated. It 
scans newly synthesized DNA for mispaired bases, excises the mismatched region, 
and then replaces it by DNA polymerase—mediated local replication. The key to such 
replacement is to know which base of the mismatched pair is correct. 

The E. coli methyl-directed pathway of mismatch repair relies on methylation pat- 
terns in the DNA to determine which strand is the newly synthesized one and which one 
was the parental (template) strand. DNA methylation, often an identifying and charac- 
teristic feature of a prokaryote’s DNA, occurs just after DNA replication. During meth- 
ylation, methyl groups are added to certain bases along the new DNA strand. However, 
a window of opportunity exists between the end of replication and the start of methyl- 
ation, when only the parental strand of a dsDNA is methylated. This window in time 
provides an opportunity for the mismatch repair system to review the dsDNA for mis- 
matched bases that arose as a consequence of replication errors. By definition, the newly 
synthesized strand is the one containing the error, and the methylated strand is the one 
having the correct nucleotide sequence. When the methyl-directed mismatch repair sys- 
tem encounters a mismatched base pair, it searches along the DNA—through thou- 
sands of base pairs if necessary—until it finds a methylated base. 

The system identifies the strand bearing the methylated base as parental, assumes its 
sequence is the correct one, and replaces the entire stretch of nucleotides within the new 
strand from this recognition point to and including the mismatched base. Mismatch 
repair does this by using an endonuclease to cut the new, unmethylated strand and an 
exonuclease to remove the mismatched bases, creating a gap in the newly synthesized 
strand. DNA polymerase III holoenzyme then fills in the gap, using the methylated 
strand as template. Finally, DNA ligase reseals the strand. 

In eukaryotes, mismatch repair is similar, but it does not rely on the absence of methyl 
groups to identify the newly synthesized strand. Instead, strand discrimination is deter- 
mined by the presence of DNA replication-associated nicks in the daughter strand. 


28.8c Damage to DNA by UV Light or Chemical Modification 
Can Also Be Repaired 


The bases in DNA are electron-rich (see Section 10.1). As such, they absorb UV light and 
they are susceptible to chemical modifications. Absorption of UV light can lead to 
formation of thymine dimers if two T residues are next to one another. These dimers 
disrupt the information transfer reactions implicit to transcription and translation. Bases 
that have been modified through unwanted chemical reaction also disrupt information 
transfer through improper base pairing. DNA repair mechanisms are available to reverse 
these modifications and maintain the integrity of the information stored in DNA. 
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A Deeper Look 


Transgenic Animals Are Animals Carrying Foreign Genes 


Experimental advances in gene transfer techniques have made it 
possible to introduce genes into animals by transfection of fertilized 
eggs. Transfection is defined as the uptake or injection of plasmid 
DNA into recipient cells. Animals that have acquired new genetic 
information as a consequence of the introduction of foreign genes 
are termed transgenic. Plasmids carrying the gene of interest are 
injected into the nucleus of an oocyte or fertilized egg, and the 
egg is then implanted into a receptive female. The technique has 
been perfected for mice (see figure, part a). In a small number of 
cases—10% or so—the mice that develop from the injected eggs 
carry the transfected gene integrated into a single chromosomal site. 
The gene is subsequently inherited by the progeny of the transgenic 
animal as if it were a normal gene. Expression of the donor gene in 
the transgenic animals is variable because the gene is randomly inte- 
grated into the host genome and gene expression is often influenced 
by chromosomal location. Nevertheless, transfection of animals has 
produced some startling results, as in the case of transgenic mice 
carrying the gene for the rat growth hormone (rGH). The trans- 
genic mice grew to nearly twice the normal size (see figure, part b). 
Growth hormone levels in these animals were several hundred times 
greater than normal. Similar results were obtained in transgenic 
mice transfected with the human growth hormone (hGH) gene. The 
biotechnology of transfection has been extended to farm animals, 
(a) 


Injected 
into 


nucleus Egg implanted 


into mouse 

Plasmid carrying 

growth hormone 
gene from rat 


and transgenic chickens, cows, pigs, rabbits, sheep, and even fish 
have been produced. 

The first animal cloned from an adult cell, a sheep named Dolly, 
represented a milestone in cloning technology. Subsequent accom- 
plishments include incorporation of the human gene encoding blood 
coagulation factor IX into sheep. Fetal sheep fibroblast cells were 
transfected with the human factor IX gene, nuclei from the trans- 
fected cells were transferred into sheep oocytes lacking nuclei, and 
these transgenic oocytes were placed in the uterus of receptive female 
sheep, which subsequently gave birth to transgenic lambs. The intro- 
duced factor IX transgene was specifically designed so that factor IX 
protein, a medically useful product for the treatment of hemophili- 
acs, would be expressed in the milk of the transgenic sheep. Similar 
successes in cows, which produce much more milk, has brought the 
potential for commercial production of virtually any protein into the 
realm of reality. 

Transfection technology is one mechanism for gene therapy by 
replacing defective genes in animals with functional genes (see 
Chapter 12). Problems concerning delivery, integration and regula- 
tion of the transgene including its appropriate expression in the 
right cells at the proper time during development and growth of 
the organism, must be brought under control before gene therapy 
becomes commonplace in humans. 


(b) 


Courtesy of Ralph L. Brinster, School of Veterinary Medicine, 
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(a) Transfection can introduce new genes into animals. The rat growth hormone gene carried on a plasmid is injected into a 
mouse oocyte or fertilized egg that is then implanted in a receptive female mouse. Integration of the plasmid into the mouse 
genome can be ascertained by Southern blot analysis of DNA from the newborn mouse. Expression of the foreign gene can 


be determined by assaying for the gene product, in this case, rat growth hormone. (b) Photograph showing a transgenic 
mouse with an active rat growth hormone gene (left). This transgenic mouse is twice the size of a normal mouse (right). 


Repair of Pyrimidine Dimers Formed by UV Light UV irradiation promotes the forma- 
tion of covalent bonds between adjacent thymine residues in a DNA strand, creating 
a cyclobutyl ring (Figure 28.28). Because the C—C bonds in this ring are shorter than 
the normal 0.34 nm base stacking in B-DNA, the DNA is distorted at this spot and 
is no longer a proper template for either replication or transcription. Photolyase (also 
called photoreactivating enzyme), a FAD-dependent enzyme, binds at the dimer and 
uses the energy of visible light to break the cyclobutyl ring, restoring the pyrimidines 


to their original form. 
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Figure 28.29 Base excision repair. A damaged base 
(m) is excised from the sugar-phosphate backbone by 
DNA glycosylase, creating an AP site. Then, an 
apurinic/apyrimidinic endonuclease severs the DNA 
strand, and an excision nuclease removes the AP site 
and several nucleotides. DNA polymerase I and 
DNA ligase then repair the gap. 
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Figure 28.28 UV irradiation causes dimerization of adjacent thymine bases. A cyclobutyl ring is 
formed between carbons 5 and 6 of the pyrimidine rings. Normal base pairing is disrupted by the 
presence of such dimers. 


Excision Repair Replacement of chemically damaged or modified bases occurs via two 
fundamental excision repair systems—base excision and nucleotide excision. As their 
names imply, these repair systems differ in whether just the base or the nucleoside at 
the point of damage is replaced. 

Base excision repair acts on single bases that have been damaged through oxidation 
or other chemical modifications during normal cellular processes. The damaged base is 
removed by DNA glycosylase, which cleaves the glycosidic bond, creating an AP site 
where the sugar—phosphate backbone is intact but a purine (apurinic site) or a pyrimi- 
dine (apyrimidinic site) is missing. An AP endonuclease then cleaves the backbone, an 
exonuclease removes the deoxyribose-P and a number of additional residues, and the 
gap is repaired by DNA polymerase and DNA ligase (Figure 28.29). The information 
of the complementary strand is used to dictate which bases are added in refilling the 
gap. In E. coli, DNA polymerase I binds at the gap and moves in the 5’-3’ direction, re- 
moving nucleotides with its 5’-exonuclease activity. The 5'—>3' DNA polymerase activity 
of DNA polymerase I fills in the sequence behind the 5’-exonuclease action. No net 
synthesis of DNA results, but this action of DNA polymerase I “edits out” sections of 
damaged DNA. Excision is coordinated with 5'—>3' polymerase-catalyzed replacement 
of the damaged nucleotides so that DNA of the right sequence is restored. 

Nucleotide excision repair recognizes and repairs larger regions of damaged DNA 
than base excision repair. The nucleotide excision repair (NER) pathway in humans 
consists of more than 30 proteins engaged in DNA damage verification, excision, gap 
filling, and ligation. The NER pathway cuts the sugar—-phosphate backbone of a 
DNA strand in two places, one on each side of the lesion, and removes the region. An 
oligonucleotide stretch 27 to 29 units long is removed. The resultant gap is then filled 
in using DNA polymerase (DNA polymerase I in prokaryotes or DNA polymerase 6 
or e and PCNA plus RFC in eukaryotes), and the sugar—phosphate backbone is co- 
valently closed by DNA ligase. 

In mammalian cells, nucleotide excision repair is the main pathway for removal of 
carcinogenic (cancer-causing) lesions caused by sunlight or other mutagenic agents. 
Such lesions are recognized by XPA protein, named for xeroderma pigmentosum, an 
inherited human syndrome whose victims suffer serious skin lesions if exposed to sun- 
light. XPA, a DNA-binding dimer of 31-kDa subunits, verifies that DNA damage has 
occurred and serves as a platform for recruitment of the multitude of other NER fac- 
tors to the damaged strand through protein-protein interactions. 


28.9 | What Is the Molecular Basis of Mutation? 


Genes are normally transmitted unchanged from generation to generation, owing 
to the great precision and fidelity with which genes are copied during chromosome 
duplication. However, on rare occasions, genetically heritable changes (mutations) 
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occur and result in alterations to the DNA. Most mutated genes function less effec- 
tively than the unaltered, wild-type allele, but occasionally, mutations arise that give 
the organism a selective advantage. When this occurs, they may be propagated for 
many generations. Together with recombination, mutation provides for genetic vari- 
ability within species and, ultimately, the evolution of new species. 

Mutations change the sequence of bases in DNA, either by the substitution of one 
base pair for another (so-called point mutations) or by the insertion or deletion of one 
or more base pairs (insertions and deletions). 


28.9a Point Mutations Arise by Inappropriate Base-Pairing 


Point mutations arise when a base pairs with an inappropriate partner. The two pos- 
sible kinds of point mutations are transitions, in which one purine (or pyrimidine) is 
replaced by another, as in A—>G (or T—>C), and transversions, in which a purine is 
substituted for a pyrimidine, or vice versa. 

Point mutations arise by the pairing of bases with inappropriate partners during 
DNA replication, by the introduction of base analogs into DNA, or by chemical muta- 
gens. Bases may rarely mispair (Figure 28.30), either because of their tautomeric prop- 
erties (see Chapter 10) or because of other influences. Even in mispairing the C;’—C,’ 
distances between bases must still be close to that of a Watson—Crick base pair (11 nm 
or so; see Figure 11.9) to maintain the mismatched base pair in the double helix. In 
tautomerization, for example, an amino group (—NH)), usually an H-bond donor, 
can tautomerize to an imino form (=NH) and become an H-bond acceptor. Or a keto 
group (C=O), normally an H-bond acceptor, can tautomerize to an enol C—OH, an 
H-bond donor. Proofreading mechanisms operating during DNA replication catch 
most mispairings. The frequency of spontaneous mutation in prokaryotes and eukary- 
otes (including humans) is about 1078 per base pair per generation. 
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Figure 28.30 Point mutations due to base mispairings. (a) An example based on tautomeric 
properties. The rare imino tautomer of adenine base pairs with cytosine rather than thymine. (1) The 
normal A-T base pair. (2) The A*-C base pair is possible for the adenine tautomer in which a proton 
has been transferred from the 6-NH, of adenine to N-1. (3) Pairing of C with the imino tautomer of 
A (A*) leads to a transition mutation (A-T to G-C) appearing in the next generation. (b) A in the syn 
conformation pairing with G (G is in the usual anti conformation). (c) T and C form a base pair by 
H-bonding interactions mediated by a water molecule. 
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Figure 28.32 (a) 2-Aminopurine normally base- 
pairs with T but (b) may also pair with cytosine 
through a single hydrogen bond. 
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Figure 28.33 Oxidative deamination of adenine in 
DNA yields hypoxanthine, which base-pairs with 
cytosine, resulting in an A-T to G-C transition. 
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Figure 28.31 5-Bromouracil usually favors the keto tautomer that mimics the base-pairing properties 
of thymine, but it frequently shifts to the enol form, whereupon it can base-pair with guanine, causing 
a T-A to C-G transition. 


28.9b Mutations Can Be Induced by Base Analogs 


Base analogs that become incorporated into DNA can induce mutations through 
changes in base-pairing possibilities. Two examples are 5-bromouracil (5-BU) and 
2-aminopurine (2-AP). 5-Bromouracil is a thymine analog and becomes inserted into 
DNA at sites normally occupied by T; its 5-Br group sterically resembles thymine’s 
5-methyl group. However, because 5-BU frequently assumes the enol tautomeric form 
and pairs with G instead of A, a point mutation of the transition type may be induced 
(Figure 28.31). Less often, 5-BU is inserted into DNA at cytosine sites, not T sites. 
Then, if it base-pairs in its keto form, mimicking T, a C—G to T-A transition ensues. 
The adenine analog, 2-aminopurine (recall that adenine is 6-aminopurine) normally 
behaves like A and base-pairs with T. However, 2-AP can form a single H bond of 
sufficient stability with cytosine (Figure 28.32) that occasionally C replaces T in DNA 
replicating in the presence of 2-AP. Hypoxanthine (Figure 28.33) is an adenine analog 
that arises in situ in DNA through oxidative deamination of A. Hypoxanthine base- 
pairs with cytosine, creating an A-T to G-C transition. 


28.9c Chemical Mutagens React with the Bases in DNA 


Chemical mutagens are agents that chemically modify bases so that their base-pairing 
characteristics are altered. For instance, nitrous acid (HNO;>) causes the oxidative de- 
amination of primary amine groups in adenine and cytosine. Oxidative deamination 
of cytosine yields uracil, which base-pairs the way T does and gives a C-G to T-A 
transition (Figure 28.34a). Hydroxylamine specifically causes C—G to T-A transitions 
because it reacts specifically with cytosine, converting it to a derivative that base-pairs 
with adenine instead of guanine (Figure 28.34c). Alkylating agents (Figure 28.34e) are 
also chemical mutagens. Alkylation of reactive sites on the bases to add methyl or 
ethyl groups alters their H bonding and hence base pairing. For example, methylation 
of O° on guanine (giving O°-methylguanine) causes this G to mispair with thymine, 
resulting in a G—C to A-T transition (Figure 28.34d). Alkylating agents can also in- 
duce point mutations of the transversion type. Alkylation of N’ of guanine labilizes its 
N-glycosidic bond, which leads to elimination of the purine ring, creating a gap in the 
base sequence. An enzyme, AP endonuclease, then cleaves the sugar—phosphate back- 
bone of the DNA on the 5’-side, and the gap can be repaired by enzymatic removal of 
the 5’-sugar—phosphate and insertion of a new nucleotide (see Figure 28.29). A trans- 
version results if a pyrimidine nucleotide is inserted in place of the purine during 
enzymatic repair of this gap. 


28.9d Insertions and Deletions 


The addition or removal of one or more base pairs leads to insertion or deletion muta- 
tions, respectively. If the number of base pairs inserted or deleted is not divisible by 
three, the codon triplet reading frame will be shifted. Such shifts cause frameshift 
mutations (misincorporation of all subsequent amino acids) in the protein encoded by 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


28.9 What Is the Molecular Basis of Mutation? 


(a) (b) Generic structure of nitrosoamines 
H H 
\ / Rı 
N—H O» H—N 
N—N=0 
HNO 7 
2 —H-N \ Ry 
N N Gi 
VA / 
C l 1 G i 1 
Cytosine Uracil Adenine 
(c) 
H H H 
\ / \ 
N N=H N Ove —N N 
A =o A f <H aN 
c, N Cy N= N 
oo We A a) 
HE Gy 
Adenine Hypoxanthine Cytosine Cytosine Adenine 
(d) (e) 
a2. joe o N, Nitrosoamines: 
Pairs 
normally J... H— Dimethylnitrosamine Diethylnitrosamine 
with Ne,’ 
cytosine 1 CON 
pakansi B= Guanine N—N~O N—N+}O 
i me omon, 
Alkylating agent 
Nitrosoguanidine: 
Hs 
o O=N—N H 
I Z 
‘ eee C—N 
Sometimes (............. N S N-nethyl-N’ -nitro- | \ 
pairs Gi N-nitrosoguanidine NO» 
with NH 
thymin N, S H — N 
H 
Nit i 
P mèthylguanine itrosourea ° Foros 
Ethyl nitrosourea HN — C — N 
Figure 28.34 Chemical mutagens. (a) HNO, (nitrous N=O 
acid) converts cytosine to uracil and adenine to 
hypoxanthine. (b) Nitrosoamines, organic compounds 
that react to form nitrous acid, also lead to the oxidative Alkyl Sulfates: 
deamination of A and C. (c) Hydroxylamine (NHOH) oO oO 
reacts with cytosine, converting it to a derivative that Ethylmethane | Dimethyl | 
base-pairs with adenine instead of guanine. The result sulfonate HsC — S$ —O —CH,CHs sulfate ie o—sS—O —'CH, 
is a C-G to T-A transition. (d) Alkylation of | | 
G residues to give O°-methylguanine, which base-pairs O Oo 
with T. (e) Alkylating agents include nitrosoamines, 
nitrosoguanidines, nitrosoureas, alkyl sulfates, and : 
nitrogen mustards. Note that nitrosoamines are Nitrogen Mustard: 
mutagenic in two ways: They can react to yield HNO,, or y, CH= CH, — Cl 
they can act as alkylating agents. The nitrosoguanidine, HC — N 
N-methyl-N'-nitro-N-nitrosoguanidine, is a very potent \ 
mutagen used in laboratories to induce mutations in NH, — CH — Cl 
experimental organisms such as Drosophila melanogaster. 


Ethylmethane sulfonate (EMS) and dimethyl sulfate are 
also favorite mutagens among geneticists. 
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the gene. Such mutations can arise if flat, aromatic molecules such as acridine orange 
insert themselves between successive bases in one or both strands of the double helix. 
This insertion or, more aptly, intercalation, doubles the distance between the bases as 
measured along the helix axis (see Figure 11.15). This distortion of the DNA results in 
inappropriate insertion or deletion of bases when the DNA is replicated. Disruptions 
that arise from the insertion of a transposon within a gene also fall in this category of 
mutation (see Figure 28.24). 


28.10 | Can Genetic Recombination Create 
Protein Diversity? 


Animals have evolved a way to exploit genetic recombination in order to generate 
protein diversity. This development was crucial to the evolution of the immune sys- 
tem. For example, the immunoglobulin genes are a highly evolved system for maxi- 
mizing protein diversity from a finite amount of genetic information. This diversity 
is essential for gaining immunity to the great variety of infectious organisms and 
foreign substances that cause disease. 


28.10a Cells Active in the Immune Response Are 
Capable of Gene Rearrangement 


Only vertebrates show an immune response. If a foreign substance, called an antigen, 
gains entry to the bloodstream of a vertebrate, the animal responds via a protective 
system called the immune response. The immune response involves production of pro- 
teins capable of recognizing and destroying the antigen. This response is mounted by 
certain white blood cells—the B- and T-cell lymphocytes and the macrophages. B-cells 
are so named because they mature in the bone marrow; T-cells mature in the thymus 
gland. Each of these cell types is capable of gene rearrangement as a mechanism for 
producing proteins essential to the immune response. Antibodies, which can recognize 
and bind antigens, are immunoglobulin proteins secreted from B-cells. Because anti- 
gens can be almost anything, the immune response must have an incredible repertoire 
of structural recognition. Thus, vertebrates must have the potential to produce immu- 
noglobulins of great diversity in order to recognize virtually any antigen. 


28.10b Immunoglobulin G Molecules Contain Regions 
of Variable Amino Acid Sequence 


Immunoglobulin G (IgG or y-globulin) is the major class of antibody molecules found 
circulating in the bloodstream. IgG is a very abundant protein, amounting to 12 mg 
per mL of serum. It is a 150-kDa a2B>-type tetramer. The a or H (for heavy) chain is 
50 kDa; the B or L (for light) chain is 25 kDa. A preparation of IgG from serum is het- 
erogeneous in terms of the amino acid sequences represented in its L and H chains. 
However, the IgG L and H chains produced from any given B lymphocyte are homoge- 
neous in amino acid sequence. L chains consist of 214 amino acid residues and are or- 
ganized into two roughly equal segments: the V; and C, regions. The V, designation 
reflects the fact that L chains isolated from serum IgG show variations in amino acid 
sequence over the first 108 residues, Vy, symbolizing this variable region of the L poly- 
peptide. The amino acid sequence for residues 109 to 214 of the L polypeptide is con- 
stant, as represented by its designation as the “constant light,” or CL, region. The heavy, 
or H, chains consist of 446 amino acid residues. Like L chains, the amino acid sequence 
for the first 108 residues of H polypeptides is variable, ergo its designation as the Vy 
region, while residues 109 to 446 are constant in amino acid sequence. This constant 
heavy region consists of three quite equivalent domains of homology designated Cyl, 
Cy2, and Cy3. Each L chain has two intrachain disulfide bonds: one in the V, region 
and the other in the Ci region. The C-terminal amino acid in L chains is cysteine, and 
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Figure 28.35 Diagram of the organization of the IgG 
molecule. Two identical L chains are joined with two 
identical H chains. Each L chain is held to an H chain 
via an interchain disulfide bond. The variable regions 
(purple) of the four polypeptides lie at the ends of the 
arms of the Y-shaped molecule. These regions are 
responsible for the antigen recognition function of the 
antibody molecules. The actual antigen-binding site is 
constituted from hypervariable residues within the Vg 
and Vy, regions. For purposes of illustration, some 
features are shown on only one or the other L chain or 
H chain, but all features are common to both chains. 


it forms an interchain disulfide bond to a neighboring H chain. Each H chain has four 
intrachain disulfide bonds, one in each of the four regions. Figure 28.35 presents a dia- 
gram of IgG organization. Within the variable regions of the L and H chains, certain 
positions are hypervariable with regard to amino acid composition. These hypervari- 
able residues occur at positions 24 to 34, 50 to 55, and 89 to 96 in the L chains and at 
positions 31 to 35, 50 to 65, 81 to 85, and 91 to 102 in the H chains. The hypervariable 
regions are also called complementarity-determining regions, or CDRs, because it is 
these regions that form the structural site that is complementary to some part of an 
antigen’s structure, providing the basis for antibody—antigen recognition. 

The secondary structure of immunoglobulin chains is dominated by B-strands. In 
terms of its tertiary structure, the IgG molecule is composed of 12 discrete collapsed 
B-barrel domains. Within each domain, alternating B-strands are antiparallel to one 
another, a pattern known by the name Greek key motif. The characteristic structure of 
this domain is referred to as the immunoglobulin fold (Figure 28.36). Each of IgG’s two 
heavy chains contributes four of these domains and each of its light chains contributes 
two. In the immunoglobulin genes, the arrangement of exons correlates with IgG struc- 
tural organization. The four variable-region domains (one on each chain) are encoded 
by multiple exons, but the eight constant-region domains are each the product of a 
single exon. All of these constant-region exons are derived from a single ancestral exon 
encoding an immunoglobulin fold. The major variable-region exon probably derives 
from this ancestral exon also. Contemporary immunoglobulin genes are a consequence 
of multiple duplications of the ancestral exon. 

The discovery of variability in amino acid sequence in otherwise identical polypep- 
tide chains was surprising and almost heretical to protein chemists. For geneticists, it 
presented a genuine enigma. They noted that mammals, which can make millions of 
different antibodies, don’t have millions of different antibody genes. How can the mam- 
malian genome encode the diversity seen in L and H chains? 


28.10c The Immunoglobulin Genes Undergo Gene Rearrangement 


The answer to the enigma of immunoglobulin sequence diversity is found in the orga- 
nization of the immunoglobulin genes. The genetic information for an immunoglobu- 
lin polypeptide chain is scattered among multiple gene segments along a chromosome 
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Figure 28.36 The characteristic collapsed B-barrel domain known as the immunoglobulin fold. 
The B-barrel structures for both (a) variable and (b) constant regions are shown. (c) A schematic 
diagram of the 12 collapsed B-barrel domains that make up an IgG molecule. CHO indicates the 
carbohydrate addition site; Fa denotes one of the two antigen-binding fragments of IgG, and F,, 
the proteolytic fragment consisting of the pairs of Cy2 and Cy3 domains. 


in germline cells (sperm and eggs). During vertebrate development and the formation 
of B lymphocytes, these segments are brought together and assembled by DNA 
rearrangement (that is, genetic recombination) into complete genes. DNA rearrange- 
ment, or gene reorganization, provides a mechanism for generating a variety of pro- 
tein isoforms from a limited number of genes. DNA rearrangement occurs in only a 
few genes, namely, those encoding the antigen-binding proteins of the immune re- 
sponse—the immunoglobulins and the T-cell receptors. The gene segments encoding 
the amino-terminal portion of the immunoglobulin polypeptides are also unusually 
susceptible to mutation events. The result is a population of B-cells whose antibody- 
encoding genes collectively show great sequence diversity even though a given cell can 
make only a limited set of immunoglobulin chains. Hence, at least one cell among the 
B-cell population likely will be capable of producing an antibody that will specifically 
recognize a particular antigen. Both the T-cell receptor proteins, which recognize 
antigens presented by the major histocompatibility complex (MHC) of other cells, and 
the genes that encode these T-cell receptor proteins are organized similarly to the IgG 
proteins and genes. Thus, T-cells also show great diversity in antigen recognition. 
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28.10d DNA Rearrangements Assemble an L-Chain 
Gene by Combining Three Separate Genes 


The organization of various immunoglobulin gene segments in the human genome is 
shown in Figure 28.37. L-chain variable-region genes are assembled from two kinds 
of germline genes: Vg and J, (J stands for joining). In mammals, there are two differ- 
ent families of L-chain genes: the K, or kappa, gene family and the A, or lambda, gene 
family; each family has V and J members. These families are on different chromo- 
somes. Humans have 40 functional V genes and 5 functional J genes for the x light 
chains and 31 V genes and 4 J genes for the A light chain. The V and J genes lie up- 
stream from the single C, gene that encodes the L-chain constant region. Each V, 
gene has its own L, segment for encoding the L-chain leader peptide that targets the 
L chain to the endoplasmic reticulum for IgG assembly and secretion. (This leader 
peptide is cleaved once the L chain reaches the ER lumen.) The A family of L-chain 
genes is organized similarly. In different mature B-lymphocyte cells, V, and J, genes 
have joined in different combinations, and along with the C—V, gene, form complete 
L-V, chains with a variety of V, regions. However, any given B lymphocyte expresses 
only one V,—J,, combination. Construction of the mature B-lymphocyte L-chain gene 
has occurred by DNA rearrangements that combine three genes (L—V,..,, Jia, Cra) to 
make one polypeptide! 


28.10e DNA Rearrangements Assemble an H-Chain 
Gene by Combining Four Separate Genes 


The first 98 amino acids of the 108-residue, H-chain variable region are encoded by a 
Vu gene. Each Vy gene has an accompanying Ly gene that encodes its essential leader 
peptide. It is estimated that there are 200 to 1000 Vy genes and that they can be 
subdivided into eight distinct families based on nucleotide sequence homology. The 
members of a particular Vy family are grouped together on the chromosome, sepa- 
rated from one another by 10 to 20 bp. In assembling a mature H-chain gene, a Vy 
gene is joined to a D gene (D for diversity), which encodes amino acids 99 to 113 of the 
H chain. These amino acids comprise the core of the third CDR in the variable region 
of H chains. The V-D gene assemblage is linked in turn to a Jy gene, which encodes 
the remaining part of the variable region of the H chain. The Vy, D, and Jy genes are 


(a) 
VK, Vko Vkg Vky Jk Ck i , 
HHH- HEL ie 
l | 
Vki VKo Vks Te Ck 


Hi Hal H ee 


(b) 


Germline 
locus 


rearranged 


rearranged 


1021 


Figure 28.37 Organization of human 
immunoglobulin gene segments. Green, orange, 
blue, or purple colors indicate the exons of a 
particular V; or Vy gene. (a) L-chain gene 
assembly: During B-lymphocyte maturation in the 
bone marrow, one of the 40 V genes combines with 
one of the 5 J genes and is joined with a C gene. 
During the recombination process, the intervening 
DNA between the gene segments is deleted (see 
Figure 28.39). These rearrangements occur by a 
mostly random process, giving rise to many 
possible light-chain sequences from each gene 
family. (b) H-chain gene assembly: H chains are 
encoded by V, D, J, and C genes. In H-chain gene 
rearrangements, a D gene joins with a J gene and 
then one of the V genes adds to the DJ assembly. 
(Adapted from Figure 2b and c in Nossal, G. J. V., 
2003. The double helix and immunology. Nature 
421:440-444.) 
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Figure 28.38 Consensus elements are given above 
and below germline variable-region genes that 
recombine to form genes encoding immunoglobulin 
chains. These consensus elements are 
complementary and are arranged in a heptamer- 
nonamer, 12 to 23-bp spacer pattern. (Adapted 
from Tonewaga, S., 1983. Somatic generation of 
antibody diversity. Nature 302:575.) 


Chapter 28 DNA Metabolism: Replication, Recombination, and Repair 


grouped in three separate clusters on the same chromosome. The four Jy genes lie 
7 kb upstream of the eight C genes, the closest of which is C,,. Any of four C genes may 
encode the constant region of IgG H chains: Cy, Cy2a, C22, and Cy. Each C gene is 
composed of multiple exons (only C, is shown in Figure 28.37, none of the other C 
genes). Ten to 20 D genes are found 1 to 80 kb farther upstream. The Vy genes lie even 
farther upstream. In B lymphocytes, the variable region of an H-chain gene is com- 
posed of one each of the Ly—-Vy genes, a D gene, and a Jy gene joined head to tail. 
Because the H-chain variable region is encoded in three genes and the joinings can 
occur in various combinations, the H chains have a greater potential for diversity 
than the L-chain variable regions that are assembled from just two genes (for example, 
L,-V, and J,). In making H-chain genes, four genes have been brought together and 
reorganized by DNA rearrangement to produce a single polypeptide! 


28.10f V-] and V-D-}] Joining in Light- and Heavy-Chain 
Gene Assembly Is Mediated by the RAG Proteins 


Specific nucleotide sequences adjacent to the various variable-region genes suggest 
a mechanism in which these sequences act as joining signals. All germline V and D 
genes are followed by a trio of segments consisting of a consensus CACAGTG hep- 
tamer, a short spacer, and a consensus ACAAAAACA nonamer. Similarly, all D and J 
genes are preceded by a trio of segments, in this case, a consensus GGTTTTTGT nona- 
mer, a short spacer, and a consensus CACTGTG heptamer. The spacers in V,., J,, and D 
genes are 12 bp long, whereas the Vy, Jk, Vn, and Jy spacers are 23 bp (Figure 28.38). 
Note that the conserved consensus elements (namely, the heptamer and nonamer se- 
quences) downstream of a gene are complementary to those upstream from the gene 
with which it recombines. Indeed, it is these complementary consensus sequences that 
serve as recombination signal sequences (RSSs) and determine the site of recombination 
between variable-region genes. Functionally meaningful recombination happens only 
where one has a 12 bp spacer and the other has a 23 bp spacer, a relationship referred to 
as the 12/23 rule (Figure 28.39). 

Lymphoid cell-specific recombination-activating gene proteins 1 and 2 (RAGI and 
RAG2) mediate V(D)J gene recombination, functioning together as the V(D)J recombi- 
nase, also called the RAG recombinase. RAG1 (1040 residues) and RAG2 (527 residues) 
form RAGI1/RAG2 dimers. RAGI has a DNA-binding domain specific for the nanomer 
elements in RSSs, and RAG2 enhances DNA binding by RAGI and participates in 
DNA cleavage (Figure 28.39). One RAGI/RAG2 dimer binds at each nonamer in an 
RSS (Figure 28.39b). HMGBI, an abundant nuclear protein involved in DNA organi- 
zation and transcriptional regulation, plays an important role in formation of the cata- 
lytically active RAGI/RAG2 recombinase. When HMGBI binds to DNA, it forces the 
DNA to bend, facilitating dimerization of the two RAGI/RAG?2 dimers into the cata- 
lytically active (RAGI/RAG2), heterotetramer, looping out of the 12- and 23-bp spacers 
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Figure 28.39 A model for V(D)J recombination. Each DNA strand in the immunoglobulin gene is 
shown as a white bar with color-coded regions. (a) Diagram of RSS consensus elements between V 
(yellow) and J (purple) immunoglobulin gene-coding segments, with complementary RSS heptamer 
repeats (magenta) and nonamer repeats (green). (b) RAG1/RAG2 heterodimers bind to the nonamer 


RSSs and in the presence of HMGB1, DNA bending and formation of (RAGI/RAG2), 


heterotetramers juxtaposes the complementary heptamer and nonamer RSSs, looping out the 12- and 
23-bp spacers. (c) This process of synapsis brings the V and J coding regions into proximity. (d) The 
(RAG1/RAG2), complex nuclease activities cleave the DNA at the borders of V and J protein-coding 
sequences and the heptamer repeats. The products of (RAGI/RAG2), DNA cleavage are novel: The 


DNA bearing the recombination signal sequences has blunt ends, whereas the both the V- and 


J-coding DNA has hairpin ends. That is, the two strands of the V- and J -coding DNA segments are 


covalently joined. (e) In the next step of the recombination process, the DSB at the RSS ends 


(magenta) are precisely joined to make a covalently closed circular dsDNA (the signal joint), but the 
hairpins at the V- and J-coding ends are opened. Opening these hairpins makes a second double- 
stranded break (DSB). The two ends of the coding DNA at this DSB undergo further processing, 


with the addition or removal of nucleotides. This processing means that joining of the V- and 


J-coding ends will be imprecise, providing an additional means for introducing antibody diversity. 


(f) Finally, the V- and J- coding segments are joined to create a recombinant immunoglobulin- 


encoding gene. The processing and joining reactions require the proteins involved in NHEJ DSB 
repair, namely, Ku70/80, DNA-PK, DNA ligase 4, and others (see Figure 28.25). 
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and aligning the homologous heptamer and nonamer regions in process called synapsis 
(Figure 28.39c). Each RAGI has a DNA cleavage catalytic site, with RAG2 contribut- 
ing to active-site formation. Together they recognize and cleave the DNA at the junctions 
between the recombination signal sequences and the V- and J-coding regions. Specifi- 
cally, (RAGI/RAG2), nuclease activity nicks one strand of V-(and J-) DNA at the RSS- 
coding region junction, creating a free 3'-OH that attacks the phosphodiester backbone 
of the complementary V- or J-strand to form the hairpin. DNA strand nicking and V- 
and J-hairpin formation causes formation of double-stranded breaks (DSBs) at the 
heptamer:V-and J-junctions (Figure 28.39d). In the next steps in the recombination pro- 
cess, the two RSS ends are precisely joined to make a covalently closed circular ds DNA 
(called the signal joint). In contrast, the hairpins at the V- and J-coding ends are opened 
(Figure 28.39e). Opening these hairpins is equivalent to making a double-stranded 
break. The two ends of the coding DNA at the DSB undergo further processing, with 
the addition or removal of nucleotides. This processing means that joining of the V- and 
J-coding ends will be imprecise, an additional mechanism for generating antibody diver- 
sity. Finally, the V- and J-coding segments are covalently joined to create a recombinant 
immunoglobulin-encoding gene. These processing and joining reactions are carried out 
by the same proteins involved in NHEJ DSB repair, namely, Ku70/80, DNA-PK, DNA 
ligase 4, and others (see Figure 28.25). The molecular structure of the mammalian 
(RAGI1/RAG2), heterotetramer is shown in Figure 28.40. 


Figure 28.40 Molecular graphic of the vertebrate RAG1/RAG2 heterotetramer (or more 
appropriately, the (RAGI/RAG2), dimer of dimers) bound to DNA. RAGI/RAG2 is a member of 
the DDE family of polynucleotide transferases that includes transposases and integrases. The RAGI 
subunits are orange and teal; the RAG2 subunits are above their RAGI partners (red and blue). The 
DNA, shown in green, consists of double-stranded oligonucleotides with sequences corresponding to 
12- and 23-RSS substrates carrying consensus heptamer and nonamer repeats. Binding of (RAGI/ 
RAG2), to the RSS nonamers drives conformational changes in RAG] that allow the fit of only one 
12-RSS and one 23-RSS, providing a molecular explanation for the 12/23 rule. Each RAGI contains a 
RNase H-like motif midway along its amino acid sequence. These RAGI RNase H domains are the 
(RAG1/RAG2), endonuclease sites responsible for nicking the DNA, initiating the recombinant 
events. (PDB id = 6DBI. Adapted from Figure 2 in Ru, H., et al., 2018. DNA melting initiates the 
RAG catalytic pathway. Nature Structural and Molecular Biology 25:732-742.) 
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28.10g Imprecise Joining of Immunoglobulin Genes 
Creates New Coding Arrangements 


Joining of the ends of the immunoglobulin-coding regions during gene reorganiza- 
tion is somewhat imprecise, as discussed above and illustrated in Figure 28.39. This 
imprecision actually leads to even greater antibody diversity because new coding 
arrangements result. Position 96 in x chains is typically encoded by the first triplet in 
the J, element. Most x chains have one of four amino acids here, depending on which 
J, gene was recruited in gene assembly. However, occasionally only the second and 
third bases or just the third base of the codon for position 96 is contributed by the J, 
gene, with the other one or two nucleotides supplied by the V, segment (Figure 28.41). 
So, the precise point where recombination occurs during gene reorganization can vary 
over several nucleotides, creating even more diversity. 


28.10h Antibody Diversity Is Due to Immunoglobulin 
Gene Rearrangements 


Taking as an example the mouse with perhaps 300 V, genes, 4 J, genes, 200 Vy genes, 
12 D genes, and 4 Jy genes, the number of possible combinations is given by 
300 x 4 X 200 xX 12 x 4. Thus, more than 10’ different antibody molecules can be 
created from roughly 500 or so different mouse variable-region genes. Including the 
possibility for V,—J,, joinings occurring within codons adds to this diversity, as does 
the high rate of somatic mutation associated with the variable-region genes. (Recall 
that somatic mutations are mutations that arise in diploid cells and are transmitted to 
the progeny of these cells within the organism, but not to the offspring of the organ- 
ism.) Clearly, gene rearrangement is a powerful mechanism for dramatically enhanc- 
ing the protein-coding potential of genetic information. 
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Figure 28.41 Recombination between the V, and J, 
genes can vary by several nucleotides, giving rise to 
variations in amino acid sequence, and hence, 
diversity in immunoglobulin L chains. 
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Foundational Biochemistry Things You Should Know after Reading Chapter 28. 


DNA is the material of heredity. 


The three Rs of DNA metabolism—teplication, recombination, and 
repair—account for the transmission, rearrangement, and stability of 
genetic information. 


DNA replication is semiconservative: The DNA strands are separated 
and then the nucleotide sequence of each strand is copied by base- 
pairing. 


DNA replication is bidirectional, accomplished by two replication 
forks moving in opposite directions. 


DNA gyrases and helicases unwind the DNA double helix so the 
individual strands can be copied. 


Single-stranded DNA-binding protein (SSB) prevents DNA reanneal- 
ing and protects the single strands from nuclease attack. 


DNA replication is semidiscontinuous: One parental strand, the 
3'—>5' strand, is copied in a continuous manner to form the lead- 
ing strand. The other, the 5'—3' strand, is copied in an intermittent 
fashion to form the lagging strand. 


The products of lagging-strand synthesis are several kilobases in 
length and are called Okazaki fragments. 


DNA polymerases select the proper base for replication of a strand 
through Watson—Crick geometric interactions. 


DNA synthesis by DNA polymerase always proceeds in the 5'—>3' 
direction, copying the template strand in the 3'—5' direction. 


DNA polymerases require a primer strand with a free 3'-OH to build 
upon. 


Prokaryotes and eukaryotes have multiple DNA polymerases, each 
serving specific functions in DNA replication and repair. 


E. coli DNA polymerase HI holoenzyme replicates the ŒE. coli 
chromosome. 


E. coli DNA polymerase III holoenzyme consists of 10 different kinds 
of subunits: 2 core (we?) DNA polymerase III units, one clamp loader 
(y—complex), and a B>-sliding clamp to tether the holoenzyme to DNA. 


Each E. coli DNA polymerase III holoenzyme is associated with a 
hexameric DnaB helicase that unwinds the DNA duplex and three 
primase subunits that synthesize RNA primers. 


E. coli DNA polymerase III holoenzyme is highly processive: It can rep- 
licate the entire E. coli chromosome without dissociating from the DNA. 


A DNA polymerase HI holoenzyme sits at each replication fork. 
E. coli DNA polymerase I removes the RNA primers and fills in the gaps. 
DNA polymerase is its own proofreader. 


E. coli DNA polymerase I activity is important in double-strand 
break repair (DSB repair). 


E. coli DNA ligase seals the links between Okazaki fragments. 
E. coli DNA replication terminates at the Ter region. 
DNA polymerases are localized in replication factories, called replicons. 


DNA polymerases across all organisms have a common architecture 
and mechanism of action. 


DNA polymerases can be organized into families based on their 
sequence homology. 


In eukaryotic cells, the cell cycle determines the time of DNA replica- 
tion through the action of checkpoints, CDKs and cyclins. 


Initiation of replication in eukaryotes depends on ORC (the origin 
of replication complex), the CMG helicases (replication licensing fac- 
tors), Cdc6, Cdt1, and other proteins. 


The MCM proteins, the Ore proteins, and Cdc6 are AAA+ ATP 
family members. 


Geminin inhibits DNA replication by preventing MCM recruitment 
to the prereplication complex. 


The two replisomes in eukaryotic replicons are bound together by 
Ctf4. 


DNA polymerase ô synthesizes the lagging strand, and DNA poly- 
merase &, the leading strand. 


Telomerase, an RNA-dependent DNA polymerase, replicates telo- 
meres, the structures at the ends of eukaryotic chromosomes. 


Telomerase consists of a DNA polymerase catalytic subunit (TERT) 
and an RNA subunit (RT or TER) that serves as the template for 
RNA-dependent synthesis of new telomeres. 


RNA genomes are replicated by reverse transcriptase, an RNA- 
dependent DNA polymerase. 


Reverse transcriptases possess RNA-directed DNA polymerase activ- 
ity, RNase H activity, and DNA-directed DNA polymerase activity. 


Genetic information is rearranged by recombination. 


Recombination between very similar DNA sequences is called 
homologous recombination or general recombination. 


Recombination between dissimilar DNA sequences is called nonho- 
mologous recombination. 


General recombination requires the breakage and reunion of DNA 
strands. 


Homologous recombination proceeds according to the Holliday 
model, with formation of a Holliday junction. 


Resolution of the Holliday junction leads to formation of patch 
recombinants or splice recombinants. 


The enzymes of general recombination include RecA, RecBCD, 
RuvA, RuvB, and RuvC. 


The RecBCD complex unwinds double-stranded DNA and cleaves 
its single strands. 


RecA can bind single-stranded DNA and then interact with double- 
stranded DNA. 


RecA mediates the ATP-dependent strand exchange reaction leading 
to formation of a Holliday junction. 


RecA binding to ssDNA leads to formation of a right-handed helical 
nucleoprotein filament in which the ssDNA is stretched and under- 
wound. 


RuvA, RuvB, and RuvC resolve the Holliday junction to form recom- 
bination products. 


Recombination-dependent DNA replication restarts DNA replication 
at stalled replication forks. 


Transposons are DNA sequences that can move from place to place 
in the genome by recombination. 


DNA is susceptible to damage by chemical reactions, UV light, or 
ionizing radiation. 


DNA can be repaired by a number of different mechanisms, including 
mismatch repair (MMR), base excision repair (BER), and double- 
strand break repair (DSB repair). 
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DSB repair can be carried out by either nonhomologous end-joining ° 
(NHEJ) or homologous recombination. 


NHEJ DSB repair is mediated Ku70/80, DNA-PK, DNA ligase 4, 
along with other proteins. 


Mismatch repair corrects errors introduced during DNA replication. 
Pyrimidine dimers formed by UV light are repaired by photolyase. 


Base excision repair involves removal of the damaged base by DNA œ 
glycosylase, cleavage of the backbone by AP endonuclease, and gap 
repair by DNA polymerase. 


Nucleotide excision repair recognizes and removes larger regions of e 
damaged DNA than BER. 


Point mutations in DNA arise by inappropriate base-pairing. 


Mutations can be induced by base analogs. 


Problems 1027 
Chemical mutagens react with bases in DNA. 

Insertions or deletions lead to frameshift mutations. 

Protein diversity can be generated through genetic recombination. 


Antibody molecules such as immunoglobulins G and T-cell recep- 
tors are an excellent examples of protein diversity through genetic 
recombination. 


The variable regions of immunoglobulin G light and heavy chains are 
regions of variable amino acid sequence created by genetic recombi- 
nation within the immunoglobulin genes. 


V-J and D-J joining in light- and heavy-chain gene assembly is medi- 
ated by the (RAGI/RAG2), heterotetramer and the proteins of the 
NHEJ DSB repair pathway. 


Imprecise joining of IgG genes creates new coding arrangements. 


Answers to all problems are at the end of this book. Detailed solutions are 8. 


available in the Student Solutions Manual. 


1. 


Semiconservative or Conservative DNA Replication '°N-labeled E.coli 9. 


DNA has a density of 1.724 g/mL and '4N-labeled E. coli DNA has 
a density of 1.710 g/mL. E. coli cells grown for many generations on 
4NH,°* as nitrogen source are transferred to media containing 
'SNH,°* as the sole source of nitrogen. (a) What will be the density of 
the DNA after one generation, assuming replication is semiconserva- 
tive? (b) Suppose replication took place by a conservative mechanism 
in which the parental strands remained together and the two progeny 
strands were paired. Design an experiment that could distinguish 
between semiconservative and conservative modes of replication. 


The Enzymatic Activities of DNA Polymerase | (a) What are the 
respective roles of the 5’-exonuclease and 3’-exonuclease activities of 
DNA polymerase I? (b) What might be a feature of an E. coli strain 
that lacked DNA polymerase I 3’-exonuclease activity? 


12. 
. Multiple Replication Forks in E. coli | (a) Assuming DNA replication 


proceeds at a rate of 750 base pairs per second, calculate how long it 
will take to replicate the entire E. coli genome. (b) Under optimal 
conditions, E. coli cells divide every 20 minutes. What is the minimal 
number of replication forks per E. coli chromosome to sustain such a 
rate of cell division? 


Multiple Replication Forks in E. coli Il On the basis of Figure 28.2, 
draw a simple diagram illustrating replication of the circular E. coli 
chromosome (a) at an early stage, (b) when one-third completed, 

(c) when two-thirds completed, and (d) when almost finished, 
assuming the initiation of replication at oriC has occurred only once. 
Then, draw a diagram showing the E. coli chromosome in problem 3 
where the E. coli cell is dividing every 20 minutes. 


. Molecules of DNA Polymerase III per Cell vs. Growth Rate It is 


estimated that there are 40 molecules of DNA polymerase III per 
E. coli cell. Is it likely that the growth rate of E. coli is limited by 
DNA polymerase II availability? 


15. 


The Role of ATP in Replication DNA replication is an energy- 
consuming process. What steps in the overall process of DNA 
replication require ATP hydrolysis? 


Number of Okazaki Fragments in E. coliand Human DNA Replication 
Approximately how many Okazaki fragments are synthesized in the 
course of replicating an E. coli chromosome? How many in replicat- 
ing an average human chromosome? 


10. 


13. 


14. 


16. 


The Roles of Helicases and Gyrases How do DNA gyrases and 
helicases differ in their respective functions and modes of action? 


Gyrases vs Helicases Why might the action of gyrases be considered 
riskier in terms of DNA integrity than the action of helicases? 


Human Genome Replication Rate Assume DNA replication proceeds 
at a rate of 100 base pairs per second in human cells and origins of 
replication occur every 300 kbp. Assume also that human DNA 
polymerases are highly processive and only two molecules of DNA 
polymerase are needed per replication fork. How long would it take to 
replicate the entire diploid human genome? How many molecules of 
DNA polymerase does each cell need to carry out this task? 


. Heteroduplex DNA Formation in Recombination From the informa- 


tion in Figures 28.17 and 28.18, diagram the recombinational event 
leading to the formation of a heteroduplex DNA region within a 
bacteriophage chromosome. 


Homologous Recombination, Heteroduplex DNA, and Mismatch 
Repair Homologous recombination in £. coli leads to the formation 
of regions of heteroduplex DNA. By definition, such regions contain 
mismatched bases. Why doesn’t the mismatch repair system of E. coli 
eliminate these mismatches? 


RecA:DNA Nucleoprotein Helix/B-DNA Comparison If RecA protein 
unwinds duplex DNA so that there are about 18.6 bp per turn, what 
is the change in Ad, the helical twist of DNA, compared with its 
value in B-DNA? 


Terminology Determine whether the following proteins or protein 
complexes play a role in DNA replication (prokaryotic and eukary- 
otic), DNA repair (including stalled replication forks), or DNA 
recombination (including Ig recombination) & transposition: 

Rad54; Cdc45; DUE; Ku70/80; XPA protein; S-CDK; RecA; photoly- 
ase; Sld2 and Sld3; PriA; geminin; Rad51; Cdc6; DNA glycosylase; 
Dpb11; SSB; GINS complex; RAGI and RAG2; RuvC; Tus; DNA 
ligase 4; BRCA; MCM proteins. 


Resolution of a Holliday Junction by Resolvase Diagram a Holliday 
junction between two duplex DNA molecules, and show how the 
action of resolvase might give rise to either patch- or splice recombi- 
nant DNA molecules. 


Chemical Mutagenesis of DNA Bases Show the nucleotide sequence 
changes that might arise in a dsDNA (coding strand segment 
GCTA) upon mutagenesis with (a) HNO,, (b) bromouracil, and 

(c) 2-aminopurine. 
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17. Transitions and Transversions Consider the original DNA sequence 
shown and the versions of this sequence (a—d) underneath. Indicate 
where a transition and/or a transversion has occurred in each 
version. 


Original: 
5’-AGTTGACATAGCATGCATGGATTACGA 
CAGTACGATA-3’ 


3’-TCAACTGTATCGTACGTACCTAATGCTG 
TCATGCTAT-' 


a S'-AGTTGCCATAGCATGCATGGATCACG 
ACAGTACAATA-3' 
3'-TCAACTGTATCGTACGTACCTAATAC 
TGTCATGCTAT-S' 

b S'-AGTTGACATAGCATGGATGGATTACG 
ACAGTACAATA-3’ 
3’-TCAAATGTATCGTACGTACCTAATGC 
TGTCATGCTAT-S’ 

c. S'-AGTTGACATAGCATCCATGGATTACG 
ACAGTATGATA-3’ 
3'-TCAATTGTATCGTACGTACCTAATGCT 
ATCATGCTAT-S’ 

d. S'-AGTTGACACAGCATGCATGGATTACG 
ACAGTACGACA-3' 
3'-TCAACTGTATCGTAAGTACCTAATGCT 
ATCATGCTAT-S' 


Transposons as Mutagens Transposons can act as mutagenic agents. 
How do they do this? 


18. 


19. Recombination in Immunoglobulin Genes If recombination between a 
V, and J, gene formed a CCA codon at codon 95 (Figure 28.41), 


which amino acid would appear at this position? 


20. IgG in ER (Integrates with Chapter 7) Post-synthesis, IgG proteins are 
transported to the endoplasmic reticulum for further processing. 
From a review of Figure 28.36, give at least one type of processing 
IgG proteins undergo in the ER. Discuss potential roles for IgG 


processing in the ER. 


21. Helicase Unwinding of the E. coli Chromosome Hexameric helicases, 
such as DnaB, the MCM proteins, and papilloma virus E1 helicase 
(illustrated in Figures 16.22 to 16.26), unwind DNA by passing one 
strand of the DNA duplex through the central pore, using a 
mechanism based on ATP-dependent binding interactions with 
the bases of that strand. The genome of E. coli K12 consists of 
4,686,137 nucleotides. Assuming that DnaB functions like papilloma 
virus El helicase, from the information given in Chapter 16 on 
ATP-coupled DNA unwinding, calculate how many molecules 
of ATP would be needed to completely unwind the E. coli K12 
chromosome. 


22. Translesion DNA Polymerase IV Experiment Asako Furukohri, Myron 


F. Goodman, and Hisaji Maki wanted to discover how the translesion 


DNA polymerase IV takes over from DNA polymerase III at a stalled 
replication fork (see Journal of Biological Chemistry 283:11260-11269, 
2008). They showed that DNA polymerase IV could displace DNA 
polymerase III from a stalled replication fork formed in an in vitro 
system containing DNA, DNA polymerase III, the B-clamp, and SSB. 
Devise your own experiment to show how such displacement might be 
demonstrated. (Hint: Assume that you have protein identification 
tools that allow you to distinguish easily between DNA polymerase 
III and DNA polymerase IV.) 


Functional Consequences of Y-Family DNA Polymerase Structure The 
eukaryotic translesion DNA polymerases fall into the Y family of 
DNA polymerases. Structural studies reveal that their fingers and 
thumb domains are small and stubby (see Figure 28.10). In addition, 
Y-family polymerase active sites are more open and less constrained 
where base pairing leads to selection of a dNTP substrate for the 
polymerase reaction. Discuss the relevance of these structural 
differences. Would you expect Y-family polymerases to have 
3'-exonuclease activity? Explain your answer. 


23. 


24. The Eukaryotic Cell Cycle and Checkpoints Figure 28.11 depicts the 

eukaryotic cell cycle. Many cell types exit the cell cycle and don’t 

divide for prolonged periods, a state termed Go; some, for example 

neurons, never divide again. 

a. In what stage of the cell cycle do you suppose a cell might be 
when it exits the cell cycle and enters Go? 

b. The cell cycle is controlled by checkpoints, cyclins, and CDKs. 
Describe how biochemical events involving cyclins and CDKs 
might control passage of a dividing cell through the cell cycle. 


25. Recombination Between V, and J, Genes and IgG Amino Acid 
Sequence Diversity Figure 28.41 gives some examples of recombina- 
tion in IgG codons 95 and 96, as specified by the V, and J, genes. List 
the codon possibilities and the amino acids encoded if recombination 


occurred in codon 97. Which of these possibilities is less desirable? 


PSAFE Problems 


(PSAFE models, listed by chapter or protein name, can be found at http:// 
www.psafeonline.net.) 


26. Using the PSAFE module for reverse transcriptase, describe the 
function of the two magnesium ions in the active site. 


27. Using the PSAFE module for Replication Factor C (RFC), describe 
how RFC interacts with PCNA to recognize the primer-template 
junction during DNA replication. 


Think-Pair-Share Question 


Detecting Mutagens and Their Effects on DNA In broad terms, describe 
experiments designed to evaluate whether a particular environmental 
factor (chemical, radiation, etc.) induces mutations in DNA. Focus on 
detecting mutations that 1) affect DNA in general or 2) affect a specific 
gene. (Review chapters 11 and 12 for some techniques related to DNA or 
genome analysis.) 
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Transcription 29 
and the Regulation 
of Gene Expression 


Now that we have all this useful information, 
it would be nice to do something with it. 


From the Unix Programmer’s Manual 


Po m or 


+ vite = 


Di: ae ean heer 
YN SA ALI oe es 


TROCHE NNA 


Xare kolani arty feys 4A ia vfo ksn SPEY 
Jiem PZR le Ciu YA egen (ay J 
Reinet aa e <r Sona ur ti nobia) vfs hree forms: hieroglyphs, Demotic (the 
% kiya. L lea = J, 1 } ‘Wo? tear noya LSA s ano conventional Egyptian script of the time), 
% ORB Sz > S22 tal Nofs 2 Sgr and Greek (the Greeks ruled Egypt in 
re veh ANd be oe > ns Ag tuhat 96 B.c.). The Rosetta stone represents 
Ww US [ZAS JJoue- “ue at Aa Lats 3.{% he transcription of hieroglyphic symbols 
28 hy Ejen uZ <ia js sy TAn ALRT nye ajre into two living languages. Shown here is 


Sn len <a> paints oe vole. a Ae pseu bit part of the interface where hieroglyphs 
and Demotic meet. 


< The Rosetta stone, inscribed in 196 B.c. 
The writing on the Rosetta stone is in 


The Rosetta Stone, from Fort St. Julien, El-Rashid (Rosetta) 196 BC (see also 138897), Egyptian 


Ptolemaic Period (332-30 BC)/British Museum, London, UK/The Bridgeman Art Library 


Expression of the information encoded in DNA depends on transcription of that information 29.1 How Are Genes Transcribed 
into RNA. How are the genes of prokaryotes and eukaryotes transcribed to form RNA? in Bacteria? 
29.2 How Is Transcription Regulated 

in Bacteria? 
29.3 How Are Genes Transcribed 
Il cells contain three major classes of RNA that participate in protein synthesis: mes- in Eukaryotes? 

senger RNA (mRNA), ribosomal RNA (rRNA), and transfer RNA (tRNA) (see 29.4 How Do Gene Regulatory Proteins : 
Chapter 10). Further, all RNAs are synthesized from DNA templates by DNA-dependent Recognize aprons DNA sequences: 
RNA polymerases in the process known as transcription. However, only mRNAs direct 29.5 How Are Eukaryotic Transcripts 

; : : i : è : Processed and Delivered 
the synthesis of proteins. Protein synthesis occurs via the process of translation, wherein j f 
: . . : à to the Ribosomes for Translation? 

the instructions encoded in the sequence of bases in mRNA are translated into a specific 


i ; i : í 29.6 Can Gene Expression Be Regulated 
amino acid sequence by ribosomes, the workbenches of polypeptide synthesis. cai py 


Once the Transcript Has Been 


Transcription is tightly regulated in all cells. In prokaryotes, only 3% or so of the genes Synthesized? 
are undergoing transcription at any given time. The metabolic conditions and the growth 29.7 Can We Propose a Unified Theory 
status of the cell dictate which gene products are needed at any moment. Similarly, dif- of Gene Expression? 
ferentiated eukaryotic cells express only a small percentage of their genes in fulfilling 29.8 RNA Degradation by Exosomes 


their biological functions, not the full genetic potential encoded in their chromosomes. 


29.1| How Are Genes Transcribed in Bacteria? 


In prokaryotes, virtually all RNA is synthesized by a single species of DNA-dependent 
RNA polymerase. The only exception is the short RNA primers formed by primase 
during DNA replication. RNA polymerase uses ribonucleoside 5'-triphosphates (ATP, 
GTP, CTP, and UTP, represented generically as NTPs) as substrates, linking them in an 
order specified by base pairing with a DNA template: 
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A Deeper Look 


Conventions Used in Expressing the Sequences of Nucleic Acids and Proteins 


Certain conventions are useful in tracing the course of information 
transfer from DNA to protein. The strand of duplex DNA that 
is read by RNA polymerase is termed the template strand. Thus, 


is followed, DNA sequences are written in terms that correspond 
directly to mRNA sequences, which correspond in turn to the amino 
acid sequences of proteins as read beginning with the N-terminus. 


the strand that is not read is the nontemplate strand. Because the 


template strand is read by the RNA polymerase moving 3'—5' GclCATG, 

along it, the RNA product, called the transcript, grows in the 5’>3’ DNAS A NANN] | ii Nontemplate strand 
direction (see accompanying figure). Note that the nontemplate 3' Tadd tet LO Template strand 
strand has a nucleotide sequence and direction identical to those ce RNA polymerase 


of the RNA transcript, except that the transcript has U residues 
in place of T. Portions of the RNA transcript may eventually be 
translated into the amino acid sequence of a protein by a process 
in which successive triplets of bases (termed codons), read 5'—3', Y 
specify a particular amino acid. Polypeptide chains are synthesized 
in the N—C direction, and the 5'-end of mRNA encodes the 
N-terminus of the protein. 

By convention, when the order of nucleotides in DNA is shown 
as a single strand, it is the 5'—>3' sequence of nucleotides in the 
nontemplate strand that is presented. Consequently, if convention 


Transcription 


RNA transcript 5’ 


Translation 


Y 
Protein N-D- E-E-E- + + — Crerminal 


The enzyme moves along a DNA strand in the 3'—5' direction, selecting the appropriate 
NTP by base-pairing rules, and linking its 5’-phosphate to the 3’-OH of the previous 
residue. Thus, the RNA chain grows 5'—>3' during transcription, just as DNA chains do 
during replication. Subsequent hydrolysis of the PP; product by the pyrophosphatases 
present in all cells makes the polymerase reaction thermodynamically favorable. 


29.1a Bacterial RNA Polymerases Use Their Sigma Subunits to Identify 
Sites Where Transcription Begins 


Transcription is initiated in bacteria by RNA polymerase holoenzyme, a complex multi- 
meric protein (about 400 kDa) large enough to be visible in the electron microscope. Its 
subunit composition is a,8B'o. After two a-subunits (35 kDa each) dimerize, one recruits 
B and the other recruits B’ to form the clawlike core polymerase structure (Figure 29.1). 
The two largest subunits, 8B’ (171 kDa) and £ (124 kDa), perform most of the enzymatic 
functions. The B-subunit forms most of the upper jaw of the claw (cyan in Figure 29.1) and 
contains the catalytic Mg?*-binding site; B’ forms the lower jaw (yellow in Figure 29.1). 
DNA passes through a 2.7-nm channel between the jaws of the claw. Nucleotide sub- 
strates reach the catalytic center through a secondary channel entering on the back side of 
the enzyme. Binding of the o-subunit to B’ allows the RNA polymerase to recognize dif- 
ferent DNA sequences that act as promoters. A number of related promoter-recognizing 
proteins, the sigma (o) factors, can serve as the o-subunit. Promoters are nucleotide 
sequences that identify the location of transcription start sites, where transcription begins. 
Both 6 and B’ contribute to formation of the catalytic site for RNA synthesis. Dissocia- 
tion of the o-subunit from the holoenzyme leaves the core polymerase (a288'), which can 
transcribe DNA into RNA but is unable to recognize promoters and initiate transcrip- 
tion. Bacteriophage T7 expresses a simpler (monomeric) RNA polymerase (Figure 29.2) 
that shares the functional characteristics of prokaryotic RNA polymerases. 


29.1b The Process of Transcription Has Four Stages 


Transcription can be divided into four stages: (1) binding of RNA polymerase holo- 
enzyme at promoter sites, (2) initiation of polymerization, (3) chain elongation, and 
(4) chain termination. 
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Figure 29.1 Structure of the core RNA polymerase 
a2BB' (PDB id = 2051) from the eubacterium 
Thermus thermophilus. The template DNA strand 
is shown in green, the nontemplate DNA strand in 
blue, and the RNA transcript in hot pink. The two 
a chains are orange, the B chain is cyan, and the B' 
chain is yellow. The active-site Mg?* is shown as a 
red sphere. 


Binding of RNA Polymerase to Template DNA The process of transcription begins 
when the o-subunit of RNA polymerase recognizes a promoter sequence (Figure 29.3), 
and RNA polymerase holoenzyme and the promoter form a closed promoter complex 
(Figure 29.3, Step 2). At this stage, the RNA polymerase-DNA interaction is referred to 
as the closed promoter complex because the dsDNA has not yet been opened (unwound) 
so that the RNA polymerase can read the base sequence of the DNA template strand 
and transcribe it into a complementary RNA sequence. 

Once the closed promoter complex is established, the RNA polymerase holoen- 
zyme unwinds about 14 base pairs of DNA (base pairs located at positions —12 to +2, 
relative to the transcription start site), forming the very stable open promoter complex 
(Figure 29.3, Step 3). In this complex, the B’ subunit has closed around the DNA, 
forming a clamp. This clamp remains closed during transcription initiation and 
elongation. Promoter sequences can be identified in vitro by DNA footprinting (see 
a Deeper Look box on page 1037): RNA polymerase holoenzyme is bound to a puta- 
tive promoter sequence in a DNA duplex, and the DNA-protein complex is treated 
with DNase I. DNase I cleaves the DNA at sites not protected by bound protein, and Figure 29.2 Bacteriophage T7 RNA polymerase 
the set of DNA fragments left after DNase I digestion reveals the promoter (by defini- | (PDB id = IMSW) in the act of transcription. 
tion, the promoter is the RNA polymerase holoenzyme binding site). T7 RNA polymerase is a 99-kDa monomeric pro- 

RNA polymerase binding typically protects a nucleotide sequence spanning the tein: The DNA is showin entering the enzyme from 

i a i the right. The template strand is green, the nontem- 
region from —70 to +20, where the +1 position is defined as the transcription start site: plate strand is blue, the RNA transcript is hot pink. 
the base in the nontemplate DNA strand that is identical with the first base in the RNA 
transcript. The next base, +2, specifies the second base in the transcript. Nontemplate 
strand bases in the 5’, or minus, direction from the transcript start site are numbered 
—1, —2, and so on. (Note that there is no zero.) Nontemplate nucleotides in the minus 
direction are said to lie upstream of the transcription start site, whereas nucleotides in 
the 3’, or plus, direction are downstream of the transcription start site. The transcript 
start site on the template strand is usually a pyrimidine, so most transcripts begin with 
a purine. RNA polymerase binding protects 90 bp of DNA, equivalent to a distance of 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


1036 Chapter 29 Transcription and the Regulation of Gene Expression 


© 


Recognition of promoter 5 
by o; binding of polymerase 3 
holoenzyme to DNA; 
migration to promoter 


DNA template 
Formation of an RNA 5 3! 
polymerase-closed promoter 3 5' 
complex 
Unwinding of DNA at p z 
promoter and formation 3' 5 


of open promoter complex 


Purine NTP 


RNA polymerase initiates 5! 
mRNA synthesis, usually 3! 
with a purine 


RNA polymerase holoenzyme- 5, 
catalyzed elongation of 3! 


mRNA by about 4 more 
nucleotides 


| 


VAJ Release of o-subunit as core 5’ 3" 
RNA polymerase proceeds 3 =a 5' 
down the template, elongating 
RNA transcript (PPP pra 

5 


Figure 29.3 Sequence of events in the initiation and elongation phases of transcription as it occurs 
in prokaryotes. Nucleotides in this region are numbered with reference to the base at the transcription 
start site, which is designated +1. 


30 nm along B-DNA. Because RNA polymerase is only 16 nm in its longest dimension, 
the DNA must be wrapped around the enzyme. 


Properties of Bacterial Promoters Promoters recognized by the principal o factor, 0”, 
serve as the paradigm for prokaryotic promoters. These promoters vary in size from 20 
to 200 bp but typically consist of a 40-bp region located on the 5’-side of the trans- 
cription start site. Within the promoter are two consensus sequence elements. A con- 
sensus sequence can be defined as the bases that appear with highest frequency at each 
position when a series of sequences believed to have common function is compared. The 
two promoter consensus sequence elements are the Pribnow box! near — 10, whose con- 
sensus sequence is the hexameric TATAAT, and a sequence in the —35 region containing 


'Named for David Pribnow, who, along with David Hogness, first recognized the importance of this 
sequence element in transcription. 
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A Deeper Look 


DNA Footprinting—ldentifying the Nucleotide 
Sequence in DNA Where a Protein Binds 


DNA footprinting is a widely used technique to DNA:protein complex Naked DNA 
identify the nucleotide sequence within DNA 
where a specific protein binds, such as the promoter 
sequence(s) bound by RNA polymerase holoen- 
zyme. In this technique, the protein is incubated 
with a labeled (*) DNA fragment containing the 
nucleotide sequence where the protein is believed 


to bind. (The DNA fragment is labeled at only (A arrows indicate DNase I 

one end.) Then, a DNA cleaving agent, such as cleavage sites) 

DNase I, is added to the solution containing the Pe Denature dsDNA: —_ 
DNA-protein complex. DNase I cleaves the DNA —— sets of labeled fragments — »———— 
backbone in exposed regions—that is, wherever n 

the presence of the DNA-binding protein does aa 

not prevent DNase I from binding. A control x 


sample containing only DNA, with no DNA- 
binding proteins, is also treated with DNase I. 


When these DNase I digests are analyzed by Gel 
gel electrophoresis, the set of labeled frag- electrophoresis 


ments from the DNA:protein complex show 
a different electrophoretic pattern than the la- 
beled fragments from the control DNA set. The 
absence of certain fragments in the digest of the 
DNA-protein complex reveals the location of 


the protein-binding site on the DNA (see accom- Protein- 
panying figure). binding 

site 
Reference 


Rhodes, D., and Fairall, L., 1997. Analysis of sequence- 
specific DNA-binding proteins. In Protein Function: 
A Practical Approach, Creighton, T. E., ed., Oxford: 
IRL Press at Oxford University Press. 


TTI ||| 


the hexameric consensus TTGACA (Figure 29.4). The Pribnow box and the —35 region 
are separated by about 17 bp of nonconserved sequence. RNA polymerase holoenzyme 
uses its o-subunit to bind to the conserved sequences, and the more closely the —35 
region sequence corresponds to its consensus sequence, the greater is the efficiency of 
transcription of the gene. The highly expressed rrn genes in E. coli that encode rRNA 
have a third sequence element in their promoters, the upstream element (UP element), 
located about 20 bp immediately upstream of the —35 region. (Transcription from the 
rrn genes accounts for more than 60% of total RNA synthesis in rapidly growing E. coli 
cells.) The o-subunit recognizes the — 10 and —35 elements, but the C-terminal domains 
(CTD) of the a-subunits of RNA polymerase recognize and bind the UP element. 

In order for transcription to begin, the DNA duplex must be opened so that RNA 
polymerase has access to single-stranded template. The efficiency of initiation is inversely 
proportional to the melting temperature, Tm, in the Pribnow box, suggesting that the 
A:T-rich nature of this region is aptly suited for easy melting of the DNA duplex and 
creation of the open promoter complex (see Figure 29.3). Negative supercoiling facili- 
tates transcription initiation by favoring DNA unwinding. 

The RNA polymerase o-subunit is directly involved in melting the dsDNA. Inter- 
action of the o-subunit with the nontemplate strand maintains the open complex 
formed between RNA polymerase and promoter DNA, with the o-subunit acting as a 
sequence-specific single-stranded DNA-binding protein. Association of the o-subunit 
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Gene -35 region Pribnow box Initiation 
(-10 region) site (+1) 
araBAD =GGATCCT TTATCGCAACTCTC TTCTCCATACCCGTTTTT 
araC GCCGTGAT TGTTACGCGTTTT GGCTTTGGTCCCGCTTTG 
bioA TTCCAAAACG TTAATTCGGTG TGTAAINCCTAAATCTTTT 
bioB CATAATCG AATTGAAAAGATT TACAAGTCMACACCGAAT 
galP2 ATTTATTCC TCGCATCTTTGT ATGGTTATTTCATACCAT 
lac ACCCCAG ATGCTTCCGGCTCG GTGTG ATTGTGAGCGG 
lacl CCATCGAA TTTCGCGGTATGG AGCGCCCGGAAGAGAGTC 
rrnA1 AAAATAAAT AGCGGGAAGGCG CACACC GCGCCGCTG 
rrnD1 CAAAAAAAT AATTGGGATCCC GCGCCTCCGTTGAGACGA 
rrnE1 CAATTTTTC CGGAGAACTCCC GCGCCTCCATCGACACGG 
RNAS" CAACGTAACG CGCGTCATTTGA GCGCCCCGCTTCCCGATA 
trp AAATGAGC ATCATCGAACTAG AGTACGCAAGTTCACGTA 
Initiation 

-35 region Pribnow box site 
Consensus 
See ---[11-15 bp] - TS -[5-8 bp]--- 


42 38 82 84 79 64 53 45 41 


79 95 44 59 51 96 


Figure 29.4 The nucleotide sequences of representative E. coli promoters. (In accordance with conven- 
tion, these sequences are those of the nontemplate strand where RNA polymerase binds.) Consensus 
sequences for the —35 region, the Pribnow box, and the initiation site are shown at the bottom. The 
numbers represent the percent occurrence of the indicated base. (Note: The —35 region is only roughly 
35 nucleotides from the transcription start site; the Pribnow box [the —10 region] likewise is located 

at approximately position —10.) In this figure, sequences are aligned relative to the Pribnow box. The 
purple-shaded box at the right shows that in 51% of the genes, A is at the +1 site and G is there in 42%. 
Further, C is at the —1 position in 55% of the genes and T is at the +2 position in 48% of the genes. 


with the nontemplate strand stabilizes the open promoter complex and leaves the bases 
along the template strand available to the catalytic site of the RNA polymerase. 


Initiation of Polymerization Unlike DNA polymerase, RNA polymerase does not 
require a primer: It can catalyze the de novo synthesis of polynucleotides. RNA poly- 
merase has two binding sites for NTPs: the initiation site and the elongation site. The 
first nucleotide binds at the initiation site, base-pairing with the +1 base exposed within 
the open promoter complex (see Figure 29.3, Step 4). The second incoming nucleotide 
binds at the elongation site, base-pairing with the +2 base. The ribonucleotides are then 
covalently linked when the 3’-O of the first nucleotide makes a phosphoester bond with 
the a-phosphorus atom of the second nucleotide, and PP; is eliminated. Note that the 
5'-end of the transcript starts out with a triphosphate attached to it. Movement of RNA 
polymerase along the template strand (translocation) to the next base prepares the RNA 
polymerase to add the next nucleotide (see Figure 29.3, Step 5). Once an oligonucle- 
otide 9 to 12 residues long has been formed, the o-subunit dissociates from RNA poly- 
merase, signaling the completion of initiation (see Figure 29.3, Step 6). The core RNA 
polymerase is highly processive and goes on to synthesize the remainder of the mRNA. 
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As the core RNA polymerase progresses, advancing the 3'-end of the RNA chain, the 
DNA duplex is unwound just ahead of it. About 12 base pairs of the growing RNA 
remain base-paired to the DNA template at any time, with the RNA strand becoming 
displaced as the DNA duplex rewinds behind the advancing RNA polymerase. 


Chain Elongation Once a short oligonucleotide chain has been synthesized, the o-subunit 
dissociates, and elongation of the RNA transcript is catalyzed by the core polymerase that 
remains. The accuracy of transcription is such that about once every 10* nucleotides, an 
error is made and the wrong base is inserted. Because many transcripts are made per gene 
and most transcripts are smaller than 10 kb, this error rate is acceptable. 

Two possibilities can be envisioned for the path of the growing RNA chain. In one, the 
RNA chain is wrapped around the DNA as the RNA polymerase follows the template 
strand around the axis of the DNA duplex, but this possibility seems unlikely due to its 
potential for tangling the nucleic acid strands (Figure 29.5a). In reality, transcription involves 
supercoiling of the DNA, so positive supercoils are created ahead of the transcription bubble 
and negative supercoils are created behind it (Figure 29.5b). To prevent torsional stress from 
inhibiting transcription, gyrases introduce negative supercoils (and thereby remove positive 
supercoils) ahead of RNA polymerase. To compensate, topoisomerases remove negative 
supercoils behind the DNA segment undergoing transcription (Figure 29.5b). 


Chain Termination Two types of transcription termination mechanisms operate in 
bacteria: one that is dependent on a specific ring-shaped motor protein called Rho 
termination factor (for the Greek symbol, p) and another, intrinsic termination, that is 
Rho-independent. Intrinsic termination is determined by specific sequences in the DNA 
called termination sites. These sites are not indicated by particular bases showing where 
transcription halts. Instead, they consist of three structural features whose base-pairing 
possibilities lead to termination: 


1. Inverted repeats, which are typically G:C-rich, so a stable stem-loop structure can 
form in the transcript via intrachain base-pairing (Figure 29.6) 

2. A nonrepeating segment that punctuates the inverted repeats 

3. Arun of six to eight As in the DNA template, coding for Us in the transcript 


Termination then occurs as follows: A G:C-rich stem-loop structure, or hairpin, forms 
in the transcript. The hairpin apparently causes the RNA polymerase to pause, where- 
upon the A:U base pairs between the transcript and the DNA template strand are 


polymerase 


(b) Topoisomerase 
removing negative 
supercoils 


polymerase 


Figure 29.5 Supercoiling versus transcription. (a) If the RNA polymerase followed the template 
strand around the axis of the DNA duplex, no supercoiling of the DNA would occur but the RNA 
chain would be wrapped around the double helix once every 10 bp. This possibility seems unlikely 
because it would be difficult to untangle the transcript from the DNA duplex. (b) Alternatively, gyrases 
and topoisomerases could remove the torsional stresses induced by transcription. 


How Are Genes Transcribed in Bacteria? 


Gyrase introducing 
negative supercoils 
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mRNA terminus 


Direction of transcription 


Inverted repeat Inverted repeat 
ee Transcription 
DNA $, 
a ay 
G-C rich A-T rich 
terminus 
Template Last base 
strand transcribed 

Figure 29.6 The termination site for the E. coli trp operon (the trp operon encodes the enzymes of 

tryptophan biosynthesis). The inverted repeats give rise to a stem-loop, or hairpin, structure ending in 

a series of U residues. 

displaced through formation of somewhat more stable A:T base pairs between the 

template and nontemplate strands of the DNA. The result is spontaneous dissociation 

of the nascent transcript from DNA. 

The alternative mechanism of termination—Rho factor-dependent termination— 
is less common and mechanistically more complex. Rho factor is an ATP-dependent 
hexameric helicase (50-kDa subunits) that catalyzes the unwinding of RNA:DNA 
hybrid duplexes (or RNA:RNA duplexes). The rho factor recognizes and binds to 
C-rich regions in the RNA transcript. These regions must be unoccupied by translating 
ribosomes for rho factor to bind. Once bound, rho factor advances in the 5’—>3’ 
direction until it reaches an RNA polymerase stalled at a Rho-dependent termination 
site (a genomic site downstream from a C>G bubble, defined as a region rich in C 
residues and poor in G residues) (Figure 29.7). There, Rho mediates the release of the 
RNA transcript and the dissociation of RNA polymerase from the DNA. 

(a) 

p factor mRNA RNA 

Figure 29.7 The Rho factor mecha- 
nism of transcription termination. 
Rho factor (a) attaches to a recogni- 
tion site on mRNA and (b) moves 
along it behind RNA polymerase. 
(c) When RNA polymerase pauses 
at the termination site, Rho factor 
unwinds the DNA:RNA hybrid in €3 
the transcription bubble, (d) releasing 
the nascent mRNA. mRNA 
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Figure 29.8 The general organization of operons. Operons consist of transcriptional control 
regions and a set of related structural genes, all organized in a contiguous linear array along the 
chromosome. The transcriptional control regions are the promoter and the operator, which lie 
next to, or overlap, each other, upstream from the structural genes they control. Operators may 
lie at various positions relative to the promoter, either upstream or downstream. Expression of 
the operon is determined by access of RNA polymerase to the promoter, and occupancy of the 
operator by regulatory proteins influences this access. Induction activates transcription from the 
promoter; repression prevents it. 


29.2 | How Is Transcription Regulated in Bacteria? 


In bacteria, genes encoding the enzymes of a particular metabolic pathway are often grouped 
adjacent to one another in a cluster on the chromosome. This pattern of organization allows 
all of the genes in the group to be expressed in a coordinated fashion through transcription 
into a single polycistronic mRNA encoding all the enzymes of the metabolic pathway. A poly- 
cistronic mRNA is a single RNA transcript that encodes more than one polypeptide. Cistron 
is a genetic term for a DNA region representing a protein: Cistron and gene are essentially 
equivalent terms. 

A regulatory sequence lying adjacent to the DNA being transcribed determines 
whether transcription takes place. This sequence is termed the operator (Figure 29.8). 
The operator is located next to a promoter. Interaction of a regulatory protein with the 
operator controls transcription of the gene cluster by controlling access of RNA poly- 
merase to the promoter. (Although this is the paradigm for bacterial gene regulation, it 
must be emphasized that many bacterial genes do not contain operators and are regu- 
lated in ways that do not involve protein; operator interactions.) Such co-expressed 
gene clusters, together with the operator and promoter sequences that control their 
transcription, are called operons. 


29.2a Transcription of Operons Is Controlled by 
Induction and Repression 


In bacteria, gene expression is often responsive to small molecules serving as signals of 
the nutritional or environmental conditions confronting the cell. Increased synthesis of 
enzymes in response to the presence of a particular substrate is termed induction. For 
example, lactose (Figure 29.9) can serve as both carbon and energy source for E. coli. 
Metabolism of lactose depends on hydrolysis into its component sugars, glucose and 
galactose, by the enzyme B-galactosidase. In the absence of lactose, E. coli cells contain 
very little B-galactosidase (less than 5 molecules per cell). However, lactose availability 
induces the synthesis of B-galactosidase by activating transcription of the lac operon. 
One of the genes in the Jac operon, lacZ, is the structural gene for B-galactosidase. 
When its synthesis is fully induced, 6-galactosidase can amount to almost 10% of the 
total soluble protein in E. coli. When lactose is removed from the culture, synthesis of 
B-galactosidase halts. 

The alternative to induction—namely decreased synthesis of enzymes in response to 
a specific metabolite—is termed repression. For example, the enzymes of tryptophan 
biosynthesis in Æ. coli are encoded in the trp operon. If sufficient Trp is available to the 
growing bacterial culture, the trp operon is not transcribed, so the Trp biosynthetic 
enzymes are not made; that is, their synthesis is repressed. Repression of the trp operon 


3 
EE] 
A 
Lactose 
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CHOH 
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Figure 29.9 The structure of lactose, a 
B-galactoside. 
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Isopropyl 8-thiogalactoside (IPTG) 


CHOH CH 
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Figure 29.10 The structure of IPTG (isopropyl 
B-thiogalactoside), a gratuitous inducer of the lac 
operon. 


in the presence of Trp is an eminently logical control mechanism: If the end product of 
the pathway is present, why waste cellular resources making unneeded enzymes? 

Induction and repression are two faces of the same phenomenon. In induction, a 
substrate activates enzyme synthesis. Substrates capable of activating synthesis of the 
enzymes that metabolize them are called co-inducers, or, often, simply inducers. Some 
substrate analogs can induce enzyme synthesis even though the enzymes are incapable 
of metabolizing them. These analogs are called gratuitous inducers. A number of thio- 
galactosides, such as IPTG (isopropyl B-thiogalactoside, Figure 29.10), are excellent 
gratuitous inducers of B-galactosidase activity in E. coli. In repression, a metabolite, 
typically an end product, depresses synthesis of its own biosynthetic enzymes. Such 
metabolites are called co-repressors. 


29.2b The lac Operon Is a Paradigm of Operons 


In 1961, François Jacob and Jacques Monod proposed the operon hypothesis to 
account for the coordinate regulation of related metabolic enzymes. The operon was 
considered to be the unit of gene expression, consisting of two classes of genes: the 
structural genes for the enzymes and regulatory genes that control expression of the 
structural genes. The two kinds of genes could be distinguished by mutation. Muta- 
tions in a structural gene would abolish one particular enzymatic activity, but muta- 
tions in a regulatory gene would affect all of the different enzymes under its control. 
Mutations of both kinds were known in Æ. coli for lactose metabolism. Bacteria with 
mutations in either the /JacZ gene or the /acY gene (Figure 29.11) could no longer 
metabolize lactose—the lacZ mutants (/acZ~ strains) because B-galactosidase activ- 
ity was absent, the lac Y mutants because lactose was no longer transported into the 
cell. Other mutations defined another gene, the /acI gene. /acI mutants were differ- 
ent because they both expressed B-galactosidase activity and immediately trans- 
ported lactose, without prior exposure to an inducer. That is, a single mutation led to 
the expression of lactose metabolic functions independently of inducer. Expression of 
genes independently of regulation is termed constitutive expression. Thus, /ac/ had the 
properties of a regulatory gene. The lac operon includes the regulatory gene /ac/; its 
promoter p; and three structural genes, lacZ, lac Y, and lacA, with their own promoter 
Piac and operator O (Figure 29.11). 


Polypeptide Amino acids 360 1023 417 203 

kDa 38.6 116.4 46.5 2257 

Protein Structure Tetramer Tetramer Membrane Dimer 
protein 

kDa 154.4 465 46.5 45.4 

Funcuon Repressor B-Galactosidase Permease pane 


acetylase 


Figure 29.11 The Jac operon. The operon consists of two transcription units. In one unit, there 
are three structural genes, lacZ, lacY, and /acA, under control of the promoter, Puo and the 
operator O. In the other unit, there is a regulator gene, /acl, with its own promoter, Piac. lacI 
encodes a 360-residue, 38.6-kDa polypeptide that forms a tetrameric lac repressor protein. lacZ 
encodes B-galactosidase, a tetrameric enzyme of 116-kDa subunits. lac Y is the B-galactoside 
permease structural gene, a 46.5-kDa integral membrane protein active in B-galactoside transport 
into the cell. The remaining structural gene encodes a lactose transacetylase, whose function 
might allow excess lactose to be transported out of the cell by a multidrug resistance permease 
that exports acetylated substrates. 
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29.2c lac Repressor Is a Negative Regulator of the lac Operon 


The structural genes of the /ac operon are controlled by negative regulation. That is, they 
are transcribed to give an mRNA unless turned off by the /acI gene product. This gene 
product is the lac repressor, a tetrameric protein (Figure 29.12). Unlike inducers and co- 
repressors, which are small molecules/metabolites, repressors are proteins. The /ac repres- 
sor has an N-terminal DNA-binding domain; the rest of the protein functions in inducer 
binding and tetramer formation. In the absence of an inducer, lac repressor blocks lac 
gene expression by binding to the operator DNA site upstream from the /ac structural 
genes. The Jac operator is a palindromic DNA sequence (Figure 29.13). Palindromes, 
or inverted repeats, provide a twofold, or dyad, symmetry, a structural feature common 
at sites in DNA where proteins specifically bind. Despite the presence of lac repressor, 
RNA polymerase can still initiate transcription at the promoter (Puc), but lac repres- 
sor blocks elongation of transcription, so initiation is aborted. In /acI mutants, the lac 
repressor is absent or defective in binding to operator DNA, lac gene transcription is 


How Is Transcription Regulated in Bacteria? 
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Figure 29.12 The mode of action of lac 
repressor. The structure of the lac repressor 
tetramer with bound IPTG (purple) is also 
shown (PDB id = 1LBH). 
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Repressor for DNA* 


DNA Repressor 

lac operator 2x 108 M~! 
All other DNA 2 x 10° M7! 
Specificity? 107 


*Values for repressor: DNA binding are given as association 


The Affinity of lac 


Repressor + 
Inducer 


2x 10 M~! 
2x 106 M~! 
104 


constants, Ka, for the formation of DNA: repressor com- 
plex from DNA and repressor. 
Specificity is defined as the ratio (K; for repressor binding 
to operator DNA)/(Ka for repressor binding to random 


DN 


A). 


(a) DNA 
Axis of = 
AAT TT 
(b) Protected bases TT ATA T fe I au Was 
| | | | | | | | 
-10 -5 +1 +5 +10 +15 +20 +25 


La mRNA transcription 


Figure 29.13 The nucleotide sequence of the lac operator. (a) This sequence comprises 36 bp showing 
nearly palindromic symmetry. The inverted repeats that constitute this approximate twofold symmetry 
are shaded in rose. (b) Position of the /ac operator, relative to the transcription start site. The bases are 
numbered relative to the +1 start site for transcription. The G:C base pair at position +11 represents 
the axis of symmetry. In vitro studies show that bound /ac repressor protects a 26-bp region from —5 
to +21 against nuclease digestion. Bases that interact with bound /ac repressor are indicated below the 
operator. Note the symmetry of protection at +1 through +4 TTAA to +18 through +21 AATT. 


not blocked, and the lac operon is constitutively expressed. Note that /ac/ is normally 
expressed constitutively from its promoter, so lac repressor protein is always available to 
fill its regulatory role. About 10 molecules of lac repressor are present in an E. coli cell. 

Derepression of the /ac operon occurs when appropriate B-galactosides occupy the 
inducer site on lac repressor, causing a conformational change in the protein that lowers 
the repressor’s affinity for operator DNA. As a tetramer, lac repressor has four inducer 
binding sites, and its response to inducer shows cooperative allosteric effects. Thus, as a 
consequence of the inducer-induced conformational change, the inducer: lac repres- 
sor complex dissociates from the DNA, and RNA polymerase transcribes the structural 
genes (see Figure 29.12). Induction reverses rapidly: Jac mRNA has a half-life of only 
3 minutes, and once the inducer is used up through metabolism by the enzymes, free lac 
repressor reassociates with the operator DNA, transcription of the operon is halted, 
and any residual lac mRNA is degraded. 

In the absence of inducer, /ac repressor binds nonspecifically to duplex DNA with 
an association constant, Ka, of 2 X 10° M`! (Table 29.1) and to the lac operator DNA 
sequence with much higher affinity, Ka = 2 X 1013 M~!. Thus, lac repressor binds 
10’ times better to Jac operator DNA than to any random DNA sequence. IPTG binds to 
lac repressor with an association constant of about 10° M~!. The IPTG: lac repressor com- 
plex binds to operator DNA with an association constant, Ką = 2 X 10!° M~!. Although 
this affinity is high, it is three orders of magnitude /ess than the affinity of inducer-free 
repressor for lac operator. There is no difference in the affinity of free lac repressor and lac 
repressor with IPTG bound for nonoperator DNA. The /ac repressor apparently acts by 
binding to DNA and sliding along it, testing sequences in a one-dimensional search until 
it finds the /ac operator. The /ac repressor then binds there with high affinity until inducer 
causes this affinity to drop by three orders of magnitude (Table 29.1). 


29.2d CAP Is a Positive Regulator of the lac Operon 


Transcription by RNA polymerase from some promoters proceeds with low efficiency 
unless assisted by an accessory protein that acts as a positive regulator. One such pro- 
tein is CAP, or catabolite activator protein. Its name derives from the phenomenon of 
catabolite repression in E. coli. Catabolite repression is a global control that coordi- 
nates gene expression with the total physiological state of the cell: As long as glucose is 
available, E. coli catabolizes it in preference to any other energy source, such as lactose 
or galactose. Catabolite repression ensures that the operons necessary for metabolism 
of these alternative energy sources, that is, the /ac and gal operons, remain repressed 
until the supply of glucose is exhausted. Catabolite repression overrides the influence 
of any inducers that might be present. 

Catabolite repression is maintained until the E. coli cells become starved of glu- 
cose. Glucose starvation leads to activation of adenylyl cyclase, and the cells begin to 
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Quantitative Evaluation of lac Repressor: DNA Interactions 


The affinity of lac repressor for random DNA ensures that essen- 
tially all repressor is DNA bound. Assume that E. coli DNA has a 
single specific lac operator site for repressor binding and 4.64 x 10° 
base pairs. Assume also that any nucleotide sequence even one base 
out of phase with the operator constitutes a nonspecific binding site. 
Thus, there are 4.64 X 10° nonspecific sites for repressor binding. 

The binding of repressor to DNA is given by the association 
constant, Ka: 


[repressor: DNA] 


[repressor][ DNA] 


where [repressor: DNA] is the concentration of repressor: DNA 
complex, [repressor] is the concentration of free repressor, and 
[DNA] is the concentration of nonspecific binding sites. Rearrang- 
ing gives the following: 


[repressor] 1 
[repressor: DNA] K,[DNA] 


If the number of nonspecific binding sites is 4.64 X 10°, there are 
(4.64 X 10°)/(6.022 x 1073) = 0.77 X 107!7 moles of binding sites 
contained in the volume of a bacterial cell (roughly 10~} liters). 
Therefore, [DNA] = (0.77 X 10710/10715) = 0.77 X 107? M. Since 
K, =2 X 10° M`! (Table 29.1), 


[repressor] 1 1 
[repressor: DNA] (2 X 10°)(0.77 X 10°) (1.54 X 104) 


So, the ratio of free repressor to DNA-bound repressor is 6.5 X 1075. 
Less than 0.01% of Jac repressor is not bound to DNA, or, stated 
another way, more than 99.99% of repressor is bound to DNA! 
The behavior of lac repressor is characteristic of DNA-binding 
proteins. These proteins bind with low affinity to random DNA 
sequences, but with much higher affinity to their unique target sites 
(Table 29.1). 


make cAMP. (In contrast, glucose uptake is accompanied by deactivation of adenylyl 
cyclase.) The action of CAP as a positive regulator is cAMP-dependent. cAMP is 
a small-molecule inducer for CAP, and cAMP binding enhances CAP’s affinity for 


Q 


Inactive CAP 


DNA. CAP, also referred to as CRP (for cAMP receptor protein), is a dimer of iden- 
tical 22.5-kDa polypeptides. The N-terminal domains bind cAMP; the C-terminal 


domains constitute the DNA-binding site. Two molecules of cAMP are bound per 


Glucose —> — [cAMP] | 


dimer. The CAP-(cAMP), complex binds to specific target sites near the promoters of 


operons (Figure 29.14). Binding of CAP-(cAMP), to DNA causes the DNA to bend 
more than 80° (Figure 29.15). This CAP-induced DNA bending near the promoter 


Figure 29.15 Binding of CAP-(cAMP), induces a severe bend in DNA about the center of dyad sym- 
metry at the CAP-binding site. The CAP dimer with two molecules of cAMP bound interacts with 

27 to 30 base pairs of duplex DNA. Two a-helices of the CAP dimer insert into the major groove 

of the DNA at the dyad-symmetric CAP-binding site. The two cAMP molecules bound by the CAP 
dimer are indicated in yellow. Binding of CAP-(cAMP), to its specific DNA site involves H bonding 
and ionic interactions between protein functional groups and DNA phosphates, as well as H-bonding 
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Figure 29.14 The mechanism of catabolite repres- 
sion and CAP action. Glucose instigates catabolite 
repression by lowering cAMP levels. cAMP is neces- 
sary for CAP binding near promoters of operons 
whose gene products are involved in the metabolism 
of alternative energy sources such as lactose, galac- 
tose, and arabinose. The binding sites for the CAP- 
(cAMP), complex are consensus DNA sequences 
containing the conserved pentamer TGTGA and a 
less well conserved inverted repeat, TCANA (where 
N is any nucleotide). 


interactions in the DNA major groove between amino acid side chains of CAP and DNA base pairs 


(PDB id = 1CGP). 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


1046 


Chapter 29 Transcription and the Regulation of Gene Expression 


Induction 


Repression 


assists RNA polymerase holoenzyme binding and closed promoter complex forma- 
tion. Contacts made between the CAP-(cAMP), complex and the a-subunit of RNA 
polymerase holoenzyme activate transcription. 


29.2e Negative and Positive Control Systems 
Are Fundamentally Different 


Negative and positive control systems operate in fundamentally different ways 
(although in some instances both govern the expression of the same gene). Genes under 
negative control are transcribed unless they are turned off by the presence of a repres- 
sor protein. Often, transcription activation is merely the release from negative control. 
In contrast, genes under positive control are expressed only if an active regulator pro- 
tein is present. The /ac operon illustrates these differences. The action of /ac repressor is 
negative. It binds to operator DNA and blocks transcription; expression of the operon 
occurs only when this negative control is lifted through the release of the repressor. In 
contrast, regulation of the lac operon by CAP is positive: Transcription of the operon 
by RNA polymerase is stimulated by CAP’s action as a positive regulator. 

Operons can also be classified as inducible, repressible, or both, depending on how 
they respond to the small molecules that mediate their expression. Repressible oper- 
ons are expressed only in the absence of their co-repressors. Inducible operons are 
transcribed only in the presence of small-molecule co-inducers (Figure 29.16). 


29.2f The araBAD Operon Is Both Positively 

and Negatively Controlled by AraC 
E. coli can use the plant pentose l-arabinose as sole source of carbon and energy. Arabi- 
nose is metabolized via conversion to D-xylulose-5-P (a pentose phosphate pathway 
intermediate and transketolase substrate [see Chapter 22]) by three enzymes encoded 
in the araBAD operon. Transcription of this operon is regulated by both catabolite 
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Figure 29.16 Control circuits governing the expression of genes. These circuits can be either negative 
or positive, inducible or repressible. 
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repression and arabinose-mediated induction. CAP functions in catabolite repression; 
arabinose induction is achieved via the product of the araC gene, which lies next to 
the araBAD operon on the E. coli chromosome. The araC gene product, the protein 
AraC, is a 292-residue protein consisting of an N-terminal domain (residues | to 170) that 
binds arabinose and acts as a dimerization motif and a C-terminal (residues 178 to 292) 
DNA-binding domain. (Proteins are often named for the genes encoding them. By con- 
vention, the name of the protein is capitalized but not italicized.) Regulation of araBAD 
by AraC is novel in that it acts both negatively and positively. The ara operon has three 
binding sites for AraC: araQ,, located at nucleotides —106 to — 144 relative to the araBAD 
transcription start site; araO, (spanning positions —265 to —294); and aral, the araBAD 
promoter. The aral site consists of two half-sites; aral, (nucleotides —56 to —78) and arah 
(—35 to — 51). (The araO, site contributes minimally to ara operon regulation.) 

The details of araBAD regulation are as follows: When AraC protein levels are low, 
the araC gene is transcribed from its promoter p, (adjacent to araO,) by RNA poly- 
merase (Figure 29.17). araC is transcribed in the direction away from araBAD. When 
cAMP levels are low and arabinose is absent, an AraC protein dimer binds to two sites, 
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Figure 29.17 Regulation of the araBAD operon by the combined action of CAP and AraC protein. 
(a) The organization of the araBCD operon, the proteins it encodes, and their roles in the pathway of 
arabinose metabolism. (b) In the absence of L-arabinose, araC monomers bound at aral and araO, 
dimerize, creating a loop in the araBCD operon that restricts transcription. (c) When L-arabinose is 
present and [cAMP] is high, L-arabinose binds to araC, causing release of araOQz, opening of the DNA 
loop and association of araC with araL,. CAP-(cAMP), binds to the CAP site located between araO, 
and aral. Protein-protein interactions between araC/L-arabinose, CAP-(cAMP)», and RNA poly- 
merase result in transcription of the araBCD operon and metabolism of arabinose. 
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araO) and the aral, half-site, forming a DNA loop between them and restricting trans- 
cription of araBAD (Figure 29.17). In the presence of L-arabinose, the monomer of 
AraC bound to the araQ; site is released from that site; it then associates with the unoc- 
cupied aral half-site, arah. L-Arabinose thus behaves as an allosteric effector that alters 
the conformation of AraC. In the arabinose-liganded conformation, the AraC dimer 
interacts with CAP-(cAMP), to activate transcription by RNA polymerase. Thus, 
AraC protein is both a repressor and an activator. 

Positive control of the araBAD operon occurs in the presence of L-arabinose and 
cAMP. Arabinose binding by AraC protein causes the release of araO , opening of the 
DNA loop, and association of AraC with arah. CAP-(cAMP), binds at a site between 
araO, and aral, and together the AraC-(arabinose), and CAP-(cAMP), complexes influ- 
ence RNA polymerase through protein: protein interactions to create an active transcrip- 
tion initiation complex. Supercoiling-induced DNA looping may promote protein: protein 
interactions between DNA-binding proteins by bringing them into juxtaposition. 


29.2g The trp Operon Is Regulated Through a Co-Repressor—Mediated 
Negative Control Circuit 


The trp operon of E. coli encodes the five polypeptides, trpE through trpA 
(Figure 29.18), that assemble into the three enzymes catalyzing tryptophan synthesis 
from chorismate (Chapter 25). Expression of the trp operon is under the control of Trp 
repressor, a dimer of 108-residue polypeptide chains. When tryptophan is plentiful, Trp 
repressor binds two molecules of tryptophan and associates with the trp operator that is 
located within the trp promoter. Trp repressor binding excludes RNA polymerase from 
the promoter, preventing transcription of the trp operon. When Trp becomes limiting, 
repression is lifted because Trp repressor lacking bound Trp (Trp apo-repressor) has a 
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Figure 29.18 The ¢rp operon of E. coli. The trp operon is transcribed as a single polycistronic mRNA 
encoding the leader peptide (trpL), anthranilate synthase component I (trpE), anthranilate synthase 
component II (trpD), the bifunctional N-(5’-phosphoribosy] anthranilate isomerase/indole-3-glycerol 
phosphate synthase (trpC), tryptophan synthase B-subunit (¢rpB), and tryptophan synthase a-subunit 
(trpA). 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 


Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


29.2 How Is Transcription Regulated in Bacteria? 


lowered affinity for the trp promoter. Thus, the behavior of Trp repressor corresponds 
to a co-repressor—mediated, negative control circuit (see Figure 29.16). Trp repressor 
not only is encoded by the trpR operon but also regulates expression of the trpR 
operon. This is an example of autogenous regulation (autoregulation): regulation of 
gene expression by the product of the gene. 


29.2h Attenuation Is a Prokaryotic Mechanism for Post-Transcriptional 
Regulation of Gene Expression 


In addition to repression, expression of the trp operon is controlled by transcription 
attenuation. Unlike the mechanisms discussed thus far, attenuation regulates transcrip- 
tion after it has begun. Charles Yanofsky, the discoverer of this phenomenon, defined 
attenuation as any regulatory mechanism that manipulates transcription termination or 
transcription pausing to regulate gene transcription downstream. In prokaryotes, trans- 
cription and translation are coupled (see Figure 10.19), and the translating ribosome 
is affected by the formation and persistence of secondary structures in the mRNA. In 
many operons encoding enzymes of amino acid biosynthesis, a transcribed 150- to 300- 
bp leader region is positioned between the promoter and the first major structural gene. 
Such regions encode a short leader peptide containing multiple codons for the pertinent 
amino acid. For example, the leader peptide of the leu operon has four leucine codons, 
the trp operon has two tandem tryptophan codons, and so forth (Figure 29.19). Transla- 
tion of these codons depends on an adequate supply of the relevant aminoacyl-tRNA, 
which in turn rests on the availability of the amino acid. When tryptophan is scarce, the 
entire trp operon from ¢rpL to trpA is transcribed to give a polycistronic mRNA. But, 
as [Trp] increases, more and more of the trp transcripts consist of only a 140-nucleo- 
tide fragment corresponding to the 5'-end of trpL. Tryptophan availability is causing 
premature termination of trp transcription, that is, transcription attenuation. Although 
attenuation occurs when tryptophan is abundant, attenuation is blocked when levels of 
tryptophan are low and little tryptophanyl-tR NA is available. The secondary structure 
of the 160-bp leader region transcript is the principal control element in transcrip- 
tion attenuation (Figure 29.20). This RNA segment includes the coding region for the 
14-residue leader peptide. Three critical base-paired hairpins can form in this RNA: the 
1-2 pause structure, the 3-4 terminator, and the 2-3 antiterminator. The 1-2 pause, 3-4 
terminator, and the 2-3 antiterminator represent mutually exclusive alternatives. A sig- 
nificant feature of this coding region is the tandem UGG tryptophan codons. 
Transcription of the trp operon by RNA polymerase begins and progresses until 
position 92 is reached, whereupon the | :2 hairpin is formed, causing RNA polymerase 
to pause in its elongation cycle. While RNA polymerase is paused, a ribosome begins 
to translate the leader region of the transcript. Translation by the ribosome releases 
the paused RNA polymerase, and transcription continues, with RNA polymerase and 
the ribosome moving in unison. As long as tryptophan is plentiful enough that 
tryptophanyl-tRNA"®? is not limiting, the ribosome is not delayed at the two tryp- 
tophan codons and follows closely behind RNA polymerase, translating the message 


Operon Amino acid Sequence 

his Met — Thr — Arg— Val — Gln —Phe — Lys — His —His —His— His — His—His—His— Pro—Asp 

ilu Met —Thr — Ala— Leu —Leu — Arg—Val—Ile— Ser -Leu —Val—Val—Ile— Ser—Val —Val—Val—Ile — Ile —Ile —Pro—Pro—Cys—Gly—Ala—Ala —Leu— Gly—Arg— Gly— Lys —Ala 
leu Met —Ser— His—Ile—Val— Arg— Phe— Thr — Gly —Leu— Leu— Leu—Leu— Asn—Ala—Phe —Ile —Val —Arg—Gly—Arg—Pro —Val— Gly— Gly —Ile —Gln—His 

pheA Met —Lys — His —Ile — Pro ~Phe — Phe— Phe —Ala —Phe — Phe— Phe —Thr—Phe Pro 

thr Met —Lys — Arg—Ile — Ser—Thr—Thr — le —Thr—Thr—Thr— Ile —Thr—Ile —Thr —Thr — Gln — Asn —Gly —Ala — Gly 

trp Met —Lys —Ala—Ile — Phe —Val—Leu — Lys —Gly —Trp — Trp — Arg —Thr —Ser 


Figure 29.19 Amino acid sequences of leader peptides in various amino acid biosynthetic operons 
regulated by attenuation. Color indicates amino acids synthesized in the pathway catalyzed by the 
operon’s gene products. (The ilv operon encodes enzymes of isoleucine, leucine, and valine biosynthesis.) 
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Figure 29.20 Alternative 

secondary structures for the 
leader region (trpL mRNA) 
of the trp operon transcript. 
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soon after it is transcribed. The presence of the ribosome atop segment 2 blocks 
formation of the 2-3 antiterminator hairpin, allowing the alternative 3-4 terminator 
hairpin to form (Figure 29.21). Stable hairpin structures followed by a run of Us are 
features typical of Rho-independent transcription termination signals, so the RNA 
polymerase perceives this hairpin as a transcription stop signal and transcription is 
terminated at this point. On the other hand, limiting levels of tryptophan and, hence, 
low availability of tryptophanyl-tR NA™? causes the ribosome to stall on segment 1. 
This leaves segment 2 free to pair with segment 3 and to form the 2-3 antiterminator 
hairpin in the transcript. Because this hairpin precludes formation of the 3-4 termina- 
tor, termination is prevented and the entire operon is transcribed. Thus, transcription 
attenuation is determined by the availability of tryptophanyl-tRNA™®? and its transi- 
tory influence over the formation of alternative secondary structures in the mRNA. 


29.2i DNA: Protein Interactions and Protein: Protein Interactions 
Are Essential to Transcription Regulation 


Although a variety of control mechanisms regulate transcription, they share common 
features. First, DNA: protein interactions are a central feature in transcriptional 
control, and the DNA sites where regulatory proteins bind commonly display 
at least partial dyad symmetry or inverted repeats. Furthermore, DNA-binding 
proteins themselves are generally even-numbered oligomers (for example, dimers, tetra- 
mers) that have an innate, twofold rotational symmetry. Second, protein: protein interac- 
tions are an essential component of transcriptional activation. We see this latter feature 
in the activation of RNA polymerase by CAP-(cAMP),, for example. Third, the regu- 
lator proteins receive cues that signal the status of the environment (for example, Trp, 
lactose, cAMP) and act to communicate this information to the genome, typically via the 
medium of conformational changes and DNA: protein interactions. 


29.2j Proteins That Activate Transcription Work Through Protein : Protein 
Contacts with RNA Polymerase 
Although transcriptional control is governed by a variety of mechanisms, 


protein: protein interactions are a universal feature in transcriptional activation. Tran- 
scriptional activation can take place when a transcriptional activator protein [such 
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as CAP-(cAMP),] bound to DNA makes protein: protein contacts with RNA poly- 
merase, and the degree of transcriptional activation is proportional to the strength of 
the protein:protein interaction. Generally speaking, a nucleotide sequence that provides 
a binding site for a DNA-binding protein can serve as an activator site if the DNA- 
binding protein bound there can interact with promoter-bound RNA polymerase. 
These interactions can involve either the a-, B-, B’-, or o-subunits of RNA polymerase. 
Moreover, if the DNA-bound transcriptional activator makes contacts with two differ- 
ent components of RNA polymerase, there is a synergistic response, seen as markedly 
elevated transcription. Thus, transcriptional activation at specific genes relies on the 
presence of one or more activator sites where one or more transcriptional activator 
proteins can bind and make contacts with RNA polymerase bound at the promoter of 
the gene. Indeed, transcriptional activators may facilitate the recruitment and binding 
of RNA polymerase to the promoter. This general principle applies to transcriptional 
activation in both prokaryotic and eukaryotic cells. In eukaryotes, transcriptional acti- 
vators typically have discrete domains of protein structure dedicated to DNA binding 
(DB domains) and transcriptional activation (TA domains). 


29.2k DNA Looping Allows Multiple DNA-Binding 
Proteins to Interact with One Another 


Because transcription must respond to a variety of regulatory signals, multiple proteins are 
essential for appropriate regulation of gene expression. These regulatory proteins are the 
sensors of cellular circumstances, and they communicate this information to the genome 
by binding at specific nucleotide sequences. However, DNA is virtually a one-dimensional 
polymer, and there is little space for a lot of proteins to bind at (or even near) a transcrip- 
tion initiation site. DNA looping permits additional proteins to convene at the initiation 
site and to exert their influence on creating and activating an RNA polymerase initia- 
tion complex (Figure 29.22). The number of participants in transcriptional regulation is 
greatly expanded by DNA looping. It is important to note that, although illustrations 
on a printed page are two-dimensional, DNA looping occurs in the three dimensions of 
space. If you conceptualize DNA looping as three-dimensional, it becomes apparent that 
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Figure 29.21 The mechanism of attenuation in the 
trp operon. 
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Figure 29.22 Formation of a DNA loop delivers DNA-bound transcriptional activator to RNA poly- 
merase positioned at the promoter. Protein:protein interactions between the transcriptional activator 
and RNA polymerase activate transcription. 


many different proteins, each bound to a DNA site remote from the others, could come 
together in three-dimensional space to recruit RNA polymerase to a promoter. Interac- 
tions between them creates the potential for exquisite control over expression of the gene. 


29.3 | How Are Genes Transcribed in Eukaryotes? 


Although the mechanism of transcription in bacteria and eukaryotes is fundamentally 
similar, transcription is substantially more complicated in eukaryotes. One significant 
difference is that the DNA of eukaryotes (and, interestingly, of Archaea as well) is 
wrapped around histones to form nucleosomes, and the nucleosomes are further orga- 
nized into chromatin (see Chapter 11). Nucleosomes control gene expression by con- 
trolling access of the transcriptional apparatus to genes. The presence of nucleosomes 
represses gene expression. Two classes of transcriptional co-regulators are necessary to 
overcome nucleosome repression: (1) enzymes that covalently modify the nucleosome 
histone proteins and thereby loosen histone: DNA interactions and (2) ATP-depen- 
dent chromatin-remodeling complexes. However, gene activation depends not only on 
relief from nucleosome repression but also on interaction of RNA polymerase with the 
promoter. Only those genes activated by specific positive regulatory mechanisms are 
transcribed. A general understanding of transcription in eukaryotes depends on the 
understanding of: 


e The three classes of RNA polymerase in eukaryotes: RNA polymerases I, II, and HI 
e The structure and function of RNA polymerase I, the mRNA-synthesizing RNA 
polymerase 
e Transcription regulation, including: 
— General features of gene regulatory sequences: promoters, enhancers, and response 
elements 
— Transcription initiation by RNA polymerase II 
The recruitment of RNA polymerase I to promoters and the assembly of 
transcription initiation complexes there. 
Alleviation of the repressive effects of nucleosomes through chromatin re- 
modeling and histone modifications. 
e A general model for eukaryotic gene activation. 


Below, we review these various features of eukaryotic transcription. 
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29.3a Eukaryotes Have Three Classes of RNA Polymerases 


Eukaryotic cells have three classes of RNA polymerase, each of which synthesizes a 
different class of RNA. All three enzymes are found in the nucleus. RNA polymerase 
I is localized to the nucleolus and transcribes the major ribosomal RNA genes. RNA 
polymerase II transcribes most noncoding RNA genes and all protein-encoding genes, 
and thus it is responsible for the synthesis of mRNA. RNA polymerase III transcribes 
tRNA genes, the ribosomal RNA genes encoding 5S rRNA, and a variety of other 
small RNAs, including several involved in mRNA processing and protein transport. 

All three RNA polymerase types are large, complex multimeric proteins (500 to 
700 kDa), consisting of ten or more types of subunits. Although the three differ in 
overall subunit composition, they have several smaller subunits in common. Further- 
more, all possess two large subunits (each 140 kDa or greater) having sequence simi- 
larity to the large B- and B’-subunits of E. coli RNA polymerase, indicating that the 
fundamental catalytic site of RNA polymerase is conserved among its various forms. 

In addition to their different functions, the three classes of RNA polymerase can 
be distinguished by their sensitivity to a-amanitin (Figure 29.23), a bicyclic octapep- 
tide produced by the poisonous mushroom Amanita phalloides (the destroying angel 
mushroom). a-Amanitin blocks RNA chain elongation. Although RNA polymerase 
I is resistant to this compound, RNA polymerase II is very sensitive and RNA poly- 
merase III is less sensitive. 

The existence of three classes of RNA polymerases acting on three distinct sets of 
genes means that there must be three distinct categories of promoters. Eukaryotic 
promoters are very different from prokaryotic promoters. All three eukaryotic RNA 
polymerases interact with their promoters via so-called transcription factors—DNA- 
binding proteins that recognize and accurately initiate transcription at specific promoter 
sequences. For RNA polymerase I, its templates are the rRNA genes. Ribosomal RNA 
genes are present in multiple copies. Optimal expression of these genes requires the first 
150 nucleotides in the immediate 5'-upstream region. 

RNA polymerase III interacts with transcription factors TFIIA, TFIIB, and 
TFIIIC. Interestingly, TFIIA and/or TFHIC bind to specific recognition sequences 
that in some instances are located within the coding regions of the genes, not in the 
5'-untranscribed region upstream from the transcription start site. TFIIIB associates 
with TFIIA or TFIIIC already bound to the DNA. RNA polymerase III then binds to 
TFIIIB to establish an initiation complex. 


29.3b RNA Polymerase II Transcribes Protein-Coding Genes 


As the enzyme responsible for the regulated synthesis of mRNA, RNA polymerase 
II has aroused greater interest than RNA polymerases I and III. RNA polymerase II 
must be capable of transcribing a great diversity of genes, yet it must carry out its 
function at any moment only on those genes whose products are appropriate to the 
needs of the cell in its ever-changing metabolism and growth. The RNA polymerase II 
enzymes from eukaryotes have been extensively characterized and show a marked 
similarity in structure, whether from yeast or humans (Figure 29.24). Both yeast and 
human RNA polymerase II consist of 12 different polypeptides, designated RPB1 
through RPB12 and ranging in size from 192 to 8 kDa (Table 29.2). RPB3, RPB4, and 
RPB7 are unique to RNA polymerase II, whereas RPB5, RPB6, RPB8, and RPB10 are 
common to all three eukaryotic RNA polymerases. 

RNA polymerases adopt a clawlike structure, grasping the DNA duplex as shown 
in Figure 29.24 for yeast RNA polymerase II. The DNA strands are unwound and 
separated, and the template strand enters the active site, where template-directed NTP 
substrate selection and NMP addition to the growing RNA transcript occur. NTP 
substrates access the active site through a channel below the active site. The DNA:RNA 
hybrid duplex exits at a right angle from the active site. As the hybrid duplex emerges 
from the protein structure, the RNA transcript is separated from the DNA template by 
RPBI residue Phe?’, which acts as a wedge, splitting the position —10 RNA:DNA base 
pair (the base pair located 10 bases from the active site). The template strand is now free 
to re-anneal with the nontemplate strand to re-establish the dsDNA structure. 
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Figure 29.23 The structure of a-amanitin, one of a 
series of toxic compounds known as amatoxins that 
are found in the mushroom Amanita phalloides. 
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CTD of RPB1 


Figure 29.24 Structure of the yeast RNA 
polymerase II (PDB id = 1Y77). The template 
DNA strand is shown in green, the nontemplate 
DNA strand in blue. The RNA transcript (hot pink) 
is emerging at the bottom of the structure. RPB1, 
the largest polypeptide chain, is shown in orange, its 
C-terminal domain (CTD) is to the upper right. The 
active-site Mg?* is shown as a red sphere in the center 
of the globular part of the enzyme. The atoms of a 
ribonucleotide substrate analog (GMPCP) are shown 
as dark blue spheres. RNA polymerase II is often 
referred to more simply as “RNA pol II.” 


IEJ CY LYA Eukaryotic RNA Polymerase II Subunits 


Subunit Size (kDa) Features Prokaryotic Homolog 
RPBIt 192 YSPTSPS CTD B' 
RPB2 139 NTP binding B 
RPB3 35 Core assembly a 
RPB4 25 Promoter recognition o 
RPBS 25 In polymerases I, II, and HI 

RPB6 18 In polymerases I, I, and III 

RPB7 19 Unique to polymerase II 

RPB8 17 In polymerases I, II, and III 

RPB9 14 Nonessential 

RPB1O 8 In polymerases I, I, and III 

RPB11 14 

RPB12 8 In polymerases I, II, and III 


TRPB stands for RNA polymerase B; RNA polymerases |, II, and III are sometimes called RPA, RPB, and RPC. 
Adapted from Myer, V. E., and Young, R. A., 1998. RNA polymerase II holoenzymes and subcomplexes. The Journal of 
Biological Chemistry 273:27757-27760. 
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The RPBI subunit has an unusual structural feature not found in prokaryotes: Its 
C-terminal domain (CTD) contains 27 repeats of the amino acid sequence YSPTSPS. 
The analogous subunit in human RNA polymerase II has this heptapeptide tandemly 
repeated 52 times. Note that the side chains of five of the seven residues in this repeat 
have —OH groups, endowing the CTD with considerable hydrophilicity and multiple 
sites for phosphorylation. A number of CTD kinases have been described, targeting 
different residues at different stages of the transcription process. The CTD domain may 
project more than 50 nm from the surface of RNA polymerase II. The phosphorylation 
possibilities represented by the YSPTSPS repeats within the CTD domain indicate the 
existence of a phosphorylation code that regulates mRNA synthesis and processing. 
In addition, an arginine residue (R*!°) within the CTD can be methylated. Reversible 
methylation provides a selection mechanism for determining specific gene expression. 

The CTD is essential to RNA polymerase II function. Only RNA polymerase 
II whose CTD is not phosphorylated can initiate transcription. However, tran- 
scription elongation proceeds only after protein phosphorylation within the CTD, 
suggesting that phosphorylation triggers the conversion of an initiation complex 
into an elongation complex. Such a mechanism would allow protein phosphoryla- 
tion to regulate gene expression. Following termination of transcription, a phospha- 
tase recycles RNA polymerase II to its unphosphorylated form. The CTD also plays 
a prominent role in orchestrating subsequent events in the transcription process. 
A multitude of additional proteins are essential to the formation of a translatable 
mRNA from the primary RNA polymerase IJ transcript; these proteins (described 
later in this chapter) include 5’-capping enzymes, splicing factors, and 3’-polyadeny]l- 
ylation complexes. Recruitment of these proteins is dependent upon phosphorylation 
of Ser residues at positions 2 and 5 in the CTD heptapeptide repeat. Phosphorylation 
of these Ser residues is also a prerequisite for interactions between the CTD and the 
histone methyltransferases capable of remodeling nucleosomes into a transcription- 
ally permissive state. 


29.3c The Regulation of Gene Expression Is More Complex in Eukaryotes 


Transcription in eukaryotes, like the process in prokaryotes, has four stages: (1) pro- 
moter recognition and RNA polymerase IT binding, (2) initiation, (3) elongation, and 
(4) termination. In eukaryotes, not only metabolic activity and cell division but also 
complex patterns of embryonic development and cell differentiation must be coordi- 
nated through the regulation of gene expression. All this coordinated regulation takes 
place in cells with a large quantity and sequence diversity of DNA. A typical mam- 
malian cell has 1500 times as much DNA as an E. coli cell. The structural genes of 
eukaryotes are rarely organized in clusters akin to operons. Each eukaryotic gene typi- 
cally possesses a discrete set of regulatory sequences appropriate to the requirements 
for regulating its transcription. Certain of these sequences provide sites of interaction 
for general transcription factors, whereas others endow the gene with great specificity 
in expression by providing targets for specific transcription factors. 


29.3d Gene Regulatory Sequences in Eukaryotes Include Promoters, 
Enhancers, Insulators, and Silencers 


Gene expression in eukaryotes is precisely regulated in a time-specific and cell-specific 
fashion. This regulation depends on coordinated interactions between proteins bound 
to DNA at cis-regulatory elements (CREs, where cis denotes that the sequence element 
is on the same DNA molecule). Proteins that bind to specific CREs are termed trans- 
acting factors. Over 1600 gene-specific trans-acting factors are known, some of which 
are master transcription regulators that control genes whose expression determines 
cell type. CREs serve two purposes: (1) they identify promoters and transcription start 
sites, and (2) they act as platforms for binding of proteins that regulate and execute 
gene expression. CREs can be located at or near the transcription start site (TSS, the 
nucleotide where transcription begins), or many thousands of base pairs away (dis- 
tal CREs). Distal CREs include enhancers, insulators, and silencers, described below. 
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Figure 29.25 The Inr and TATA box in selected 
eukaryotic genes. The consensus sequence of a num- 
ber of such promoters is presented in the lower part 
of the figure, the numbers giving the percent occur- 
rence of various bases at the positions indicated. 


Efficient transcription of a protein-coding gene by RNA pol II depends on interac- 
tions with proteins bound at two distinct CRE sequences: sequences at or near the TSS, 
termed promoters, and more distantly located enhancers or silencers. 


Promoters The promoters of eukaryotic protein-encoding genes are quite complex and 
variable. Their location is defined by the core promoter, a 50- to 100-bp DNA sequence 
within which lies the transcription start site (TSS). The core promoter serves as a plat- 
form for the assembly of the transcriptional apparatus, which is composed of RNA pol 
II, the general transcription factors (GTFs), and other proteins. The assembled apparatus 
is megadaltons in size and occupies more than 100 bp around the TSS. About half of 
the core promoters for human protein-coding genes have an Inr element, while about 
20% have a TATA box. The Inr element has the consensus sequence: C/G/T-C/G/T-C-A- 
C/c/T-A/T, with the bold A at position +1, defining the TSS. The TATA box (consen- 
sus sequence T-A-T-A-A-T1/A-A/T-A/G) is usually located at position —31 from the TSS 
(Figure 29.25). The A:T-rich nature of the TATA box facilitates promoter melting 
and access of the transcription apparatus to the template strand. Another consensus 
sequence, the downstream promoter element (DPE; A/G-G-A/T-C/T-A/C/G-T) is often 
found in genes with an Inr motif but lacking a TATA box. 

DPEs are embedded in the coding region, positioned at +28 to +33 relative to the 
TSS. The TATA box and Inr serve distinct sets of genes. The TATA box is typically a 
feature of tissue-specific genes, whereas the Inr is found in housekeeping genes: genes 
that encode proteins commonly present in all cells and essential to normal function. 
Housekeeping genes are typically transcribed at more-or-less steady levels, a state 
referred to as constitutive expression. Sequence elements other than the Inr motif, 
TATA box, and the DPE also occur as promoters for RNA pol II-transcribed genes, 
but less frequently. Such regulatory elements include short nucleotide sequences called 
response elements (REs) found near the promoter (the promoter—proximal region). 
Response elements bind certain specific transcription factors, such as proteins that 
trigger expression of a related set of genes in response to some physiological signal 
(hormone) or challenge (temperature shock). 


Enhancers and silencers Enhancers and silencers are CREs located quite distant from 
the genes they regulate, ranging from several kilobase pairs to nearly a megabase pair in 
some human genes. The typical enhancer has a highly variable DNA sequence less than 
100-bp-long and is located in a region of chromatin accessible to DNA-binding proteins 
because nucleosomes are sparse. These distantly located sequences are recognized by spe- 
cific DNA-binding proteins that activate transcription above basal levels. The sequences 
where transcriptional activators bind are termed enhancers; silencers are sequences 
where transcriptional repressors bind. Searches through the human genome by the Ency- 
clopedia of DNA Elements (ENCODE) Consortium reveal hundreds of thousands of 
putative enhancers and silencers, far more than the number of protein-coding genes. 

As nucleotide sequences distant from the promoter that regulate transcription, 
enhancers and silencers share many attributes. The focus here is on enhancers and their 


Inr 
Adenovirus GGGGOPATAANANGGGGGTGGGGGCGCGTTCGTCCTCACTC 
late 


Chicken GAGGCHATATATTCCCCAGGGCTCAGCCAGTGTCTOTACA 
ovalbumin 


Mouse Bglobin GAGCAMAMAMIGET GAGGTAGGATCAGTTGCTCCTCRCATTT 
major 

Rabbit TTGCC@AMAMMMcccAGAGCAGGGCAGCTGCTGCTGCTARICACT 
B-globin N 

+1 


Transcription 
start site 
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activation of transcription, but silencers differ only in that they repress transcription. 
Enhancers (called upstream activation sequences, or UASs in yeast) assist transcrip- 
tion initiation. Enhancers differ from promoters in two fundamental ways. First, the 
location of enhancers relative to the transcription start site is not fixed. Enhancers may 
be several thousand nucleotides away from the promoter, and they act to enhance tran- 
scription initiation even if positioned downstream from the gene. Second, enhancer 
sequences are bidirectional in that they function in either orientation. That is, enhanc- 
ers can be removed and then reinserted in the reverse sequence orientation without 
impairing their function. Like promoters, enhancers represent modules of consensus 
sequence. Enhancers are non-specific and they stimulate transcription from any pro- 
moter that happens to be in their vicinity. Nevertheless, enhancer function is dependent 
on recognition by a specific transcription factor. A specific transcription factor bound 
at an enhancer element stimulates transcription by interacting with promoter-bound 
RNA pol II. Looping of chromatin must happen in order for this interaction to occur. 
Repressors share these enhancer properties, except they repress rather than enhance 
transcription. 

Insulators are DNA sequences of 0.1—1 kb that act in two ways: (1) to shield regions 
of chromatin active in transcription from inactive regions, and (2) to block an enhancer 
from stimulating transcription from a specific promoter. An insulator sequence can act 
as an enhancer blocker if it is located somewhere along the DNA sequence between the 
enhancer and the promoter. 

Many genes are subject to multiple regulatory influences. Regulation of such genes 
is achieved through the presence of an array of different regulatory elements. The 
metallothionein gene is a good example (Figure 29.26). Metallothionein is a metal- 
binding protein that protects cells against metal toxicity by binding excess amounts 
of heavy metals and removing them from the cell. This protein is always present at 
low levels, but its concentration increases in response to heavy metal ions such as 
cadmium or in response to glucocorticoid hormones. The metallothionein gene pro- 
moter consists of two general promoter elements, namely, a TATA box and a GC box, 
two basal-level enhancers, four metal response elements (MREs, consensus sequence 
GCNCCCGGCNCN) where the MRE-binding factor MTF binds, and one glucocor- 
ticoid response element (GRE, consensus sequence T@GTACAAATGTTCT). When 
MTF binds heavy metals such as cadmium and zinc, it activates metallothionein gene 
expression by binding to the MREs. Similarly, in the presence of the stress hormone 
cortisol, growth hormone (GR) binds to the metallothionein gene GRE, activating 
transcription. Nearly 30% of human genes have a GRE. Elements like MRE and GRE 
function independently of one another; any one can activate transcription of the gene. 


29.3e Transcription Initiation by RNA Polymerase II Requires TBP and the GTFs 


Transcription initiation in eukaryotes proceeds through assembly of a preinitia- 
tion complex (PIC) on the promoter DNA. The PIC is composed of more than 100 
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Figure 29.26 The metallothionein gene possesses several constitutive elements in its promoter (the TATA 
and GC boxes) as well as specific response elements such as MREs and a GRE. The BLEs are elements 
involved in basal level expression (constitutive expression). TRE is a tumor response element activated in 
the presence of tumor-promoting phorbol esters such as TPA (tetradecanoyl phorbol acetate). 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


1057 


1058 Chapter 29 Transcription and the Regulation of Gene Expression 


polypeptides, including RNA polymerase II, and a set of general transcription fac- 
tors (TFIIA, TFIIB, TFIID, TFITE, TFIIF, TFITH, and Mediator, a 1.5-megadalton 
complex that mediates interactions between RNA pol II and proteins bound at dis- 
tal CREs.) Although the core promoter can trigger transcription initiation, transcrip- 
tional activity is very weak without stimulatory input from distal enhancers (Table 
29.3). TFIID consists of the TBP (the TATA binding protein), which directly recognizes 
the TATA box within the core promoter and a set of TBP-associated factors (TAFs). 
This TFIID complex is the first GTF to recognize and bind to the promoter. TBP is 
the TATA box-binding protein that recognizes the promoter core, making contacts 
with the DNA minor groove and distorting and bending the DNA so that sequences 
upstream and downstream of the TAT box come into closer proximity (Figure 29.27a). 
TFIIA stimulates transcription by stabilizing the interaction of TFIID and the TATA 
box. TFIIB serves as a bridge between promoter-bound TFIID and RNA pol II, posi- 
tioning the RNA polymerase active site over the promoter. TFITE and TFIIF anchor 
the TFITH CDK kinase module within the preinitiation complex, enhancing various 
aspects of promoter:PIC interaction. The helicase activity of TFITH aids in creating 
the transcription bubble. The transition from a state where the PIC is situated over the 
promoter (the closed initiation complex) occurs when TFIIB opens the promoter and 
the template strand of DNA enters the pol II active-site cleft. The assemblage of RNA 
polymerase II and these other proteins at the promoter is now termed the initiation 
complex. However, transcription initiation in eukaryotes also requires Mediator, which 
in humans is composed of 26 subunits, the Med proteins. 

The role of general transcription factor TFIID as a general cofactor for PIC forma- 
tion can also be assumed by SAGA, a megacomplex of about 20 proteins. SAGA is 
named for its Spt, Ada, and Gen5 subunits. Spt is a family of proteins involved in gene 
expression; Ada is a histone acetyltransferase (HAT, a histone-modifying enzyme); and 
Gcn5 is a lysine acetyltransferase (KAT). SAGA also contains TAF (TBP-associated 
proteins) subunits. Thus, SAGA, like TFIID, can recognize TATA boxes. SAGA is 
required for the expression of most RNA pol II-transcribed genes. 


29.3f The Role of Mediator in Transcription Activation and Repression 


Mediator serves as a dynamic bridge between gene-specific transcription co-activators 
bound to enhancers and the RNA pol II/GTF transcription machinery bound at the 
promoter. Once DNA becomes accessible through the action of chromatin remod- 
eling complexes (discussed in the following sections), a transcription co-activator 


| Table 29.3 | Ye) General Transcription Initiation Factors from Human Cells 


Factor Number of Subunits 
TFIID 

TBP 1 

TAFs 13 
TFHA 2 
TFIIB 1 
TFUF 2 
RNA pol II 13 
TFHE 2 
TFUH 10 
TFIS 34 


Function 


Core promoter recognition (TATA); platform for recruitment of TFIIA, TFIIB, and TAFs 
Core promoter recognition (non-TATA elements); positive and negative regulatory functions; HAT (histone 
acetyltransferase) activity 


Stabilization of TBP binding; stabilization of TAF—DNA interactions 


Binding TBP, RNA polymerase II, and promoter DNA; determines transcription start site. 


Promoter targeting of polymerase II; destabilization of nonspecific RNA polymerase II-DNA interactions 


Enzymatic synthesis of RNA; TFIIE recruitment 


TFIIH recruitment; modulation of TFIIH helicase, ATPase, and kinase activities; promoter melting 


Promoter melting using helicase activity; promoter clearance via CTD phosphorylation (2 subunits of TFHII 
are Cyclin H and Cdk7, forming a cyclin-CDK pair) 


Elongation 


Adapted from Table 1 in Roeder, R. G., 1996. The role of general initiation factors in transcription by RNA polymerase II. Trends in Biochemical Sciences 21:327-335; Reese, J. C., 2003. 
Basal transcription factors. Current Opinion in Genetics and Development 13:114-118; Table 1 in Hahn, S., 2004. Structure and mechanism of the RNA polymerase II transcription 
machinery. Nature Structural and Molecular Biology 11:394-403, and Box 3 in Allen, B. L., and Taatjies, D. J., 2015. The Mediator complex: a central integrator of transcription. Nature 


Reviews Molecular Cell Biology 16:155-166. 
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Figure 29.27 Protein complexes key to transcription initiation. (a) Model of 
the TATA-binding protein (TBP) in complex with a DNA TATA sequence 
(PDB id = 1YTB). The saddle-shaped TBP (go/d) is unusual in that it binds in 
the minor groove of DNA, sitting on the DNA like a saddle on a horse. TBP 
binding pries open the minor groove, creating a 100° bend in the DNA axis 
and unwinding the DNA within the TATA sequence. The other subunits of the 
TFIID complex (see Table 29.3) sit on TBP, like a person on a saddle. All 
known eukaryotic genes (those lacking a TATA box as well as those 
transcribed by RNA polymerase I or III) rely on TBP. (b) Front and side views 
of the overall Mediator-RNA polymerase II-TFIIF complex, as obtained 
through cryo-electron microscopy, are shown as blue mesh. The location of 
RNA polymerase II is shown in red. The locations of TBP, TFIID, TFIIA, 
TFIIB, TFIIF, TFUE, and TFIIH are indicated. The likely path of upstream 
and downstream promoter DNA is shown by the dashed line. This relationship 
of upstream to downstream DNA is established by the severe bend that TBP 
forces on the DNA. Adapted from Figure 5 in Bernecky, C., Grob, P., Ebmeier, 
C. C., Nogales, E. and Taatjes, D. J., 2011. Molecular architecture of the 
human Mediator-RNA polymerase II-TFIIF assembly. PLoS Biology 
9:e1000603. doi:10.1371/journal.pbio. 1000603. Part (a) courtesy of Dylan J. 
Taatjes. (c) The relationship of RNA pol II, the various GTFs, and the 
Mediator complex to the promoter in the Pre-Initiation Complex (PIC). Note 
that, in terms of this illustration, the downstream DNA is oriented up and 
thus, transcript would proceed in a vertical direction, as indicated by the arrow 
near IIH. Adapted from Box 3 in Allen, B. L., and Taatyies, D. J., 2015. The 
Mediator complex: a central integrator of transcription. Nature Reviews 
Molecular Cell Biology 16:155-166. 


Promoter 


binds to an enhancer and recruits Mediator. Mediator has been described as the ulti- 
mate regulator of transcription since it integrates and communicates information from 
enhancers and transcription co-activators to RNA polymerase at the promoter. 

Mediator is a crescent-shaped structure, with a head, middle, and tail (Figure 29.27c). 
The tail section recognizes and binds the transcription co-activator. Both the head 
region and the middle region of Mediator bind RNA pol II through interactions with 
its CTD, with the middle region also associating with general transcription factor 
TFIIE (Figure 29.28a). 

Mediator is a global regulator essential to transcription of virtually every RNA pol II- 
dependent gene. It is even required for basal transcription of these genes. Mediator also 
displays HAT activity, which may aid in exposing promoters and subsequent binding of 
GTFs and RNA pol II. Taken together, the results indicate that Mediator provides a scaf- 
fold for assembly of the PIC at the promoter just prior to transcription initiation. Media- 
tor apparently acts as the gateway by which RNA pol II gains access to the promoter. 

Transition from the initiation to the elongation phase of RNA synthesis is trig- 
gered when TFIIH phosphorylates Ser’ within the RNA pol II CTD YSPTSPS hepta- 
peptide repeats. CTD phosphorylation also leads to dissociation of Mediator from the 
TFIID:promoter complex, as well as escape of RNA pol II, allowing it to fully transcribe 
the gene. Two of the Mediator subunits are a cyclin-CDK pair (CyceC/CDK8 in yeast or 
CCNC/CDK8 in vertebrates). The cyclin-CDK pair, together with subunits Med12 and 
Med13, form the four-subunit Mediator kinase module. This module phosphorylates the 
CTD as elongation commences. It also serves as a bridge between cell signaling, metabo- 
lism, and transcription through phosphorylation of TFs. Phosphorylation of CTD resi- 
due Tyr! during elongation blocks recruitment of transcription termination factors to 
pol II. During elongation, the rate of RNA synthesis by pol II is 30 nucleotides/s. 
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enhancer 


29.3g Mediator as a Repressor of Transcription 


Mediator apparently acts in repression of transcription as well as activation. How can 
Mediator serve two opposing regulatory functions? Perhaps the explanation lies in the 
ability of a Mediator subcomplex (the MED12 and MED13 proteins, plus CycC/CDK8) 
to interact with co-repressor bound to a silencer (see Figure 29.28). This Mediator sub- 
complex is a repressive module that stabilizes Mediator in a conformation that cannot 
recruit RNA polymerase II to the promoter as long as CycC/Cdk§8 is associated with the 
subcomplex (Figure 29.28b). 


29.3h Chromatin-Remodeling Complexes and Histone-Modifying 
Enzymes Alleviate the Repression Due to Nucleosomes 


The central structural unit of nucleosomes, the histone core octamer (see Figure 11.29), 
is constructed from the eight histone-fold protein domains of the eight various histone 
monomers comprising the octamer. Successive histone octamers are linked via histone 
H1, which is not part of the octamer. Each histone monomer in the core octamer has 
an unstructured N-terminal tail that extends outside the core octamer. Interactions 
between histone tails contributed by core histones in adjacent nucleosomes are an impor- 
tant influence in establishing the higher orders of chromatin organization. Activation of 
eukaryotic transcription is dependent on two sets of circumstances: (1) relief from the 
repression imposed by chromatin structure and (2) interaction of RNA polymerase II 
with the promoter and transcription regulatory proteins. Typically, nucleosome occu- 
pancy of enhancer and promoter regions is quite low, such that enhancers and promoters 
occur in nucleosome-free regions (NFRs). NFRs sometimes exist because of the intrinsic 
properties of certain DNA sequences or they may arise through the action of chromatin- 
remodeling complexes. If nucleosomes are present, relief from repression requires fac- 
tors that can reorganize the chromatin and then alter the nucleosomes so that promoters 
become accessible to the transcriptional machinery. Two sets of factors are important 
here: chromatin-remodeling complexes that mediate ATP-dependent conformational 
(noncovalent) changes in nucleosome structure and histone-modifying enzymes that intro- 
duce covalent modifications into the N-terminal tails of the histone core octamer. Chro- 
matin remodeling and histone modification are closely linked processes. 


DNA 


(b) silencer 


Mediator 


promoter promoter 


Figure 29.28 Simple models of Mediator in the regulation of eukaryotic gene transcription. In these 
models, conformational changes generated through enhancer or silencer recognition are propagated 
across the entire Mediator structure from tail to head. These conformational changes determine 
whether or not RNA pol II is bound and transcription occurs. (a) Mediator as a transcription 
activator. Mediator regions are highlighted in color: green for the tail, yellow for the middle, and red 
for the head. RNA polymerase II and the GTFs are blue. The transcription co-activator is orange. 
DNA is shown as a black line. (b) Mediator as a repressor. If Mediator interacts with a co-repressor 
(aqua) bound to a silencer and then binds the repressive subcomplex (MED12-MED13-CycC/CDK8), 
shown in pink here, it fails to recruit RNA polymerase II and the GTFs to the promoter, and 
expression from the promoter is repressed. (Adapted from Figures 1 and 2 in Bjorklund, S., and 
Gustafson, C. M., 2005. The yeast Mediator complex and its regulation. Trends in Biochemical 
Sciences 30:240-244.) 
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29.31 Chromatin-Remodeling Complexes Are Nucleic 
Acid—Stimulated Multisubunit ATPases 


Chromatin-remodeling complexes are protein assemblies containing ATP-dependent 
enzymes that loosen the DNA: protein interactions in nucleosomes by sliding, ejecting, 
inserting, or otherwise restructuring core octamers. In the process, about 50 bp of DNA 
are peeled away from the edge of the nucleosome, creating a bulge that allows RNA pol 
I, GTFs, and other transcription factors to access the DNA. Chromatin-remodeling 
complexes always contain proteins of the SNF2 family of DEAD/H-box-containing, 
nucleic acid-stimulated ATPases. (DEAD signifies the Asp-Glu-Ala-Asp tetrapeptide 
signature sequence of this protein family; in some of these proteins, a histidine residue 
(H) replaces the second D in the box.) The various SNF2 chromatin remodelers are 
assigned to one of four subfamilies, SWI/SNF, ISWI, CHD, and INO80, based on their 
subunit composition and recognition of various histone modifications: SWI2 types 
with bromodomains that bind acetylated lysines, ISWI types that have separate domains 
for histone tail and linker DNA binding, CHD types with chromodomains that bind 
methylated lysines, and INO80 types with DNA-binding domains. 

During the elongation phase of transcription, RNA pol II must pass by each 
nucleosome, and such passage is believed to result in the loss and regain of an H2A/ 
H2B dimer from the core octamer. FACT (for facilitates chromatin transcription) is a 
heterodimer that acts on some nucleosomes to catalyze H2A/H2B removal and read- 
dition; its activity is markedly increased by covalent modification of H2B Lys!”°. 


29.3} Covalent Modification of Histones 


Chromatin is also remodeled through the action of enzymes that covalently modify 
side chains on histones within the core octamer. These modifications either directly 
decrease DNA: histone associations through disruption of electrostatic interactions or 
introduce substitutions that can recruit binding of new protein participants that dis- 
rupt DNA:histone interactions. 

Initial events in transcriptional activation include acetyl-CoA-dependent acetyla- 
tion of -amino groups on lysine residues in histone tails by histone acetyltransferases 
(HATs) (Figure 29.29). The histone transacetylases responsible are essential components 
of several megadalton-size complexes known to be required for transcription co-activation 


Figure 29.29 A schematic diagram of the 
nucleosome illustrating the various covalent 
modifications on the N-terminal tails of histones. 
acK = acetylated lysine residue; meK = methylated 
lysine residue; meR = methylated arginine residue; 
PS = phosphorylated serine residue. The numbers 
indicate the positions of the amino acids in the 
amino acid sequences. Note the prevalence of 
modifiable sites, particularly acetylatable lysines, on 
the N-terminal tails of histones H2B, H3, and H4. 
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(co-activation in the sense that they are required along with RNA polymerase IT and other 
components of the transcriptional apparatus). Examples of such complexes include the 
TFIID (some of whose TAFys have HAT activity), the SAGA complex (which also con- 
tains TAFys), and the ADA complex. N-Acetylation suppresses the positive charge in 
histone tails, diminishing their interaction with the negatively charged DNA. 

Phosphorylation of Ser residues and methylation of Lys residues in histone tails 
also contribute to transcription regulation (Figure 29.29). Lysine methylation is cata- 
lyzed by S-adenosylmethionine—dependent lysine methyltransferases (KMTs) that can 
add one, two, or three methyl groups to the e-amino groups of K residues. Mono-, 
di-, and trimethyllysines are referred to as Kmel, Kme2, or Kme3, respectively. Lysine 
demethylases (KDMs) are O>-dependent enzymes that release the K methyl groups as 
either CO, or formaldehyde. The nomenclature for K methylation is straightforward: 
H3K4me3 signifies that lysine 4 on histone H3 is trimethylated. Attachment of small 
proteins to histone C-terminal lysine residues is an additional form of covalent modi- 
fication found in nucleosomes. Collectively, these modifications create binding sites for 
proteins that modulate chromatin structure, such as the chromatin-remodeling com- 
plexes with bromodomains that interact specifically with acetylated lysine residues and 
chromodomains that bind to methylated lysine residues. 


29.3k Covalent Modification of Histones Suggests 
the Existence of a Histone Code 


A code based on histone-tail covalent modifications has been proposed as a mechanism 
that governs gene expression through selective recruitment of proteins. Proteins that 
cause chromatin compaction (heterochromatin formation) lead to repression; proteins 
giving easier access to DNA through relaxation of histone: DNA interactions enhance 
gene expression. 

The prominent forms of histone covalent modification are lysine acetylation, lysine 
methylation, serine phosphorylation, lysine ubiquitination, and lysine sumoylation. The 
lysine residue at position 9 (K9) in the histone H3 amino acid sequence is methylated 
in heterochromatin, the compacted, repressed state of chromatin. In contrast, lysine 4 
(K4) of histone H3 typically is methylated in chromatin where gene expression is active. 
Different proteins are recruited to these two methylated forms of histone H3. Methyl- 
ated K9 recruits heterochromatin protein 1 (HP1), which binds via its chromodomain. 
On the other hand, methylated K4 binds CHD1, a chromatin remodeling protein 
with two chromodomains. (CHD1 is an acronym for chromodomain, helicase, DNA- 
binding.) Ubiquitination of Lys!™ in the C-terminal tail of H2B favors methylation 
(and thus transcription activation), whereas ubiquitination of Lys!!? in the C-terminal 
tail of H2A favors repression. Sumoylation of Lys residues tends to repress transcrip- 
tion, since sumoylation antagonizes acetylation. 


29.31 Methylation and Phosphorylation Act 
as a Binary Switch that Regulates Chromatin Dynamics 


As cells enter mitosis, the chromatin becomes condensed and histone H3 is not only 
methylated at K9 but also phosphorylated at the adjacent serine residue, S10. S10 
phosphorylation triggers the dissociation of HP1 from the heterochromatin. Thus, 
phosphorylation of the residue neighboring K9 trumps HP! binding. Similarly, phos- 
phorylation of the threonine residue (T3) neighboring K4 in the histone H3 tail evicts 
CHD 1 from its site on the methylated K4. Apparently, lysine methylation is the on 
position for the binary switch that recruits specific proteins to histone tails, and phos- 
phorylation at a neighboring residue turns the switch to the off position by ejecting the 
bound proteins. There are at least 16 instances of serines or threonines immediately 
flanking lysine residues in the four histones that constitute the histone core octamer 
of the nucleosome. The methylation-phosphorylation binary switch may be a general 
phenomenon in the regulation of chromatin dynamics. 


Histone Code or Histone Crosstalk? The effects of histone modifications are more com- 
plex and nuanced than the concept of a histone code might suggest. That is, the order of 
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addition and removal of histone modifications, as well as any preexisting modifications 
that might be present, are important in determining the transcriptional fate of the gene 
associated with the nucleosome. The influence of one modification on the likelihood 
of another is referred to as histone crosstalk. For example, phosphorylation of histone 
H3 S10 by different kinases at different times can affect whether or not acetylation of 
histone H3 occurs, and whether transcription ensues. Whether code or crosstalk, chro- 
matin structure is dynamically regulated through the multiple possibilities for covalent 
modification of histone tails. As a consequence of these structural changes, transcrip- 
tion of a particular gene can be profoundly influenced. 


29.3m Chromatin Deacetylation Leads to Transcription Repression 


Deacetylation of histones is a biologically relevant matter, and enzyme complexes 
that carry out such reactions have been characterized. Known as histone deacetylase 
complexes, or HDACs, they catalyze the removal of acetyl groups from lysine residues 
along the histone tails, restoring the chromatin to a repressed state. Beyond these effects 
on transcription, histone modifications determine whether significant cellular events 
involving DNA allocation through mitosis and meiosis may occur. 


29.3n Nucleosome Alteration and Interaction of RNA Polymerase II with 
the Promoter Are Essential Features in Eukaryotic Gene Activation 


Gene activation (the initiation of transcription) can be viewed as a process requiring 
two principal steps: (1) alterations in nucleosomes (and thus, chromatin) that relieve 
the general repressed state imposed by chromatin structure, followed by (2) the inter- 
action of RNA pol II and the GTFs with the promoter. Transcription activators ini- 
tiate the process by recruiting chromatin-altering proteins (the chromatin-remodeling 
complexes and histone-modifying enzymes described previously). Once these altera- 
tions have occurred, promoter DNA is accessible to TBP: TFIID, the other GTFs, 
and RNA pol II. Transcription activation, however, requires communication between 
RNA pol II and the transcription activator for transcription to take place. Mediator 
fulfills this function. Mediator interacts with both the transcription activator and the 
CTD of RNA pol II. This Mediator bridge provides an essential interface for com- 
munication between enhancers and promoters, triggering RNA pol II to begin tran- 
scription. A simple model for transcription initiation is shown in Figure 29.30. Once 
transcription begins, Mediator is replaced by another complex called Elongator. Elon- 
gator has HAT subunits whose activity remodels downstream nucleosomes as RNA 
pol II progresses along the chromatin-associated DNA. 


Promoter-Proximal Pausing The interactions described thus far emphasize regulation of 
gene expression at the level of RNA pol II recruitment to promoters. However, whole- 
genome analyses show that, for many genes, RNA pol II is already situated at promoters 
and appears to be paused there, awaiting signals that will activate the elongation phase 
of transcription. Thus, the expression of many genes may be regulated at the level of 
transcription elongation. What distinguishes between these possibilities? It seems that 
RNA polymerase pausing occurs more often on highly regulated genes because the pro- 
moters of these genes might otherwise be blocked by nucleosomes. Polymerase pausing 
is a mechanism to outcompete nucleosome occlusion of the promoter, so that the gene 
is poised for expression, as decided by regulatory signals. Promoter-proximal pausing by 
RNA pol II also provides a window of opportunity to ensure that proper regulation of 
gene expression has occurred and that RNA quality-control mechanisms can verify the 
fidelity of transcription. 


Activation of Elongation Along with TFIIS and other transcription elongation factors, 
p-TEFb (positive transcription elongation factor b) brings an end to promoter-proximal 
pausing, allowing RNA pol II to move along the DNA template, elongating the tran- 
script. The key event is the phosphorylation of Ser? in the RNA pol II CTD by the 
CDK kinase subunit of p-TEFb. 
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Acetylase 


Deacetylase 


Figure 29.30 A model for the transcriptional regulation of eukaryotic genes. The DNA is a green 
ribbon wrapped around disclike nucleosomes. A specific transcription factor (TF, pink) is bound to a 
regulatory element (either an enhancer or silencer). RNA polymerase II and its associated GTFs (blue) 
are bound at the promoter. The N-terminal tails of histones are shown as wavy lines (blue) emanating 
from the nucleosome discs. A specific transcription factor that is a transcription activator stimulates 
transcription through interaction with a co-activator whose HAT activity renders the DNA more 
accessible and through interactions with the Mediator complex, which in turn recruits RNA pol II to 
the promoter. A specific transcription factor that is a repressor interacts with a co-repressor that has 
HDAC activity that deacetylates histones, restructuring the nucleosomes into a repressed state. 


Most Mammalian Genes Lack Core Promoters The promoters of genes that are consti- 
tutively expressed, such as housekeeping genes, are relatively nucleosome-free. Signifi- 
cantly, about 70% of the human genome is transcribed, even though protein-encoding 
genes represent only 2% of the genome. Further, most mammalian genes lack core 
promoter elements, yet they recruit the complete transcriptional apparatus and are suc- 
cessfully transcribed. And, surprisingly, transcription initiation from most mammalian 
promoters proceeds in both directions, upstream and downstream, a phenomenon called 
divergent transcription. Genes lacking core promoters are instead rich in upstream CpG 
islands, and these CpG islands are able to recruit TBP and transcription factors and 
initiate transcription, but in divergent directions. The RNA products of divergent tran- 
scription tend to be short and not very stable. In contrast, genes having core promoter 
elements are transcribed uni-directionally, as described in this section. 

The fact is the process of transcription in cells has a lot of noise: Most DNA is 
transcribed, but only a small portion of it is represented by protein-coding genes whose 
transcripts are translated into proteins. Some of the noncoding RNA produced arises 
from RNA pol II transcription of enhancer DNA and is referred to as eRNA. Some of 
these eRNAs participate in the regulation of RNA pol II transcription. ENCODE has 
identified tens of thousands of eR NAs in human cells. 

Beyond these considerations, various mechanisms regulate gene expression through 
events that take place subsequent to transcription. Post-transcriptional gene regula- 
tion mediated by microRNAs (see Section 29.6), as well as alternative RNA splicing and 
nucleotide changes introduced through RNA editing (as described in this chapter), are 
mechanisms targeting transcripts. Post-translational modifications of proteins also play a 
major role in the regulation of gene expression, as assessed at the level of biological activity. 


29.4 | How Do Gene Regulatory Proteins Recognize 
Specific DNA Sequences? 


Proteins that recognize nucleic acids do so by the basic rule of macromolecular rec- 
ognition. That is, such proteins present a three-dimensional shape or contour that is 
structurally and chemically complementary to the surface of a DNA sequence. When 
the two molecules come into contact, the numerous atomic interactions that underlie 
recognition and binding can take place. Nucleotide sequence-specific recognition by 
the protein involves a set of atomic contacts with the bases and the sugar—phosphate 
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Human Biochemistry 


Storage of Long-Term Memory Depends on Gene Expression 
Activated by CREB-Type Transcription Factors 


Learning can be defined as the process whereby new information 
is acquired and memory as the process by which this information 
is retained. Cognition is the acquisition of knowledge and is a re- 
sult of learning. The biochemical processes that establish learning 
and memory were for a long time a mystery. It is now clear that 
macromolecular changes underpin both learning and memory. 
Short-term memory (which lasts minutes or hours) requires only 
the covalent modification of preexisting proteins, but long-term 
memory (which lasts days, weeks, or a lifetime) depends on gene 
expression, protein synthesis, and the establishment of new neuro- 
nal connections. Research by Eric Kandel of the Howard Hughes 
Medical Institute was the first to reveal the biochemical basis for 
memory storage in neurons. He was awarded the 2000 Nobel Prize 
in Physiology or Medicine for, among other things, his discovery 


of the role of transcription factors in long-term memory. His work 
opened a new arena in molecular biology, the molecular biology 
of cognition. 

The macromolecular synthesis underlying long-term memory 
storage requires cAMP-response element-binding (CREB) protein- 
related transcription factors and the activation of cAMP-dependent 
gene expression. Serotonin (5-hydroxytryptamine, or 5-HT, a hor- 
mone implicated in learning and memory) acting on neurons pro- 
motes cAMP synthesis, which in turn stimulates protein kinase A 
to phosphorylate CREB protein-related transcription factors that 
activate transcription of cAMP-inducible genes. These genes are 
characterized by the presence of CRE (cAMP response element) 
consensus sequences containing the 8-bp TGACGTCA palindrome. 
CREB transcription factors are bZ/P-type proteins. 


backbone. Direct interactions between protein side chains and specific bases in DNA is 
termed direct readout, since the protein is, in effect, reading the DNA sequence. Hydro- 
gen bonding is critical for recognition, with amino acid side chains providing most 
of the critical contacts with DNA. Protein contacts with the bases of DNA usually 
occur within the major groove (but not always). Protein contacts with the DNA back- 
bone involve both H bonds and salt bridges with electronegative oxygen atoms of the 
phosphodiester linkages. Structural studies on regulatory proteins that bind to specific 
DNA sequences have revealed that the majority of such proteins can be assigned to one 
of three principal classes based on their possession of one of three kinds of small, dis- 
tinctive structural motifs: the helix-turn-helix (or HTH), the zinc finger (or Zn-finger), 
and the leucine zipper-basic region (or bZIP). The latter two motifs are found only in 
DNA-binding proteins from eukaryotic organisms. 

In addition to their DNA-binding domains, these proteins commonly possess other 
structural domains that function in protein:protein interactions recognitions essential 
to oligomerization (for example, dimer formation), DNA looping, transcriptional acti- 
vation, and signal reception (for example, effector binding). 


29.4a a-Helices Fit Snugly into the Major Groove of B-DNA 


A recurring structural feature in DNA-binding proteins is the presence of a-helical 
segments that fit directly into the major groove of B-form DNA. The diameter of an 
a-helix (including its side chains) is about 1.2 nm. The dimensions of the major groove 
in B-DNA are 1.2 nm wide by 0.6 to 0.8 nm deep. Thus, one side of an a-helix can fit 
snugly into the major groove. Further, the particular amino acid side chains arrayed 
along the a-helix find H-bonding partners presented by the edges of the base pairs 
lining the floor of the major groove. This direct readout of DNA sequence by the pro- 
tein underlies sequence-specific DNA binding. Although examples of B-sheet DNA 
recognition elements in proteins are known, the a-helix and B-form DNA are the pre- 
dominant structures involved in protein-DNA interactions. Significantly, proteins can 
recognize specific sites in normal B-DNA; the DNA need not assume any unusual, 
alternative conformation (such as Z-DNA). 


29.4b Proteins with the Helix-Turn-Helix Motif 
Use One Helix to Recognize DNA 


Figure 29.31 An HTH motif protein: Antp 
monomer bound to DNA. Helix 3 (yellow) is locked 
into the major groove of the DNA by helix 2 
(magenta) (PDB id = 9ANT). 


The HTH motif is a protein structural domain consisting of two successive a-helices 
separated by a sharp -turn (Figure 29.31). Within this domain, the a-helix situated 
more toward the C-terminal end of the protein, the so-called helix 3, is the DNA 
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recognition helix; it fits nicely into the major groove, with several of its side chains 
touching DNA base pairs. Helix 2, the helix at the beginning of the HTH motif, creates 
a stable structural domain through hydrophobic interactions with helix 3 that locks 
helix 3 into its DNA interface. Proteins with HTH motifs bind to DNA as dimers. 
In the dimer, the two helix 3 cylinders are antiparallel to each other, such that their 
N-—>C orientations match the inverted relationship of nucleotide sequence in the 
dyad-symmetric DNA-binding site. An example is Antp. Antp is a member of a family 
of eukaryotic proteins involved in the regulation of early embryonic development that 
have in common an amino acid sequence element known as the homeobox domain. The 
homeobox is a DNA motif that encodes a related 60—amino acid sequence (the homeo- 
box domain) found among proteins of virtually every eukaryote, from yeast to man. 
Genes containing homeoboxes are master control genes regulating cell differentiation, 
morphogenesis, and development in animals. Embedded within the homeobox domain 
is an HTH motif. Homeobox domain proteins act as sequence-specific transcription 
factors. Typically, the homeobox portion comprises only 10% or so of the protein’s 
mass, with the remainder of the protein serving in protein: protein interactions essen- 
tial to transcription regulation. Other DNA-binding proteins with HTH motifs are lac 
repressor, trp repressor, and the C-terminal domain of CAP. 


How Does the Recognition Helix Recognize Its Specific DNA-Binding Site? The edges of 
base pairs in dsDNA present a pattern of hydrogen-bond donor and acceptor groups 
within the major and minor grooves, but only the pattern displayed on the major- 
groove side is distinctive for each of the four base pairs A:T, T:A, C:G, and G:C. 
(You can get an idea of this by inspecting the structures of the base pairs in Figure 
11.9.) Thus, the base-pair edges in the major groove act as a recognition matrix identifi- 
able through H bonding with a specific protein, so it is not necessary to melt the base 
pairs to read the base sequence. Although formation of such H bonds is very important 
in DNA: protein recognition, other interactions also play a significant role. For exam- 
ple, the C-5-methyl groups unique to thymine residues are nonpolar knobs projecting 
into the major groove. 


Proteins Also Recognize DNA via Shape Readout Direct readout of the base sequence 
by a DNA-binding protein is not the only way that proteins recognize specific nucleo- 
tide sequences. Shape readout, also known as indirect readout, is the term for the abil- 
ity of a protein to indirectly recognize a particular nucleotide sequence by recognizing 
local conformational variations resulting from the effects that base sequence has on 
DNA structure. Superficially, the B-form structure of DNA appears to be a uniform 
cylinder. Nevertheless, the conformation of DNA over a short distance along its cir- 
cumference varies subtly according to local base sequence. That is, base sequences gen- 
erate unique contours that proteins can recognize. Because these contours arise from 
the base sequence, the DNA-binding protein indirectly reads out the base sequence 
through interactions with the DNA backbone. In the E. coli Trp repressor: trp oper- 
ator DNA complex, the Trp repressor engages in 30 specific hydrogen bonds to the 
DNA: 28 involve phosphate groups in the backbone; only 2 are to bases. Thus, some 
sequence-specific DNA-binding proteins are able to recognize an overall DNA confor- 
mation caused by the specific DNA sequence. However, it is likely that most proteins 
use both base readout and shape readout to recognize the specific nucleotide sequences 
to which they bind. 


29.4c Some Proteins Bind to DNA via Zn-Finger Motifs 


There are many classes of Zn-finger motifs. The prototype Zn-finger is a structural 
feature formed by a pair of Cys residues separated by two residues, then a run of 
12 amino acids, and finally a pair of His residues separated by three residues (Cys-x>- 
Cys-x »-His-x3-His). This motif may be repeated as many as 13 times over the primary 
structure of a Zn-finger protein. Each repeat coordinates a zinc ion via its two Cys 
and two His residues (Figure 29.32). The 12 or so residues separating the Cys and His 
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Figure 29.32 The Zn-finger motif of the C,H) type 
showing (a) the coordination of Cys and His residues to 
Zn and (b) the secondary structure. (c) Structure of a 
classic CH) zinc finger protein (Zif268) with three zinc 
fingers bound to DNA (PDB id = 1ZAA). 


coordination sites are looped out and form a distinct DNA interaction module, the so- 
called zinc finger. When Zn-finger proteins associate with DNA, each Zn-finger binds in 
the major groove and interacts with about five nucleotides, adjacent fingers interacting 
with contiguous stretches of DNA. Many DNA-binding proteins with this motif have 
been identified. In all cases, the finger motif is repeated at least two times, with at least a 
seven- to eight-amino acid linker between Cys/Cys and His/His sites. Proteins with this 
general pattern are assigned to the C,H; class of Zn-finger proteins to distinguish them 
from proteins bearing another kind of Zn-finger, the C, type, which includes the Cy and 
Cs Zn-finger proteins. The C, proteins have a variable number of Cys residues available 
for Zn chelation. For example, the vertebrate steroid receptors have two sets of Cys resi- 
dues, one with four conserved cysteines (C4) and the other with five (Cs). 


29.4d Some DNA-Binding Proteins Use a Basic 
Region-Leucine Zipper (bZIP) Motif 


bZIP is a structural motif characterizing the third major class of sequence-specific, 
DNA-binding proteins. This motif was first recognized by Steve McKnight in 
CIEBP (for CCAAT and enhancer-binding protein). The DNA-binding domain of 
C/EBP in the C-terminal region of the protein shows a notable absence of Pro resi- 
dues, suggesting it might be arrayed in an a-helix. Within this region are two clusters 
of basic residues: A and B. Further along is a 28-residue sequence. When this latter 
region is displayed end-to-end down the axis of a hypothetical a-helix, beginning at 
Leu*!5, an amphipathic cylinder is generated, similar to the one shown in Figure 6.22. 
One side of this amphipathic helix consists principally of hydrophobic residues (par- 
ticularly leucines), whereas the other side has an array of negatively and positively 
charged side chains (Asp, Glu, Arg, and Lys), as well as many uncharged polar side 
chains (glutamines, threonines, and serines). 


The Zipper Motif of bZ/P Proteins Operates Through Intersubunit Interaction of Leucine 
Side Chains The leucine zipper motif arises from the periodic repetition of leucine 
residues within this helical region. The periodicity causes the Leu side chains to pro- 
trude from the same side of the helical cylinder, where they can enter into hydrophobic 
interactions with a similar set of Leu side chains extending from a matching helix in 
a second polypeptide. These hydrophobic interactions establish a stable noncovalent 
linkage, fostering dimerization of the two polypeptides (as shown in Figure 29.33). The 
leucine zipper is not a DNA-binding domain. Instead, it functions in protein dimeriza- 
tion. Leucine zippers have been found in other mammalian transcriptional regulatory 
proteins, including Myc, Fos, and Jun. 
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Chelation is from the Greek word chele, meaning 
“claw”; it refers to the binding of a metal ion to two or 
more nonmetallic atoms in the same molecule. 


Basic region 
(DNA contact 
surface) 

Leucine zipper 
(dimerization motif) 


(0; C 


Figure 29.33 Model for a dimeric bZIP protein. 
Two bZIP polypeptides dimerize to form a Y-shaped 
molecule. The stem of the Y is the Leu zipper, and it 
holds the two polypeptides together. Each arm of 
the Y is the basic region from one polypeptide. Each 
arm is composed of two a-helical segments: BR-A 
and BR-B (basic regions A and B). 
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The Basic Region of bZIP Proteins Provides the DNA-Binding Motif The actual DNA 
contact surface of bZIP proteins is contributed by a 16-residue segment that ends 
exactly 7 residues before the first Leu residue of the Leu zipper. This DNA contact 
region is rich in basic residues and, hence, is referred to as the basic region. Two bZIP 
polypeptides join via a Leu zipper to form a Y-shaped molecule in which the stem of 
the Y corresponds to a coiled pair of a-helices held by the leucine zipper. The arms of 
the Y are the basic regions of each polypeptide; they act as a linked set of DNA con- 
tact surfaces (Figure 29.33). The dimer interacts with a DNA target site by situating 
the fork of the Y at the center of the dyad-symmetric DNA sequence. The two arms 
of the Y can then track along the major groove of the DNA in opposite directions, 
reading the specific recognition sequence (Figure 29.34). An interesting aspect of bZIP 
proteins is that the two polypeptides need not be identical (Figure 29.34). Heterodimers 
can form, provided both polypeptides possess a leucine zipper region. An important 
consequence of heterodimer formation is that the DNA target site need not be a palin- 


Figure 29.34 Model for the heterodimeric bZIP 
transcription factor c-Fos:c-Jun bound to a DNA : à - i > i 
oligomer containing the AP-1 consensus target dromic sequence. The respective basic regions of the two different bZIP polypeptides 


sequence TGACTCA (PDB id = 1FOS). (for example, Fos and Jun) can track along the major groove reading two different base 
sequences. Heterodimer formation expands enormously the DNA recognition and reg- 
ulatory possibilities of this set of proteins. 


How Are Eukaryotic Transcripts Processed 


29.5 
and Delivered to the Ribosomes for Translation? 


Transcription and translation are concomitant processes in prokaryotes, but in 
eukaryotes the two processes are spatially separated (see Chapter 10). Transcription 
occurs on DNA in the nucleus, and translation occurs on ribosomes in the cytoplasm. 
Consequently, transcripts must be transported from the nucleus to the cytosol to be 
translated. Before exiting the nucleus, these transcripts undergo processing: alterations 
that convert the newly synthesized RNAs, or primary transcripts, into mature mes- 
senger RNAs. Indeed, processing is underway even before transcription of the gene is 
terminated. Also, unlike prokaryotes, in which many mRNAs encode more than one 
polypeptide (that is, they are polycistronic), eukaryotic mRNAs encode only one poly- 
peptide (that is, they are exclusively monocistronic). 


29.5a Eukaryotic Genes Are Split Genes 


Most genes in higher eukaryotes are split into coding regions, called exons, and non- 
coding regions, called introns (Figure 29.35; see also Figure 10.19). Although the term 
exon is commonly used to refer to the protein-coding regions of an interrupted or split 
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Figure 29.35 The organization of a split eukaryotic gene. Processing of primary 
mRNA transcripts includes removal of all introns. Mature mRNAs lack introns. 
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gene, a more precise definition would specify exons as sequences that are represented 
in mature RNA molecules. This definition encompasses not only protein-coding genes 
but also the genes for various RNAs (such as tRNAs or rRNAs) from which interven- 
ing sequences must be excised to generate the mature gene product. Introns are the 
intervening nucleotide sequences that are removed from the primary transcript when it 
is processed into a mature RNA. Gene expression in eukaryotes entails not only tran- 
scription but also the processing of primary transcripts to yield the mature RNA mol- 
ecules we classify as mRNAs, tRNAs, rRNAs, and so forth. 


29.5b The Organization of Exons and Introns in Split 
Genes Is Both Diverse and Conserved 


Human genes contain from 0 to 300 introns, but, on average, they have about nine 
exons and eight introns. The yeast actin gene is a simple example, having only a single 
309-bp intron that separates the nucleotides encoding the first three amino acids from 
those encoding the remaining 350 or so amino acids in the protein. The chicken oval- 
bumin gene is composed of eight exons and seven introns. The two vitellogenin genes 
of the African clawed toad Xenopus laevis are both spread over more than 21 kbp of 
DNA; their primary transcripts consist of just 6 kbp of message that is punctuated by 
33 introns. The chicken pro a-2 collagen gene has a length of about 40 kbp; the coding 
regions constitute only 5 kb distributed over 51 exons within the primary transcript. 
The exons are quite small, ranging from 45 to 249 bases in size. 

Clearly, the mechanism by which introns are removed and multiple exons are 
spliced together to generate a continuous, translatable mRNA must be both precise 
and complex. If one base too many or too few is excised during splicing, the coding 
sequence in the mRNA will be disrupted. The mammalian DHFR (dihydrofolate 
reductase) gene is split into six exons spread over more than 31 kbp of DNA. The six 
exons are spliced together to give a 6-kb mRNA (Figure 29.36). Note that, in three dif- 
ferent mammalian species, the size and position of the exons are essentially the same 
but that the lengths of the corresponding introns vary considerably. Indeed, the lengths 
of introns in vertebrate genes range from a minimum of about 60 bases to more than 
10,000 bp. Many introns have nonsense codons in all three reading frames and thus 
are untranslatable. Introns are found in the genes of mitochondria and chloroplasts as 
well as in nuclear genes. Introns do occur in Eubacteria, Archaea, and even in bacterio- 
phage T4, but they are much rarer than in eukaryotes. 


29.5c Post-Transcriptional Processing of Messenger RNA Precursors 
Involves Capping, Methylation, Polyadenylylation, and Splicing 


The immediate product of eukaryotic transcription, the primary transcript, under- 
goes a series of covalent modifications collectively called processing. These modifica- 
tions include adding a GMP to the 5’-phosphate end, methylation on various bases and 
riboses at the 5’-end, addition of multiple A residues (polyadenylylation) to the 3’-end, 
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Figure 29.36 The organization of the mammalian 
DHER gene in three representative species. Note 
that the exons are much shorter than the introns. 
Note also that the exon pattern is more highly 
conserved than the intron pattern. 
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and removal of all introns (splicing). The result is the mature mRNA, which is exported 
from the nucleus for translation in the cytosol. 


Capping and Methylation of Eukaryotic mRNAs The protein-coding genes of eukaryotes 
are transcribed by RNA polymerase II to form primary transcripts or pre-mRNAs that 
serve as precursors to MRNA. Asa population, these RNA molecules are very large and 
their nucleotide sequences are very heterogeneous because they represent the transcripts 
of many different genes, hence the designation heterogeneous nuclear RNA, or hnRNA. 
Shortly after transcription of hnRNA is initiated, the 5'-end of the growing transcript 
is capped by addition of a guanylyl residue. This reaction is catalyzed by the nuclear 
enzyme guanylyl transferase using GTP as substrate (Figure 29.37). The cap structure is 
methylated at the 7-position of the G residue. Additional methylations may occur at the 
2'-O positions of the two nucleosides following the 7-methyl-G cap and at the 6-amino 
group of a first base adenine (Figure 29.38). S-Adenosylmethionine is the methyl donor 
in all these reactions. The ribose of first nucleotide after the 5-G cap is almost always 
2'-O-methylated (cap | structure), and this modification is sometimes found on the next 
(second) nucleotide as well. These modifications are important for recruitment of trans- 
lation initiation factors to the mRNA. Methylation of N-6 on A at the 1-position to 
yield N®,2'-O- makes the mRNA markedly more stable. Further, such methylations alter 
the local structure of pre-mRNA in a way that makes the RNA more accessible to pro- 
teins with single-stranded RNA-binding motifs. Such proteins are important players in 
pre-mRNA processing and maturation to yield a competent, translatable mRNA. 


Transcription Termination in Eukaryotes Transcription by RNA pol II typically continues 
past the 3’-end of the mature messenger RNA. Primary transcripts show heterogeneity 
in sequence at their 3’-ends, indicating that the precise point where termination occurs 
is nonspecific. However, termination does not normally occur until RNA polymerase IT 
has transcribed past a consensus AAUAAA sequence known as the polyadenylylation 
signal, encoded in the template DNA as TTATTT. 

RNA pol II continues to extend the transcript, but Ser>-P in its CTD is now dephos- 
phorylated. CPSF (cleavage and polyadenylation specificity factor) recognizes and 
binds to the AAUAAA polyadenylation signal in the pre-e MRNA (Figure 29.39). A 
second sequence element downstream from AAUAAA, the U/GU-rich region, binds 
CstF (cleavage stimulation factor). CstF interacts with CPSF, establishing a connec- 
tion between the AAUAAA and the U/GU-rich region. One of the 6 subunits of CPSF 
is an endoribonuclease and it cleaves the capped pre-mRNA between the AAUAAA 
signal and the U/GU-rich region, freeing it from the transcription machinery. Upon 
5'—3'- exonuclease cleavage of the transcript remnant clinging to the RNA pol IJ, the 
transcription machinery dissociates from the DNA. 


Guanylyl HN 
transferase . 


GTP 5'-end of transcript 5'-capped transcript 


Figure 29.37 The capping of eukaryotic pre-mRNAs. Guanylyl transferase catalyzes the addition of a 
guanylyl residue (G,) derived from GTP to the 5’-end of the growing transcript, which has a 
5'-triphosphate group already there. In the process, pyrophosphate (pp) is liberated from GTP 

and the terminal phosphate (p) is removed from the transcript. Gppp + pppApNpNpNp. ..—> 
GpppApNpNpNp... + pp + p (A is often the initial nucleotide in the primary transcript). 
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Figure 29.38 Methylation of several specific sites located at the 5'-end of eukaryotic pre-mRNAs is 
an essential step in mRNA maturation. A cap bearing only a single —CH; on the guanyl is termed 
cap 0. This methylation occurs in all eukaryotic mRNAs. If a methyl is also added to the 2’-O 
position of the first nucleotide after the cap, a cap 1 structure is generated. This is the predominant 
cap form in all multicellular eukaryotes. Some species add a third —CH; to the 2’-O position of the 
second nucleotide after the cap, giving a cap 2 structure. Also, if the first base after the cap is an 
adenine, it may be methylated on its 6-NH)>. In addition, approximately 0.1% of the adenine bases 
throughout the mRNA of higher eukaryotes carry methylation on their 6-NH; groups. 


3’-Polyadenylylation of Eukaryotic mRNAs Most eukaryotic mRNAs have 80 to 250 ade- 
nine residues attached at their 3’-end, the poly(A) tail. (Histone mRNAs are the only 
common mRNAs that lack poly(A) tails.) These A residues are not encoded in the DNA 
but are added post-transcriptionally by the enzyme poly(A) polymerase, using ATP as 
a substrate. The consensus AAUAAA is not itself the poly(A) addition site; instead, 
it defines the position where poly(A) addition occurs (Figure 29.39). The consensus 
AAUAAA is found 10 to 35 nucleotides upstream from where the nascent primary 
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Figure 29.39 Poly(A) addition to the 3'-ends of 
transcripts occurs 10 to 35 nucleotides downstream 
from a consensus AAUAAA sequence, defined as the 
polyadenylylation signal. CPSF (cleavage and 
polyadenylylation specificity factor) binds to this 
signal sequence and mediates looping of the 3’-end of 
the transcript through interactions with the cleavage 
stimulation factor (CstF) bound at a G/U-rich 
sequence even further downstream. The endonuclease 
subunit of CPSF then carries out cleavage of the 
transcript to create a new 3’-end 10 to 35 nucleotides 
downstream from the polyadenylylation signal. 
Poly(A) polymerase (PAP) then successively adds 

80 to 250 adenylyl residues to the new 3’-end. (RNA 
pol II is also part of the polyadenylylation complex 
at the 3’-end of the transcript, but for simplicity in 
illustration, its presence is not shown in the lower 
part of the figure.) 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


1072 Chapter 29 Transcription and the Regulation of Gene Expression 


5'-Splice Site Consensus 
AG : GUAAGU 


Exon Intron 


3’-Splice Site Consensus 


Py Py Py Py Py Py Py Py-CAG : G—- 


Intron Exon 


Figure 29.40 Consensus sequences at the splice sites 
in vertebrate genes. 


pre-mRNA 5’ 


Figure 29.41 Splicing of mRNA precursors. A 
representative precursor mRNA is depicted. Exon 1 
and Exon 2 indicate two exons separated by an 
intervening sequence (or intron) with consensus 5’, 
3’, and branch sites. The fate of the phosphates at 
the 5’- and 3’-splice sites can be followed by tracing 
the fate of the respective ps. The products of the 
splicing reaction, the lariat form of the excised 
intron and the united exons, are shown at the 
bottom of the figure. 


transcript is cleaved by the CPSF endonuclease to generate a new 3’-OH end. This end 
is where the poly(A) tail is added. The processing events of mRNA capping, poly(A) 
addition, and splicing of the primary transcript create the mature mRNA. Significantly, 
both the guanylyl transferase that adds the 5’-cap structure and the enzymes that pro- 
cess the 3'-end of the transcript and add the poly (A) tail are anchored to RNA pol II 
via interactions with its CTD. 


29.5d Nuclear Pre-mRNA Splicing 


Within the nucleus, hnRNA forms ribonucleoprotein particles (RNPs) through asso- 
ciation with a characteristic set of nuclear proteins. These proteins interact with the 
nascent RNA chain as it is synthesized, maintaining the hnRNA in an untangled, 
accessible conformation. The capped primary transcript emerging from the RNA pol IT 
transcriptional apparatus is the substrate for splicing. Splicing requires both intron exci- 
sion and exon ligation. Splicing occurs exclusively in the nucleus. The mature mRNA 
that results is then exported to the cytoplasm to be translated. Splicing requires precise 
cleavage at the 5'- and 3’-ends of introns and the accurate joining of the two ends. 
Consensus sequences define the exon/intron junctions in eukaryotic mRNA precursors, 
as indicated from an analysis of the splice sites in vertebrate genes (Figure 29.40). Note 
that the sequences GU and AG are found at the 5’- and 3’-ends, respectively, of introns 
in pre-mRNAs from higher eukaryotes. In addition to the splice junctions, a conserved 
sequence within the intron, the branch site, is also essential to pre-mRNA splicing. 
The site lies 18 to 40 nucleotides upstream from the 3’-splice site and is represented in 
higher eukaryotes by the consensus sequence YNYRAY, where Y is any pyrimidine, R 
is any purine, and N is any nucleotide. 


29.5e The Splicing Reaction Proceeds via Formation of a Lariat Intermediate 


The mechanism for splicing nuclear mRNA precursors is shown in Figure 29.41. A 
covalently closed loop of RNA, the lariat, is formed by attachment of the 5'-phosphate 
group of the intron’s invariant 5'-G to the 2'-OH at the invariant branch site A to form 
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a 2'-5' phosphodiester bond. Note that lariat formation creates an unusual branched 
nucleic acid. The lariat structure is excised when the 3’-OH of the consensus G at the 
3'-end of the 5’ exon (Exon 1, Figure 29.41) covalently joins with the 5’-phosphate 
at the 5’-end of the 3’ exon (Exon 2). The reactions that occur are transesterification 
reactions where an OH group reacts with a phosphoester bond, displacing an —OH to 
form a new phosphoester link. Because the reactions lead to no net change in the num- 
ber of phosphodiester linkages, no energy input (for example, as ATP) is needed. The 
lariat product is unstable; the 2’-5’ phosphodiester branch is quickly cleaved to give a 
linear excised intron that is rapidly degraded in the nucleus. 


29.5f Splicing Depends on snRNPs 


The hnRNA (pre-mRNA) substrate is not the only RNP complex involved in the splic- 
ing process. Splicing also depends on a unique set of small nuclear ribonucleoprotein par- 
ticles called snRNPs (pronounced “snurps’’). In higher eukaryotes, each snRNP consists 
of a small RNA molecule 100 to 200 nucleotides long and about 10 different proteins. 
The RNAs found in snRNPs are uridine-rich. This quality leads to the U designation of 
the five prominent snRNPs as U1, U2, U4, 5, and U6 snRNP, as shown in Table 29.4. 
Four of the five snRNPs have a core set of seven Sm proteins (SmB/B’, SmD1, SmD2, 
SmD3, SmE, SmF, and SmG). The fifth, U1 SnRNP, has a related set of seven proteins 
that are designated LSm (for “like Sm”). Each snRNA has a single-stranded, U-rich RNA 
sequence, the Sm site, whose nucleotide sequence is AUUUUUG. The Sm proteins assem- 
ble as a very stable heteroheptameric ring around the Sm site, in the order SmE-SmG- 
SmD3-SmB-SmD1-SmD2-SmF. Thus, the snRNA passes through the central hole in the 
Sm protein heteroheptamer (Figure 29.42). Each of the snRNPs also binds its own set of 
unique proteins to complete its ribonucleoprotein structure. These major snRNP species 
are very abundant, present at greater than 100,000 copies per nucleus. In the U1 SnRNP, 
the UI snRNA folds into a secondary structure that leaves the 11 nucleotides at its 5’-end 
single-stranded. The 5’-end of U1 snRNA is complementary to the consensus sequence at 
the 5'-splice junction of the pre-mRNA (Figure 29.43), as is a region at the 5’-end of U6 
snRNA. The U2 SnRNP snRNA is complementary to the consensus branch site sequence. 


29.5g snRNPs Form the Spliceosome 


Figure 29.42 Structure of the core domain of the 
U4 SnRNP. The U4 snRNA (backbone indicated in 
orange) passes through the central hole in the 
heteroheptad Sm protein complex, SmG-SmD3- 
SmB-SmD1-SmD2-SmF. Each of the seven Sm 
proteins is a different color. (Adapted from Figure | 
in Leung, A. K. W., Nagai, K., and Li, J. 2011. 
Structure of the spliceosomal U4 snRNP core 
domain and its implication for snRNP biogenesis. 
Nature 473:536-539; PDB id = 2Y9A). 


| Table 29.4 | ye) The snRNPs Found 


in Spliceosomes 


snRNP Length (nt) Splicing Target 


Ul 165 5’ splice 
Splicing occurs when the various snRNPs interact with the pre-mRNA to form the U2 189 Branch 
spliceosome, a multi-megadalton complex the size of a ribosome. The spliceosome is y4 145 5’ splice, recruitment 
assembled de novo at splice sites. Among its many subunits are ATP-dependent RNA of branch point to 
a ser 5 115 meh 
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Figure 29.43 Mammalian U1 snRNA can be arranged in a secondary structure where its 5'-end is 
single-stranded and can base-pair with the consensus 5'-splice site of the intron. Newly transcribed 
pre-RNAs are vulnerable to premature degradation, and U1 snRNP helps to protect the pre-mRNA 
from such destruction. (Adapted from Figure | in Rosbash, M., and Seraphin, B., 1991. Who’s on 
first? The U1 snRNP-S’ splice site interaction and splicing. Trends in Biochemical Sciences 16:187.) 
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Figure 29.44 Events in spliceosome assembly. U1 
snRNP binds at the 5’-splice site, followed by the 
association of U2 snRNP with the VACUAA*C 
branch-point sequence. The triple U4/U6-U5 
snRNP complex replaces U1 at the 5’-splice site and 
directs the juxtaposition of the branch-point 
sequence with the 5’-splice site, whereupon U4 
snRNP is released. Lariat formation occurs, freeing 
the 3’-end of the 5'-exon to join with the 5’-end of 
the 3’-exon, followed by exon ligation. U2, U5, and 
U6 snRNPs dissociate from the lariat following exon 
ligation. 


Chapter 29 Transcription and the Regulation of Gene Expression 


In the course of each splicing reaction, the spliceosome cycles through eight distinct 
compositional states as various U-snRNPs associate, carry out their assigned roles, and 
then dissociate. Assembly of the spliceosome begins with the binding of Ul snRNP 
at the 5’-splice site of the pre-mRNA (Figure 29.44). Each subsequent step in spliceo- 
some assembly requires ATP-dependent RNA rearrangements catalyzed by spliceoso- 
mal DEAD-box ATPases/helicases. The branch-point sequence (UACUAAC in yeast) 
binds U2 snRNP, and then the triple snRNP complex of U4/U6-U5 replaces U1 at the 
5'-splice site. The substitution of base-pairing interactions between U1 and the pre- 
mRNA 5'-splice site by base-pairing between U6 and the 5’-splice site is just one of 
the many RNA rearrangements that accompany the splicing reaction. Base-pairings 
between U6 and U2 RNA bring the 5’-splice site and the branch point RNA sequences 
into proximity. Interactions between U2 and U6 lead to release of U4 snRNP. The spli- 
ceosome is now activated for catalysis. 

The spliceosome is a ribozyme: The transesterification reactions that achieve RNA 
splicing are catalyzed by the U6 snRNA. Mechanistically, transesterifications are 
metal-catalyzed reactions, and the U6 snRNA sugar-phosphate backbone provides a 
set of phosphoryl oxygen atoms to coordinate the Mg?* ions that mediate splicing. As 
shown in Figure 29.41, a transesterification reaction involving the 2’-O of the invariant 
A residue in the branch-point sequence displaces the 5'-exon from the intron, creating 
the lariat intermediate. The free 3'-O of the 5’-exon now triggers a second transesteri- 
fication reaction through attack on the P atom at the 3’-exon splice site. This second 
reaction joins the two exons and releases the intron as a lariat structure. In addition to 
the snRNPs, a number of proteins with RNA-annealing functions as well as proteins 
with ATP-dependent RNA-unwinding activity participate in spliceosome function. 
The spliceosome is thus a dynamic structure that uses the pre-mRNA as a template for 
assembly, carries out its transesterification reactions, and then disassembles when the 
splicing reaction is over. 


29.5h Alternative RNA Splicing Creates Protein Isoforms 


In one mode of splicing, every intron is removed and every exon is incorporated into 
the mature RNA without exception. This type of splicing, termed constitutive splic- 
ing, results in a single form of mature mRNA from the primary transcript. However, 
many eukaryotic genes can give rise to multiple forms of mature RNA transcripts. The 
mechanisms for production of multiple transcripts from a single gene include use of 
different promoters, selection of different polyadenylation sites, alternative splicing of 
the primary transcript, or even a combination of the three. 


Protein Isoforms Arise from Different Transcripts of the Same Gene Different transcripts 
from a single gene make possible a set of related polypeptides, or protein isoforms, each 
with slightly altered functional capability. Such variation serves as a useful mecha- 
nism for increasing the apparent coding capacity of the genome. Furthermore, alter- 
native splicing offers another level at which regulation of gene expression can operate. 
For example, mRNAs unique to particular cells, tissues, or developmental stages could 
be formed from a single gene by choosing different 5'- or 3’-splice sites or by omit- 
ting entire exons. Translation of these mature mRNAs produces cell-specific protein iso- 
forms that display properties tailored to the needs of the particular cell. Such regulated 
expression of distinct protein isoforms is a fundamental characteristic of eukaryotic cell 
differentiation and development. 


Fast Skeletal Muscle Troponin T Isoforms Are an Example of Alternative Splicing In addi- 
tion to many other instances, alternative splicing is a prevalent mechanism for generat- 
ing protein isoforms from the genes encoding muscle proteins (see Chapter 16), allowing 
distinctive isoforms aptly suited to the function of each muscle. An impressive manifes- 
tation of alternative splicing is seen in the expression possibilities for the rat fast skeletal 
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A Deeper Look 


Inteins—Bizarre Parasitic Genetic Elements Encoding a Protein-Splicing Activity 


Inteins are parasitic genetic elements found within protein-coding 
regions of genes. These selfish DNA elements are transcribed and 
translated along with the flanking host gene sequences. The typi- 
cal intein protein consists of two domains: One domain is capable 
of self-catalyzed protein splicing; the other is an endonuclease that 
mediates the insertion of the intein nucleotide sequence into host 
genes. After the full protein is synthesized, the intein catalyzes 
excision of itself from the host protein and ligation of adjacent 
host polypeptide regions to form the functional protein that the 
host gene encodes. These adjacent polypeptides are termed exteins 
(external proteins) to distinguish them from the intein (internal pro- 
tein). Inteins have been found across all domains of life—archaea, 
bacteria, and eukaryotes—although thus far only in unicellular 
organisms. Inteins vary in size from about 130 to 600 amino acid 
residues. The protein splicing function of inteins is found in its N- 
terminal and its C-terminal regions; the endonuclease function that 
carries out parasitic insertion of the intein sequence into host genes 
is found in the central part of the intein. Splicing of the protein is an 
intramolecular process that liberates the intein sequence and ligates 
the host protein sequences (see accompanying figure for the basic 
reaction mechanism). 

Inteins are usually found as inserts in highly conserved host 
genes that have essential functions, such as genes encoding DNA or 
RNA polymerases, proton-translocating ATPases, or other vital 
metabolic enzymes. Their location in such genes means that evolu- 
tionary loss of the intein via deletion or genetic rearrangement is 
more difficult. The endonuclease activity of the intein recognizes a 
14- to 40-bp sequence in a potential host gene and cleaves the DNA 
there. During repair of the double-stranded DNA break, the intein 
gene is copied into the cleavage site, thereby establishing the para- 
sitic genetic element in the host gene. 


Transcription and translation of the combined intein-host gene 
flanking sequences leads to synthesis of a fused intein—extein protein. 
The intein splices itself out when (1) the C-terminal residue of the 
N-extein is shifted to the O (or S) atom of a neighboring intein Ser (or 
Cys) residue, (2) the N-extein C-terminal carbonyl undergoes 
nucleophilic attack by the O (or S) atom of a Ser (or Cys) residue at 
the end of the C-extein in a transesterification reaction that creates a 
branched protein intermediate, (3) cyclization of the intein C-terminal 
asparagine residue excises the intein, and (4) the two exteins are 
properly united via a peptide bond when the N-extein C-terminus 
spontaneously shifts to the C-extein N-terminus to form an intact host 
protein. An example of a protein-coding gene containing an intein is 
the gene for Prp8, a core subunit of spliceosomes. Prp8 mediates 
rearrangements of the spliceosome catalytic core. 
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Fast skeletal troponin T gene and spliced mRNAs 
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Figure 29.45 Organization of the fast skeletal muscle troponin T gene and the 64 possible mRNAs 
that can be generated from it. Exons are constitutive (yellow), combinatorial (green), or mutually 
exclusive (blue or orange). Exon | is composed of 5’-untranslated (UT) sequences, and Exon 18 
includes the polyadenylation site (AATAAA) and 3’-UT sequences. The TATA box indicates the 
transcription start site. The amino acid residues encoded by each exon are indicated below. Many 
exon-intron junctions fall between codons. The sawtooth exon boundaries indicate that the splice site 
falls between the first and second nucleotides of a codon, the concave/convex exon boundaries indicate 
that the splice site falls between the second and third nucleotides of a codon, and flush boundaries 
between codons signify that the splice site falls between intact codons. Each mRNA includes all 
constitutive exons, | of the 32 possible combinations of Exons 4 to 8, and either Exon 16 or 17. 


muscle troponin T gene (Figure 29.45). This gene consists of 18 exons, 11 of which are 
found in all mature mRNAs (exons | through 3, 9 through 15, and 18) and thus are 
constitutive. Five exons, those numbered 4 through 8, are combinatorial in that they may 
be individually included or excluded, in any combination, in the mature mRNA. Two 
exons, 16 and 17, are mutually exclusive: One or the other is always present, but never 
both. Sixty-four different mature mRNAs can be generated from the primary transcript 
of this gene by alternative splicing. Because each exon represents a cassette of genetic 
information encoding a segment of protein, alternative splicing is a versatile way to 
introduce functional variation within a common protein theme. 


29.51 RNA Editing: Another Mechanism That Increases 
the Diversity of Genomic Information 


RNA editing is a process that changes one or more nucleotides in an RNA transcript 
by deaminating a base, either A—I (adenine to inosine, through deamination at the 
6-position in a purine ring) or C—>U (cytosine to uracil, through deamination at the 
4-position in a pyrimidine ring). These changes alter the coding possibilities in a tran- 
script. An A-to-I change leads to an interpretation of I as G during translation when I 
pairs with C. The C-to-U change replaces the previous C:G pairing with U:A pairing at 
that position. RNA editing has the potential to increase protein diversity by (1) altering 
amino acid coding possibilities, (2) introducing premature stop codons, or (3) changing 
splice sites in a transcript. If RNA splicing is cutting-and-pasting, then these single- 
base changes are aptly termed RNA editing. 

A-to-I editing is carried out by adenosine deaminases that act on RNA (the ADAR 
family of RNA-editing enzymes). ADARs act only on double-stranded regions of 
RNA. Typically, such regions form when an exon region containing an A to be edited 
base-pairs with a complementary base sequence in an intron known as the editing site 
complementary sequence, or ECS. ADARs are abundant in the nervous system of 
animals. A prominent example of RNA editing occurs in transcripts encoding mam- 
malian glutamate receptors (GluRs). Deamination of the GluR-B gene transcript 
changes a glutamine codon CAG to CIG, which is read by the translational machinery 
as an arginine codon (CGG), dramatically altering the conductance properties of the 
membrane receptor produced from the edited transcript, as compared to the receptor 
produced from the unedited transcript. 

C-to-U editing is carried out on single-stranded regions of transcripts by an edito- 
some core structure consisting of a cytosine deaminase and an adapter protein that 
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brings the deaminase and the transcript together. A prominent example of C-to-U edit- 
ing targets a single C residue in a 14-kb transcript encoding the 4536-residue apolipo- 
protein B100 protein (see Chapter 24). ApoB RNA editing changes codon 2153 (a CAA 
[glutamine] codon) to a UAA stop codon, which leads to a shortened protein product, 
ApoB48, consisting of the N-terminal 48% of apoB100. In humans, apoB100 is made 
in the liver and found in liver-derived VLDL serum lipoprotein complexes. In contrast, 
apoB48 is made in intestinal cells and found in intestinal-derived lipid complexes. 


29.6 | Can Gene Expression Be Regulated Once 
the Transcript Has Been Synthesized? 


Much emphasis has been placed on the regulation of gene expression through mecha- 
nisms at the gene promoter that determine whether or not a gene will be transcribed. 
In reality, many protein-coding genes are transcribed into RNA but their transcripts 
are never translated to form proteins. Indeed, post-transcriptional regulation of gene 
expression is a pervasive form of regulation, affecting virtually every cellular process 
from metabolism to development and cancer. MicroRNAs (miRNAs; see Chapter 10) 
are important players in post-transcriptional gene regulation. 


29.6a miRNAs Are Key Regulators in Post-Transcriptional Gene Regulation 


miRNAs are a large family of small (~21 nucleotides), noncoding RNAs found in ani- 
mals, plants, and protists. Humans have about 2300 miRNA genes and they are pre- 
dicted to target the expression of about 60% of all protein-coding genes. miRNAs are 
synthesized mainly by RNA pol II transcription. Their coding sequences are found in 
intergenic regions of DNA, in the introns of protein-coding genes, and even in exons. 


miRNA Synthesis and Processing miRNAs are synthesized as pri-miRNAs (primary miR- 
NAs), primary transcripts hundreds or thousands of nucleotides long that are processed 
in the nucleus by the microprocessor protein complex. Microprocessor is composed of 
Drosha (an RNase III family member), a dimer of the dsR NA-binding protein DGCR8, 
and other factors. Microprocessor converts the pri-miRNAs into 70-nucleotide stem- 
loop structures called pre-miRNAs (precursor miRNAs). Pre-miRNAs are exported to 
the cytosol, where Dicer (Chapter 12) processes the stem of the pre-miRNA to yield an 
approximately 20-nucleotide-long miR NA/miRNA* duplex (the * denotes the strand that 
is discarded). The miR NA strand represents the mature miRNA. 


miRNA Mechanism of Action The mature miRNA is incorporated into a miRNA- 
induced silencing complex (miRNA-RISC) through interaction with AGO2 
(Figure 29.46). AGO2 contains three domains: PAZ, MID (which recognizes the 
5'-end of the miRNA), and PIWI, the site of an RNase H-like enzyme activity. The 
5'-terminal end of the miRNA binds in a deep pocket of AGO2 at the MID-PIWI 
junction. The PIWI domain also interacts with GW182, a 182-kDa protein having 
a glycine- and tryptophan-rich (GW) N-terminal domain and an RNA-recognition 
motif (RRM) at its C-terminus. AGO2 and GW182 are the primary proteins in the 
miRNA-RISC, but they recruit additional proteins, such as PABP, poly(A)-bind- 
ing protein (see Figure 30.29), a protein that binds to the poly(A) tail of mRNAs. 
miRNA-RISC mediates the interaction of the miRNA with its mRNA targets. 

In most cases, miRNAs target the 3'-untranslated region (3'-UTR) of the mRNAs 
they regulate. As the key site for post-transcriptional gene regulation, the 3’-UTR can 
be viewed as the post-transcriptional equivalent of the gene promoter. It is here that 
the macromolecular interactions underlying post-transcriptional regulatory decisions 
take place. In almost every case, the miRNA base-pairs imperfectly with its target 
sequence in the 3’-UTR of the mRNAs it regulates. Effective mRNA targeting requires 
base-pairing of nucleotides 2 to 8 of the miRNA with the mRNA. Nucleotides 2 to 8 
are termed the seed region. Because the RNase function of AGO2 cleaves only perfectly 
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Figure 29.46 Domain organization of human Argonaute 2 (AGO2), GW182, and poly(A)-binding 
protein (PABP). The AGO2 PAZ (Piwi-Argonaute-Zwilli) domain is found in other proteins involved in 
post-transcriptional gene silencing. The GW-repeat domain of GW182 interacts with AGO2. The 
C-terminal GW182 domain has an RNA recognition motif (RRM). PABP has four RRM domains and 
a conserved C-terminal domain, PABC. Mammalian PABP binds directly to the silencing region of 
human GW182 proteins via PABC. (Adapted from Figure 1 in Fabian, M. R., Sonenberg, N., and 
Filipowicz, W., 2010. Regulation of mRNA stability and translation by miRNAs. Annual Review of 
Biochemistry 79:351-379.) 


paired RNA duplexes, miRNA-RISC interaction with the mRNA does not result 
directly in RNA degradation. Further, because the miRNA-mRNA base-pairing is 
not perfect, miRNAs are non-specific. They can interact with the 3’-UTRs of multiple 
mRNA targets; keep in mind that these 3’-UTRs are very extensive, 700 nucleotides or 
more in human mRNAs, affording many opportunities for miRNA interactions. 


miRNA Achieves Post-Transcriptional Gene Silencing in Two Ways miRNA interaction 
with a transcript prevents its translation into protein, a phenomenon termed post- 
transcriptional gene silencing because the gene will not be expressed. A key aspect of 
eukaryotic mRNA translation is closing the loop: the formation of an mRNA circle 
maintained through RNA:protein interactions. Closing the loop is achieved through 
translation factors binding at the capped 5’-end of the mR NA and then interacting with 
PABP bound to the poly(A) tail of the mRNA. Because of the proximity of the 5'- and 
3’-ends of the mRNA when the loop is closed, miR NA-RISC binding at the 3’-UTR 
can interfere with recruitment of ribosomes to translate the mRNA, a mode of miRNA 
action referred to as translation repression. In addition, miR NA-RISC complexes desta- 
bilize mRNAs through deadenylylation at their 3’-ends, decapping at their 5’-ends, and 
subsequent 5’—>3’ exonucleolytic degradation. The miRNA-RISC complex disrupts the 
closed loop through competing interactions with PABP, exposing the mRNA to decap- 
ping enzymes and exonucleases. Thus, the second mode of miRNA action is promoting 
mRNA decay. 


IncRNAs Long noncoding RNAs (IncRNAs) are a class of RNA molecules defined 
by their lack of any protein-coding potential and a length greater than 200 nucleo- 
tides. IncRNAs are very abundant, numbering in the tens of thousands in mammals. 
IncRNAs resemble mRNA transcripts and indeed arise from RNA pol II transcrip- 
tion of DNA, either via divergent transcription or transcription of DNA regions 
between genes (so-called intergenic regions). Potentially, IncR NAs can base-pair with 
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complementary regions in DNA and/or interact with RNA-binding proteins. Never- 
theless, despite much research, it remains an open question whether or not IncRNAs 
have any biological function. 


29.6b Targeted RNA Degradation by Exosomes 


The amount of specific mRNAs or proteins present in a cell at any time represents 
a balance between the rates of macromolecular synthesis and degradation. Regulated 
degradation of mRNAs (discussed here) and proteins is a rapid and effective way to 
control the cellular levels of these macromolecules. Because indiscriminate degradation 
of RNAs and proteins could have detrimental consequences within the cell, such deg- 
radation typically is compartmentalized. Targeted degradation of RNAs and proteins 
is enclosed within ringlike or cylindrical macromolecular complexes—the exosome’ for 
RNA and the proteasome for proteins (see Chapter 31). The catalytically active compo- 
nent of an exosome is an RNase PH family member that processively degrades RNA 
in the 3'—5' direction. In addition to this exonucleolytic activity, RNA endonuclease 
activity has been found in exosomes. 

Exosomes have a fundamental structural pattern: a ring of six subunits surrounding 
a central cavity, with one or more of the subunits having RNase PH activity. Superim- 
posed on top of the hexameric RNase PH ring are three different proteins with S1 RNA- 
binding domains (RBD proteins), as shown in Figure 29.47. Collectively, these nine proteins 
constitute the exosome complex. Depending on the cellular compartment—nucleus, 
nucleolus, or cytosol—other proteins associate with the exosome complex. These pro- 
teins determine the substrates degraded—rRNA, mRNA, tRNA, or any of the variety 
of small RNAs. RNAs to be degraded are threaded into the central cavity. The architec- 
ture of the exosome restricts substrate access and compartmentalizes the R Nase activity 
so that indiscriminate degradation of cellular RNase is avoided. Only RNAs targeted 
to the exosome are destroyed. The exosome is a key player in RNA quality control, 
ensuring that misprocessed RNAs and ribonucleoprotein complexes are degraded. 


Cell biologists use the word “exosome” to refer to cell-derived vesicles found in biological fluids, so 
some confusion surrounds the use of this term. 


Figure 29.47 Structure of the human exosome 
complex (PDB id = 2NN6). The exosome complex is 
composed of nine different polypeptide chains. A 
hexameric ring of polypeptides (each a different 
color in this image) surrounds a central cavity. This 
cavity is capped by a set of three S1 RBD proteins 
(all colored hot pink here). The human exosome core 
is catalytic inactive, but it serves as a platform for the 
association of additional subunits that have 
3’-exonuclease activity. 
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29.7 | Can We Propose a Unified Theory of Gene Expression? 


The stages of eukaryotic gene expression—from transcriptional activation, transcrip- 
tion, transcript processing, nuclear export of mRNA, to translation—have tradi- 
tionally been presented as a linear series of events, that is, as a pathway of discrete, 
independent steps. However, it now is clear that each stage is part of a continuous pro- 
cess, with physical and functional connections between the various transcriptional and 
processing machineries. This realization led George Orphanides and Danny Reinberg 
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Figure 29.48 A unified theory of gene expression. Each step in gene expression, from transcription to 
translation, is but a stage within a continuous process. RNAP stands for RNA polymerase II. Each 
stage is physically and functionally connected to the next, ensuring that all steps proceed in an 
appropriate fashion and overall regulation of gene expression is tightly integrated. (Adapted from 
Figure 2 in Orphanides, G., and Reinberg, D., 2002. A unified theory of gene expression. Cell 
108:439-451.) 


NUCLEUS 
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to propose a unified theory of gene expression. The principal tenet of this theory is 
that eukaryotic gene expression is a continuous process, from transcription through 
processing and protein synthesis: DNA—RNA->protein (Figure 29.48). Furthermore, 
regulation occurs at multiple levels in this continuous process, in a coordinated fashion. 
Tom Maniatis and Robin Reed provide additional support for this theory by pointing 
out that eukaryotic gene expression depends on an interacting network of multicom- 
ponent protein machines—nucleosomes, HATs, and the remodeling apparatus; RNA 
polymerase II and its associated factors, which include capping, splicing, and polyad- 
enylylation enzymes; and the proteins involved in mRNA export to the cytoplasm for 
translation on ribosomes, the topic of the next chapter. Translation is not inevitable. If 
a noncoding RNA base-pairs with the mRNA, translation will be thwarted. For exam- 
ple, base pairing with a microRNA will result in gene silencing (see Chapter 10 and 
Section 29.6a). Base pairing with a small interfering RNA (siRNA, Chapter 10) leads 
to gene knockdown by RNAi through mRNA destruction by the RISC protein com- 
plex (see Figure 12.21). Recall that gene silencing is a post-transcriptional regulatory 
mechanism that prevents translation of an mRNA, whereas RNAi carries out post- 
transcriptional destruction of mRNAs. 

It should be mentioned that eukaryotic cells have elaborate systems for mRNA sur- 
veillance; these systems destroy any messages containing errors, such as the nonsense 
mediated decay (NMD) system. NMD degrades any message that has premature non- 
sense, or stop, codons. Furthermore, the regulation of mRNA levels through selective 
destruction provides another mechanism for the post-transcriptional regulation of 
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gene expression. 


Foundational Biochemistry Things You Should Know after Reading Chapter 29. 


Transcription of operons is controlled through induction and repression. 


Operons are units of gene expression that consist of structural genes 


lac repressor is a negative regulator of the lac operon. 
CAP is a positive regulator of the lac operon. 
Negative and positive control systems are fundamentally different. 


The araBAD operon is both positively and negatively controlled by 


The trp operon is repressed if the Trp repressor protein binds 


Attenuation, a prokaryotic mechanism for post-transcriptional gene 
regulation, is achieved through formation of alternative secondary 


DNA:protein interactions and protein:protein interactions are essential 


Proteins that activate transcription exert their effects through 
protein:protein contacts with RNA polymerase. 


DNA looping allows multiple DNA-binding proteins to convene at 


RNA polymerase I transcribes rRNA genes. 
RNA polymerase II transcribes protein-coding genes. 


RNA polymerase III transcribes tRNA genes and other small 
RNA genes, including 5S rRNA. 


RNA polymerase II is composed of 12 different subunits. 
The C-terminal domain (CTD) of the RPB1 subunit of RNA poly- 


e All RNAs are synthesized from DNA templates by DNA-directed © 
RNA polymerases. é 
e The synthesis of RNA from DNA templates is called transcription. and regulatory genes. 
e Prokaryotes have a single version of RNA polymerase. ° 
e RNA is synthesized in the 5'—>3' direction, as RNA polymerase © 
moves along the DNA in a 3'—>5' direction. ë 
e The strand of DNA that is transcribed by RNA polymerase is called a 
the template strand. AtaC. 
e The nucleotide sequence of the RNA transcript is identical to the e 
nucleotide sequence of the nontemplate DNA strand (except with U tryptophan. 
instead of T). JP Spaan 
e 
e Bacterial RNA polymerase has an œ ßß'o subunit composition. 
e The o-subunit recognizes gene promoters, which identify the tran- structures in the trp operon transcript. 
scription start site for prokaryotic genes. š 
e Transcription has four stages: in transcription regulation. 
RNA polymerase holoenzyme binding to DNA at promoter sites. e 
Initiation of polymerization. 
Elongation by the œB’ RNA polymerase core. e 
Termination of polymerization. the promoter. 
e Bacterial promoters contain two consensus sequence elements: the ° Eukaryotes have three RNA polymerases: 
Pribnow box and the —35 region. 
e Initiation of polymerization begins formation of an open promoter 
complex. 
e Two transcription termination mechanisms operate in bacteria: Rho 
(p)-mediated termination and intrinsic termination at specific termi- e 
nation sites. 
e 
e 


Bacterial genes are often grouped together in operons. 


merase II contains multiple repeats of the amino acid sequence 
YSPTSPS. 
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e The CTD multiple YSPTSPS repeats provide multiple phosphoryla- 
tion sites. 


e A phosphorylation code controls RNA polymerase II, regulating 
mRNA synthesis and processing. 


e The regulation of gene expression is more complex in eukaryotes 
compared to prokaryotes. 


e Efficient gene transcription by RNA pol II depends on interactions 
with proteins bound at two cis-regulatory elements (CREs): promot- 
ers near the transcription start site (TSS) and more distantly located 
enhancers. 


e About 70% of the human genome is transcribed, even though pro- 
tein-encoding genes represent only 2% of the genome. 


e — Inr is the site of transcription initiation. 


e Typical eukaryotic promoters contain modules of short, conserved 
sequences, such as the TATA box and DPEs. 


® = The location of enhancers (also called upstream activating sequences) 
is not fixed relative to the promoter. 


e Many genes are subject to multiple regulatory influences achieved 
through an array of different CREs. 


e Transcription initiation by RNA pol II requires TBP and the GTFs. 


e Formation of the pre-initiation complex (PIC) and promoter opening 
establishes an initiation complex. 


e Transcription initiation also depends on interactions between RNA 
pol II and Mediator. 


e Mediator also functions in repression of transcription. 


e Chromatin-remodeling complexes and histone-modifying enzymes 
alleviate repression due to nucleosomes. 


e Chromatin-remodeling complexes are huge, ATP-dependent enzymes 
that loosen DNA-protein interactions in nucleosomes. 


e The various covalent modifications of histones can modulate histone- 
DNA interactions and create binding sites for recruitment of addi- 
tional proteins. 


e Methylation and phosphorylation act as a binary switch that regu- 
lates chromatin dynamics. 


e Chromatin deacetylation leads to gene repression. 


© Gene regulatory proteins recognize specific DNA sequences through 
DNA:protein interactions that may or may not involve direct readout 
of base sequence. 


The three prominent motifs in DNA-binding proteins include the 
HTH (helix-turn-helix), the Zn finger, and the bZ/P. 


Eukaryotic genes are split into coding regions (exons) and noncoding 
regions (introns). 


Post-transcriptional processing of eukaryotic genes includes capping, 
methylation, 3’-polyadenylylation, and splicing. 


Intron excision by the splicing reaction proceeds through formation 
of a lariat intermediate. 


Splicing depends on a set of snRNPs, each of which contains in its 
core a specific 100- to 200-nucleotide U-rich RNA chain and a het- 
eroheptamer of Sm proteins. 


The spliceosome is a metal ion—dependent ribozyme. 


Spliceosomes also contain DEAD-box ATPases/helicases that rear- 
range transcript secondary structure so that the splicing reaction can 
occur. 


Alternative splicing dramatically increases the number of protein iso- 
forms possible from a single transcript. 


RNA editing through AI or C—U deamination reactions also 
increases the diversity of proteins encoded by a transcript. 


Post-transcriptional regulation of gene expression affects virtually all 
cellular processes. 


miRNAs are key regulators in post-transcriptional gene expression. 


miRNAs are transcribed from intergenic regions of DNA, introns, 
and even exons. 


miRNAs act as part of miRNA-RISC complexes. 


The miRNAs in miRNA-RISC complexes base-pair imperfectly with 
sequences in the 3’-UTR of mRNA. 


miRNAs lead to either transcription repression or mRNA decay. 


IncRNAs are an abundant class of RNA transcripts whose biological 
significance is unclear. 


A unified theory of gene expression describes the events through the 
sequential participation of multicomponent molecular machines: 
chromatin decompaction, regulated transcription through RNA 
polymerase II-Mediator interactions, capping, methylation, poly- 
adenylylation, splicing, mRNA packaging and export from the 
nucleus, and either translation by ribosomes, destruction by RNAI, or 
silencing by miRNAs in the cytosol. 


The RNA exosome degrades RNAs. 


Answers to all problems are at the end of this book. Detailed solutions are 
available in the Student Solutions Manual. 


1. Template-Transcript Relationships The 5’-end of an mRNA has the 
sequence 


.. AGAUCCGUAUGGCGAUCUCGACGAAGACUC- 
CUAGGGAAUCC... 


What is the nucleotide sequence of the DNA template strand from 
which it was transcribed? If this mRNA is translated beginning with the 
first AUG codon in its sequence, what is the N-terminal amino acid 
sequence of the protein it encodes? (See Table 30.1 for the genetic code.) 


2. The Events in Transcription Initiation Describe the sequence of events 
involved in the initiation of transcription by E. coli RNA polymerase. 
Include in your description all the features a gene must have for 
proper recognition and transcription by RNA polymerase. 


3. 


Substrate Binding by RNA Polymerase RNA polymerase has two 
binding sites for ribonucleoside triphosphates: the initiation site and 
the elongation site. The initiation site has a greater K, for NTPs than 
the elongation site. Suggest what possible significance this fact might 
have for the control of transcription in cells. 


Comparison of Prokaryotic and Eukaryotic Transcription Make a list of 
the ways that transcription in eukaryotes differs from transcription in 
prokaryotes. 


Transcription Components For each of the following items, give their 
function in transcription and associated processes. 

a. a,BB'o 

b. TATAAT 

c. RPBI1 through 12 

d. (YSPTSPS)o7 or 52 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


10. 


T. 


12. 


13. 


14. 


15. 


16. 


17. 


CREs 

. MREs 
g. GREs 
h. TBP 
KMTs 
j. HTH 
k. Cys-x2-Cys-x}2-His-x3-His 
l. AAUAAA 


pas 


. DNA Recognition by DNA-Binding Proteins DNA-binding proteins 


may recognize specific DNA regions either by reading the base 
sequence or by shape readout. How do these two modes of 
protein : DNA recognition differ? 


. The Size of Gene Promoters (Integrates with Chapter 11.) The 


metallothionein promoter is illustrated in Figure 29.26. How long is 
this promoter, in nm? How many turns of B-DNA are found in this 
length of DNA? How many nucleosomes (approximately) would be 
bound to this much DNA? (Consult Chapter 11 to review the 
properties of nucleosomes.) 


. Heterodimer Formation by bZIP Proteins Describe why the ability of 


bZIP proteins to form heterodimers increases the repertoire of genes 
whose transcription might be responsive to regulation by these proteins. 


. Alternative Splicing Possibilities Suppose exon 17 were deleted from 


the fast skeletal muscle troponin T gene (Figure 29.45). How many 
different mRNAs could now be generated by alternative splicing? 
Suppose that exon 7 in a wild-type troponin T gene were duplicated. 
How many different mRNAs might be generated from a transcript of 
this new gene by alternative splicing? 


Alternative Splicing Possibilities Determine the number of possible 
transcripts if a gene contains five, six, or seven introns. 


Histone Modifications and Histone:DNA Interactions Figure 29.29 
illustrates some of the various covalent modifications that occur on 
histone tails. How might each of these modifications influence DNA: 
histone interactions? 


Histone Modification Lysine residues in proteins are typically 
positively charged. Methylation of the amino group on the side chain 
of lysine does not abolish this positive charge. How then does 
methylation affect the interaction possibilities of these lysine residues? 


Inducer Binding to lac Repressor (Integrates with Chapter 15.) Predict 
from Figure 29.12 whether the interaction of /ac repressor with 
inducer might be cooperative. Would it be advantageous for inducer 
to show cooperative binding to /ac repressor? Why? 


Post-transcriptional Modification of Eukaryotic mRNAs What are the 
advantages of capping, methylation, and polyadenylylation of 
eukaryotic mRNAs? 


The RNA Polymerase Mechanism of Action (Integrates with Chapter 
28.) Figure 29.24 shows only one Mg?t ion in the RNA polymerase 
II active site. However, more recent studies reveal the presence of two. 


Why is the presence of two Mg?" ions significant? 


Chromatin Remodeling Exposes DNA (Integrates with Chapter 11.) 
The SWI/SNF chromatin-remodeling complex peels about 50 bp from 
the nucleosome. Assuming B-form DNA, how long is this DNA 
segment? In forming nucleosomes, DNA is wrapped in turns about 
the histone core octamer. What fraction of a DNA turn around the 
core octamer does 50 bp of DNA comprise? How does 50 bp of DNA 
compare to the typical size of eukaryotic promoter modules and 
response elements? 


RNA Splicing Branch Sites Section 29.5d discusses the conserved 
sequences at branch sites in RNA molecules that undergo splicing. 
Consider the sequences shown below and indicate which ones are 
branch sites. 


a CUUGAC 
b CGUAGU 


18. 


19. 


20. 


N 
—_ 


22. 


23. 


24. 


25. 


26. 
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Problems 


ce UUUAAU 
d. TCCAAC 
e GGUAAU 
£ UACGAG 
g. UGCGAC 


The Lariat Intermediate in RNA Splicing Draw the structures that 
comprise the lariat branch point formed during mRNA splicing: 
the invariant A, its 5'-R neighbor, its 3’-Y neighbor, and its 2’-G 
neighbor. 


HTH DNA-Binding Proteins and B-DNA Dimensions (Integrates with 
Chapters 6 and 11.) The a-helices in HTH (helix-turn-helix motif) 
DNA-binding proteins are formed from seven- or eight—amino acid 
residues. What is the overall length of these a-helices? How does their 
length compare with the diameter of B-form DNA? 


Separating RNA from DNA (Integrates with Chapter 4.) During the 
transcription process, the emerging RNA molecule is separated from 
the DNA template by RPB1 (see Section 29.3b). Residue Phe?” of 
RPB1 is responsible for this separation. Explain why Phe is ideal for 
this purpose. Which other amino acids might replace Phe in carrying 
out this function? 


. Exploring the Structure of RNA Polymerase Bacteriophage T7 RNA 


polymerase bound to two DNA strands and an RNA strand, as 
shown in pdb file MSW, provides a glimpse of transcription. View 
this structure at https://www.rcsb.org to visualize how the template 
DNA strand is separated from the nontemplate strand and tran- 
scribed into an RNA strand. Which Phe residue of the enzyme plays a 
significant role in DNA strand separation? In which domain of the 
polymerase is this Phe located? (You might wish to consult Yin, Y. W., 
and Steitz, T. A., 2002. Structural basis for the transition from 
initiation to elongation transcription in T7 RNA polymerase. Science 
298:1387-1395 to confirm your answer.) 


Structure of a-Amanitin (Integrates with Chapter 4.) Figure 29.23 
shows the structure of the cyclic polypeptide a-amanitin. Which 
five amino acids or derivatives of amino acids are present in this 
structure? 


a-Amanitin Inhibition of RNA Polymerase II RNA polymerase II is 
inhibited by a-amanitin. This mushroom-derived toxin has no effect 
on the enzyme’s affinity for NTP substrates, but it dramatically slows 
polymerase translocation along the DNA. Go to Attps://www.resb.org 
to view pdb file 1K 83, which is the structure of RNA polymerase II 
with bound a-amanitin. Locate a-amanitin within this structure, and 
discuss why its position is consistent with its mode of inhibition. 


Exploring the Structure of a DNA-Bound bZIP Transcription Factor 
C/EBPS is a bZIP transcription factor in neuronal differentiation, 
learning and memory process, and other neuronal and glial functions. 
The structure of the bZIP domain of C/EBPB bound to DNA is 
shown in pdb file 1GU4. Explore this structure to discover the leucine 
zipper dimerization domain and the DNA-binding basic regions. On 
the left side of the https://www.rcsb.org 1GU4 page under “Display 
Files,” click “pdb file” to see the atom-by-atom coordinates in the 
three-dimensional structure (scroll down past “Remarks” to find this 
information). Toward the end of this series, find the amino acid 
sequence of the C/EBP8 domain used in this study. Within this amino 
acid sequence, find the leucine residues of the leucine zipper and the 
basic residues in the DNA-binding basic region. 


Protein:DNA Interactions with Catabolite Activator Protein Figure 
29.15 highlights in red the DNA phosphate oxygen atoms. Some of 
them interact with catabolite activator protein (CAP). What kind of 
interactions do you suppose predominate, and what kinds of CAP 
amino acid side chains might be involved in these interactions? 


Chromatin Decompaction and Gene Expression Chromatin decom- 
paction is a preliminary step in gene expression (Figure 29.48). How 
is chromatin decompacted? 
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Think-Pair-Share Question 
Splicing of RNA Consider the RNA sequence shown. Using the informa- 
tion discussed in Sections 29.5d and 29.5e: 
a. Identify possible 5’ splicing sites, branch sites, and 3’ splicing 
sites, 
b. Suggest possible splicing products from this RNA. 


S'ACUUGACAGUAGGUAAGUUAGGCCU 
AUCAUGAUCGAGGUAAGUCGAGUCUC 
UUCUACAACCUCAGGAGCGAUGUCCUC 
CAGGCGGACUACGACCUGCGAGUGCA 
GCCCUUCCAGGAUGACGUAGCGAUGA 
C3: 
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Georg Gerster/Science Source 


Protein Synthesis 


Essential Question 


Ribosomes synthesize proteins by reading the nucleotide sequence of mRNAs and polymer- 
izing amino acids in an N—>C direction. How is the nucleotide sequence of an mRNA 
molecule translated into the amino acid sequence of a protein molecule? 


W: turn now to the problem of how the sequence of nucleotides in an mRNA mol- 
ecule is translated into the specific amino acid sequence of a protein. The ques- 
tion is: “What is the genetic code that allows the information specified in the sequence 
of bases to be translated into the amino acid sequence of a polypeptide?” That is, how 
is the 4-letter language of nucleic acids translated into the 20-letter language of pro- 
teins? It is easy to see how base-pairing establishes a one-to-one correspondence that 
allows the template-directed synthesis of polynucleotide chains in the processes of rep- 
lication and transcription. However, there is no obvious chemical affinity between the 
purine or pyrimidine bases and the 20 different amino acids. Nor is there any obvious 
structural or stereochemical connection between polynucleotides and amino acids 
that might guide the translation of information. 

Francis Crick reasoned that adapter molecules must bridge this information gap. 
These adapter molecules must interact specifically with both nucleic acids (mRNAs) and 
amino acids. At least 20 different adapter molecules would be needed, at least one for 
each amino acid. The various adapter molecules would be able to read the genetic code 
in an mRNA template and align the amino acids according to the template’s directions 
so that they could be polymerized into a unique polypeptide. Transfer RNAs (tRNAs; 
Figure 30.1) are these adapter molecules (see Chapter 10). Amino acids are attached to 
the 3’-OH at the 3’-CCA end of tRNAs as aminoacyl esters. The formation of these 
aminoacyl-tRNAs, so-called charged tRNAs, is catalyzed by specific aminoacyl-tRNA 
synthetases. There is one of these enzymes for each of the 20 amino acids and each 
aminoacyl-tRNA synthetase loads its amino acid only onto tRNAs designed to carry it. 
In turn, these tRNAs specifically recognize unique sequences of bases in the mRNA 
through complementary base pairing. 


30 


We are a spectacular, splendid manifestation of 

life. We have language and can build metaphors as 
skillfully and precisely as ribosomes make proteins. 
We have affection. We have genes for usefulness, and 
usefulness is about as close to a “common goal” of 
nature as I can guess at. And finally, and perhaps 
best of all, we have music. 


Lewis Thomas (1913-1994) 
“The Youngest and Brightest Thing Around” 
in The Medusa and the Snail (1979) 


The Maya encoded their history in hieroglyphs carved 
on stelae and temples like these ruins in Yucatan 
Peninsula, Mexico. They were finally translated when it 
became apparent that the glyphs are pictorial signs 
representing words or syllables in the Mayan language. 
The 800 or so Mayan glyphs can be arranged to form 
sentences capable of expressing any fact or concept. 


Key Questions 


30.1 What Is the Genetic Code? 


30.2 How Is an Amino Acid Matched 
with Its Proper tRNA? 

30.3 What Are the Rules in Codon— 
Anticodon Pairing? 

30.4 What Is the Structure of Ribosomes, 
and How Are They Assembled? 

30.5 What Are the Mechanics 
of mRNA Translation? 

30.6 How Are Proteins Synthesized 
in Eukaryotic Cells? 


30.7 Do Protein Synthesis Inhibitors Have 
Clinical Applications? 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


1090 


Chapter 30 Protein Synthesis 


30 oaei * Variable loop 
© 


g 


(2 
g Q 
Anticodon loo 
OW 
35 


U9 
Anticodon 


Figure 30.1 Generalized secondary structure of 
tRNA molecules. Circles represent nucleotides in the 
tRNA sequence. The numbers given indicate the 
standardized numbering system for tRNAs (which 
differ in total number of nucleotides). Dots indicate 
places where the number of nucleotides may vary 

in different tRNA species. All tRNAs have the 
invariant 3-base sequence CCA at their 3’-ends. The 
amino acid is attached here to the ribose of the 
terminal A. Recall from Chapter 10 that tRNA 
molecules often have modified or unusual bases. 
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Figure 30.2 (a) An overlapping versus a 
nonoverlapping code. (b) A continuous versus a 
punctuated code. 


30.1 | What Is the Genetic Code? 


Once it was realized that the sequence of bases in a gene specified the sequence of 
amino acids in a protein, various possibilities for such a genetic code were consid- 
ered. How many bases are necessary to specify each amino acid? Is the code overlap- 
ping or nonoverlapping (Figure 30.2)? Is the code punctuated or continuous? 
Mathematical considerations favored a triplet of bases as the minimal code word, or 
codon, for each amino acid: A doublet code based on pairs of the four possible bases, 
A, C, G, and U, has 47 = 16 unique arrangements, an insufficient number to encode 
the 20 amino acids. A triplet code of four bases has £ = 64 possible code words, 
more than enough for the task. 


30.la The Genetic Code Is a Triplet Code 


The genetic code is a triplet code read continuously from a fixed starting point in each 
mRNA. Typically, the fixed starting point is the first nucleotide in the codon that 
specifies the first amino acid in the protein being synthesized. The genetic code it is 
defined by the following: 


1. A group of three bases codes for one amino acid. 

2. The code is not overlapping. It begins at a particular nucleotide and is read in 
groups of three from that first nucleotide. Alternative readings starting from 
the second or third nucleotide are not permitted by the translation machinery 
(Figure 30.2a). 

3. The base sequence is read continuously from the fixed starting point without punctua- 
tion (Figure 30.2b). That is, the mRNA sequences contain no “commas” signifying 
appropriate groupings of triplets. Successive triplets define the reading frame. If the 
reading frame is displaced by one base, it remains shifted throughout the subsequent 
message; no “commas” are present to restore the “correct” frame. 


30.1b Codons Specify Amino Acids 


The complete translation of the genetic code is presented in Table 30.1. This code is 
often referred to as the Universal Genetic Code because, with few exceptions, it is 
invariant across all forms of life. Codons, like other nucleotide sequences, are read 
5'—3'. Codons represent triplets of bases in mRNA or, replacing U with T, triplets 
along the nontranscribed (nontemplate) strand of DNA. 

Several noteworthy features characterize the genetic code: 


1. All the codons have meaning. Of the 64 codons, 61 specify particular amino acids. 
The remaining 3—UAA, UAG, and UGA—specify no amino acid and, thus, they 
are nonsense codons. Nonsense codons serve as termination codons; they are stop 
signals indicating that the end of the protein has been reached. 

2. The genetic code is unambiguous. Each of the 61 sense codons encodes only one 
amino acid. 

3. The genetic code is degenerate. Recall that a triple code yields 64 codons for 20 
amino acids. Most codons (61 of 64) code for some amino acid. With the exception 
of Met and Trp, every amino acid is coded by more than one codon. Several—Arg, 
Leu, and Ser—are represented by six different codons. Codons coding for the same 
amino acid are called synonymous codons. 

4. Codons representing the same amino acid or chemically similar amino acids tend to 
be similar in sequence. Often the third base in a codon is irrelevant, so, for example, 
all four codons in the GGX family specify Gly, and the UCX family specifies Ser 
(Table 30.1). This feature is known as third-base degeneracy. Note also that codons 
with a pyrimidine as second base likely encode amino acids with hydrophobic side 
chains, and codons with a purine in the second-base position typically specify polar 
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Table 30.1 The Genetic Code 
First Third 
Position Second Position 
(5’-end) Position (3’-end) 
U C A G 
UUU Phe UCU Ser UAU Tyr UGU Cys U 
U UUC Phe UCC Ser UAC Tyr UGC Cys C 
UUA Leu UCA Ser UAA Stop UGA Stop 5 
UUG Leu UCG Ser UAG Stop = UGG Trp © 
CUU Leu CCU Pro CAU His CGU Arg U 
é CUC Leu CCC Pro CAC His CGC Arg C 
CUA Leu CCA Pro CAA Gin CGA Arg : 
CUG Leu CCG Pro CAGGIn CGG Arg 
AUU Ile ACU Thr AAU Asn AGU Ser U 
N AUC Ile ACC Thr AAC Asn AGC Ser C 
AUA Ile ACAThr  AAALys AGA Arg > 
‘AUG Met* ACG Thr AAG Lys AGG Arg 
GUU Val GCU Ala GAU Asp GGU Gly U 
é GUC Val GCC Ala GAC Asp GGC Gly 
GUA Val GCA Ala GAA Glu GGA Gly G 
GUG Val GCG Ala GAG Glu GGG Gly 


*AUG signals translation initiation as well as coding for Met residues. 
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Third-Base Degeneracy Is Color-Coded 


Third Bases 

Third-Base with Same 

Relationship Meaning Number of Codons 
Third base U, C, A, G 32 (8 families) 
irrelevant 
Purines AorG 12 (6 pairs) 
Pyrimidines U or C 14 (7 pairs) 
Three outof U, C, A 3 (AUX = Ile) 
four 

o] Unique G only 2 (AUG = Met) 
definitions (UGG = Trp) 
Unique A only 1 (UGA = Stop) 
definition 


or charged amino acids. The two negatively charged amino acids, Asp and Glu, are 
encoded by GAX codons; GA-pyrimidine gives Asp and GA-purine specifies Glu. 
The consequence of these similarities is that mutations are less likely to be harm- 
ful because single base changes in a codon will result either in no change or in a 
substitution with an amino acid similar to the original amino acid. The degeneracy 


of the code is evolution’s buffer against mutational disruption. 


5. The genetic code is universal. Although certain minor exceptions in codon usage 
occur (see A Deeper Look box “Natural and Unnatural Variations in the Stan- 
dard Genetic Code,” Section 30.1), the more striking feature of the code is its 
universality: Codon assignments are virtually the same throughout all organisms— 
Archaea, bacteria, and eukaryotes. This conformity means that all extant organ- 
isms use the same genetic code, providing strong evidence that they all evolved from 


a common primordial ancestor. 


Careers in Chemistry 


Dr. Alison Brown 


Dr. Alison Brown currently serves as the Program 
Director for the National Institutes of Health’s 
National Heart, Blood, and Lung Institute. Her work 
focuses on the social determinants of health, social 
health disparities, and obesity prevention. 

Dr. Brown’s educational training includes a 
Bachelor’s in Chemistry from Spelman, a Master’s in 
Nutrition and Applied Physiology from Columbia, 
and a Ph.D. from the Friedman School of Nutrition 


ou 
karys 


Science and Policy at Tufts University. Her hard work 
provided the foundation needed to do the important 
work she does today at the crossroads between 
health and social research. At the NIH, Dr. Brown 
manages grants, heads committees, and translates 
scientific research into policy changes. Making use 
of her extensive qualifications, Dr. Brown ensures 
that the scientific research she oversees adheres to 
policies that benefit the general public. 
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A Deeper Look 


Natural and Unnatural Variations in the Standard Genetic Code 


The genomes of some lower eukaryotes, prokaryotes, and mito- 
chondria show some exceptions to the standard genetic code 
(Table 30.1) in codon assignments. The phenomenon is more com- 
mon in mitochondria. For example, the termination codon UGA 
codes for tryptophan in mitochondria from various animals, proto- 
zoans, and fungi. AUA, normally an Ile codon, codes for methio- 
nine in some animal and fungal mitochondrial genomes, and AGA 
(an Arg codon) is a termination codon in vertebrate mitochondria 
but is a Ser codon in fruit fly mitochondria. Mitochondria in several 
species of yeast use the CUX codons to specify Thr instead of Leu. 
Some yeast and algal mitochondria use CGG, normally an Arg 
codon, as a stop codon. 

Less common are genomic codon variations within the genomes 
of prokaryotic and eukaryotic cells. Among the lower eukaryotes, 
certain ciliated protozoans (Tetrahymena and Paramecium) use 
UAA and UGA as glutamine codons rather than stop codons. 
Instances in prokaryotes include use of the stop codon UGA to 
specify Trp by Mycoplasma. Perhaps most interesting is the use of 
some UGA codons by both prokaryotes and eukaryotes (including 
humans) to specify selenocysteine (Sec), an analog of cysteine in 
which the sulfur atom is replaced by a selenium atom. Indeed, the 
identification of Sec residues in proteins from bacteria, archaea, 
and eukaryotes has led some people to nominate Sec as the twenty- 
first amino acid! (Pyrrolysine, encoded by the stop codon AUG in 
certain archaea and bacteria, has been touted as the twenty-second 
amino acid; it is not present in humans.) Sec formation requires a 
novel Sec-specific tRNA known as tRNA. This tRNA is loaded 
with a Ser residue by seryl-tRNA synthetase, the aminoacyl-tRNA 
synthetase for serine. Then, the enzyme selenocysteine synthase 
catalyzes a reaction wherein the Ser-O is replaced by Se. Transla- 
tion of UGA codons by selenocysteinyl-tRNA** depends on the 
presence of specific stem-loop secondary structures in the mRNA 
called SECIS elements that recode the UGA codon from stop 
to Sec. SECIS elements are recognized by specific proteins that 
recruit selenocysteinyl-tR NA‘® to the UGA codon during protein 
synthesis. Most selenoproteins (proteins containing selenocysteine) 
are involved in oxidation—reduction reactions, and Sec participates 


directly in the catalytic mechanism. Sec provides a more reactive 
functionality than Cys. 


H 
H—Se— CH, —C— COO- 
| 
NH3” 


Selenocysteine (Sec) 


NH3 


O 
TO 
N 
H 


Oo N~ 
Pyrrolysine (Pyl) 


CH, 


It is also possible now to manipulate the genetic code, tRNAs, and 
aminoacyl-tRNA synthetases to introduce amino acids never before 
imagined into proteins of interest. Bacterial, yeast, and mammalian 
cells, as well as cell-free systems for synthetic biology, have been 
engineered so that genetically encoded unnatural amino acids with 
desired physical, chemical, or biological properties can be placed at 
any position in a protein. Hundreds of such unnatural amino acids, 
each with a chemically unique side chain, can be incorporated into 
proteins using virtually any codon and an appropriately engineered 
tRNA-—aminoacyl-tRNA synthetase pair. 
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30.2 


How Is an Amino Acid Matched 


with Its Proper tRNA? 


Two codes lie at the heart of translation. The more familiar one is codon recognition, 
achieved by aminoacyl-tRNAs. In order for accurate translation to occur, the appro- 
priate aminoacyl-tRNA must read the codon through base pairing via its anticodon 
loop (see Figure 30.1). Once an aminoacyl-tRNA has been synthesized, the amino acid 
part makes no contribution to accurate translation of the mRNA. That is, the amino 
acid is passively chauffeured by its tRNA and becomes inserted into a growing pep- 
tide chain following codon—anticodon recognition between the mRNA and tRNA. 
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30.2 How Is an Amino Acid Matched with Its Proper tRNA? 


While base-pairing allows a tRNA to recognize a codon, how is an amino acid loaded 
onto its cognate tRNAs (those tRNAs having anticodons that can read one or more of 
the codons specifying that particular amino acid)? 


30.2a Aminoacyl-tRNA Synthetases Interpret 
the Second Genetic Code 


A second genetic code must exist, the code by which each aminoacyl-tRNA synthetase 
matches up its amino acid with tRNAs that base-pair with those codons for its amino 
acid. To interpret this second genetic code, an aminoacyl-tRNA synthetase discrimi- 
nates between the 20 amino acids and the many tRNAs and uniquely picks out its proper 
substrates—one specific amino acid and the tRNA(s) appropriate to it—from among 
the more than 400 possible combinations. The appropriate tRNAs are those having 
anticodons that recognize the codons specifying the particular amino acid. It is impera- 
tive that the proper amino acids be loaded onto the various tRNAs so that the mRNA 
can be translated with fidelity. Although the primary genetic code is key to understand- 
ing the central dogma of molecular biology that dictates how DNA encodes proteins, 
the second genetic code is just as crucial to the fidelity of information transfer. 

Eukaryotic cells and some bacteria have 20 different aminoacyl-tRNA synthetases, 
one for each amino acid. Each of these enzymes catalyzes ATP-dependent attachment of 
its specific amino acid to the 3’-end of its cognate tRNA molecules (Figure 30.3). The 
products of the reaction are an aminoacyl-tRNA, AMP, and PP;. Ever-present pyro- 
phosphatases quickly hydrolyze the pyrophosphate product to give two P;. This highly 
exergonic reaction provides the overall thermodynamic driving force for aminoacyl-tRNA 
synthesis. The aminoacyl-tR NA synthetase reaction serves two purposes: 


1. It activates the amino acid so that it will readily react to form a peptide bond. 
2. It bridges the information gap between amino acids and codons. 


The underlying mechanisms of molecular recognition used by each aminoacyl-tRNA 
synthetase to bring the proper amino acid to its cognate tRNA are the embodiment 
of the second genetic code. An exception to these rules is the fact that most bacteria 
and archaea do not have specific aminoacyl-tRNA synthetases for Asn and/or Gln. 
Instead, the Asp- and Glu-tRNA synthetases are used to load Asp and Glu onto 
tRNA4"" and tRNA®!", respectively. Then these aminoacyl groups are amidated by 
tRNA-dependent amidotransferases to yield Asn-tRNA“* and Gln-tRNAS" for 
protein synthesis. 

Most aminoacylated-tRNAs are destined to deliver their amino acids to the transla- 
tional apparatus for use in protein synthesis. However, aminoacylated-tR NAs also serve 
as aminoacyl donors for a number of other purposes. For example, aminoacyl-tRNAs 
are involved in tagging the N-termini of proteins for degradation (see Figure 31.11), 
in the delivery of amino acids for peptidoglycan synthesis (see Figure 7.29), as sub- 
strates for amino acid addition to membrane lipids (as in the formation of lysyl- 
phosphatidylglycerol, thereby dramatically altering the charge of this membrane 
lipid), and as substrates in antibiotic biosynthesis (for example, the synthesis of 
valinomycin). 


30.2b Evolution Has Provided Two Distinct Classes 
of Aminoacyl-tRNA Synthetases 
Despite their common enzymatic function, aminoacyl-tRNA synthetases are a 


diverse group of proteins in terms of size, amino acid sequence, and oligomeric struc- 
ture. In higher eukaryotes, some aminoacyl-tRNA synthetases are assembled into 
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Figure 30.3 The aminoacyl-tRNA synthetase reaction. (a) The overall reaction. (b) Aminoacyl-tRNA +NH. 


formation proceeds in two steps: (i) formation of an aminoacyl-adenylate and (ii) transfer of the activated 
amino acid moiety of the mixed anhydride to either the 2'-OH (class I aminoacyl-tRNA synthetases) or 3'-OH 
(class II aminoacyl-tRNA synthetases) of the ribose on the terminal adenylic acid at the 3’-CCA terminus 
common to all tRNAs. Those aminoacyl-tRNAs formed as 2’-aminoacy]l esters undergo a transesterification 
that moves the aminoacyl function to the 3’-O of tRNA. Only the 3’-esters are substrates for protein synthesis. 


3'-O aminoacyl-tRNA 


large multiprotein complexes. The aminoacyl-tRNA synthetases fall into two funda- 
mental classes on the basis of similar amino acid sequence motifs, oligomeric state, 
and acylation function (Table 30.2): class I and class II. Class I aminoacyl-tRNA 
synthetases first add the amino acid to the 2'-OH of the terminal adenylate residue of 
tRNA before shifting it to the 3’-OH; class II enzymes add it directly to the 3’-OH 
(Figure 30.3). The catalytic domains of these enzymes evolved from two different 
ancestral predecessors. Aminoacyl-tRNA synthetases are ranked among the oldest 
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proteins because the different classes of these enzymes were present very early in 
evolution. Class I and class II aminoacyl-tRNA synthetases interact with the tRNA 
3’-terminal CCA and acceptor stem in a mirror-symmetric fashion with respect to 
each other (Figure 30.4). Class I enzymes bind to the tRNA acceptor stem helix from 
the minor-groove side, whereas class II enzymes bind it from the major-groove side. 

Both class I and class II aminoacyl-tRNA synthetases can be represented as two- 
domain structures, as can their L-shaped tRNA substrates, which have the acceptor 
stem/CCA-3'-OH at one end and the anticodon stem-loop at the other (see 
Figures 11.35 and 30.5). This L-shaped tertiary structure of tRNAs separates the 
3'-CCA acceptor end from the anticodon loop by a distance of 7.6 nm. The two domains 
of tRNAs have distinct functions: The 3’-CCA end is the site of aminoacylation, and 
the anticodon-containing domain interacts with the mRNA template. The two domains 
of tRNAs interact with the separate domains in the synthetases. One of the two major 
aminoacyl-tRNA synthetase domains is the catalytic domain (which defines the differ- 
ence between class I and class II enzymes); this domain interacts with the tRNA 
3'-CCA end. The other major domain in aminoacyl-tRNA synthetases is highly vari- 
able and interacts with parts of the tR NA beyond the acceptor-TWC stem-loop domain, 
including, in some cases, the anticodon. 


30.2c Aminoacyl-tRNA Synthetases Can Discriminate 
Between the Various tRNAs 


Aside from the need to uniquely recognize their cognate amino acids, aminoacyl- 
tRNA synthetases must be able to discriminate between the various tRNAs. The 
structural features that permit the synthetases to recognize and aminoacylate their 
cognate tRNA(s) are not universal. That is, a common set of rules does not govern 
tRNA recognition by these enzymes. Most surprising is the fact that the recognition 
features are not limited to the anticodon and, in some instances, do not even include 
the anticodon. For most tRNAs, a set of sequence elements is recognized by its spe- 
cific aminoacyl-tRNA synthetase, rather than a single distinctive nucleotide or base 
pair. These elements include one or more of the following: (1) at least one base in the 
anticodon; (2) one or more of the three base pairs in the acceptor stem; and (3) the 
base at canonical position 73 (the unpaired base preceding the CCA end), referred 
to as the discriminator base because this base is invariant in the tRNAs for a particu- 
lar amino acid. Figure 30.5 presents a ribbon diagram of a tRNA molecule showing 
the location of nucleotides that contribute to specific recognition by the respective 
aminoacyl-tRNA synthetases for each of the 20 amino acids. Interestingly, the same 
set of tRNA features that serves as positive determinants for binding and aminoac- 
ylation of the tRNA by its cognate aminoacyl-tRNA synthetase may act as negative 


(a) (b) 
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| Table 30.2 | eyl i The Two Classes of 


Aminoacyl-tRNA Synthetases 


Class | Class II 
Arg Ala 
Cys Asn 
Gln Asp 
Glu Gly 
Ile His 
Leu Lys 
Met Phe 
Trp Pro 
Tyr Ser 
Val Thr 


Figure 30.4 Mirror-symmetric interactions of 

class I versus class II aminoacyl-tRNA synthetases 
with their tRNA substrates. The two different classes 
of aminoacyl-tRNA synthetases bind to opposite 
faces of tRNA molecules. (a) The structure of the 
class I glutaminyl-tRNA®™ synthetase: tR NAS” 
complex with a bound active-site inhibitor (orange; 
PDB id = 1EUQ). Note that the enzyme is on the 
left side of the tRNA in this view. (b) The structure 
of the class II threonyl-tR NA™ synthetase: tR NA™* 
complex with AMP (red) at the active site (PDB id = 
1QF6). Note that most of the aminoacyl-tRNA 
synthetase structure is on the right side of the tRNA. 
The relative orientation of the tRNA is the same in 
both structures, with the 3’-CCA end of the tRNA 
pointed away from the viewer. 
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Figure 30.6 Major identity elements in four tRNA 
species. Each base in the tRNA is represented by a 
circle. Numbered filled circles indicate positions of 
identity elements within the tRNA that are recognized 
by its specific aminoacyl-tRNA synthetase. (Adapted 
from Schulman, L. H., and Abelson, J., 1988. Recent 
excitement in understanding transfer RNA identity. 
Science 240:1591-1592.) 


Position 73 
A, C, D,E, F G 
H, IL, K, L, M, N, P 


Acceptor stem 


A, D, G, H,M 
Q, S, T, V, W 


C, D, E, F, G 
H, I, K, M, N, P 


Q, R, T, V, W, Y 


Figure 30.5 Ribbon diagram of tRNA tertiary structure. Numbers represent the consensus nucleotide 
sequence (see Figure 30.1). The locations of nucleotides recognized by the various aminoacyl-tRNA 
synthetases are indicated; shown within the boxes are one-letter designations of the amino acids 
whose respective aminoacyl-tRNA synthetases interact at the discriminator base (position 73), 
acceptor stem, variable pocket and/or loop, or anticodon. The inset shows additional recognition sites 
in those tRNAs having a variable loop that forms a stem-loop structure. (Adapted from Figure 2 in 
Saks, M. E., Sampson, J. R., and Abelson, J. N., 1994. The transfer RNA problem: a search for rules. 
Science 263:191-197.) 


determinants that prohibit binding and aminoacylation by other (noncognate) 
aminoacyl-tRNA synthetases. Because no common set of rules exists, the second 
genetic code is an operational code based on aminoacyl-tRNA synthetase recogni- 
tion of varying sequence and structural features in the different tRNA molecules 
during the operation of aminoacyl-tRNA synthesis. Some examples of this code are 
given in Figure 30.6. 


30.2d Escherichia coli Glutaminyl-tRNA°" Synthetase Recognizes Specific 
Sites on tRNAS" 


E. coli glutaminyl-tR NAS" synthetase, a class I enzyme, provides a good illustration of 
aminoacyl-tRNA synthetase:cognate tRNA interactions. This glutaminyl-tR NAS" 
synthetase shares a continuous interaction with its cognate tRNA that extends from the 
anticodon to the acceptor stem along the entire inside of the L-shaped tRNA 
(Figure 30.7). Specific recognition elements include enzyme contacts with the discrimi- 
nator base, acceptor stem, and anticodon, particularly the central U in the CUG antico- 
don. The carboxylate group of Asp™* makes sequence-specific H bonds in the tRNA 
minor groove with the guanine 2-NH) group in the base pair G3:C70 of the acceptor 
stem. A mutant glutaminyl-tR NAC" synthetase with Asn substituted for Asp at position 
235 shows relaxed specificity; that is, it now will acylate noncognate tRNAs with Gln. 
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30.2e The Identity Elements Recognized by Some Aminoacyl-tRNA 
Synthetases Reside in the Anticodon 


Alteration of the anticodons of either tR NA™? or tR NAV! to CAU, the anticodon for 
the methionine codon AUG, transforms each of the tRNAs into a substrate for 
methionyl-tRNA synthetase, and they are loaded with methionine. Similarly, revers- 
ing the methionine CAU anticodon of tRNA™" to UAC transforms it into a substrate 
for valyl-tRNA™! synthetase. Clearly, methionyl-tRNA synthetase and valyl-tRNA 
synthetase rely on the anticodon in selecting tRNAs for loading. 


30.2f A Single G:U Base Pair Defines tRNA‘!s 


The noncanonical base pair, G3:U70, is the singular feature by which alanyl-tRNA“” 
synthetase recognizes tRNAs as its substrates. All tRNA“! representatives, from 
archaea to eukaryotes, possess this G3:U70 acceptor stem base pair. Altering this 
unusual G3:U70 base pair of tRNA“ to G:C, A:U, or even U:G abolishes its ability 
to be aminoacylated with alanine. On the other hand, provided the G3:U70 base pair 
is present, alanyl-tR NA“ synthetase aminoacylates a 24-nucleotide stem-loop ana- 
log of tRNA“® (Figure 30.8). The key feature of the G3:U70 base pair is the 2-NH> 
group of G3. In the RNA A-form double-helical structure adopted by the tRNA 
acceptor stem, the G3 2-NH) group is exposed in the minor groove of the helix, and 
if the G3 pairing partner is a U, this 2-NH> group lacks an H-bonding partner 


Figure 30.7 Structure of E. coli glutaminyl-tRNAS" 
. ‘ i ; : synthetase complexed with tR NAS" and ATP 
(Figure 30.8). Thus, an unpaired G 2-amino group at the right place ina tRNA (PDB id = 1GSG). The protein:tRNA contact region 


acceptor stem marks a tRNA for aminoacylation by alanyl-tRNA“" synthetase. extends along one side of the entire length of this 
Interactions between alanyl-tRNA‘" synthetase and the G3:U70 base pair cause extended protein. The acceptor stem of the tRNA 


and the ATP (green) fit into a cleft at the top of the 
protein in this view. The enzyme also interacts 
extensively with the anticodon (lower tip of tRNA9™). 


conformational changes along the tRNA acceptor stem. These changes direct the 
entry of the 3’-CCA acceptor end into the enzyme’s active site. The 3’-CCA acceptor 
ends of tRNAs lacking a G3:U70 base pair are not reoriented to the alanyl-tRNA“" 
synthetase active site and thus are not aminoacylated. 
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Figure 30.8 Identification of the key feature that determines whether or not a tRNA is aminoacylated 
by alanyl-tRNA“" synthetase. (a) Secondary structure of the tRNA“! having the GGC anticodon 
(tRNA4 SSC) (Jeff) and its stem microhelix^! analog (right). The microhelix“" consists of nucleotides 
1 through 13 of tRNA4’S°C connected directly to 66 through 76 to recreate the tRNA“! 7 bp 
acceptor stem. Because the microhelix analog has the characteristic G3:U70 base pair found in 
tRNA", it is aminoacylated by alanyl-tR NA“ synthetase. (b) The key determinant, the unbonded 
2-NH,;, group of the G3 in the G3:U70 base pair, would lie within the minor groove of an A-form 
RNA double helix. [(a) Adapted from Schimmel, P., 1989. Parameters for molecular recognition of 
transfer RNAs. Biochemistry 28:2747-2759.] 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


1098 


Chapter 30 Protein Synthesis 


Anticodon 
d 
5' _ eee 3’ 
Codon 


mRNA 


Figure 30.9 Codon—anticodon pairing. 
Complementary trinucleotide sequence elements 
align in antiparallel fashion. 


| Table 30.3 ER Base-Pairing Possibilities 


at the Third Position of the Codon 


Bases Recognized 


Base on the Anticodon on the Codon 
U A,G 
C G 
A U 
G U, C 
I U,C, A 


Adapted from Crick, F. H. C., 1966. Codon—anticodon 


pairing: The wobble hypothesis. Journal of Molecular Biology 


19:548-555. 


30.3 | What Are the Rules in Codon—Anticodon Pairing? 


Protein synthesis depends on the codon-directed binding of the proper aminoacyl- 
tRNAs so that the right amino acids are sequentially aligned according to the speci- 
fications of the mRNA undergoing translation. This alignment is achieved via 
codon—anticodon pairing in antiparallel orientation (Figure 30.9). However, consid- 
erable degeneracy exists in the genetic code at the third position. Conceivably, this 
degeneracy could be handled in either of two ways: (1) Codon—anticodon recognition 
could be highly specific so that a complementary anticodon is required for each of the 
61 sense codons, or (2) fewer than 61 anticodons could be used for the sense codons if 
certain allowances were made in the base-pairing rules. Then, some anticodons could 
recognize more than one codon. As early as 1965, it was known that poly(U) bound 
all Phe-tR NA?"* molecules even though UUC is also a Phe codon. The phenylalanine- 
specific tRNAs could recognize either UUU or UUC. Also, one particular yeast 
tRNA4® was able to bind to three codons: GCU, GCC, and GCA. 

Interestingly, the tRNA anticodon is the site of the most diverse and chemically 
complex post-transcriptional modifications found in RNA. These modifications play a 
key role in codon-anticodon recognition. 


30.3a Francis Crick Proposed the Wobble Hypothesis 
for Codon—Anticodon Pairing 


Francis Crick considered these results and tested alternative base-pairing possibili- 
ties by model building. He hypothesized that the first two bases of the codon and the 
last two bases of the anticodon form canonical Watson—Crick A:U or G:C base pairs, 
but pairing between the third base of the codon and the first base of the anticodon 
follows less stringent rules. That is, a certain amount of play, or wobble, might be 
allowed in base pairing at this position. The third base of the codon is sometimes 
referred to as the wobble position. 

Crick’s investigations suggested a set of rules for pairing between the third base of 
the codon and the first base of the anticodon (Table 30.3). The wobble rules indicate 
that a first-base anticodon U could recognize either an A or G in the codon third-base 
position; first-base anticodon G might recognize either U or C in the third-base posi- 
tion of the codon; and first-base anticodon I (inosine (6-OH purine)) might interact 
with U, C, or A in the codon third position (Figure 30.10). Thus, the first base of the 
anticodon indicates whether the tRNA can read one, two, or three different codons. 
Anticodons beginning with A or C read only one codon, those beginning with G or U 
read two, and anticodons beginning with I can read three codons. 

The wobble rules also predict that four-codon families (like Pro or Thr), where any 
of the four bases may be in the third position, require at least two different tRNAs. 
However, all members of the set of tRNAs specific for a particular amino acid—termed 
isoacceptor tRNAs—are served by one aminoacyl-tRNA synthetase. 


30.3b Some Codons Are Used More Than Others 


Because more than one codon exists for most amino acids, the possibility for varia- 
tion in codon usage arises. Indeed, variation in codon usage accommodates the fact 
that the DNA of different organisms varies in relative A:T/G:C content. Neverthe- 
less, even in organisms of average base composition, codon usage may be biased. 
Table 30.4 gives some examples from Æ. coli and humans reflecting the nonrandom 
usage of codons. Of more than 109,000 Leu codons tabulated in a set of human 
genes, CUG was used in excess of 48,000 times, CUC more than 23,000 times, but 
UUA just 6000 times. 

The occurrence of codons in E. coli mRNAs correlates well with the relative abun- 
dance of the tRNAs that read them. Preferred codons are represented by the most abun- 
dant isoacceptor tRNAs. Furthermore, mRNAs for proteins that are synthesized in 
abundance tend to employ preferred codons. Rare tRNAs correspond to rarely used 
codons, and messages containing such codons might experience delays in translation. 
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Ribose 


Figure 30.10 Pairing of anticodon inosine (1, left) with C, U, or A as the codon third base. 
Note that I is in the keto tautomeric form. 


Table 3 
E. coli Gene Human Gene 
Amino Acid Codon Frequency/1000 Frequency/1000 
Leu CUA 3.2 6.1 
CUC 9.9 20.1 
CUG 54.6 42.1 
CUU 10.2 10.8 
UUA 10.9 5.4 
UUG 11.5 11.1 
Pro CCA 8.2 15.4 
CCC 4.3 20.6 
CCG 23.8 6.8 
CCU 6.6 16.1 
Ala GCA 15.6 14.4 
GCC 34.4 29.7 
GCG 32.9 Tea 
GCU 13.4 18.9 
Lys AAA 36.5 21.9 
AAG 12.0 35.2 
Glu GAA 43.5 26.4 
GAG 19.2 41.6 


The results are expressed as the frequency of occurrence of a codon per 1000 codons tabulated 
in 1562 E. coli genes and 2681 human genes. (Because E. coli and human proteins differ 
somewhat in amino acid composition, the frequencies for a particular amino acid do not 
correspond exactly between the two species.) 


Adapted from Wada, K., et al., 1992. Codon usage tabulated from Genbank genetic sequence data. Nucleic Acids 
Research 20:2111-2118. 
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30.3c Nonsense Suppression Occurs When Suppressor tRNAs 
Read Nonsense Codons 


Mutations that alter a sense codon to one of the three nonsense codons—UAA, UAG, 
or UGA—result in premature termination of protein synthesis and the release of 
truncated (incomplete) polypeptides. The nonsense codons are named ochre (UAA), 
amber (UAG), and opal (UGA). Geneticists found that second mutations elsewhere in 
the genome were able to suppress the effects of nonsense mutations so that the organ- 
ism survived, a phenomenon termed nonsense suppression. The molecular basis for 
nonsense suppression was a mystery until it was realized that suppressors were muta- 
tions in tRNA genes that altered the anticodon so that the mutant tRNA could now 
read a particular stop codon and insert an amino acid. For example, alteration of the 
anticodon of a tRNA" from GUA to CUA allows this tRNA to read the amber stop 
codon, UAG, and insert Tyr. Suppressor tRNAs are typically generated from minor 
tRNA species within a set of isoacceptor tRNAs, so their recruitment to a new role 
via mutation does not involve loss of an essential tRNA; that is, the mutation is not 
particularly deleterious to the organism. A suppressor tRNA, as a mutant tRNA, 
may even carry and introduce an amino acid different from the one borne by the wild- 
type tRNA. 


30.4 | What Is the Structure of Ribosomes, 
| and How Are They Assembled? 


Protein biosynthesis is achieved by the process of translation. Translation converts the 
language of genetic information embodied in the base sequence of a messenger RNA 
molecule into the amino acid sequence of a polypeptide chain. During translation, 
proteins are synthesized on ribosomes by linking amino acids together in the specific 
linear order stipulated by the sequence of codons in an mRNA. Ribosomes are the 
agents of protein synthesis. 

Ribosomes are compact ribonucleoprotein particles found in the cytosol of all cells, 
as well as in the matrix of mitochondria and the stroma of chloroplasts. The general 
structure of ribosomes is described in Chapter 10; here we consider their structure in 
light of their function in synthesizing proteins. Ribosomes are mechanochemical sys- 
tems that move along mRNA templates, orchestrating the interactions between succes- 
sive codons and the corresponding anticodons presented by aminoacyl-tRNAs. As 
they align successive amino acids via codon—anticodon recognition, ribosomes also 
catalyze the formation of peptide bonds between the growing peptide chain and incom- 
ing amino acids. 


30.4a Prokaryotic Ribosomes Are Composed of 30S and 50S Subunits 


E. coli ribosomes are representative of the structural organization of the prokaryotic 
versions of these supramolecular protein-synthesizing machines (Table 30.5, see also 
Figure 10.21). The E. coli ribosome is a roughly globular particle with a diameter of 
25 nm, a sedimentation coefficient of 70S, and a mass of about 2520 kDa. It consists 
of two unequal subunits that dissociate from each other at Mg?* concentrations below 
1 mM. The smaller, or 30S, subunit is composed of 21 different proteins and a single 
rRNA, 16S ribosomal RNA (rRNA). The larger 50S subunit consists of 31 different 
proteins and two rRNAs: 23S rRNA and 5S rRNA. Ribosomes are roughly two-thirds 
RNA and one-third protein by mass. An E. coli cell contains around 20,000 ribo- 
somes, constituting about 20% of the dry cell mass. 


Ribosomal Proteins There is one copy of each ribosomal protein per 70S ribosome, 
excepting protein L7/L12 (L7 and L12 have identical amino acid sequences and differ 
only in the degree of N-terminal acetylation). Only one protein is common to both the 
small and large subunit: S20 and L26 are identical. The largest ribosomal protein is S1 
(557 residues, 61.2 kDa); the smallest is L34 (46 residues, 5.4 kDa). The sequences of 
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Table 30.5 | 

Ribosome Small Subunit Large Subunit 
Sedimentation coefficient 70S 30S 50S 
Mass (kDa) 2520 930 1590 
Major RNAs 16S = 1542 bases 23S = 2904 bases 
Minor RNAs 5S = 120 bases 
RNA mass (kDa) 1664 560 1104 
RNA proportion 66% 60% 70% 
Protein number 21 polypeptides* 31 polypeptides* 
Protein mass (kDa) 857 370 487 
Protein proportion 34% 40% 30% 


*The S proteins 
iThe L proteins 


ribosomal proteins share little similarity. These proteins are typically rich in the 
cationic amino acids Lys and Arg and have few aromatic amino acid residues, proper- 
ties appropriate to proteins intended to interact strongly with polyanionic RNAs. 


rRNAs The rRNAs of E. coli are encoded by a set of seven operons (Figure 30.11). 
Each of these operons is transcribed into a 30S rRNA precursor that includes several 
tRNAs. RNase III and other nucleases cleave these precursors to generate 23S, 16S, 
and 5S rRNA, as well as several tRNAs that are unique to each operon. Transcription 
of rRNA genes accounts for 80% to 90% of total cellular RNA synthesis. Ribosomal 
RNAs show extensive potential for intrachain hydrogen bonding and assume second- 
ary structures reminiscent of tRNAs, although substantially more complex (see 
Figures 11.38 and 11.39). About two-thirds of rRNA is double-helical. Double-helical 
regions are punctuated by short, single-stranded stretches, generating hairpin con- 
formations that dominate the molecule; four distinct domains can be discerned in the 
secondary structure of 16S rRNA and six in 23S rRNA. The three-dimensional 


RNase II 


eines 


Operon Spacer tRNA Trailer tRNA 
rmA Tle, Ala ee 
rrnB Gu ee 
rrnC Glu Asp, Trp 
rrnD Tle, Ala eee 
rmE Gu ee 
rrnG Glu Asp 
rmH Ile, Ala 


Figure 30.11 The seven ribosomal RNA operons in E. coli. Numerals to the right of the brackets 
indicate the number of species of tRNA encoded by each transcript. 
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Figure 30.12 Tertiary structure of the 16S rRNA 
within the Thermus thermophilus 30S ribosomal 
subunit (PDB id = 2J02). This view is of the face 
that interacts with the 50S subunit (see Figures 30.13 
and 30.15). H = head; Be = beak; N = neck; P 
platform; Sh = shoulder; Sp = spur; Bo = body; 
these labels describe structural features of the 16S 
rRNA. Like E. coli, T. thermophilus is Gram- 
negative bacterium. 


structures of both the 30S and 50S ribosomal subunits show that the general shapes 
of the ribosomal subunits are determined by the conformation of the rRNA mole- 
cules within them. Figure 30.12 illustrates the three-dimensional structure of 16S 
rRNA within the 30S subunit. The overall form of the 30S structure is essentially that 
of the rRNA (compare Figures 30.12 and 30.13a). The same relationship is true for the 
23S plus 5S rRNA and the large ribosomal subunit (compare Figures 11.39 and 
30.13b). Ribosomal proteins serve a largely structural role in ribosomes; their pri- 
mary function is to brace and stabilize the rRNA conformations within the ribo- 
somal subunits. 


30.4b Ribosomes Spontaneously Self-Assemble In Vitro 


Ribosomal subunit self-assembly is one of the paradigms for the spontaneous for- 
mation of supramolecular complexes from their macromolecular components. If the 
individual proteins and rRNAs composing ribosomal subunits are mixed together 
in vitro under appropriate conditions of pH and ionic strength, spontaneous self- 
assembly into functionally competent subunits takes place without the intervention 
of any additional factors or chaperones. The rRNA acts as a scaffold upon which 
the various ribosomal proteins convene. Ribosomal proteins bind in a specified 
order. 


30.4c Ribosomes Have a Characteristic Anatomy 


Ribosomal subunits have a characteristic three-dimensional architecture that has 
been revealed by image reconstructions from cryoelectron microscopy, X-ray crys- 
tallography, and X-ray and neutron solution scattering. Such analyses provide 
images as depicted in Figure 30.13. The 30S, or small, subunit features a head and a 
base, or body, from which a platform projects. A cleft is defined by the spatial rela- 
tionship between the head, base, and platform (Figure 30.13a). The mRNA passes 
across this cleft. The platform represents the central domain of the 30S subunit; it 
contains one-third of the 16S rRNA. This central domain binds mRNA and the anti- 
codon stem-loop end of aminoacyl-tRNAs, providing the framework for decoding 
the genetic information in mRNA by mediating codon—anticodon recognition. As 
such, this central domain of the 30S subunit serves as the decoding center. This center 
is composed only of 16S rRNA; no ribosomal proteins are involved in decoding the 
message. 

The 50S, or large, subunit is a mitt-like globular structure with three distinctive pro- 
jections: a central protuberance, the stalk containing protein L1, and a winglike ridge 
known as the L7/L12 region (Figure 30.13b). The large subunit binds the aminoacyl- 
acceptor ends of the tRNAs and is responsible for catalyzing formation of the peptide 
bond between successive amino acids in the polypeptide chain. This catalytic center, the 
peptidyl transferase, is located at the bottom of a deep cleft. From it, a 10-nm-long tun- 
nel passes outward through the back of the large subunit. The process of protein folding 
begins within the tunnel, which can accommodate a protein of 30 or so amino acids, 
depending on its conformation. 

The small and large subunits associate with each other in the manner shown in 
Figure 30.13c and d. The contacts between the 30S and 50S subunits are rather limited, 
and the subunit interface contains mostly rRNA, with relatively little contribution from 
ribosomal proteins. The decoding center in the 30S subunit is aligned somewhat with 
the peptidyl transferase and the tunnel in the large subunit, and the growing peptidyl 
chain is threaded through this tunnel as protein synthesis proceeds. Even though the 
ribosomal proteins are arranged peripherally around the rRNAs in ribosomes, rRNA 
occupies 30% to 40% of the ribosomal subunit surface areas. 

Both subunits are involved in translocation, the process by which the mRNA moves 
through the ribosome, one codon at a time. (Although it is physically more likely for the 
mRNA to move through the ribosome, the descriptions of the events in protein synthe- 
sis that follow imply that the ribosome moves along the mRNA.) 
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Figure 30.13 Structure of the T. thermophilus ribosomal subunits and 70S ribosome, as deduced by X-ray 
crystallography. Prominent structural features are labeled. (a) 30S (PDB id = 2J02) and (b) 50S (PDB id = 
2J03) subunits. These views show the sides of these two that form the interface between them when they 
(c) come together to form a 70S subunit. (d) A side view of the 70S ribosome; the white area represents the 
region where mRNA and tRNAs are bound and peptide bond formation occurs. The tunnel through the 
50S subunit that the growing peptide chain transits is shown as a dashed line. The approximate dimensions 
of the 30S subunit are 5.5 X 22 X 22 nm; the 50S subunit dimensions are 15 X 20 X 20 nm. 


30.4d The Cytosolic Ribosomes of Eukaryotes Are Larger 
Than Prokaryotic Ribosomes 


Eukaryotic cells have ribosomes in their mitochondria (and chloroplasts) as well as in 
the cytosol. The mitochondrial and chloroplastic ribosomes resemble prokaryotic 
ribosomes in size, overall organization, structure, and function, a fact reflecting the 
prokaryotic origins of these organelles. Although eukaryotic cytosolic ribosomes are 
larger and considerably more complex, they retain the core structural and functional 
properties of their prokaryotic counterparts, confirming that the fundamental ribo- 
some organization and operation has been conserved across evolutionary time. The 
rRNA genes of eukaryotes are present in the form of several hundred tandem clusters; 
these clusters define the nucleolus, a distinct region of the nucleus where these clusters 
are located and where transcription of rRNA genes occurs. As in prokaryotes, 80% to 
90% of eukaryotic transcription is rRNA synthesis. Higher eukaryotes have more 
complex ribosomes than lower eukaryotes. For example, the yeast cytosolic ribo- 
somes have major rR NAs of 3392 (large subunit) and 1799 nucleotides (small subunit); 
the major rRNAs of human cytosolic ribosomes are 5034 and 1870 nucleotides, 
respectively. Table 30.6 lists the properties of human cytosolic ribosomes. Compari- 
son of base sequences and secondary structures of rRNAs from different organisms 
(Section 11.7c) suggests that evolution has worked to conserve the secondary structure 
of these molecules, although not necessarily the nucleotide sequences creating such 
structure. That is, the retention of a base pair at a particular location seems more 
important than whether the base pair is G:C or A:U. 
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Figure 30.14 The basic steps in protein synthesis. 


The ribosome has three distinct binding sites for 


tRNA: the A, or acceptor, site; the P, or peptidyl, 


site; and the E, or exit, site. The figure shows 
peptidyl-RNA occupying the P site. 


3! 


| Table 30.6 | KF Structural Organization of Human Cytosolic Ribosomes 


Ribosome Small Subunit Large Subunit 

Sedimentation coefficient 80S 40S 60S 

Mass (megaDa) 4.3 1.4 2.9 

Major RNAs 18S = 1870 bases 28S = 5034 bases 

Minor RNAs 5.88 = 156 bases 
5S = 121 bases 

RNA mass (megaDa) 2.5 0.7 1.8 

RNA proportion 60% 50% 65% 

Protein number 33 polypeptides 47 polypeptides 

Protein mass (megaDa) L7 0.7 1.0 

Protein proportion 40% 50% 35% 


30.5 | What Are the Mechanics of mRNA Translation? 


In translating an mRNA, a ribosome must move along it in the 5'—>3' direction, 
recruiting aminoacyl-tRNAs whose anticodons match up with successive codons and 
joining amino acids in peptide bonds in a polymerization process that forms a par- 
ticular protein. Like chemical polymerization processes, protein biosynthesis in all 
cells is characterized by three distinct phases: initiation, elongation, and termination. 
At each stage, the energy driving the assembly process is provided by GTP hydrolysis, 
and specific soluble protein factors participate in the events. These soluble proteins 
are often G-protein family members that use the energy released upon hydrolysis of 
bound GTP to fuel switchlike conformational changes. Such conformational changes 
are at the heart of the mechanical steps necessary to move a ribosome along an mRNA 
and to deliver an aminoacyl-tRNA into appropriate register with a codon. 


30.5a Translation Can be Organized into Three Steps: Initiation, 
Elongation, and Termination 


Initiation involves binding of mRNA by the small ribosomal subunit, followed by 
association of a particular initiator aminoacyl-tRNA that recognizes the first codon. 
This codon often lies within the first 30 nucleotides or so of mRNA spanned by the 
small subunit. The large ribosomal subunit then joins the initiation complex, prepar- 
ing it for the elongation stage. 

Elongation includes the synthesis of all peptide bonds from the first to the last, as the 
ribosome moves along the mRNA, recruiting the cognate aminoacyl-tR NAs that deliver 
successively each of the multitude of amino acids in a polypeptide. The ribosome 
remains associated with the mRNA throughout elongation, moving along it and trans- 
lating its message into an amino acid sequence. This is accomplished via a repetitive 
cycle of events in which successive aminoacyl-tR NAs are added to the ribosome: mRNA 
complex as directed by codon binding, the 50S subunit catalyzes peptide bond forma- 
tion, and the polypeptide chain grows by one amino acid at a time. 

Three tRNA molecules may be associated with the ribosome: mRNA complex at any 
moment. Each lies in a distinct site (Figure 30.14). The A, or acceptor, site is the binding 
site for an incoming aminoacyl-tRNA. The P, or peptidyl, site is occupied by peptidyl- 
tRNA, the tRNA carrying the growing polypeptide chain. The elongation reaction 
transfers the peptide chain from the peptidyl-tR NA in the P site to the aminoacyl-tRNA 
in the A site. This transfer occurs through covalent attachment of the a-amino group of 
the aminoacyl-tRNA to the a-carboxyl group of the peptidyl-tRNA, forming a new 
peptide bond. The ribosome now moves (translocates) precisely one codon further along 
the mRNA. Accompanying this translocation is the movement of the new, longer pepti- 
dyl-tRNA from the A site into the P site. The A site, left vacant by this translocation, 
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can accept the next incoming aminoacyl-tRNA. The E, or exit, site, is transiently occu- 
pied by the unloaded, or deacylated, tRNA, which has lost its peptidyl chain through 
the peptidyl transferase reaction. These events are summarized in Figure 30.14. The 
contributions made to each of the three tRNA-binding sites by each ribosomal subunit 
are shown in Figure 30.15. 

Termination is triggered when the ribosome reaches a stop codon on the mRNA. At 
this point, the polypeptide chain is released and the ribosomal subunits dissociate from 
the mRNA. 

Protein synthesis proceeds rapidly. In vigorously growing bacteria, about 20 amino 
acid residues are added to a growing polypeptide chain each second. So an average 
protein molecule of about 300 amino acid residues is synthesized in only 15 seconds. 
Eukaryotic protein synthesis is only about 10% as fast. Protein synthesis is also highly 
accurate: An inappropriate amino acid is incorporated only once in every 10* codons. 
We focus first on protein synthesis in E. coli, the system for which we know the most. 


30.5b Peptide Chain Initiation in Bacteria Requires 
a G-Protein Family Member 


The components required for peptide chain initiation include (1) mRNA; (2) 30S and 
50S ribosomal subunits; (3) a set of proteins known as initiation factors; (4) GTP; and 
(5) a specific charged tRNA, f-Met-tRNA,™*, 


Initiator tRNA tRNA;™* is a particular tRNA for reading an AUG (or GUG, or even 
UUG) codon that signals the start site, or N-terminus, of a polypeptide chain; the 
subscript ; signifies initiation. This tR NA#™* does not read internal AUG codons, so 
it does not participate in chain elongation. Instead, that role is filled by another me- 
thionine-specific tRNA, referred to as tRNA™*, which cannot replace tRNA;™* in 
peptide chain initiation. (However, both of these tRNAs are loaded with Met by the 
same methionyl-tRNA synthetase.) The structure of E. coli tRNA}™* has several 
distinguishing features (Figure 30.16). Collectively, these features identify this tRNA 
as essential to initiation and inappropriate for chain elongation. 

The synthesis of all E. coli polypeptides begins with the incorporation of a modified 
methionine residue, N-formyl-Met, as N-terminal amino acid. However, in about half of 
the E. coli proteins, this Met residue is removed once the growing polypeptide is ten or so 
residues long; as a consequence, many mature proteins in E. coli lack N-terminal Met. 

The methionine contributed in peptide chain initiation by tRNA;™* is unique in that 
its amino group has been formylated. This reaction is catalyzed by a specific enzyme, 
methionyl-tRNA;™* formyl transferase (Figure 30.17). Note that the addition of the 
formyl group to the a-amino group of Met creates an N-terminal block resembling a 
peptidyl grouping. That is, the initiating Met is transformed into a minimal analog of a 
peptidyl chain. 


transferase 


Figure 30.15 The three tRNA-binding sites 

on ribosomes. The view shows the ribosomal 
surfaces that form the interface between the (a) 
30S and (b) 50S subunits in a 70S ribosome (as if 
a 70S ribosome has been opened like a book to 
expose facing pages). The A (green), P (blue), and 
E (yellow) sites are occupied by tRNAs. The 
decoding center on the 30S subunit (a) lies behind 
the top of the tRNAs in the A and P sites (which 
is where the anticodon ends of the tRNAs are 
located). The peptidyl transferase center on the 
50S subunit (b) lies at the lower tips (acceptor 
ends) of the A- and P-site tRNAs. (a: PDB id = 
2J02; b: PDB id = 2J03.) 
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Figure 30.16 The secondary structure of E. coli 
N-formyl-methionyl-tRNA;™*. The features 
distinguishing this initiator tRNA from noninitiator 
tRNAs are highlighted. 
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Figure 30.17 Methionyl-tRNA;™* formyl 
transferase catalyzes the transformylation of 
methionyl-tR NA;™* using N!°-formyl-THF as 
formyl donor. The tRNA for reading Met codons 
within a protein ((RNA™") is not a substrate for 
this transformylase. 


Figure 30.18 Various Shine-Dalgarno sequences 
recognized by E. coli ribosomes. These sequences lie 
about ten nucleotides upstream from their respective 
AUG initiation codon and are complementary to the 
UCCU core sequence element of E. coli 16S rRNA. 
G:U as well as canonical G:C and A: U base pairs 
are involved here. 


| Table 30.7 | =A) Properties of E. coli 


Initiation Factors 


Mass  Molecules/ 

Factor (kDa)  Ribosome Function 

IF-1 9 0.15 Binds to 30S A site 
and prevents tRNA 
binding 

IF-2 97 G-protein that 
binds fMet- 
tRNA;™«; interacts 
with IF-1 

IF-3 23 0.25 Binds to 30S E site; 


prevents 50S 
binding 
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satu gee | 
Note similarity 


to peptide grouping H 
Met-tRNA,™* N-formyl-Met-tRNA,™* 

Initiation 

codon 
araB - UUUGGAUGGAGUGAAACGA >GCGAUU- 
galE -AGCCUAAUGGAGCGAAUU:, ;AGAGUU- 
lacl - CAAUUCAGGGUGGUGAUU ' AAACCA— 
lacZ -UUCACACAGGAAACAGCU:, ;ACCAUG- 
Q B phage replicase - UAACUAAGGAUGAAAUGC r;-UCUAAG— 
X174 phage A protein - AAUCUUGGAGGCUUUUUU. GUUCGU- 
R17 phage coat protein - UCAACCGGGGUUUGAAGC 7CUUCU- 
ribosomal protein S12 -AAAACCAGGAGCUAUUUDUA,. GCAACA-— 
ribosomal protein L10 - CUACCAGGAGCAAAGCUA, 7CUUUA-— 
trpE - CAAAAUUAGAGAAUAACA, CAAACA-— 
irpL leader - GUAAAAAGGGUAUCGACAAUGAAAGCA-— 


’ ’ 
3! end of 16S rRNA 3 yoAUUCCUCCACUAG-— 5 


mRNA Recognition and Alignment In order for the mRNA to be translated accurately, 
its sequence of codons must be brought into proper register with the translational 
apparatus. Recognition of translation initiation sequences on mRNAs involves the 
16S rRNA component of the 30S ribosomal subunit. Base pairing between a pyrimi- 
dine-rich sequence at the 3’-end of 16S rRNA and complementary purine-rich tracts 
at the 5’-end of prokaryotic mRNAs positions the 30S ribosomal subunit in proper 
alignment with an initiation codon on the mRNA. The purine-rich mRNA sequence, 
the ribosome-binding site, is often called the Shine-Dalgarno sequence in honor of its 
discoverers. Figure 30.18 shows various Shine-Dalgarno sequences found in pro- 
karyotic mRNAs, along with the complementary 3’-tract on E. coli 16S rRNA. 
The 3’-end of 16S rRNA resides in the head region of the 30S small subunit, and 
the double-helical duplex formed by the Shine-Dalgarno sequence and the 16S 
rRNA 3’-end fits snugly in a chamber between the 30S subunit head and platform 
domains. 


Initiation Factors Initiation involves interaction of the initiation factors (IFs) with GTP, 
N-formyl-Met-tR NA/™*, mRNA, and the 30S subunit to give a 30S initiation complex 
to which the 50S subunit then adds to form a 70S initiation complex. The initiation 
factors are soluble proteins required for assembly of proper initiation complexes. 
Their properties are summarized in Table 30.7. 


Events in Initiation Initiation begins when a 30S subunit:(IF-3:IF-1) complex binds 
mRNA and a complex of IF-2, GTP, and f-Met-tRNA;'™*., The sequence of events is 
summarized in Figure 30.19. Although IF-3 is absolutely essential for mRNA binding 
by the 30S subunit, it is not involved in locating the proper translation initiation site 
on the message. The presence of IF-3 on 30S subunits also prevents them from reas- 
sociating with 50S subunits. IF-3 must dissociate before the 50S subunit will associate 
with the mRNA:30S subunit complex. 

IF-2 delivers the initiator f-Met-tRNA;!™ in a GTP-dependent process. The 30S 
subunit is aligned with the mRNA such that the initiation codon is situated within 
the 30S part of the P site. Upon binding, f-Met-tRNA;™* enters this 30S portion of 
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Figure 30.19 The sequence of events in peptide chain initiation. 


the P site. Codon recognition by fMet-tRNA;™M"*t changes the conformation of the 
30S complex, stabilizing binding of the fMet-tRNA;™* and destabilizing IF-3 
binding. GTP hydrolysis is necessary to form an active 70S ribosome. GTP hydro- 
lysis is triggered when the 50S subunit joins and is accompanied by IF-1 and IF-2 
release. The A site of the 70S initiation complex is ready to accept an incoming 
aminoacyl-tRNA; the 70S ribosome is poised to begin chain elongation. 


30.5c Peptide Chain Elongation Requires Two G-Protein 
Family Members 


The requirements for peptide chain elongation are (1) an mRNA:70S ribosome: peptidyl- 
tRNA complex (peptidyl-tR NA in the P site), (2) aminoacyl-tR NAs, (3) a set of proteins 
known as elongation factors, and (4) GTP. Elongation begins as soon as the second 
codon in the mRNA is read by the next aminoacyl-tRNA. Chain elongation can be 
divided into three principal steps: 


1. Codon-directed binding of the incoming aminoacyl-tRNA at the A site. Decoding 
center regions of 16S rRNA make sure the proper aminoacyl-tRNA is in the A site 
by direct surveillance of codon—anticodon base pairing geometry. 

2. Peptide bond formation: transfer of the peptidyl chain from the tRNA bearing it 
to the—NH)> group of the new amino acid. 

3. Translocation: Following peptide bond formation, the ribosome translocates 
precisely one codon further along the mRNA. Translocation also shifts the one- 
residue-longer peptidyl-tRNA to the P site to make room for the next aminoacyl- 
tRNA at the A site. 


30.5d The Elongation Cycle 


The properties of the soluble proteins essential to peptide chain elongation are sum- 
marized in Table 30.8. These proteins are present in large quantities, reflecting the 
great importance of protein synthesis to cell vitality. For example, elongation factor 
Tu (EF-Tu) is the most abundant protein in E. coli, accounting for 5% of total cellular 
protein. EF-Tu and EF-G, along with IF-2, are all members of the G-protein family of 
GTPases (see Figures 15.20, 32.21, and 32.22.) As such, the conformation of the GTP- 
bound and GDP-bound forms of these proteins are significantly different, allowing 
them to act as conformational switches that, in turn, can trigger dramatic conforma- 
tional changes in the ribosome to which they are bound. These ribosomal conforma- 
tional changes lie at the heart of the mechanical processes of protein synthesis by a 
ribosome translocating along an mRNA. 


5 


| Table 30.8 | <li) Properties of E. coli 


Elongation Factors 


Mass Molecules/ 
Factor (kDa) Cell Function 


EF-Tu 43 70,000 G protein that 
binds aminoacyl- 
tRNA and 
delivers it to the 
A site 


EF-Ts 74 10,000 Guanine- 
nucleotide 
exchange factor 
(GEF) that 
replaces GDP on 
EF-Tu with GTP 


EF-G 77 20,000 G protein that 
promotes 
translocation of 
mRNA 
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30.5e Aminoacyl-tRNA Binding and the Peptidyl Transferase Reaction 


EF-Tu binds aminoacyl-tRNA and GTP. There is only one EF-Tu species serving all 
the different aminoacyl-tRNAs, and aminoacyl-tRNAs are accessible to the A site of 
active 70S ribosomes only in the form of aminoacyl-tRNA:EF-Tu:GTP complexes. 
The ternary aa-tRNA:EF-Tu:GTP complex binds to the L7/L12 stalk region of the 
50S ribosomal subunit. Once correct base pairing between codon and anticodon has 
been established within the A site, the GTP is hydrolyzed to GDP and P; by EF-Tu. 
Because the sites of codon—anticodon recognition and GTP hydrolysis are 7.5 nm 
apart, conformational changes in the ribosome must convey the information of cog- 
nate tRNA recognition to the EF-Tu GTPase site. Indeed, conformational adjust- 
ments in the 50S ribosome driven by codon—anticodon recognition allow the phosphate 
group of residue A2662 in 23S rRNA to activate the intrinsic GTPase activity of EF-Tu 
through interactions with His** in EF-Tu. The region of the 50S ribosomal subunit 
where 23S rRNA A2662 is located is near where EF-Tu binds and is referred to as 
the GTPase-activating center, or GAC. As a result of these events, the aminoacyl end 
of the tRNA is properly oriented in the peptidyl transferase site of the 50S subunit, 
and the EF-Tu molecule is released as a EF-Tu:GDP complex (Figure 30.20). 
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Figure 30.20 (a) The cycle of events in peptide chain E~ Peptidyl-tRNA 


elongation on E. coli ribosomes. (b) The molecular 
graphic at the upper right is an EF-Tu: aminoacyl- 
tRNA complex (PDB id = 1TTT). EF-Tu is shown as 
a ribbon diagram and the tRNA as a ball-and-stick 
structure. 
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Dissociation of EF-Tu allows the aminoacyl end of the aminoacyl-tRNA to swing into 
the peptidyl transferase center, positioning its amino acid for addition to the growing 
peptide chain. 

Elongation factor Ts (EF-Ts) is a guanine-nucleotide exchange factor (GEF) that cata- 
lyzes the recycling of EF-Tu by mediating the displacement of GDP and its replacement 
by GTP. EF-Ts forms a transient complex with EF-Tu by displacing GDP, whereupon 
GTP displaces EF-Ts from EF-Tu (Figure 30.20). 


The Decoding Center: A 16S rRNA Function Analysis of the structures of the 70S 
ribosome:tR NA complexes and isolated 30S subunits has revealed the decoding center 
in the 30S subunit. This decoding center, where anticodon loops of the A- and P-site 
tRNAs and the codons of the mRNA are matched up, is primarily a property of 16S 
rRNA. Figure 30.21 reveals the location of the decoding center and highlights the 
interface between an mRNA UUU codon, the GAA anticodon of a cognate tRNA”, 
and the elements of the ribosome. 16S rRNA nucleotides A1493, A1492, G530, C518, 
and C1054 interact extensively with the minor groove of the UUU:AAG duplex; 
residues Ser% and Prof! of ribosomal protein S12 also participate (Figure 30.21b). 
A1492 and A1493 are part of 16S rRNA helix 44; G530 and C518 are from 16S helix 18. 
Triggered conformational changes within these 16S rRNA regions are key to codon— 
anticodon recognition. 


Peptidyl Transfer Peptidyl transfer, or transpeptidation, is the central reaction of 
protein synthesis, the actual peptide bond—forming step. No energy input (for 
example, in the form of ATP) is needed; the ester bond linking the peptidyl moiety 
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Figure 30.21 The decoding center of the 30S 
ribosomal subunit is composed only of 16S rRNA. 
(a) The 30S subunit, as viewed from the 50S subunit. 
The cyan circle shows the latch structure of the 30S 
subunit that encircles and encloses the mRNA. The 
mRNA enters along the path indicated by the arrow 
and follows a groove along this face of the subunit. 
The location of the decoding center is indicated by 
the red circle. (Adapted from Figure 3 in Schluenzen, 
F, et al., 2000. Structure of the functionally 
activated small ribosomal subunit at 3.3 Å 
resolution. Cell 102:615—623.) (b) 30S ribosomal 
subunit interactions with the codon-anticodon 
duplex during cognate tRNA recognition. (Adapted 
from Figure 2 in Ogle, J. M., and Ramakrishnan, V., 
2005. Structural insights into translational fidelity. 
Annual Review of Biochemistry 74:129-177.) 
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Figure 30.22 The peptidyl transferase active site: 
Base-pairing between cytosine residues of A-site 
tRNA (yellow) and P-site tRNA (orange) with 

23S rRNA bases (pale green; PDB id = 1VQN). 
The a-amino group of the aminoacyl group on the 
aminoacyl-tRNA (blue) is positioned for the attack 
on the ester carbonyl carbon of the peptidyl group 
of the P-site tRNA (green). (Adapted from Figure 2 
in Beringer, M., and Rodnina, M. V., 2007. The 
ribosomal peptidyl transferase. Molecular Cell 
26:311-321.) 


to tRNA is intrinsically reactive. As noted earlier, peptidyl transferase, the activity 
catalyzing peptide bond formation, is associated with the 50S ribosomal subunit. 
Indeed, this reaction is a property of the 23S rRNA in the 50S subunit. 


23S rRNA Is the Peptidyl Transferase Enzyme Peptide bond formation is catalyzed by 
the large rRNA in the large ribosomal subunit (e.g., the 23S rRNA in prokaryotic 
50S ribosomal subunits) through its peptidyl transferase center (PTC). No ribosomal 
proteins lie within 1.5 nm of the PTC; thus, none can participate in the reaction 
mechanism. In kinetic terms, the catalytic power of ribosomes is modest: A ribosome, 
through the rRNA comprising its PTC, accelerates the peptide-bond forming reaction 
rate 4 X 10°-fold over the uncatalyzed reaction. Peptidyl transfer requires that the 
acceptor ends (the aminoacylated ends) of both A- and P-site tRNAs interact with 
the PTC of the 50S subunit. Because the growing peptidyl chain doesn’t move during 
peptidyl transfer, the acceptor end of the A-site aminoacyl-tRNA must move into 
the P site as its aminoacyl function picks up the peptidyl chain. At the same time, 
the acceptor end of the deacylated P-site tRNA is shunted into the E site (see 
Figure 30.15). 


The Catalytic Power of the Ribosome Comes from Tight Binding of Its Peptidyl-tRNA 
and Aminoacyl-tRNA Substrates Structurally, the PTC can be described as a funnel- 
shaped active-site crater where the 3’-acceptor ends of the peptidyl-tRNA and ami- 
noacyl-tRNA meet. This crater lies directly above the entrance to the peptide exit 
tunnel. Nucleotides at the 3’-ends of the tRNAs base-pair with bases in the PTC. 
G2251 and G2252 of the PTC (Figure 30.22) base-pair with C75 and C74, respectively, 
of the P-site tRNA. PTC G2553 forms a base pair with C75 of the A-site tRNA. 
Significantly, PTC C2501 interacts with A76 of the P-site tRNA, and nearby U2506 
interacts with A76 of the A-site tRNA. These interactions and others lead to tight 
binding of the substrates by the ribosome in a manner such that the reactive groups 
(aminoacyl and peptidyl) are juxtaposed and properly oriented for reaction to occur. 
Collectively, functional groups provided by the PTC, the peptidyl-tRNA, and the 
aminoacyl-tRNA establish a network of H bonds and electrostatic interactions that 
stabilize the highly polar transition state (Figure 30.23). In effect, interactions of the 
tRNAs with the rRNA of the PTC create a highly organized environment that facili- 
tates peptide bond synthesis. Ribosome-catalyzed protein synthesis relies more on 
proximity and orientation of substrates than on chemical catalysis. The role of the 
PTC in protein synthesis is to align the tRNA-linked substrates so that the reaction 
is facilitated. 
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Translocation Three things remain to be accomplished in order to return the active 
70S ribosome:mRNA complex to the starting point in the elongation cycle, in prepa- 
ration for addition of the next amino acid: 


1. The deacylated tRNA must be removed from the P site. 

2. The peptidyl-tRNA must be moved (translocated) from the A site to the P site. 

3. The ribosome must move one codon down the mRNA so that the next codon is 
positioned in the A site. 


In the process of translocation, the tRNAs and the mRNA must move a distance of 
about 5 nm relative to the ribosome, a shift requiring large-scale conformational 
changes. Translocation is a two-step process. First, the acceptor stems of the tRNAs 
move from the A and P sites on the 50S subunit to the 50S P and E sites, respectively, 
while their anticodon stems remain fixed in the 30S A and P sites. Rachet-like rotation 
of the 30S and 50S subunits relative to one another drives this tRNA acceptor-stem 
repositioning, as transmitted through 30S interactions with the L1 stalk of the 50S sub- 
unit (see Figure 30.13). In the second step, the mRNA and the tRNAs move relative to 
the 30S subunit, bringing the one-residue-longer peptidyl-tRNA into the 30S P site and 
positioning the next codon within the 30S A site for translation. The translocation 
protein elongation factor G (EF-G) and GTP are required for this second step. EF-G, a 
G-protein family GTPase-driven conformational switch, couples the energy of GTP 
hydrolysis to these movements. As the mRNA and the anticodon ends of tRNAs move 
together with respect to the 30S subunit, the mRNA is passively dragged one codon 
further through the ribosome. This translocation of the mRNA relative to the 30S 
subunit delivers the next codon to the 30S A site. Swiveling of the head region of the 
30S subunit underlies relocation of the mRNA and tRNAs. The ribosomal components 
are now back in proper register for the another round of elongation. 

EF-G binds to the ribosome as an EF-G:GTP complex. GTP hydrolysis is essential 
not only for translocation but also for subsequent EF-G dissociation. Because EF-G 
and EF-Tu compete for a common binding site on the ribosome (the factor-binding 
center, also known as the GAC or GTPase-activating center) adjacent to the A site, 
EF-G release is a prerequisite for return of the 70S ribosome:mRNA to the beginning 
point in the elongation cycle. 

In this simple model of peptidyl transfer and translocation, the ends of both tRNAs 
move relative to the two ribosomal subunits in two discrete steps, with the acceptor ends 
moving first and then the anticodon ends. Furthermore, the readjustments needed to 
reposition the ribosomal subunits relative to the mRNA and to one another imply that 
the 30S and 50S subunits rotate relative to one another in ratchetlike fashion. This model 
provides a convincing explanation for why ribosomes are universally organized into a 
two-subunit structure: The small and large subunits must move relative to each other in a 
rachetlike fashion, as opposed to moving as a unit, in order to carry out the process of 
translation. 


Figure 30.23 The protein synthesis reaction 
proceeds via deprotonation of the a-amino 
group of the aminoacyl-tRNA, followed by 
nucleophilic attack of the a-NH), on the peptidyl- 
tRNA carbonyl carbon to form a highly polar 
tetrahedral intermediate. Proton transfer to the 
3'-O of the tRNA that contributed the peptidyl 
chain leads to release of this tRNA in the 
deacylated form and formation of a one-residue- 
longer peptidyl-tRNA. 
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A Deeper Look 


30.5f GTP Hydrolysis Fuels the Conformational Changes 
That Drive Ribosomal Functions 


Two GTPs are hydrolyzed for each amino acid residue incorporated into peptide dur- 
ing chain elongation: one upon EF-Tu—mediated binding of aa-tRNA and one more 
in translocation. The role of GTP (with EF-Tu as well as EF-G) is mechanical, in anal- 
ogy with the role of ATP in driving muscle contraction (see Chapter 16). GTP binding 
induces conformational changes in ribosomal components that actively engage these 
components in the mechanics of protein synthesis; subsequent GTP hydrolysis 
followed by GDP and Pi release relax the system back to the initial conformational 
state so that another turn in the cycle can take place. The energy expenditure for 
protein synthesis is at least four high-energy phosphoric anhydride bonds per amino 
acid. In addition to the two provided by GTP, two from ATP are expended in amino 
acid activation via aminoacyl-tRNA synthesis (see Figure 30.3). 


Molecular Mimicry—The Structures of EF-Tu: Aminoacyl-tRNA, EF-G, and RF-3 


EF-Tu and EF-G compete for binding to ribosomes. EF-Tu 
has the unique capacity to recognize and bind any aminoacy]l- 
tRNA and deliver it to the ribosome in a GTP-dependent 
reaction. EF-G catalyzes GTP-dependent translocation. The 
structure of the EF-Tu:tRNA complex is remarkably similar 
to the structure of EF-G (EF-Tu:tRNA is shown on the left 
in the figure; EF-G is in the center). The EF-Tu:tRNA struc- 
ture is Thermus aquaticus EF-Tu: Phe-tRNA?* complexed with 
GMPCP (purple); GMPCP is a nonhydrolyzable analog of GTP 
(PDB id = 1TTT). The EF-G structure has GDP bound (purple; 
PDB id = 1DAR). Note that parts of the EF-G structure mimic 
the structure of the tRNA molecule. Both EF-Tu and EF-G bind 
to the factor-binding center, which is located on the L7/L12 side 
of 50S ribosomal subunit. Part of this center has an associated 


EF-Tu:tRNA (left); EF-G 
(center); and RF-3 (right). 


GTPase function that plays an integral role in GTP hydrolysis 
and release of these factors. 

Nature has extended this mimicry with RF-3, a ribosome-binding 
protein essential to the termination phase of protein synthesis. RF-3 
is a GTP-binding protein that looks like a tRNA (the structure of 
RF-3 with bound GDP (purple) is shown on the right in the figure; 
PDB id = 2H5E). 

One view of early evolution suggests that RNA was the primordial 
macromolecule, fulfilling all biological functions, including those 
of catalysis and information storage that are now assumed for 
the most part by proteins and DNA. The mimicry of EF-Tu: 
tRNA by EF-G may represent a fossil of early macromolecular 
evolution when the proteins first began to take over some functions 
of RNA by mimicking shapes known to work as RNAs. 
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30.5g Peptide Chain Termination Requires Yet Another 
G-Protein Family Member 


The elongation cycle of polypeptide synthesis continues until the 70S ribosome 
encounters a stop codon. At this point, polypeptidyl-tR NA occupies the P site and the 
arrival of a stop or nonsense codon in the A site signals that the end of the polypeptide 
chain has been reached (Figure 30.24) These nonsense codons are not read by any 
terminator tRNAs but instead are recognized by specific proteins known as release 
factors (RFs), so named because they promote polypeptide release from the ribosome. 
The release factors bind at the A site. RF-1 recognizes UAA and UAG, whereas RF-2 
recognizes UAA and UGA. RF-1 and RF-2 are members of the guanine nucleotide 
exchange factor (GEF) family of proteins, which includes EF-Ts. Like EF-G, RF-1 
and RF-2 interact well with the ribosomal A-site structure. Furthermore, these release 
factors read the nonsense codons through specific tripeptide sequences that serve as 
the RF protein equivalent of the tRNA anticodon loop. There is about one molecule 
each of RF-1 and RF-2 per 50 ribosomes. Ribosomal binding of RF-1 or RF-2 is com- 
petitive with EF-G. RF-1 or RF-2 recruit a third release factor, RF-3, complexed with 
GTP; this protein is a structural mimic of tRNA. RF-3 is the fourth G-protein family 
member (the other three are IF-2, EF-Tu, and EF-G) involved in protein synthesis. All 
share the same ribosomal binding site. The state of the peptidyl-tRNA in the P site 
determines which is bound and, importantly, the progression of protein synthesis 
through initiation, elongation, and termination. 

The presence of release factors with a nonsense codon in the A site creates a 
70S ribosome:RF-1 (or RF-2):RF-3-GTP:termination signal complex that transforms 
the ribosomal peptidyl transferase into a hydrolase. That is, instead of catalyzing the 
transfer of the polypeptidyl chain from a polypeptidyl-tR NA to an acceptor aminoacyl- 
tRNA, the peptidyl transferase hydrolyzes the ester bond linking the polypeptidyl chain 
to its tRNA carrier. Both RF-1 and RF-2 have a conserved GGQ sequence located in a 
loop that enters the PTC. This GGQ sequence faces nucleotide A76 of the P-site tRNA; 
the closest 23S rRNA residues are A2451 and A2602. RF-1 and RF-2 promote confor- 
mational adjustments that expose the peptidyl-tRNA ester to attack by a water mole- 
cule. The conserved Gln in the release factor GGQ motif positions this hydrolytic water 
molecule through hydrogen-bonding interactions. The peptidy] transferase transfers the 
polypeptidyl chain to this water molecule instead of an aminoacyl-tRNA. Peptide 
release is followed by expulsion of RF-1 (or RF-2) from the ribosome (Figure 30.24). 
This leaves a ribosome: mRNA: P-site tRNA complex that must be disassembled by a 
protein ribosome recycling factor (RRF) with the help of EF-G. The structure of RRF 
resembles the letter “L”; thus, this protein is also structurally akin to a tRNA. 

We can now appreciate the role played by GTP in initiation, elongation, and termina- 
tion of protein synthesis. IF-2, EF-Tu, EF-G, and RF-3 are all GTP-binding proteins, 
and all are part of the G-protein superfamily (see Figure 15.20). IF-2, EF-Tu, EF-G, 
and RF-3 interact with the same site on the 50S subunit, the factor-binding center, in the 
50S cleft. This factor-binding center activates the GTPase activity of these factors, once 
they become bound. 


30.5h The Ribosomal Subunits Cycle Between 70S Complexes and a Pool 
of Free Subunits 


Ribosomal subunits cycle rapidly through protein synthesis. In actively growing bac- 
teria, 80% of the ribosomes are engaged in protein synthesis at any instant. Once a 
polypeptide chain is synthesized and the nascent polypeptide chain is released, the 
70S ribosome dissociates from the mRNA and separates into free 30S and 50S sub- 
units (Figure 30.25). Intact 70S ribosomes are inactive in protein synthesis because 
only free 30S subunits can interact with the initiation factors. Binding of initiation 
factor IF-3 by 30S subunits and interaction of 30S subunits with 50S subunits are 
mutually exclusive. 30S subunits with bound initiation factors associate with mRNA, 
but 50S subunit addition requires IF-3 release from the 30S subunit. 
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(a) A-site empty 
mRNA 3’ 
Nascent polypeptide 
PeptidyltRNA 


Inactive 70S ribosome 


Figure 30.24 The events in peptide chain 
termination. (a) A nonsense codon in the A site 
leads to binding of RF1 and RF3-GTP, hydrolysis 
of the peptidyl-tR NA bond, and release of the 
nascent polypeptide. RRF drives dissociation of the 
mRNA and deacylated tRNA in the P site. (b) The 
ribosome:RF-2 termination complex. The view here 
shows only the 30S subunit-tRNA complex, as if it 
were folded down and away from the 50S subunit, 
opening the ribosome as if you were opening your 
car’s glove compartment door. RF-2 is shown in 
yellow, the P-site tRNA in orange, the E-site tRNA 
in red, MRNA in dark blue, the 30S proteins in cyan, 
and the 30S ribosomal subunit 16S rRNA in light 
purple. Lower: The ribosome-bound conformation 
of RF-2, with its GGQ motif, is shown alone as the 
lower image in (b). These structures are derived from 
PDB id = 3F1E. (Adapted from Korostelev, A., 

et al., 2008. Crystal structure of a translation 
termination complex formed with release factor 
RF2. Proceedings of the National Academy of 
Sciences, U.S.A. 105:19684-19689.) 
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J 30S subunit with IF-1 and IF-3 
binds mRNA. IF3 blocks 50S 
subunit binding. 
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50S subunit can join. 


Elongation 
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Pool of 
70S ribosomes 


-/ 


Termination 


Figure 30.25 The ribosome life cycle. Note that IF-3 is released prior to 50S addition. 


A Deeper Look 


Tethered Ribosomes Open New Possibilities in Synthetic Biology 


As noted, ribosomal subunits normally cycle between free 30S and 
50S subunits and complete 70S ribosomes assembled by initiation 
factors in the presence of mRNA. Intact 70S ribosomes are inactive. 
The two-part nature of ribosomes is evident in their division of labor, 
with the decoding center located in the 30S subunit and the peptidyl 
transferase center in the 50S subunit. In addition, a precise choreog- 
raphy of substantial conformational changes takes place within and 
between the ribosomal subunits as they translate an mRNA, recruit- 
ing aminoacyl-tR NAs as dictated by codons and joining amino acids 
one after another to form polypeptide chains. For these reasons, 
ribosomes are universally composed of two subunits. Thus, the 
discovery by Alexander Mankin and his colleagues that ribosomes 
whose subunits were tethered together could still synthesize proteins 
is extraordinary. The Mankin laboratory created tethered ribosomes 
by connecting 16S and 23S rRNA gene sequences by a flexible RNA 
linker and expressing this co-joined super rRNA molecule in E. coli 
cells. The cells were able to assemble functional 70S-sized ribosomes 
on this super rRNA platform. These tethered single-particle ribo- 
somes (called Ribo-T) translated mRNA normally. In a further 


refinement, by changing the sequence within the Shine-Dalgarno 
complementary region at the 3’-end of the 16S rRNA segment of 
the super RNA, they could make Ribo-T ribosomes that would 
only translate mRNAs bearing mutant Shine-Dalgarno sequences, 
thereby controlling which messages Ribo-T would translate. Intro- 
duction of such mutations into the 16S small subunit rRNA and 
others into the 23S large subunit RNA of tethered ribosomes 
allows these ribosomes to be dedicated exclusively to translation of 
desired synthetic mRNAs and production of highly tailored protein 
products. This breakthrough has enormous potential for synthetic 
biology: The process of translation may now be engineered using 
expanded genetic codes and noncanonical amino acids to create 
polypeptide-like molecules with all sorts of novel chemistries. 
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Oscar L. Miller Jr., University of Virginia 


Figure 30.26 Electron micrograph of polysomes: 
multiple ribosomes translating the same mRNA. 
(From Francke, C., et al., 1982. Electron microscopic 
visualization of a discrete class of giant translation 
units in salivary gland cells of Chironomus tentans. 
The EMBO Journal 1:59-62. 


30.51 Polyribosomes Are the Active Structures of Protein Synthesis 


Active protein-synthesizing units consist of an mRNA with several ribosomes 
attached to it. Such structures are polyribosomes, or, simply, polysomes (Figure 30.26). 
All protein synthesis occurs on polysomes. In the polysome, each ribosome is 
traversing the mRNA and independently translating it into polypeptide. The further 
a ribosome has moved along the mRNA, the greater the length of its associated 
polypeptide product. In prokaryotes, as many as 10 ribosomes may be found in a 
polysome. Ultimately, as many as 300 ribosomes may translate an mRNA, so as many 
as 300 enzyme molecules may be produced from a single transcript. Eukaryotic 
polysomes typically contain fewer than 10 ribosomes. 


30.6 | How Are Proteins Synthesized in Eukaryotic Cells? 


Eukaryotic mRNAs are characterized by two post-transcriptional modifications: the 
5'-terminal 7methyl-GTP cap and the 3’-terminal poly(A) tail (Figure 30.27). The 
7Tmethyl-GTP cap is essential for mRNA binding by eukaryotic ribosomes and also 
enhances the stability of these mRNAs by preventing their degradation by 
5'-exonucleases. The poly(A) tail enhances both the stability and translational 
efficiency of eukaryotic mRNAs. The Shine-Dalgarno sequences found at the 5’-end 
of prokaryotic mRNAs are absent in eukaryotic mRNAs. 


30.6a Peptide Chain Initiation in Eukaryotes 


As might be expected, eukaryotic protein synthesis is considerably more complex 
than prokaryotic protein synthesis. More than 100 different RNA molecules and 200 
proteins are required just for the core translational machinery in Caenorhabditis ele- 
gans, a simple animal. Despite such complexity, the overall process is similar to pro- 
karyotic protein synthesis. The events in eukaryotic peptide chain initiation are 
summarized in Figure 30.28, and the properties of eukaryotic initiation factors, sym- 
bolized eIFs, are presented in Table 30.9. The eukaryotic initiator tRNA is a unique 
tRNA functioning only in initiation. Like the prokaryotic initiator tRNA, the eukary- 
otic version carries only Met. However, unlike prokaryotic f-Met-tRNAj™*, the Met 
on this tRNA is not formylated. The eukaryotic initiator tRNA is usually designated 
tRNA", with the “i” indicating initiation. 

In particular, initiation of protein synthesis is significantly more complicated in 
eukaryotes. It can be divided into three fundamental stages: 


Stage 1: Eukaryotic protein synthesis begins with formation of the 43S preinitiation 
complex from the 40S ribosomal subunit and the Met-tRNA;M:eIF2-GTP ternary 
complex (Stage 1, Figure 30.28). Initiation factors eIF1, eIF1A, eIF3, and eIF5 bind to 
a 40S ribosomal subunit. eIF1 and eIF1A binding induces a conformational change in 
the 40S subunit that renders it competent for Met-tRNAM* binding. Met-tRNAM* 


7-Methyl GTP “cap” at 
5'-end 


5' Untranslated Coding 3' Untranslated Poly(A) 
6 o o region region region tail 
ee (ete oo a ae 
O— P —O— P —O— P (A) 50-100 — OH 
| | | SZ 
o 9 Initiation Polyadenylation 
codon signal 


Figure 30.27 The characteristic structure of eukaryotic mRNAs. Untranslated regions ranging 
between 40 and 150 bases in length occur at both the 5'- and 3'-ends of the mature mRNA. An 
initiation codon at the 5'-end, invariably AUG, signals the translation start site. 
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Ea. GTP Figure 30.28 The three stages in the initiation of 
a Met-tRNA Met \ translation in eukaryotic cells. See Table 30.9 for a 
' \ description of the functions of the eukaryotic 


y initiation factors (eIFs). Stage 1: Formation of the 
eF MettRNA Met è lerr? e CTP 43S preinitiation complex from Met-tRNA;M*t, GTP, 
(ternary complex) eIFs 1, 2, and 3, and the 40S ribosomal subunit. 
Stage 2: Formation of 43S:mRNA complex (48S 
1 éis initiation complex) from the 43S preinitiation 
complex, the capped and polyadenylylated mRNA, 
80S > 40S > 435 ~q eIF4B, eIF4F, and PABP. (Detailed organization of 
the 48S initiation complex is shown in Figure 30.29.) 
GDP Stage 3: eIF5B-mediated joining of the 60S 
Y ribosomal subunit to form the 80S initiation 
43S preinitiation complex complex. 


7 
m‘GTP ‘A mRNA ——~ 
n 
eIF4E GTP 
eIF4F 4 eIF4G 
eIF4A 
eIF4B 
PABP 
2 
M 
43S e mRNA complex 
m’GTP ——> Z IIA (48S initiation complex) 
subunit 
elF5B * GTP BE. cpp yy 
3 
Elongation 
oa 80S initiation complex 


Table 30.9 perties of Eukaryotic Translation Initiation Factors 
Factor Subunit Size (kDa) Function 
elF1 15 Enhances initiation complex formation 
eIFIA 17 Stabilizes Met-tR NA; binding to 40S ribosomes 
elF2 125 GTP-dependent Met-tRNA; binding to 40S ribosomes 
a 36 Regulated by phosphorylation 
B 50 Binds Met-tRNA; 
y 59 Binds GTP, Met-tRNA; 
eIF2B 270 Promotes guanine nucleotide exchange on elF2 
eIF2C 94 Stabilizes ternary complex in presence of RNA 
eIF3 800 A multisubunit scaffolding complex that recruits eIFs, mRNA and Met-tRNA;, and the 40S subunit to 
form the 43S PIC 
elF4F 243 Binds to mRNA caps and poly(A) tails; consists of eIF4A, eIF4E, and eIF4G; RNA helicase activity 
unwinds mRNA 2° structure 
eIF4A 46 Binds RNA; ATP-dependent RNA helicase; promotes mRNA binding to 40S ribosomes 
eIF4E 24 Binds to 5'-terminal ’methyl-GTP cap on mRNA 
elF4G 173 A scaffolding protein that associates the mRNA with ribosome-bound eIF3, binds to PABP 
elF4B 80 Binds mRNA; promotes RNA helicase activity and mRNA binding to 40S ribosomes 
elF4H 25 Acts with eIF4B to stimulate el[F4A helicase activity 
elF5 49 Promotes GTPase of eIF2, ejection of e[F2 and eIF3 
eIF5B 175 Ribosome-dependent GTPase activity; mediates 40S and 60S joining, homologous with prokaryotic IF2 
elF6 Dissociates 80S; binds to 60S 
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(in the form of an eIF2:GTP:Met-tRNAM* ternary complex) then binds to the 
eIF1/1A/3:40S subunit complex, followed by eIF5. (Unlike in prokaryotes, binding of 
Met-tRNAM* by eukaryotic ribosomes occurs in the absence of mRNA, so Met- 
tRNA binding is not codon-directed.) In mammals, eIF3 is an 800-kDa multimer 
containing at least 13 different subunits. The eIF3 complex binds to the solvent-exposed 
side of the 40S subunit, away from the face that interacts with mRNA and the 60S sub- 
unit. A prominent role of eIF3 is to serve as a platform and scaffold for the recruitment 
of mRNA and other proteins involved in translation initiation. 


Stage 2: in eukaryotic translation initiation leads to formation of the 48S initiation 
complex (Figure 30.28). This stage involves binding of the 43S preinitiation complex to 
mRNA and migration of the 40S ribosomal subunit to the correct AUG initiation co- 
don. Binding of mRNA by the 43S preinitiation complex requires a set of proteins in- 
cluding eIF4 group and the (poly)A-binding protein (PABP). Collectively, these proteins 
recognize the 5’-terminal cap and 3’-terminal poly(A) tail of an mRNA, unwind any 
secondary structure in the mRNA, and transfer the mRNA to the 43S preinitiation 
complex. The eIF4 group includes eIF4B and eIF4F. eIF4F is a trimeric complex con- 
sisting of e[F4A (an ATP-dependent RNA helicase), e[F4E (which binds the 5’-termi- 
nal ‘methyl-GTP of mRNAs), and eIF4G. Because eIF4G interacts with PABP (Figure 
30.29), eIF4G serves as the bridge between the cap-binding eIF4E, the poly(A) tail of 
the mRNA, and the 40S subunit (through interaction with eIF3). These interactions 
between the 5'-terminal ’methyl-GTP cap and the 3’-poly(A) tail initiate scanning of 
the 40S subunit in search of an AUG codon. Both eIF1 and eIFIA are necessary for 
opening the mRNA-binding channel of the 40S subunit so that scanning in search of 
an initiation codon can take place. The scanning mechanism operates by inspecting 
every triplet of bases within the 5’-leader region of mRNA for complementarity with 
the tRNA" anticodon. 

eIF4E, the mRNA cap-binding protein of eIF4F, is a key regulatory element in 
eukaryotic translation. eIF4F binding to the cap structure is necessary for association of 
eIF4B and formation of the 48S preinitiation complex. Translation is inhibited when the 
eIF4E subunit of e[F4F binds with 4EBP (the eIF4E binding protein). Growth factors 
stimulate protein synthesis by causing the phosphorylation of 4EBP. For example, mam- 
malian target of rapamycin complex 1 (mTORC-1)-mediated phosphorylation of 4EBP 
causes it to release bound eIF4E, which then can bind to the 5’-cap of an mRNA, trig- 
gering translation initiation. Amino acid levels, glucocorticosteroid hormones, and resis- 
tance exercise all affect protein synthesis through effects on mTORC-1. 

Note that recruitment of both the 5'-cap and 3'-poly(A) tail of the mRNA to eIF4G 
(Figure 30.29) brings the two ends of the mRNA together. Thus, protein:protein inter- 
actions between poly(A)-bound PABP and the N-terminus of eI[F4G bring the 3’- and 
5'-ends of the mRNA into close proximity, creating an mRNA circle. These interactions 
ensure that only appropriately processed transcripts are translated. Further, closing the 
circle helps protect the mRNA from exonucleolytic degradation. 


Stage 3: Formation of the 80S initiation complex. When the 48S preinitiation complex 
stops at an AUG codon, GTP hydrolysis in the e[F2: Met-tRNA;M ternary complex 
causes ejection of the initiation factors bound to the 40S ribosomal subunit. eIF5B acts 
here by stimulating the GTPase activity of eIF2. Ejection of the eIFs is followed by 60S 
subunit association to form the 80S initiation complex, whereupon translation begins 
(Figure 30.28, stage 3). eI[F2:GDP is recycled to eIF2:GTP by eIF2B (eIF2B is a 
guanine nucleotide exchange factor). 


30.6b Control of Eukaryotic Peptide Chain Initiation ls One Mechanism 
for Post-Transcriptional Regulation of Gene Expression 


Regulation of gene expression can occur post-transcriptionally through control of 
mRNA translation, as indicated by the effects of mTORC-1 mentioned earlier. 
Phosphorylation/dephosphorylation of translational components is a dominant mech- 
anism for control of protein synthesis. Several initiation factors—elIF 2a, eIF2B, eIF4E, 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


30.6 How Are Proteins Synthesized in Eukaryotic Cells? 1119 


elF4G, 40S ribosomal protein S6, and two eukaryotic elongation factors, eEF1 and 
eEF2 (see following)—have been identified as targets of regulatory controls. Modifica- 
tion of some factors affects the rate of mRNA translation; modification of others 
affects which mRNAs are selected for translation. Peptide chain initiation, the initial 
phase of the synthetic process, is the optimal target for such control. Phosphorylation 
of S6 facilitates initiation of protein synthesis, resulting in a shift of the ribosomal pop- 
ulation from inactive ribosomes to actively translating polysomes. S6 phosphorylation 
is stimulated by serum growth factors. On the other hand, the phosphorylation of some 
translational components inhibits protein synthesis. For example, the a-subunit of eIF2 
can be reversibly phosphorylated at a specific Ser residue by an eIF2a kinase/phospha- 
tase system (Figure 30.30). Four different e[F2a kinases have been identified; each is 
responsive to specific metabolic signals. Phosphorylation of elF2a@ inhibits peptide 
chain initiation. Phosphorylation of eIF2 by heme-regulated inhibitor (HRI, the heme- 
inhibited eIF2a kinase) is an important control governing globin synthesis in reticulo- 
cytes. If heme for hemoglobin synthesis becomes limiting in these cells, elF2a is 
phosphorylated, so globin mRNA is not translated and chains are not synthesized. 
Availability of heme inhibits HRI, leading to resumption of protein synthesis upon 
phosphatase-mediated removal of the phosphate group from the Ser residue. 


eIF2a 

kinase 
2 » =D a, 
EG, (Boy <— BI 
eIF2 eIF2 Tight 


complex 


Phosphoprotein 
® phosphatase H,O 


Figure 30.29 The 48S initiation complex. mRNA is 
shown as a black line, its 5'-cap as a black circle, and 
its coding region as a yellow bar beginning with AUG 
and ending with UAA; downstream lies the poly(A) 
tail. The initiator Met-tR NA; is shown as a pitchfork 
bound to the AUG initiation codon. The various 
initiation factors are shown by their numerical and 
letter codes. Mnk is a stress- and mitogen-activated 
protein kinase that phosphorylates elF4E, increasing 
its affinity for the 5’-cap. 


Figure 30.30 Control of eIF2 functions through 
reversible phosphorylation of a Ser residue on its 
a-subunit. The phosphorylated form of eIF2 
(eIF2-P) enters a tight complex with eIF2B and is 
unavailable for initiation. 
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30.6c Peptide Chain Elongation in Eukaryotes 
Resembles the Prokaryotic Process 


Eukaryotic peptide elongation occurs in very similar fashion to the process in pro- 
karyotes. An incoming aminoacyl-tRNA enters the ribosomal A site while peptidyl- 
tRNA occupies the P site. Peptidyl transfer then occurs, followed by translocation of 
the ribosome one codon further along the mRNA. Two elongation factors, eEF1 and 
eEF2, mediate the elongation steps. eEF1 consists of two components: eEFIA and 
eEF1B. eEFIA is the eukaryotic counterpart of EF-Tu; it serves as the aminoacyl- 
tRNA binding factor and requires GTP. eEFIB is the eukaryotic equivalent of pro- 
karyotic EF-Ts; it catalyzes the exchange of bound GDP on eEFI:GDP for GTP so 
that active eEF1:GTP can be regenerated. EF2 is the eukaryotic translocation factor. 
eEF2 (like its prokaryotic kin, EF-G) binds GTP, and GTP hydrolysis accompanies 
translocation. The diphthamide group of eEF2 (see the Human Biochemistry box) 


Human Biochemistry 


Diphtheria Toxin ADP-Ribosylates eEF2 


Diphtheria arises from infection by Corynebacterium diphtheriae 
bacteria carrying bacteriophage corynephage B. Diphtheria toxin is 
a phage-encoded enzyme secreted by these bacteria that is capable of 
inactivating a number of GTP-dependent enzymes through covalent 
attachment of an ADP-ribosyl moiety derived from NAD*. That is, 
diphtheria toxin is an NAD*-dependent ADP-ribosylase. One tar- 
get of diphtheria toxin is the eukaryotic translocation factor, eEF2. 
This protein has a modified His residue known as diphthamide. 
Diphthamide is generated post-translationally on eEF2; the diph- 
thamide moiety of eEF2 is important for maintaining the correct 


codon reading frame during translocation. (EF-G of prokaryotes 
lacks this unusual modification and is not susceptible to diphtheria 
toxin.) Diphtheria toxin specifically ADP-ribosylates an imidazole- 
N within the diphthamide moiety of eEF2 (see accompanying 
figure). ADP-ribosylated eEF2 retains the ability to bind GTP but 
is unable to function in protein synthesis. Because diphtheria toxin 
is an enzyme and can act catalytically to modify many molecules 
of its target protein, just a few micrograms suffice to cause death. 
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Diphtheria toxin catalyzes the NAD*-dependent ADP-ribosylation of selected 
proteins. ADP-ribosylation of the diphthamide moiety of eukaryotic EF2. ADP-ribosylated 
(Diphthamide = 2-[3-carboxamido-3-(trimethylammonio)propylJhistidine. ) diphthamide residue 
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stabilizes codon-anticodon recognition by interacting directly with the mRNA 
codon. Further, not only does eEF2 catalyze the GTP-dependent translocation of the 
ribosome along the mRNA, it ensures the codon-by-codon accuracy of translocation, 
thereby maintaining the correct reading frame. Elongation continues until a stop 
codon is reached, signaling termination. 


30.6d Eukaryotic Peptide Chain Termination 
Requires eRF1, eRF3, and GTP 


Translation of an mRNA terminates when a stop codon occupies the A site on the 
ribosome. Termination in eukaryotes is mediated by a ternary complex of release fac- 
tors eRF1, eRF3, and GTP. The N-terminal domain of eRF1 is able to recognize any 
of the three stop codons; its C-terminal domain binds eRF3 and another protein 
called ABCEI. Upon reading a stop codon, eRF1 catalyzes hydrolysis of the nascent 
polypeptide chain from the P-site peptidyl-tRNA in the following manner: When 
eRFI sits on the A-site stop codon, its GGQ sequence motif is positioned adjacent to 
the peptidyl transferase center (PTC) of the large ribosomal subunit, in proximity to 
CCA-end of the peptidyl-tRNA. Residue Q'* of the eRF1 GGQ motif is close to the 
ester bond linking the polypeptide to the peptidyl-tRNA. The eRF3 partner of eRF1 
is a GTPase whose action drives conformational changes. These changes expose the 
peptidyl-tR NA ester bond to Q!'*-facilitated nucleophilic attack by water, leading to 
polypeptide release. Upon termination, the eRFl:eRF3 complex dissociates and the 
ribosomal subunits are recycled by ATP-driven conformational changes mediated by 
ABCE! (ATP-binding cassette protein 1). The action of ABCE1 splits the 80S ribo- 
some into free 40S and 60S subunits, freeing them for recruitment into new transla- 
tion initiation reactions. 


30.7 | Do Protein Synthesis Inhibitors Have 
Clinical Applications? 


Protein synthesis inhibitors serve two major, and perhaps complementary, purposes. 
First, they have been very useful scientifically in elucidating the biochemical mecha- 
nisms of protein synthesis. Second, some of these inhibitors affect prokaryotic but 
not eukaryotic protein synthesis, and thus, are medically important antibiotics. 
Table 30.10 is a partial list of these inhibitors and their mode of action. The structures 
of some of these compounds are given in Figure 30.31. 

As indicated in Table 30.10, many antibiotics inhibit protein synthesis by binding to key 
rRNA components. The two principal sites for antibiotic binding are the 16S rRNA of the 
decoding center within the small ribosomal subunit and the PTC of the 23S rRNA in the 
large subunit. More antibiotics are targeted to the PTC than to the decoding center. 

Why is rRNA the target for antibiotics? Generally speaking, it is difficult for organ- 
isms to defend themselves against agents that bind rRNA rather than protein. First of 
all, there are many rRNA genes, so mutation of one gene to antibiotic resistance (loss 
in antibiotic binding, for instance) would affect few rRNA molecules among the many. 
Further, four bases comprise RNA (as opposed to 20 amino acids in proteins), so there 
are fewer possibilities for modification of the rRNA so that the RNA retains its func- 
tion but is no longer able to bind the antibiotic. 


30.7a The Decoding Site Is a Target of Aminoglycoside Antibiotics 


Aminoglycoside antibiotics are a series of compounds with a 2-deoxy-streptamine 
(2-DOS) core structure (Figure 30.32a) that is central to their action. These antibiotics 
bind to an unpaired adenine residue (A148) within the 16S rRNA decoding site (that 
part of the 16S rRNA that decodes the mRNA through interactions with the antico- 
don of the aminoacyl-tRNA in the A site). A'4°° lies within a small internal rRNA 
loop (Figure 30.32b) that is part of the decoding center. The sugar moiety attached to 
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Figure 30.31 The structures of various antibiotics that act as protein synthesis inhibitors. Puromycin 
mimics the structure of aminoacyl-tRNA in that it resembles the 3'-terminus of a Tyr-tRNA (shaded box). 


the 4-position of the 2-DOS core forms H bonds with A™° that limit the flexibility of 
A? and A'°3 on the other side of the small RNA loop; A”? and A? interact with 
codon nucleotides (see Figure 30.21b). This flexibility loss results in errors in transla- 
tion fidelity, in which the code is misread, the wrong amino acid is inserted in the 
growing polypeptide chain, and the protein being made is nonfunctional. Aminogly- 
coside antibiotics have been useful probes for decoding center properties. 


30.7b Many Antibiotics Target the PTC and the Peptide Exit Tunnel 


A chemically diverse series of antibiotics targets the PTC and/or the adjacent peptide 
exit tunnel. Among these, the macrolide antibiotics are one of the clinically most 
important classes; erythromycin (see Figure 30.31), a broadly prescribed macrolide, is 
representative of this class. (The newer, semisynthetic ketolide drugs are analogs of 
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| Table 30.10 | AAO Some Protein Synthesis Inhibitors 


Inhibitor Cells Inhibited Mode of Action 

Initiation 

Aurintricarboxylic acid Prokaryotic Prevents IF binding to 30S subunit 
Kasugamycin Prokaryotic Inhibits fMet-tRNA;M"°t binding 
Streptomycin Prokaryotic Prevents formation of initiation complexes 


Elongation: Aminoacyl-tRNA Binding 


Tetracycline Prokaryotic Inhibits aminoacyl-tRNA binding at A site 
Streptomycin Prokaryotic Codon misreading, insertion of improper amino acid 
Kirromycin Prokaryotic Binds to EF-Tu, preventing conformational switch from EF-Tu: GTP to EF-Tu: GDP 


Elongation: Peptide Bond Formation 


Sparsomycin Prokaryotic Peptidyl transferase inhibitor 

Chloramphenicol Prokaryotic Binds to 50S subunit, blocks the A site and inhibits peptidyl transferase activity 

Clindamycin Prokaryotic Binds to 50S subunit, overlapping the A and P sites and blocking peptidyl transferase activity 
Erythromycin Prokaryotic Blocks the 50S subunit tunnel, causing premature peptidyl-tR NA dissociation 

Elongation: Translocation 

Fusidic acid Both Inhibits EF-G: GDP dissociation from ribosome 

Thiostrepton Prokaryotic Inhibits ribosome-dependent EF-Tu and EF-G GTPase activity 

Diphtheria toxin Eukaryotic Inactivates eEF-2 through ADP-ribosylation 

Cycloheximide Eukaryotic Binds to the 50S E-site and stalls peptide elongation 

Premature Termination 

Puromycin Both Aminoacyl-tRNA analog, binds at A site and acts as peptidyl acceptor, aborting peptide elongation 
Ribosome Inactivation 

Ricin Eukaryotic Catalytic inactivation of 28S rRNA via N-glycosidase action on A**¢ within the GAC 
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the macrolide class and act in a similar manner.) X-ray crystallographic analyses of 
ribosome—macrolide complexes show the macrolide bound within the peptide exit 
tunnel, | to 1.5 nm below the PTC. Macrolide binding plugs the exit tunnel, prevent- 
ing the movement of the growing peptide chain through the tunnel. Protein synthesis 
is aborted and the peptidyl-tR NA eventually dissociates from the ribosome. Substitu- 
ents bound at the 5-position of the macrolide extend outward from the peptide exit 
tunnel toward the PTC. (The 5-position in macrolides is the position where the 
N(CH3))-containing saccharide ring of erythromycin is attached; see Figure 30.31). 
All direct interactions of macrolide antibiotics with the ribosome involve 23S rRNA, 
not ribosomal proteins. Both macrolide and ketolide antibiotics adopt a conforma- 
tion within the ribosome that projects a polar face into the peptide exit tunnel and a 


Figure 30.32 The antibiotic geneticin and its site 
of action. (a) Structure of geneticin, a 
representative aminoglycoside antibiotic. Note the 
characteristic 2-deoxystreptamine (2-DOS) core 
structure, in red. (b) The base sequence of the 
small RNA loop within the 16S rRNA decoding 
center. Note that unpaired adenine residues 1408, 
1492, and 1493 constitute the internal loop 
structure. (Adapted from Figure 1 in Hermann, T., 
2005. Drugs targeting the ribosome. Current 
Opinion in Structural Biology 15:355—366.) 
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nonpolar face toward the tunnel wall. Hydrophobic interactions between the lactone- 
ring methyl groups and the tunnel wall contribute significantly to the binding energy 
of these drugs. Residue A’ of the prokaryotic 23S rRNA lies at the entrance to the 
peptide exit tunnel and favors macrolide antibiotic binding. The major large-subunit 
rRNA of ribosomes found in the cytosol and mitochondria of eukaryotic cells have G 
at the equivalent position and are unaffected by macrolides. 

The most common effect of antibiotics that interact with the PTC is to occupy space 
within this center, such that the amino acid or peptidyl chain linked at the 3’-end of a 
tRNA cannot be positioned properly for the peptide-bond forming reaction. This mode of 
inhibition is consistent with the catalytic role of PTC in precisely orienting the substrates 
so that the peptide bond-forming reaction can occur. This effect is more common for 
aminoacyl-tRNAs in the A site, although some drugs can bridge the A and P sites and 
affect both aminoacyl-tRNA and peptidyl-tR NA orientation. Ribosomes with long pepti- 
dyl chains attached to the tRNA in the P site are less susceptible to macrolide antibiotics. 


30.7c Eukaryotic-Specific Protein Synthesis Inhibitors 


Small molecule inhibitors that target the 80S ribosome have use in the treatment of 
cancer, genetic disorders, or diseases caused by infectious organisms that are eukary- 
otic. Such compounds include those that target the 60S peptidyl transferase center 
(PTC), the tRNA E site, the peptide exit tunnel, or the GTPase-activating center 
(GAC). Others target the 40S decoding center or translation factor-binding sites. 
Unfortunately, most of these compounds have limited clinical use due to their cyto- 
toxic side effects. 


Foundational Biochemistry Things You Should Know after Reading Chapter 30. 


e The genetic code allows the information expressed in a base sequence ® 


to be translated into the amino acid sequence of a protein. 


Bacterial ribosomes contain 50 different proteins and three different 
rRNAs. 


e tRNAs are adapter molecules that bridge the gap between nucleotide © Ribosomes can spontaneously assemble in vitro from a mixture of 
sequence and amino acid sequence. their component proteins and rRNAs. 
®© The genetic code is a triplet code. © The overall structures of prokaryotic and eukaryotic ribosomes are 
© Codons specify amino acids. sinan, 
e (OF the 64 codons possible from the four bases (A, C, G, U), 61 specify The central Coma of the 30S ribosomal subunit is the site of the 
. : See Oe : decoding center. 
an amino acid (the sense codons) and 3 are termination signals (stop 
codons or nonsense codons). e The defining reaction of protein synthesis is performed by peptidyl 
o : ; ; transf 7 
e The genetic code is unambiguous, degenerate, and universal. uence 
: or ee ; , e  Peptidyl transferase is a catalytic activity of the large rRNA of the 
e An amino acid is matched with its proper tRNA by its specific la - è ea osómäl subunit. ? 7 5 
aminoacyl-tRNA synthetase. 
. . © Cytosolic ribosomes of eukaryotes are considerably larger and more 
e Cells have 20 different aminoacyl-tRNA synthetases. complex than prokaryotic ribosomes. 
. The code by which each aminoacyl-tRNA matches up. its amino acid e In translation, a ribosome moves along an mRNA in the 5’-3’ direc- 
with appropriate tRNAs constitutes a second genetic code. tion, synthesizing a protein in the N=>C direction. 
e Aminoacyl-tRNA synthetases interpret the second genetic code. e Initiation of translation involves mRNA binding by the small ribo- 
. igs aie eet 
e Evolution has provided two distinct classes of aminoacyl-tRNA somal subunit, following by association of the initiator Met-tR NA; 
that recognizes the first codon, then recruitment of a large ribosomal 
synthetases. : ae ae ee 
subunit to form the 70S initiation complex, which is composed of 
© Recognition of mRNA codons by tRNA anticodons is through anti- 30 + 50S ribosomal subunits: mRNA: f-Met-tRNAM*. 
parallel base-pairing. e Elongation is the phase of translation that forms every peptide bond, 
e The wobble hypothesis points out that the third base of the codon and from the first to the last. 
the first base of the anticodon follow less stringent base-pairing rules. — ẹ Elongation is a repetitive series of: codon-directed binding of an 
e Some codons are used more than others. aminoacyl-tRNA, peptidyl transfer from the peptidyl-tRNA to the 
. amino group of the aminoacyl-tRNA, and translocation of the ribo- 
° eva suppression occurs when suppressor tR NAs read nonsense some one codon further along the mRNA. 
codons. 
; k . e Three tRNAs may be associated with the ribosome:mRNA complex 
e Ribosomes are the agents by which mRNAs are translated into proteins. at any moment: one in the A (or acceptor) site, one in the P (or pep- 
e Bacterial ribosomes are composed of 30S and 50S subunits. tidyl) site, and one in the E (or exit) site. 
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e Termination is triggered when the ribosome reaches a termination 
codon on the mRNA. 


e Peptide chain initiation requires: mRNA, 30S and 50S ribosomal 
subunits, initiation factors, GTP, and f-Met-tRNA™*. 


e In prokaryotes, recognition of the mRNA and proper alignment 
for translation depends on interaction between the Shine-Dalgarno 
sequence at the 5’-end of the mRNA and a complementary sequence 
near the 3’-end of the small ribosomal subunit 16S rRNA. 


e Three initiation factors (IF-1, IF-2, and IF-3) participate in initiation 
of translation. 


e  IF-2, a G-protein family member, delivers the initiator f-Met-tRNAM* 
to the 30S ribosomal subunit: mRNA complex. 


e GTP hydrolysis by IF-2 is triggered when a 50S ribosomal subunit 
joins the 30S: mRNA: f-Met-tRNA™* complex. 


e Three elongation factors are required: EF-Tu, EF-Ts, and EF-G. 
e The peptide chain elongation cycle has three principal steps: 


1. Codon-directed binding of an incoming aminoacyl-tRNA at the 
A site. 

2. Peptide bond formation by the 50S subunit peptidyl transferase ac- 
tivity through transfer of the peptidyl chain from the P-site tRNA 
to the amino group of the aminoacyl-tRNA in the A site. 

3. Translocation of the ribosome one codon further along the 
mRNA, accompanied by movement of the now longer peptidyl- 
tRNA from the A site to the P site. 


e The decoding center that ensures proper codon: anticodon recogni- 
tion is a 16S rRNA function. 


e EF-Tu is a G-protein family member that mediates delivery of the 
aminoacyl-tRNA to the A site. 


e  EF-Ts is a guanine nucleotide exchange factor (GEF) that exchanges 
GDP for GTP on Ef-Tu. 


e 238 rRNA is the peptidyl transferase enzyme. 


e Catalysis of peptide bond formation relies on tight binding of the 
aminoacyl-tRNA and the peptidyl-tR NA by the ribosome. 


e = The translocation phase of protein synthesis has three principal steps: 


1. The deacylated tRNA must be moved from the P site to the E site. 
2. The newly extended peptidyl-tRNA must be moved from the 
A site to the P site. 


Problems 1125 


3. Concomitant with these events, the ribosome must translocate one 
codon further along the mRNA. 


EF-G acts as a GTP-driven molecular switch that mediates ribosome 
movement along the mRNA. 


EF-Tu and EF-G compete for binding at a common site on the ribosome, 
called the factor-binding center or GTPase-activating center (GAC). 


Peptide chain termination requires yet another G-protein family mem- 
ber, RF-3. 


Molecular mimicry is seen in the structures of EF-TU :tRNA com- 
plex, EF-G, and RF-3. 


Termination occurs when release factors recognize a termination codon. 


The ribosomal subunits cycle between 70S complexes and free ribo- 
somal subunits. 


Polysomes are the active structures of protein synthesis. 
Protein synthesis in eukaryotes is more complex than in prokaryotes. 


More than a dozen eukaryotic initiation factors, symbolized as elFs, 
are required. 


Eukaryotic initiation proceeds through three stages: 


1. Formation of the 43S preinitiation complex, composed of a 
set of the eIFs, the 40S ribosomal subunit, and elF2:GTP: 
Met-tRNAM*. 

2. Formation of the 48S initiation complex, which includes the 
43S preinitiation complex, mRNA, and additional elFs. 

3. Formation of the 80S initiation complex, with the initiator 
Met-tRNA,M“ atop the initiation codon, through addition of the 
60S large ribosomal subunit. 


Control of eukaryotic peptide chain initiation is an important mecha- 
nism for post-transcriptional regulation of gene expression. 


Eukaryotic peptide chain elongation resembles the prokaryotic process. 


Eukaryotic peptide chain termination requires release factors eRF1, 
eRF3, and GTP. 


Because eukaryotic and bacterial protein synthesis is different, some 
inhibitors of protein synthesis serve as medically important antibiotics. 


The decoding center is a target of aminoglycoside antibiotics. 


Many antibiotics target the peptidyl transferase center and the pep- 
tide exit tunnel. 


Answers to all problems are at the end of this book. Detailed solutions are 
available in the Student Solutions Manual. 


1. Translating Nucleotide Sequences into Amino Acid Sequences 
(Integrates with Chapter 12.) The following sequence represents part 
of the nucleotide sequence of a cloned cDNA: 


. .CAATACGAAGCAATCCCGCGACTAGACCTTAAC... 


Can this cDNA be translated into more than one amino acid 
sequence? 


2. Nucleotide Sequences, Possible Codons, and Amino Acid Specification 
A random (AG) copolymer was synthesized using a mixture of 
five parts adenine nucleotide to one part guanine nucleotide as 
substrate. If this random copolymer is used as an MRNA ina 
cell-free protein synthesis system, which amino acids will be 
incorporated into the polypeptide product? What will be the relative 
abundances of these amino acids in the product? 


3. Nucleotide Sequences and Codons If the first nucleotide in a codon 


is altered due to a mutation, it will almost always lead to a change in 
the incorporated amino acid. Which two amino acids represent 
exceptions? 


. The Second Genetic Code Review the evidence establishing that 


aminoacyl-tRNA synthetases bridge the information gap between 
amino acids and codons. Indicate the various levels of specificity 
possessed by aminoacyl-tRNA synthetases that are essential for 
high-fidelity translation of messenger RNA molecules. 


. Codon-Anticodon Recognition: Base-Pairing Possibilities 


(Integrates with Chapter 11.) Draw base-pair structures for 

(a) a G:C base pair, (b) a C:G base pair, (c) a G: U base pair, and 
(d) a U:G base pair. Note how these various base pairs differ in the 
potential hydrogen-bonding patterns they present within the major 
groove and minor groove of a double-helical nucleic acid. 
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Consequences of the Wobble Hypothesis Explain why Crick’s wobble 
hypothesis would allow fewer than 61 anticodons to be used to translate 
the 61 sense codons. How may wobble accelerate the rate of translation? 


. Sense-to-Nonsense Codon Mutations How many codons can mutate 


to become nonsense codons through a single base change? Which 
amino acids do they encode? 


. Tryptophan Substitutions (Integrates with Chapter 4.) Tryptophan 


(W) is a unique amino acid in terms of its size, and its physical and 
chemical properties. List the codons (and the amino acids, if any, that 
they specify) that could be changed into UGG (the codon for W) by a 
single point mutation. Describe the effect of each such mutation. 


. Amino Acid Possibilities in Nonsense Suppression Nonsense 


suppression occurs when a suppressor mutant arises that reads a 
nonsense codon and inserts an amino acid, as if the nonsense codon 
were actually a sense codon. Which amino acids do you think are 
most likely to be incorporated by nonsense suppressor mutants? 


RNA Editing and Translation (Integrates with Chapter 29.5i.) Review 
the discussions about mRNA editing in Section 29.51. Discuss the 

effects that RNA editing might have in light of the fact that GUG is 
sometimes used as a start codon instead of AUG (see Section 30.5a). 


Bacterial vs. Eukaryotic Protein Synthesis Why do you suppose 
eukaryotic protein synthesis is only 10% as fast as bacterial protein 
synthesis? 


Mutations at a AUG Codon Compare prokaryotic and eukaryotic 
translation initiation and explain why a mutation in the first AUG 
codon from the 5’ end of mRNA would not lead to the synthesis 
of a protein in prokaryotes but may produce a shorter protein in 
eukaryotes. 


Capacity of the Ribosomal Peptide Exit Tunnel If the tunnel through 
the large ribosomal subunit is 10 nm long, how many amino acid 
residues might be contained within it? (Assume that the growing 
polypeptide chain is in an extended B-sheet—like conformation.) 


The Consequences of Ribosome Complexity Eukaryotic ribosomes are 
larger and more complex than prokaryotic ribosomes. What advan- 
tages and disadvantages might this greater ribosomal complexity 
bring to a eukaryotic cell? 


Ribosomes as Two-Subunit Structures What ideas can you suggest to 
explain why ribosomes invariably exist as two-subunit structures, 
instead of a larger, single-subunit entity? 


23S rRNA Why is 23S rRNA (prokaryotes) classified as a ribozyme? 


tRNA Recognition and Translation Initiation How do bacterial cells 
determine whether a particular methionyl-tR NA™" is intended to initiate 
protein synthesis or to deliver a Met residue for internal incorporation 
into a polypeptide chain? How do the Met codons for these two different 
purposes differ? How do eukaryotic cells handle these problems? 


Invariant Nucleotides in tRNA (Integrates with Chapter 11.) tRNA 
molecules always have the invariant 3’-AAC sequence of unpaired 
nucleotides (see Section 11.7a). What is the role of these three 
nucleotides in the translation process and why are they always 
unpaired? 

The Function of the Shine-Dalgarno Sequence What is the Shine- 
Dalgarno sequence? What does it do? The efficiency of protein 
synthesis initiation may vary by as much as 100-fold for different 
mRNAs. How might the Shine-Dalgarno sequence be responsible for 
this difference? 


Reconciling Peptide Bond Formation and tRNA Movements In the pro- 
tein synthesis elongation events described under the section on 
translocation, which of the following seems the most apt account of 
the peptidyl transfer reaction: (a) The peptidyl-tRNA delivers its 
peptide chain to the newly arrived aminoacyl-tRNA situated in the 

A site, or (b) the aminoacyl end of the aminoacyl-tRNA moves 
toward the P site to accept the peptidyl chain? Which of these two 
scenarios makes more sense to you? Why? 


21. 


22. 


23. 


24. 


25. 


26. 


27. 


Why Translation Factors Are G Proteins (Integrates with Chapter 15.) 
Why might you suspect that the elongation factors EF-Tu and EF-Ts 
are evolutionarily related to the G proteins of membrane signal 
transduction pathways described in Chapter 15? 


The Energetic Cost of Peptide Elongation How many ATP equivalents 
are consumed for each amino acid added to an elongating polypep- 
tide chain during the process of protein synthesis? 


Exploring the Structure of the Prokaryotic Ribosome Go to www.rcsb. 
org and bring up PDB file 4V42, which shows the structure of the 
Thermus thermophilus 70S ribosome, complete with the three tRNAs 
and A- and P-site mRNA codons. At the bottom of the box on the 
left with the image, click on “1D-3D View.” A new window opens 
with an interactive view of the structure. By moving your cursor over 
the image, you can rotate it to view it from any perspective. Also, a 
pop-up window at the bottom of the image box reveals the compo- 
nent of the structure that the cursor is resting on. 

a. Determine which part of the structure is the 30S ribosomal 
subunit and which is the 50S ribosomal subunit. 

b. Notice the color-coded tRNAs in the structure. What color Is the 
A-site tRNA, what color is the P-site tR NA, and what color is 
the E-site tRNA in this image? 

c. Zoom in and notice the juxtaposition of the 3’-CCA ends of the 
A-site and P-site tRNAs. What function of the ribosome must lie 
at this juxtaposition? Which ribosomal subunit is the home of 
this function? 


d. Notice the green band of ribbon near the anticodon ends of the 
A-site and P-site tRNAs. It represents mRNA bearing codons 
read by the anticodons of these tRNAs. What function of the 
ribosome must lie near these codons and anticodons? Which 
ribosomal subunit is the home of this function? 


Exploring the Interaction between Elongation Factor eEF2 and the 
Translating Eukaryotic Ribosome Go to www.rcsb.org and bring up 
PDB 6GQB, which shows the structure of yeast eukaryotic ribosome in 
complex with tRNAs, mRNA, and eEF2, as described in Pellegrino, S., 
et al., the Journal of Molecular Biology 430:2677—2687. Position 699 in 
the amino acid sequence of eEF2 is the location of the modified 
histidine diphthamide moiety. At the bottom of the box on the left with 
the image, click on “1D-3D View.” A new window opens with an 
interactive view of the structure. By moving your cursor over the image, 
you can rotate it to view it from any perspective. Also, a pop-up window 
at the bottom of the image box reveals the component of the structure 
that the cursor is resting on. eEF2 (designated protein FC in the PDB 
assignment for 6GQB) shows up in a prominent red color in this image. 
a. What is the general location of eEF2 relative to the 80S 
ribosome? 
b. Zoom in to explore the neighborhood near the residue 690-710 
region of eEF2. What key RNA is nearby? 
c. Interpret the significance of your observations in a. and b. in 
light of the role of the diphthamide side chain in eEF2 function. 
(The Human Biochemistry box on page 1120 gives a clue.) 


Post-translational Events and Protein Function The transcription and 
translation of a particular gene leads to a polypeptide chain but not 
necessarily to a functional protein. Revisit previous chapters and 
discuss what post-translational events might be necessary for some 
proteins to become functional? 


Shine-Dalgarno Sequences Base Pair with the 3’-End of 16S rRNA Re- 
view the list of Shine-Dalgarno sequences in Figure 30.18 and select 
the one that will interact best with the 3’-end of E. coli 16S rRNA. 


Chloramphenicol Inhibition of Peptidyl Transferase Chloramphenicol 
(Figure 30.31) inhibits the peptidyl transferase activity of the 50S 
ribosomal subunit. The 50S peptidy! transferase active site consists 
solely of functionalities provided by the 23S rRNA. What sorts of 
interactions do you think take place when chloramphenicol binds to 
the peptidyl transferase center? Which groups on chloramphenicol 
might be involved in these interactions? 
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Think-Pair-Share Question 


Impact of Mutations on Protein Function Consider the DNA sequence 
shown. 


3'-GCATGATACGTGCTAATAGAGTCTTGGACT 
TTGCATAA-S'’ 


Further Reading 1127 


a. Transcribe this DNA sequence into the corresponding mRNA 
and translate into the corresponding amino acid sequence 
starting from the appropriate start codon. 

b. Give at least six reasons why a point mutation in a protein- 
coding gene may not necessarily lead to a dysfunctional protein. 
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Completing the Protein Life 31 
Cycle: Folding, Processing, 
and Degradation 


Life is a process of becoming, a combination of 
states we have to go through. 


Anais Nin (1903-1977) 


< Origami cranes. Origami—the Asian art of paper 
folding—arose in China almost 2000 years ago when 
paper was rare and expensive and the folded shape 
added special meaning. Protein folding, like origami, 
takes a functionless form and creates a structure with 
unique identity and purpose. 


Courtesy of Reginald H. Garrett (special thanks to Reggie and Ricky Garrett) 


Proteins are the agents of biological function. Protein turnover (synthesis and decay) is a 31.1 How Do Newly Synthesized Proteins 
fundamental aspect of each protein’s natural history. How are newly synthesized polypeptide Fold? 

chains transformed into mature, active proteins, and how are undesired proteins removed 31.2 How Are Proteins Processed 

from cells? Following Translation? 


31.3 How Do Protein Translocation 
Pathways Target Proteins to 


: x : ? 
he human genome contains approximately 20,500 genes, but some estimates sug- Sübċellular: Compartments: 


gest that the total number of proteins in the human proteome (the entire comple- 
ment of distinct functional proteins in the human body) may approach 1 million. 
What processes introduce such dramatically increased variation into the products of 
protein-encoding genes? We’ve reviewed many of these processes; a partial list 
(with examples) includes: 


31.4 How Does Protein Degradation 
Regulate Cellular Levels of Specific 
Proteins? 


Gene rearrangements (immunoglobulin G) 

Alternative splicing (fast skeletal muscle troponin T) 

RNA editing (apolipoprotein B) 

Proteolytic processing (chymotrypsinogen) 

Isozymes (lactate dehydrogenase) 

Protein sharing (the glycolytic enzyme enolase is identical to 7-crystallin in the eye) 
Protein—protein interactions at many levels (oligomerization, supramolecular com- 
plexes, assembly of signaling pathway protein complexes upon scaffold proteins) 
8. Covalent modifications of many kinds (phosphorylation or glycosylation, with 
multisite phosphorylation or variable degrees of glycosylation, to name just two of 
the hundreds of possibilities) 


SD OE eS 


As we will see in this chapter, for each protein variant, its amount and its distribution 
within the cell is determined not only by gene expression, mRNA stability, and protein 
synthesis, but also by post-translational modifications, protein—protein interactions, 
intracellular localization, and, ultimately, protein degradation (Figure 31.1). All of 
these processes are dynamic and integrated. 
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Figure 31.1 Modes of post-transcriptional regulation that control the functional protein pool of the 
cell. The proper amount of each protein is determined by tightly regulated crosstalk between all of 
these processes, in response to regulatory signals received by the cell. (Adapted from Figure 1 in 
Breker, M., and Schuldiner, M., 2014. The emergence of proteome-wide technologies: Systematic 
analysis of proteins comes of age. Nature Reviews Molecular and Cell Biology 15:453—464.) 


Thus, the nascent polypeptide emerging from a ribosome is not yet the agent of bio- 
logical function that is its destiny. First, the polypeptide must fold into its native tertiary 
structure. This complicated process is made even more so by the crowding of macro- 
molecules inside a typical cell. Even then, seldom is the newly synthesized, folded pro- 
tein in its final functional state. Proteins often undergo various proteolytic processing 
reactions and covalent modifications as steps in their maturation to functional mole- 
cules. Most proteins are eventually degraded by regulatory processes (after a brief or a 
long life-span). These regulatory processes carefully control the available amounts of 
particular proteins. Proteins may also be degraded if they are damaged or unfolded. 
Proteins that persist for long times in organisms may be untouched by these regulatory 
processes, but these proteins are at increased risk for damage accumulation that may 
contribute to cellular aging processes. Damaged or misfolded proteins are a serious haz- 
ard; accumulation of protein aggregates can be a cause of human disease, including the 
Figure 31.2 Protein folding pathways. prion diseases and diseases of amyloid accumulation, such as Alzheimer’s, Parkinson’s, 
(a) Chaperone-independent folding. The protein folds or Huntington’s disease. 


as it is synthesized on the ribosome (green) 
(or shortly thereafter). (b) Hsp70/Hsp90-assisted 


EPA ETAR Aa AP EER E E 31.1 | How Do Newly Synthesized Proteins Fold? 


polypeptide chains as they are synthesized and assists 
their folding. Binding to Hsp90 facilitates further 


folding. (e) Folding assisted by Hsp70 and The information for folding each protein into its unique three-dimensional architecture 
chaperonin complexes. The chaperonin complex in resides within its amino acid sequence or primary structure (see Chapter 6). Pro- 
E. coli is GroES-GroEL. teins begin to fold even before their synthesis by ribosomes is completed (Figure 31.2a). 
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Human Biochemistry 


Neurodegenerative Disorders are Caused by the Accumulation of Protein Deposits 


As noted in Chapter 6, protein misfolding problems can cause disease 
by a variety of mechanisms. For example, protein aggregates can 
impair cell function. Most neurodegenerative diseases are caused by 
aggregation of a particular protein (Table). 


Disease Involved Protein 


Alzheimer’s disease B-Amyloid, Tau 


Parkinson’s disease a-Synuclein 


Diabetes Type 2 Amylin 
Amyotrophic lateral sclerosis TDP-43 
(Lou Gehrig’s disease) 

Chronic traumatic encephalopathy Tau 
Huntington’s disease Huntingtin 
Creutzfeldt-Jakob disease and kuru PrP (prion) 


All of these proteins except huntingtin (Figure 1) are intrinsi- 
cally unstructured proteins (IUPs) or contain unstructured 
domains. Huntingtin in human neurons forms a complex with 
HAP40 in which both proteins are largely a-helical. Huntingtin 
plays an important role in neurons in the brain, and mutations in 
the gene for huntingtin result in disease. 

Amyloid plaques (so named because they resemble the intracellular 
starch, or amyloid, deposits found in plant cells) and neurofibrillary 
tangles (NFTs), composed of tau (7) proteins, are proteinaceous de- 
posits found in the brains of individuals suffering from any of several 
neurodegenerative diseases. In Alzheimer’s, disease is caused both by 
extracellular amyloid deposits composed of proteolytic fragments 
of the amyloid precursor protein (APP) termed amyloid-6 (AB) and 
intracellular NFTs. AB (Figure 2) is a peptide 39 to 43 amino acids 
long that polymerizes to form long, highly ordered, insoluble fibrils 


Figure 1 Human huntingtin protein in a complex with HAP40. 
Huntingtin consists of three domains — N-terminal (N-term, blue), 
Bridge (yellow) and C-terminal (C-term, green). (PDB id = 6EZ8). 


40 


Figure 2 A model for the A®\_49 structural unit in B-amyloid fibrils. Fibrils 
contain -strands perpendicular to the fibril axis, with interstrand hydrogen 
bonding parallel to the fiber axis. The top face of the B-sheet is hydrophobic 
and presumably interacts with neighboring AB molecules in fibril formation. 
(Figure adapted from Figure | in Thompson, L. K., 2003. Unraveling the 
secrets of Alzheimer’s B-amyloid fibrils. Proceedings of the National Academy 
of Sciences, U.S.A. 100:383-385.) 


consisting of a hydrogen-bonded parallel B-sheet structure in which 
identical residues on adjacent chains are aligned directly, in register 
(see accompanying figure). Why AB aggregates in some people but 
not others is not clear. In Parkinson’s, the culprits are NFTs; no amy- 
loid plaques are evident. In Huntington’s disease, the protein deposits 
occur as nuclear inclusions composed of polyglutamine (polyQ) ag- 
gregates. PolyQ aggregates arise from mutant forms of huntingtin, 
a protein that characteristically has a stretch of glutamine residues 
close to its N-terminus. 

Many amyloid-forming proteins are secreted and form deposits 
in the extracellular space of the nervous system and various organs. 
A few, including a-synuclein, tau and huntingtin are cytosolic and 
form intracellular inclusions. How do protein aggregates form, and 
how do they damage neurons in these neurodegenerative diseases? 
Misfolded and partially folded proteins expose hydrophobic amino 
acid residues and regions of unstructured polypeptide backbone to 
the solvent. Spontaneous association of these hydrophobic surfaces 
results in aggregations that form oligomers and then amyloid fibrils 
rich in B-sheet structure (Figure 3). 

These soluble oligomers are widely considered to be more potent 
neurotoxins than the fibrils. The oligomers expose sticky surfaces 
(hydrophobic amino acid residues and unpaired B-strands) that dis- 
turb phospholipid bilayers and engage in aberrant interactions with 
multiple key cellular proteins. 

Impairment of cellular function by proteinaceous deposits may 
be a general phenomenon. In vitro experiments have demonstrated 
that aggregates of proteins not associated with disease can be cyto- 
toxic, and the ability to form amyloid deposits is a general property 
of proteins. The evolution of chaperones to assist protein folding 
and proteasomes to destroy improperly folded proteins may have 
been driven by the necessity to prevent protein aggregation. 


(Continued) 
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Figure 3 A model for formation of oligomers and amyloid fibrils by proteins involved in 


neurodegenerative diseases. 


However, the cytosolic environment is a very crowded place, with effective protein 
concentrations as high as 0.3 g/mL. Macromolecular crowding enhances the likeli- 
hood of nonspecific protein association and aggregation. The primary driving force 
for protein folding is the burial of hydrophobic side chains away from the aqueous 
solvent and reduction in solvent-accessible surface area (see Chapter 6). The folded 
protein typically has a buried hydrophobic core and a hydrophilic surface. Protein 
aggregation is driven by hydrophobic interactions, so burial of hydrophobic regions 
through folding is a crucial factor in preventing aggregation. To evade aggregation, 
nascent proteins are often assisted in folding by a family of helper proteins known as 
molecular chaperones (see Chapter 6) because, like the chaperones at a prom, their 
purpose is to prevent inappropriate liaisons. Chaperones also serve to shepherd pro- 
teins to their ultimate cellular destinations. Also, mature proteins that have become 
partially unfolded may be rescued by chaperone-assisted refolding. 


31.la Chaperones Help Some Proteins Fold 


Small, single-domain proteins fold efficiently without the aid of chaperones, but 
larger proteins and multi-domain proteins often require chaperones, especially in the 
crowded milieu of a cell. A number of chaperone systems are found in all cells. Many 
of the proteins in these systems are designated by the acronym Hsp (for heat shock 
protein) and a number indicating their relative mass in kilodaltons (as in Hsp60). Hsps 
were originally observed as abundant proteins in cells given brief exposure to high 
temperature (42°C or so). The principal Hsp chaperones are Hsp70, Hsp60 (the chap- 
eronins), and Hsp90. In general, proteins whose folding is chaperone-dependent pass 
down a pathway in which Hsp70 acts first on the newly synthesized protein and then 
passes the partially folded intermediate to a chaperonin for completion of folding. 
Protein folding by chaperones is often ATP-dependent, and in many cases, the func- 
tion of these chaperones is regulated by cochaperones, which bind to the chaperone 
complex and control ATP hydrolysis activity and protein folding. 

Nascent polypeptide chains exiting the large ribosomal subunits are met by ribosome- 
associated chaperones (TF, or Trigger Factor, in Escherichia coli; NAC [nascent chain- 
associated complex] in eukaryotes). In E. coli, the 50S ribosomal protein L23, which is 
situated at the peptide exit tunnel, serves as the docking site for TF, directly linking 
protein synthesis with chaperone-assisted protein folding. TF and NAC mediate transfer 
of the emerging nascent polypeptide chain to the Hsp70 class of chaperones, although 
many proteins do not require this step for proper folding. 


31.1b Hsp70 Chaperones Bind to Hydrophobic 
Regions of Extended Polypeptides 


In Hsp70-assisted folding, proteins of the Hsp70 class bind to nascent polypeptide 
chains while they are still on ribosomes (Figure 31.2b). DnaK is the principal Hsp70 in 
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Dr. Odunuga earned a B.S. in Biochemistry 
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Careers in Chemistry 


his M.S. in Biochemistry from the University 
of Ibadan in his birthplace of Nigeria. While 
lecturing in Nigeria he was offered a predoctoral 
fellowship to pursue a Ph.D. in Biochemistry 
and Molecular Biology from Rhodes University 
in South Africa. His thriving research program 
enjoys collaborations across the United States, 
South Africa (Rhodes University), and Norway 
(the Arctic University of Norway). Dr. Odunuga 
credits his education and career for his many 
opportunities to travel the world to teach people of 
all backgrounds the joys of biochemistry. 


E. coli. It recognizes exposed, extended regions of polypeptides that are rich in 
hydrophobic residues. By interacting with these regions, this Hsp70 prevents nonpro- 
ductive associations and keeps the polypeptide (termed a client protein) in an unfolded 
(or partially folded) state until productive folding interactions can occur. Completion of 
folding requires release of the protein from Hsp70; release is energy-dependent and is 
driven by ATP hydrolysis. 

Hsp70 proteins such as DnaK consist of two domains: a 44-kDa N-terminal ATP- 
binding domain and an 18-kDa central domain that binds polypeptides with exposed 
hydrophobic regions (Figure 31.3a). The DnaK-ATP complex receives an unfolded (or 
partially folded) polypeptide chain from DnaJ (Figure 31.3b). DnaJ is an Hsp40 family 
member. Interaction of DnaK with DnaJ triggers the ATPase activity of DnaK; the 
DnaK-ADP complex forms a stable complex with the unfolded polypeptide, preventing 
its aggregation with other proteins. A third protein, GrpE, catalyzes nucleotide exchange 
on DnaKk, replacing ADP with ATP, which converts DnaK back to a conformational 
form having low affinity for its polypeptide substrate. Release of the polypeptide gives 
it the opportunity to fold. Multiple cycles of interaction with DnaK (or Hsp70) give rise 
to partially folded intermediates or, in some cases, completely folded proteins. The 
partially folded intermediates may be passed along either to an Hsp90 (Figure 31.2b) or 
to the Hsp60/chaperonin system for completion of folding (Figure 31.2c). 

The chaperones that mediate protein folding do so in an ATP-dependent manner, as 
illustrated in Figures 31.3 and 31.5. The affinity of chaperones for their unfolded or mis- 
folded protein substrates is determined by the nature of the nucleotide bound by the ATP 
binding domain of these proteins, which functions as an ATPase. If ATP is bound, the 
chaperone adopts a conformation with an open substrate-binding pocket (Figure 31.3c). 
ATP increases the rate of association of the chaperone Hsp70 (DnaK) with an unfolded 
peptide or protein client by 100-fold, but it increases the rate of dissociation of the unfolded 
protein from the chaperone even more, by a factor of 1000. Overall, the chaperone’s affin- 
ity for an unfolded protein substrate decreases 10-fold (or more) when it binds ATP. 

On the other hand, if the substrate-binding site on the peptide binding domain of 
DnaK is occupied by an unfolded protein client in conjunction with binding of the 
DnaJ cochaperone (Figure 31.3b), ATP hydrolysis by the ATPase domain is triggered. 
The presence of ADP in the ATP-binding (ATPase) domain causes a shift in the 
substrate-binding site of the peptide-binding domain to a closed conformation, high- 
affinity state (Figure 31.3c). Thus, ATP-dependent chaperones cycle between two 
stable conformational states, just like allosteric proteins. Bound ATP favors the open 
conformation for the protein substrate-binding site, and ADP favors the closed confor- 
mation. (When ADP is released and no nucleotide occupies the ATP binding site of the 
ATPase domain, the peptide-binding site remains in the closed, high-affinity conforma- 
tion; see Figure 31.3b). 
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Figure 31.3 Structure and function of DnaK. (a) Domain organization 
and structure of the Hsp70 family member, DnaK. The ribbon 
diagram on the lower left is the ATP-binding domain of the DnaK 
analog, bovine Hsc70; bound ADP is shown as a stick diagram 
(purple). The ribbon diagram on the lower right is the polypeptide- 
binding domain of DnaK. The small blue ovals highlight the position 
of the polypeptide substrate; the protein regions that bind the 
polypeptide substrate are blue-green. (b) DnaK mechanism of action: 
DnaJ binds an unfolded protein (U) or partially folded intermediate (I) 
and delivers it to the DnaK-ATP complex. The nucleotide exchange 
protein GrpE replaces ADP with ATP on DnaK and the partially 
folded intermediate ([I]) is released. I has several possible fates: It may 
fold into the native state, N; it may undergo another cycle of 
interaction with DnaJ and DnaK; or it may become a substrate for 
folding by the GroEL chaperonin system. (Adapted from Figures la 
and 2a in Frydman, J., 2001. Folding of newly translated proteins in 
vivo: The role of molecular chaperones. Annual Review of Biochemistry 
70:603-647.) (c) Open and closed structures of DnaK, an Hsp70 from 
E.coli. Left—Open, ATP-bound structure of DnaK (PDB id = 4B9Q). 
Right—Closed, ADP-bound structure of DnaK (PDB id = 2KHO). 


DnaJ in E. coli is a J-domain protein or simply a J-protein. The Hsp70s are likely 
the most versatile molecular chaperones, and they gain much of this versatility from 
their multiple J-protein partners (at least 7 in E. coli and 45 in humans). These Hsp70/ 
J-protein interactions not only direct Hsp70s to particular substrate proteins, but also 
integrate Hsp70 functions into cellular developmental and regulatory pathways and 
also chaperone-mediated protein homeostasis networks. 


31.1c Hsp90 Partners With Hsp70 in Hsp70-Hsp90 Folding Cascades 


Hsp90 constitutes 1% to 2% of the total cytosolic proteins of eukaryotes, its abundance 
reflecting its importance. Like other Hsp chaperones, its action depends on cyclic 
binding and hydrolysis of ATP. Hsp90 is a highly conserved chaperone involved in 
protein homeostasis. It functions in the remodeling of hundreds of client proteins in 
eukaryotes and participates in many cellular processes, including signal transduc- 
tion, protein trafficking, and maturation of receptor proteins. Hsp90 is a homodimer 
with each monomer consisting of three domains, including an N-terminal domain, a 
middle-domain (M-domain), and a C-terminal domain. A portion of the M-domain 
interacts with Hsp70 in E. coli and yeast and perhaps also in higher eukaryotes. 
Hsp90s are highly dynamic proteins that undergo large conformation changes during 
the chaperone cycle (Figure 31.4). 

As Hsp90 moves through its various conformations, client proteins may be bound 
and released, either in ATP-dependent transitions or in an ATP-independent manner. 
Following release, the unstructured regions of the client may fold spontaneously into 
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Figure 31.4 Hsp90 undergoes large conformation changes in the chaperone cycle. (a) The peptide 
sequence of Hsp90 from E. coli. (b) Two structures of Hsp90 from E. coli—the apo state with no 
substrate or nucleotide bound (PDB id = 2IOQ) and the ADP-bound state (PDB id = 2IOP). One 
subunit is colored in each image to match the domains indicated in (a). 


the native conformation. If the native conformation is not achieved, the client protein 
may undergo repeated cycles of binding and release by Hsp90 until it finally folds. 

Hsp70 and Hsp90 often work together as a protein folding relay team. Hsp70 does not 
actually fold proteins, but rather is a folding preventer. Its binding pocket accepts stretches 
of short hydrophobic or aromatic sequences (up to five sequential residues), which would 
ordinarily end up in the hydrophobic core of folded proteins and late folding intermedi- 
ates but which are exposed in unfolded proteins. Multiple cycles of binding and release 
of these hydrophobic segments follow, preventing aggregation but allowing the target 
protein to fold while not bound. After release, the protein either folds or binds again to 
Hsp70. Thus, Hsp70 binds early to the core-forming segments of unfolded proteins. 

On the other hand, Hsp90 recognizes proteins that are not completely folded by 
Hsp70 and binds to hydrophobic and charged residues exposed over a large surface area 
in natively unfolded proteins. An extended binding site on Hsp90 recognizes scattered 
hydrophobic and charged patches typical of late folding stages. In contrast to Hsp70, 
Hsp90 does not block folding. Instead, the client can achieve the native state while 
bound to Hsp90. Hsp70 and Hsp90 thus interact transiently and sequentially, with 
Hsp90 acting downstream of Hsp70 to improve folding. 

Conformational regulation of signal transduction molecules seems to be a major 
purpose of Hsp90. Receptor tyrosine kinases, soluble tyrosine kinases, and steroid 
hormone receptors are some of the signal transduction proteins that must associate 
with Hsp90 in order to become fully competent. CFTR (cystic fibrosis transmembrane 
regulator), telomerase, and nitric oxide synthase are also Hsp90-dependent. 

Association of nascent polypeptide chains with proteins of the various chaperone 
systems commits them to a folding pathway, redirecting them away from degradation 
pathways that would otherwise eliminate them from the cell. However, if these protein 
chains fail to fold, they are recognized as non-native and targeted for destruction. 


31.1d The GroES—GroEL Complex of E. coli Is an Hsp60 Chaperonin 


The Hsp60 class of chaperones, also known as chaperonins, assists some partially 
folded proteins (10% of the cytosolic proteome) to complete folding after their release 
from ribosomes. Chaperonins sequester partially folded molecules from one another 


E. coli 
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A Deeper Look 


Chaperone Proteins That Function by Stress-Induced Unfolding 


Certain small, ATP-independent chaperones are fully folded and 
inactive under normal cellular circumstances. These chaperones 
are activated by stressful conditions that convert them from a fully 
folded, but inactive, form into a disordered, but active, structure 
that can bind client proteins, help them to achieve their appropriate 
folded states, and then undergo a disorder-to-order transition that 
releases the folded client. For example, HdeA is an acid-activated 
chaperone in the periplasm of E. coli cells. When E. coli is ingested 
by a human subject, the pH of its periplasm rapidly equilibrates with 
the pH of the stomach (~2). This pH would cause irreversible dena- 
turation and inactivation of most proteins, were it not for HdeA. At 
pH 7, HdeA is a folded dimer (figure) with no chaperone function. 
Upon the shift to pH 2, HdeA undergoes partial unfolding and dis- 
sociates to form monomers, which are active as chaperones. HdeA 
monomers bind to a variety of periplasmic protein clients, protecting 
them against irreversible damage, and thereby shielding E. coli from 
the harmful effects of stomach acid. When the E. coli cells move into 
the small intestine and its near-neutral pH, HdeA releases its client 
proteins in folded form and reverts to the inactive dimeric state. 


Some chaperones, including HdeA and Hsp33, respond to cellular 
stress. HdeA (left, PDB id = 1DG8) and Hsp33 (right, PDB id = 


1VZY). The Zn atom of Hsp33 is green; the coordinating Cys 
residues are shown as space-filled spheres). 


In a manner similar to HdeA, Hsp33 is a compactly folded, inac- 
tive chaperone protein under nonstressful conditions. Hsp33 is a 
monomeric, redox-regulated bacterial chaperone composed of an 
N-terminal domain and a C-terminal domain (figure) with four Cys 
residues that coordinate a zinc ion. Exposure to oxidizing condi- 
tions (bleach, for example, or heat stress) induces partial unfolding 
of the zinc-binding domain and release of the zinc ion. The four Cys 
residues are now free to form two disulfide bridges. Partial unfolding 
and disulfide formation activates Hsp33’s chaperone activity. Hsp33 
protects hundreds of cytoplasmic proteins against stress-induced 
protein aggregation and allows bacteria (such as E. coli and Vibrio 
cholera) to survive the anti-microbial action of bleach. 
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(and from extraneous interactions), allowing folding to proceed in a protected envi- 
ronment. This protected environment is sometimes referred to as an Anfinsen cage 
because it provides an enclosed space where proteins fold spontaneously, free from 
the possibility of aggregation with other proteins. Chaperonins are large, cylindrical 
protein complexes formed from two stacked rings of subunits. 

The chaperonins have been organized into two groups, I and II, on the basis of 
their source and structure. Group I chaperonins are found in bacteria and group II in 
archaea and eukaryotes. The group I chaperonin in Æ. coli is the GroES—GroEL 
complex (Figure 31.2c). GroEL is made of two stacked seven-membered rings of 
60 kDa subunits that form a cylindrical ay, oligomer 15 nm high and 14 nm wide 
(Figure 31.5). Each GroEL ring has a 5-nm central cavity where folding can take 
place. This cavity can accommodate proteins up to 60 kDa in size. GroES, sometimes 
referred to as a co-chaperonin, consists of a single seven-membered ring of 10 kDa 
subunits that sits like a dome on one end of GroEL (Figure 31.5). The end of GroEL 
where GroES is sitting is referred to as the apical end. Each GroEL subunit has two 
structural domains: an equatorial domain that binds ATP and interacts with 
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Figure 31.5 Structure and function of the GroES—GroEL complex. 

(a) Structure and overall dimensions of GroES—GroEL (top view, /eft; 
side view, right; PDB id = 1AON). (b) Section through the center of the 
complex to reveal the central cavity. (c) Model of the GroEL cylinder 
(blue) in action. An unfolded (U) or partially folded (I) polypeptide 
binds to hydrophobic patches on the apical ring of a7-subunits, followed 
by ATP binding, forced protein unfolding, and GroES (red) association. 
ATP hydrolysis in one ring is required to enable subsequent ATP 
binding to the opposite ring, but hydrolysis is not required for folding to 
proceed within the chamber. (d) Depiction of protein folding in the 
cytoplasm of an E. coli cell, emphasizing macromolecular crowding. 
The cytoplasmic components are present at their known concentrations. 
Proteins shown include a new protein undergoing folding (orange), 
chaperone structures (GroEL in green, DnaK in red, and Trigger Factor 
in yellow), ribosomes (purple), and RNA (salmon). Image courtesy of 

A. Elcock and L. Gierasch. 
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Small Heat Shock Proteins: Nature’s Molecular Sponges 


The small heat-shock proteins (SHSP) are a diverse family of ubiq- 
uitous intracellular proteins, with subunit masses of 12 to 43 kDa. 
sHSPs are important in protein maintenance and cellular responses 
to stress. They prevent the denaturation and aggregation of a vari- 
ety of unrelated proteins under stress conditions, such as elevated 
temperature or reduced or oxidized conditions, at a stoichiometry as 
low as one subunit of sHSP to one target client protein. The small 
HSPs have a 100 residue core a-crystallin domain with variable 
N- and C-terminal extensions. These extensions enable the sHSPs to 
oligomerize and also mediate recognition of client proteins. 

Carol Robinson, Justin Benesch, and their colleagues have stud- 
ied the behavior of HSP18.1 from Pisum sativum (the garden pea) 
by mass spectrometry and have found that this sHSP exists as a 
dodecamer at 22°C. However, as the temperature is increased, 
dimers escape from some of the dodecamers and then add to other 
dodecamers to form higher-order oligomers of 13 to 20 subunits. In 
vitro assays conducted at 42°C (the classic heat shock temperature) 
show that monomers of a model client protein (firefly luciferase) 
add to sHSP oligomers to form heteromultimeric assemblages that 
reorganize and enlarge dynamically to yield more than 300 different 


sHSP-client combinations! Lila Gierasch has likened this process 
to a sponge becoming moist and swelling so that it accommodates 
yet more water. Robinson and Benesch conclude that dynamic equi- 
librium fluctuations of quaternary structure in such ensembles of 
sHSP structures underlie the protection of client proteins as they 
seek their native folded structures. 
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Temperature stress 


When native proteins (N) become unstable (U) 
under heat stress, SSHPs protect cells against 
deleterious formation of aggregates by dynamic 
fluctuations of quaternary structure. 


Aggregate 


Large and 
growing 
ensemble 


{sSHSP-clientm} 


sHSPS 12 


neighbors in the other a; ring and an apical domain with hydrophobic residues that 
can interact with hydrophobic regions on partially folded proteins. The hydrophobic 
patches of the apical domain face the interior of the central cavity. An unfolded (or 
partially folded) protein binds to the apical patches and is delivered to the central cav- 
ity of the upper a; ring (Figure 31.5c). ATP binding to the subunits of the upper a7 ring 
causes rapid (<100 ms), forced unfolding of the substrate protein, followed by two 
events that occur on a slower time scale (~1 s): (1) GroES is recruited to GroEL, and 
(2) the a-subunits undergo a conformational change that buries their hydrophobic 
patches. The a-subunits now present a hydrophilic surface to the central cavity. This 
change displaces the bound partially folded polypeptide into the sheltered hydro- 
philic environment of the central cavity, where it can fold, free from danger of aggre- 
gation with other proteins. GroES also promotes ATP hydrolysis (Figure 31.5c). 
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The GroEL-ADP-GroES complex dissociates when ATP binds to the subunits of the 
other (lower) a7 ring. Dissociation of GroES allows the partially folded (or folded) 
protein to escape from GroEL. If the protein has achieved its native conformation, its 
hydrophobic residues will be buried in its core and the hydrophobic patches on the a7 
rings will have no affinity for it. On the other hand, if the protein is only partially 
folded, it may be bound again, gaining access to the Anfinsen cage of the a7 ring and 
another cycle of folding. The folding of rhodanese, a 33 kDa protein, requires the 
hydrolysis of about 130 equivalents of ATP. The extreme crowding of macromolecules 
in the typical cell, which could interfere with the folding of nascent proteins, justifies 
the need for elaborate structures like GroES-GroEL, and also the substantial invest- 
ment of ATP energy in the folding process (Figure 31.5d). 

The group II chaperonin and eukaryotic analog of GroEL, CCT (also called TriC) 
is also a double-ring structure, but each ring consists of eight different subunits that 
vary in size from 50 to 60 kDa. Furthermore, group II chaperonins lack a GroES 
counterpart. Prefoldin (also known as GimC), a hexameric protein composed of six 
subunits from two related classes (two a and four B), can serve as a co-chaperone for 
CCT, much as GroES does for GroEL. However, prefoldin also acts like an Hsp70 
protein because it binds unfolded polypeptide chains emerging from ribosomes and 
delivers them to CCT. Prefoldin resembles a jellyfish, with six tentacle-like coiled 
coils extending from a barrel-shaped body. The ends of the tentacles have hydropho- 
bic patches for binding unfolded proteins. 

Prior to substrate protein binding, CCT exists in a partly open state. ATP binding 
opens the ring even more, a state in which prefoldin delivers the substrate protein. ATP 
hydrolysis closes the chamber and drives the folding process. ATP-induced conformation 
changes that promote protein folding propagate from one subunit to the next around the 
ring structure. 


31.2 | How Are Proteins Processed Following Translation? 


The release of the completed polypeptide from the ribosome is not necessarily the 
final step in the covalent construction of a protein. Many proteins must undergo 
covalent alterations before they become functional. In the course of these post- 
translational modifications, the primary structure of a protein may be altered, and/or 
novel derivations may be introduced into its amino acid side chains. Hundreds of 
different amino acid variations have been described in proteins, virtually all arising 
post-translationally. The list of such modifications is very large; some are rather 
commonplace, whereas others are peculiar to a single protein. The diphthamide moi- 
ety in elongation factor eEF-2 is one example of an amino acid modification (see the 
Human Biochemistry box “Diphtheria Toxin ADP-Ribosylates eEF2” in Chapter 30); 
the fluorescent group of green fluorescent protein (GFP; see Chapter 4) is another. In 
addition, common chemical groups such as carbohydrates and lipids may be cova- 
lently attached to a protein during its maturation. Phosphorylation, acetylation, and 
methylation of proteins are common mechanisms for regulating protein function. 
Interestingly, many proteins are modified in multiple ways, and many post- 
translational modifications act in combinations—a phenomenon termed cross-talk. 
(The majority of proteins in cells can be phosphorylated on one or more residues. 
A survey of some of the more prominent chemical groups conjugated to proteins is 
given in Chapter 5.) To put a number on the significance of post-translational modi- 
fications, we have seen that the number of human proteins is estimated to exceed the 
number of human genes (20,500 or so) by several orders of magnitude. 

Proteolytic cleavage is the most prevalent form of protein post-translational modifi- 
cation, and it merits special attention. The very occurrence of proteolysis as a process- 
ing mechanism seems strange: Why join a number of amino acids in sequence and then 
eliminate some of them? Three reasons can be cited. First, diversity can be introduced 
where none exists. For example, a simple form of proteolysis, enzymatic removal of 
N-terminal Met residues, occurs in many proteins. Met-aminopeptidase, by removing 
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the invariant Met initiating all polypeptide chains, introduces diversity at N-termini. 
Second, proteolysis serves as an activation mechanism so that expression of the biologi- 
cal activity of a protein can be delayed until appropriate. A number of metabolically 
active proteins, including digestive enzymes and hormones, are synthesized as larger 
inactive precursors termed pro-proteins that are activated through proteolysis (see zymo- 
gens, Chapter 15). The N-terminal pro-sequence on such proteins may act as an intra- 
molecular chaperone to ensure correct folding of the active site. Third, proteolysis is 
involved in the targeting of proteins to their proper destinations in the cell, a process 
known as protein translocation. 


31.3 | How Do Protein Translocation Pathways Target 
| Proteins to Subcellular Compartments? 


Approximately 30% of all proteins are translocated across or inserted into lipid 
membranes. Transporting long polymers with both hydrophilic and hydrophobic 
residues into or across a bilayer membrane is a challenge for both prokaryotic and 
eukaryotic cells. Eukaryotic cells must transport many proteins into the ER lumen 
for processing and must insert others into the ER membrane. Prokaryotic cells trans- 
port certain proteins into or across the plasma membrane. Moreover, protein trans- 
port must be selective to ensure that the right proteins end up in the right places. 
Specialized membrane protein complexes called translocons have evolved to solve 
these problems in protein translocation pathways. Translocons are selectively perme- 
able protein-conducting channels that catalyze movement of the proteins across the 
membrane, and metabolic energy in the form of ATP, GTP, or a membrane potential 
is essential. 


31.3a Proteins Are Delivered to the Proper Cellular 
Compartment by Translocation 


Protein translocation occurs in all cells. There are two pathways involved in protein 
translocation: The Sec pathway (where Sec refers to secretory functions) transports 
unfolded proteins. Sec systems are found in every organism. Translocation in the Sec 
pathway occurs through a hetero-trimeric membrane pore complex: The SecY com- 
plex in bacteria, archaea, and chloroplast thylakoid membranes, and the Sec61 com- 
plex in eukaryotes. By contrast, the twin-arginine translocation (Tat) pathway is a 
protein export system in the cytoplasmic membrane of prokaryotes that transports 
proteins that have already folded. 

Translocation through Sec channels occurs either co-translationally, by association 
with translating ribosomes, or post-translationally. The co-translational pathway is 
used in most species for membrane protein insertion, and also by eukaryotes for protein 
secretion. The co-translational complex in prokaryotes is SecY EG, composed of sub- 
units SecY, SecE, and SecG, and the analogous complex in eukaryotes is Sec61, com- 
posed of Sec61, Sec62, and Sec63 and other accessory proteins. The post-translational 
pathway is the primary pathway for protein secretion in prokaryotes and involves auxil- 
lary proteins SecD, SecF, and YajC. The Tat pathway is exclusively post-translational. 

Proteins that will be translocated carry targeting information at or near their 
N-terminus that directs the protein to eventual destinations. In both Sec pathways, ini- 
tiation of translocation depends on delivery to the SecY-Sec61 complex at the plasma 
membrane or ER membranes of a substrate with a cleavable N-terminal signal sequence 
(for secretory proteins) or the first transmembrane segment (for membrane proteins). In 
effect, signal sequences serve as zip codes for sorting and dispatching proteins to their 
proper compartments. Thus, the information specifying the correct cellular localization 
of a protein is found within its structural gene. Once the protein is routed to its destina- 
tion, the signal sequence is often, but not always, proteolytically clipped from the protein 
by a signal sequence-specific endopeptidase called a signal peptidase. 
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31.3b The SecYEG Translocon Mediates Translocation of Newly 
Synthesized Prokaryotic Proteins 


E. coli, the paradigm of gram-negative bacteria, synthesizes approximately 4400 pro- 
teins, including (approximately) 1000 inner membrane proteins, 400 periplasmic pro- 
teins, and 160 outer membrane proteins. Non-cleavable signal anchor sequences in 
inner membrane proteins and N-terminal, cleavable signal sequences in secretory 
proteins target these proteins for membrane insertion or secretion, respectively. Most 
of these membrane and periplasmic proteins are translocated into and across the 
inner membrane by the SecYEG translocon, a complex built on core proteins SecY, 
SecE, and SecG (Figure 31.6a). This heterotrimeric core is passive by itself. To recog- 
nize an enormous variety of different substrate proteins and to be active in protein 
transport, the SecYEG translocon interacts with multiple additional proteins (SecA, 
SecD, SecF, and YajC). These partner proteins recognize SecYEG substrates, provide 
the driving force for transport, mediate substrate release from SecYEG, and commu- 
nicate with the proteostasis network. 

Two protein targeting systems, operate in parallel in bacterial cells to recognize and 
direct SecYEG substrate proteins to their appropriate destinations. Pre-secretory pro- 
teins with cleavable N-terminal signal sequences are targeted by SecA, a dimeric motor 
ATPase that associates with SecYEG for post-translational secretion (Figure 31.6b). 
Pre-secretory proteins often associate with a chaperone and the pre-protein-chaperone 
complex is bound to SecA and then to SecYEG, after which the SecA dimer dissociates. 


(b) SecA-dependent transport (c) SRP-dependent transport 
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Figure 31.6 Protein targeting and translocation pathways in bacteria. (a) The structure of the SeceYEG- 
SecA complex from Tibellus maritimus. SecA is blue; SecY is dark green; SecG is orange; SecE is light 
green. The location of the inner membrane is indicated (PDB id = 3DIN.) (b) SecA-dependent transport. 
Post-translational pathway involves association with chaperones, including trigger factor and SecB, 
followed by a hand-off to SecA in the SeceYEG complex. In the co-translational pathway, SecA associates 
with the nascent polypeptide at the ribosome and delivers it to the SeceYEG complex. (c) SRP-dependent 
transport. In the co-translational pathway, SRP binds tightly to the nascent polypeptide and delivers it to 
FtsY and the SecYEG complex. In the post-translational pathway, the SR P-polypeptide complex 
dissociates from the ribosome and migrates either to the FtsY-SecYEG complex or the FtsY-YidC 
complex for insertion into the membrane. 
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Figure 31.7 Cryo-EM structure of a heterotrimer of 
Sec61 (open conformation) in complex with a signal 
peptide. Subunits al (green), B (purple), and y (blue) 
from Canis lupus familiaris are shown with a bovine 
prolactin signal peptide (yellow) (PDB id = 3JC2). 


In an alternative co-translational pathway, SecA may engage the pre-secretory protein 
at the ribosome for delivery to SecYEG. In the second targeting system, inner mem- 
brane proteins with non-cleavable signal anchor sequences emerging from the ribosome 
exit tunnel engage a ribosome-bound signal recognition particle (SRP) and interact 
with SecYEG for co-translational membrane insertion (Figure 31.6c). The SRP is a 
325 kDa nucleoprotein assembly that contains six polypeptides and a 300 nucleotide 
7S RNA. SRPs typically bind with high affinity to ribosomes and bind the signal 
anchor sequence of a membrane protein when it emerges from the peptide tunnel of the 
ribosome. This assemblage is referred to as a ribosome-associated nascent chain (RNC) 
complex. The SRP delivers the RNC complex to FtsY, the SecYEG-bound receptor for 
nascent membrane proteins. In a parallel pathway, the SRP can deliver membrane pro- 
teins to YidC (termed an insertase), which can insert membrane proteins independently 
of SecYEG but which also cooperates with the SecYEG translocon. Finally, the SRP- 
nascent protein complex may also dissociate from the ribosome and bind to the FtsY- 
YidC complex for post-translational membrane insertion. 


31.3c The Sec61 Translocon Mediates Translocation of Newly Synthesized 
Eukaryotic Proteins 


Eukaryotic cells are characterized by many membrane-bounded compartments. 
Approximately 30% of all eukaryotic proteins are routed to these membranes and 
compartments via the Sec61 translocon complexes. 


Structure of the Sec61 Translocon Sec6l translocons are dynamic entities, organized 
around an invariant core, the trimeric Sec61 protein-conducting channel (Figure 31.7). 
Full maturation of proteins in the ER involves several additional processes, including 
glycosylation (covalent attachment of sugars), signal peptide cleavage, and membrane 
protein insertion, orientation and folding. To accomplish these varied functions on a 
broad range of substrates, the Sec6l channel has different accessory proteins that 
associate with it, either transiently or in stable complexes. 


ER Translocon Pathways The ER translocon complexes use both co-translational and 
post-translational pathways for protein transport across the ER membrane or inser- 
tion into it. These eukaryotic pathways are analogous to those for prokaryotic trans- 
location as described in Figure 31.6. Moreover, the N-terminal signal sequences of 
eukaryotic secretory proteins are analogous to bacterial signal sequences. Proteins 
with highly hydrophobic signal sequences are recognized by signal recognition parti- 
cles (SRPs) as they emerge from the ribosome exit tunnel and then moved through the 
Sec61 channel co-translationally. Proteins with less hydrophobic signal sequences are 
transported through the Sec61 channel post-translationally in association with an- 
other membrane protein complex comprised of Sec62 and Sec63. 


The Co-Translational Eukaryotic Translocation Pathway In co-translational eukaryotic 
translocation, the SRP binds to the signal peptide as it emerges from the ribosome (the 
ribosome-nascent-chain [RNC] complex) and recruits the RNC to the ER membrane 
via an SRP receptor (SR) (Figure 31.8, Step 2). Ribosome binding to the Sec61 com- 
plex enables the nascent unfolded peptide to enter the Sec61 channel (Step 3). A signal 
peptidase cleaves off the signal sequence from the nascent peptide on the luminal face 
of the ER membrane (Step 4). The peptide may then be transported entirely across the 
ER membrane, producing a secretory protein (Step 5). Alternatively, the Sec61 trimer 
may facilitate insertion of hydrophobic helices of the peptide into the ER membrane, 
forming a new transmembrane protein. 

The ribosome-Sec61 complex at the ER membrane typically includes a number of 
accessory proteins, including the translocon-associated protein (TRAP) complex and 
the oligosaccharyl transferase complex (OST). TRAP supports recruitment of specific 
signal peptides and facilitates membrane insertion and orientation of some peptide 
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Figure 31.8 Synthesis of a eukaryotic secretory 
protein and its translocation into the endoplasmic 
reticulum. (1) The signal recognition particle 
(SRP, red) recognizes the signal sequence emerging 
from a translating ribosome (ribosome-nascent- 
chain complex [RNC], gray). (2) The RNC-SRP 
interacts with the SRP receptor (SR, purple) and is 
transferred to the translocon (pink). (3) Release 

of the SRP and alignment of the peptide exit 
tunnel of the RNC with the protein-conducting 
channel of the translocon stimulates the ribosome 
to resume translation. (4) The membrane- 
associated signal peptidase (purple circle) clips 

off the N-terminal signal sequence, and BiP 


(the ER lumen Hsp70 chaperone, blue) binds the 
nascent chain mediating its folding into its native 
conformation. (5) Following dissociation of the 

ribosome, BiP plugs the translocon channel. Not 


i a Signal 4 shown are subsequent secretory protein maturation 
BiP —@) Men aa & events, such as glycosylation. (Adapted from 
Clipped Figures la and 2a in Frydman, J., 2001. Folding of 
signal . newly translated proteins in vivo: The role of 
ER lumen sequence Secretory protein molecular chaperones. Annual Review of 


Biochemistry 70:603—647.) 


substrates, whereas OST is responsible for glycosylation of specific asparagine residues 
on the translocated peptide substrates. Cryo-EM studies have revealed the details of 
some of these large and complex structures (Figure 31.9a) 


The Post-Translational Eukaryotic Translocation Pathway In the post-translational mode, 


Sec61 forms a stable complex with Sec62 and Sec63 (Figure 31.9b). This complex is 
sufficient for post-translational protein translocation in higher eukaryotes (humans, 


TRAP 


(mammals) 


Figure 31.9 Cryo-EM structures of co-translational and post-translational complexes of Sec61. 

(a) Co-translational complex from Canis lupus familiaris—Sec61, TRAP, OST on ribosome. (Image 
kindly provided by Friedrich Förster and adapted from Gemmer, M. Forster, F., 2020. A clearer picture 
of the ER translocon complex. Journal of Cell Science 133:jcs231340; PDB id = 6FTG). (b) Structure 
of a post-translational complex of Sec61 complex from S. cerevisiae. Subunits Sec61 (purple), Sec63 
(red), Sec72 (yellow), Sec71 (light blue), Sss1 (brown), and Sbh1 (green) are shown. (PDB id = 6ND1). 
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Autophagy—How Cells Recycle Their Materials 


Autophagy is a tightly regulated catabolic process whereby cells 
use their lysosomes to degrade their own macromolecular and 
organellar constituents. Autophagy functions to maintain a bal- 
ance between the synthesis of macromolecules and organelles, 
their degradation, and recycling of their component materials. 
More than 30 evolutionarily conserved genes, the ATG genes, are 
involved in autophagy. Their protein products are designated Atg 
proteins. Autophagy is dependent on the hydrolytic capacities of 
acidic hydrolyases in lysosomes to achieve its ends. Autophagy 
begins when a volume of cytoplasm or an organelle (such as a mito- 
chondrion) becomes separated from the rest of the cell by a cup- 
shaped membranous structure called the phagophore (Figure a). The 
phagophore derives from a pre-autophagosomal structure (PAS), a 
dynamic assembly of Atg proteins with associated lipids that func- 
tions as a membrane-forming apparatus. Closure of the phagophore 
around this sequestered material gives rise to the double-membrane- 
bounded autophagosome. Fusion of the autophagosome membrane 
with the membrane of a lysosome creates a hybrid vesicle, the 
autolysosome (sometimes called an autophagolysosome). Within the 
autolysosome, the contents delivered by the autophagosome are 
broken down so that their constituent amino acids, nucleotides, and 
lipids can be recycled to meet the cell’s need for building blocks to 
create new macromolecules and organelles. 

Three forms of autophagy are recognized: macroautophagy (just 
described), microautophagy (wherein small blebs of cytoplasm are 
engulfed directly by a lysosome), and chaperone-mediated autophagy 
(CMA), in which proteins targeted by Hsc (heat shock cognate)- 


(a) 


Autophagosome 


a 
eo @g ® 


Q8/ Phagophore 


Cytosolic proteins 
and organelles 


Hydrolases 


Lysosome 


Autolysosome 


Adapted from Xie, Z., and Klionsky, D. J., 2007. Autophagosome 
formation: Core machinery and adaptations. Nature Cell Biology 
9:1102-1109. 


containing chaperone/cochaperone complexes are taken up directly 
by lysosomes. The focus here is on macroautophagy, the principal 
form. 

Although autophagy is an ongoing process of homeostasis in 
normal cells, it becomes crucial when extracellular nutrients be- 
come limiting (Figure b). Recycling cellular components through 
autophagy allows the cell to maintain energy production and 
macromolecular synthesis despite a paucity of nutrients. As noted 
in Chapter 27, when mTORCI1 (a protein complex that serves as a 
nutrient and energy sensor) is active, it promotes protein synthesis, 
cell growth, and cell proliferation. mTORCI is a negative regulator 
of autophagy, acting to limit its expression when nutrients are in 
abundance. When mTORCI is inactivated upon nutrient depletion 
(starvation), a particular Atg protein, Atg13, is dephosphorylated, 
a triggering event that culminates in activation of autophagosome 
formation. 

Autophagy as a source of cellular nutrients can be only a 
short-term solution. Interestingly, starvation-induced autophagy 
in cultured mammalian AgRP neurons derived from the hypo- 
thalamus causes an intracellular increase in free fatty acids (FFA). 
In turn, these FFA promote a rise in AgRP levels. Expression of 
AgRP along with NPY is known to promote eating behavior (see 
Figure 27.12). Thus, upon starvation, autophagy provides a short- 
term fix and a long-term solution to nutrient limitation. Autophagy 
is also implicated in obesity, diabetes, cardiovascular disease, neuro- 
degenerative diseases, and cancer. 
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Adapted from Levine, B., 2005. Eating oneself and uninvited guests: 
Autophagey-related pathways in cellular defense. Cell 120:159-162. 
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for example). Sec63 interacts with transmembrane segments in the N-terminal half of 
the Sec61 channel and also with cytosolic loops in the C-terminal half of Sec61. Sec63 
causes Sec6l to expand to an open-channel conformation that allows peptide sub- 
strates to be transported through the channel. Sec63 also contains a domain that ac- 
tivates the BiP ATPase to bind to the incoming nascent protein. BiP prevents the 
substrate from sliding back into the cytosol and also drives it through the Sec61 chan- 
nel. In lower eukaryotes (for example, fungi), the so-called Brl domain of Sec63 blocks 
ribosome binding to Sec6l and also facilitates binding of Sec71 and Sec72. These latter 
subunits capture nascent polypeptides associated with Hsp70 and other chaperones 
and enable post-translational folding. 


Retrograde Translocation Prevents Secretion of Damaged Proteins and Recycles Old ER 
Proteins To prevent secretion of inappropriate proteins, fragmented or misfolded 
secretory proteins are passed from the ER back into the cytosol via Sec61. Thus, Sec61 
also serves as a channel for aberrant secretory proteins to be returned to the cytosol so 
that they can be destroyed by the proteasome degradation apparatus (see Section 31.4). 
Among these proteins are ER membrane proteins that are damaged or no longer needed. 


31.4 | How Does Protein Degradation Regulate 
Cellular Levels of Specific Proteins? 


Cellular proteins in any organism must be present in the correct amounts, and mutant 
and damaged proteins must be degraded. As a result, cellular proteins are in a 
dynamic state of turnover, with the relative rates of protein synthesis and protein 
degradation ultimately determining the amount of protein present at any point in 
time. In many instances, transcriptional regulation determines the concentrations of 
specific proteins expressed within cells, with protein degradation playing a minor 
role. In other instances, the amounts of key enzymes and regulatory proteins, such as 
cyclins and transcription factors, are controlled via selective protein degradation. In 
addition, abnormal proteins arising from biosynthetic errors or post-synthetic dam- 
age must be destroyed to prevent the deleterious consequences of their buildup. The 
elimination of proteins typically follows first-order kinetics, with half-lives (t12) of 
different proteins ranging from several minutes to many days. A single, random pro- 
teolytic break introduced into the polypeptide backbone of a protein is believed suf- 
ficient to trigger its rapid disappearance because no partially degraded proteins are 
normally observed in cells. 

Protein degradation poses a real hazard to cellular processes. To control this hazard, 
protein degradation is compartmentalized, either in macromolecular structures known as 
proteasomes or in degradative organelles such as lysosomes. Protein degradation within 
lysosomes is largely nonselective; selection occurs during lysosomal uptake. Proteasomes 
are found in eukaryotic as well as prokaryotic cells. The proteasome is a functionally and 
structurally sophisticated counterpart to the ribosome. Regulation of protein levels via 
degradation is an essential cellular mechanism. Regulation by degradation is both rapid 
and irreversible. 


31.4a Eukaryotic Proteins Are Targeted for Proteasome 
Destruction by the Ubiquitin Pathway 


Ubiquitination is the most common mechanism to label a protein for proteasome 
degradation in eukaryotes. Ubiquitination has emerged as a major mechanism by 
which virtually all cellular events can be regulated. Ubiquitin is a highly conserved, 
76 residue (8.5 kDa) polypeptide widespread in eukaryotes. Proteins are targeted for 
degradation through ligation to ubiquitin. Three proteins in addition to ubiquitin 
are involved in the ligation process: El, E2, and E3 (Figure 31.10). The human 
genome encodes two Els, at least 40 E2s and more than 600 E3s. E1 is the ubiquitin- 
activating enzyme (105 kDa dimer). It becomes attached via a thioester bond to the 
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Figure 31.10 Enzymatic reactions in the ligation of 
ubiquitin to proteins. Step 1: ATP drives formation 
of a ubiquitinyl-acyladenylate. Step 2: El, the 
ubiquitin-activating enzyme, is linked to ubiquitin 
via a thioester bond to the C-terminal glycine. 
Step 3: Ubiquitin is transferred from E1 to an -SH 
group on E2, the ubiquitin-carrier protein. 

Step 4: Ubiquitin is attached to selected proteins 
via isopeptide bonds formed between the ubiquitin 
carboxy-terminus and free amino groups (a-NH, 
terminus, Lys e-NH; side chains) on the protein. 
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C-terminal Gly residue of ubiquitin through ATP-driven formation of an activated 
ubiquitin-acyladenylate intermediate. Ubiquitin is then transferred from El to an 
SH group on E2, the ubiquitin-carrier protein. In protein degradation, E2-S~ubiquitin 
transfers ubiquitin to free amino groups on proteins selected by E3 (180 kDa), the 
ubiquitin-protein ligase. Upon binding a protein substrate, E3 catalyzes the transfer 
of ubiquitin from E2-S~ubiquitin to free amino groups (usually Lys e-NH;) on the 
protein. Additional regulation of the protein degradation process is provided by the 
attachment of more than one ubiquitin to a protein substrate, and tandemly linked 
chains of ubiquitin also occur, either via isopeptide bonds between the C-terminal 
glycine residue of one ubiquitin and the e-amino of Lys residues in another, or via a 
conventional peptide bond with the N-terminal Met of another ubiquitin. Ubiquitin 
has seven lysine residues, at positions 6, 11, 27, 29, 33, 48, and 63. Isopeptide linkages 
to all of these Lys residues have been found, with the K**-type being most common 
as a degradation signal. 

E3 plays a central role in recognizing and selecting proteins for degradation. E3 
selects proteins by the nature of the N-terminal amino acid. Proteins must have a free 
a-amino terminus to be susceptible. Proteins having either Met, Ser, Ala, Thr, Val, Gly, 
or Cys at the amino terminus are resistant to the ubiquitin-mediated degradation path- 
way. However, proteins having Arg, Lys, His, Phe, Tyr, Trp, Leu, Asn, Gln, Asp, or Glu 
N-termini have half-lives of only 2 to 30 minutes. 

Interestingly, proteins with acidic N-termini (Asp or Glu) show a tRNA requirement for 
degradation (Figure 31.11). Transfer of Arg from Arg-tRNA to the N-terminus of these 
proteins alters their N-terminus from acidic to basic, rendering the protein susceptible to 
E3. It is also interesting that Met is less likely to be cleaved from the N-terminus if the next 
amino acid in the chain is one particularly susceptible to ubiquitin-mediated degradation. 

Most proteins with susceptible N-terminal residues are not normal intracellular pro- 
teins but tend to be secreted proteins in which the susceptible residue has been exposed by 
action of a signal peptidase. Perhaps part of the function of the N-terminal recognition 
system is to recognize and remove from the cytosol any invading foreign or secreted 
proteins. Other proteins targeted for ubiquitin ligation and proteasome degradation 
contain PEST sequences—short, highly conserved sequence elements rich in proline (P), 
glutamate (E), serine (S), and threonine (T) residues. 

The broader role that ubiquitination plays in cells is reflected by the large numbers of 
E2 and E3 that are known. At least 40 E2 ubiquitin-conjugating proteins have been 
identified, along with more than 600 E3 ubiquitin ligases. The E3 ubiquitin ligases are 
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typically complexes of multiple subunits that together recognize a particular target 
protein and facilitate linkage to ubiquitin or to a ubiquitin-like protein (Figure 31.12). 
Ubiquitination is involved in many biological processes, beyond regulation of protein 
stability, including gene transcription, DNA repair, cell cycle progression, receptor 
transport, signal transduction, and immune responses. Given its integral involvement in 
most cellular events, the ubiquitin system is an important therapeutic target, particu- 
larly for treatments of cancer and neurodegenerative disorders. 


31.4b Proteins Targeted for Destruction Are Degraded 
by Proteasomes 


Proteasomes are large oligomeric structures enclosing a central cavity where proteoly- 
sis takes place. Ubiquitinated proteins are directed to proteasomes for degradation. 


The Structures of Proteasomes The 20S proteasome from the archaeon Thermoplasma 
acidophilum is a 700 kDa barrel-shaped structure composed of two different kinds of 
polypeptide chains, a and ß, arranged to form four stacked rings of a78787a7-subunit 
organization. The barrel is about 15 nm in height and 11 nm in diameter, and it con- 
tains a three-part central cavity (Figure 31.13a). The proteolytic sites of the 20S protea- 
some are found within this cavity. Access to the cavity is controlled through a 1.3 nm 
opening formed by the outer a, rings. These rings are believed to unfold proteins 
destined for degradation and transport them into the central cavity. The B-subunits 
possess the proteolytic activity. Proteolysis occurs when the B-subunit N-terminal 
threonine side-chain O atom makes nucleophilic attack on the carbonyl-C of a peptide 
bond in the target protein. The products of proteasome degradation are oligopeptides 
seven to nine residues long. 

Eukaryotic cells contain two forms of proteasomes: the 20S proteasome, and its 
larger counterpart, the 26S proteasome. The eukaryotic 26S proteasome is a 45 nm-long 
structure composed of a 20S proteasome plus two additional substructures known 
as 19S regulators (also called 19S caps or PA700 [for proteasome activator-700 kDa]) 
(Figure 31.13b). Overall, the 26S proteasome (approximately 2.5 megadaltons) has 
2 copies each of 32 to 34 distinct subunits, 14 in the 20S core and 18 to 20 in the cap 
structures. Unlike the archaeal 20S proteasome, the eukaryotic 20S core structure 
contains seven different kinds of a-subunits and seven different kinds of B-subunits. 
Interestingly, only three of the seven different B-subunits have protease active sites. The 
26S proteasome forms when the 19S regulators dock to the two outer a7 rings of the 
20S proteasome cylinder. 


Regulators of the Proteasome Many of the 19S regulator subunits have ATPase activ- 
ity. Replacement of certain 19S regulator subunits with others changes the speci- 
ficity of the proteasome. The 19S regulators cause the proteolytic function of the 20S 
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Figure 31.11 Proteins with acidic N-termini show 

a tRNA requirement for degradation. Arginyl- 
tRNA“"/protein transferase catalyzes the transfer 
of Arg to the free a-NH, of proteins with Asp or 
Glu N-terminal residues. Arg-tR NA“‘%/protein 
transferase serves as part of the protein degradation 
recognition system. 


Figure 31.12 Structures of representative E1, E2, and 
E3 complexes. (a) (Top) A ubiquitin-El heterodimer 
complex (PDB id = 1R4N; ubiquitin is shown in 
blue). (Middle) A ubiquitin-E2 complex (PDB id = 
1FXT; E2 is shown in orange, ubiquitin in blue). 
(Bottom) The clamp-shaped E3 heteromultimer 
(PDB id = 1LDK and 1FQV). The target protein is 
bound between the jaws of the clamp. (b) CryoEM 
structure of a cullin-RING-UBE2D ubiquitin 
ligation assembly. Proteins in the complex include 
cullin (CULI, green), S-phase kinase-associated 
protein 1 (SKP1, blue), a B-transducin repeats- 
containing protein (8-TrCP, go/d), a regulator protein 
(NEDDS, yellow), ubiquitin (Ub, purple), Ring box 

1 (RBX1, dark red), ubiquitin-conjugating enzyme 
E2 D2 (UBE2D, aqua) and the substrate protein, 
IkBa (a protein kinase, gray; PDB id = 6TTU). 
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Figure 31.14 Diagram of the ubiquitin-proteasome 
degradation pathway. Pink lollipop structures 
symbolize ubiquitin molecules. (Adapted from 
Figure | in Hilt, W., and Wolf, D. H., 1996. 
Proteasomes: Destruction as a program. Trends in 
Biochemical Sciences 21:96-102.) 


Figure 31.13 The structure of the 26S proteasome. (a) The yeast (Saccharomyces cerevisiae) 20S 
proteasome core with bortezomib bound (red; PDB id = 2F 16). Bortezomib is the first therapeutic 
proteasome inhibitor used in humans. It is approved in the United States for treatment of relapsed 
multiple myeloma and mantle cell lymphoma. (b) Composite model of the 26S proteasome. The 20S 
proteasome core is shown in yellow; the 19S regulator (19S cap) structures are in blue. (Adapted from 
Figure 5 in Voges, D., Zwickl, P., and Baumeister, W., 1999. The 26S proteasome: A molecular 
machine designed for controlled proteolysis. Annual Review of Biochemistry 68:1015—1068.) 


proteasome to become ATP-dependent and specific for ubiquitinylated proteins as 
substrates. That is, these 19S caps act as regulatory complexes for the recognition and 
selection of ubiquitinylated proteins for degradation by the 20S proteasome core 
(Figure 31.14). The 26S proteasome shows a preference for proteins having four or 
more ubiquitin molecules attached to them. The 19S regulators also carry out the 
unfolding and transport of ubiquitinylated protein substrates into the proteolytic 
central cavity. 

The 19S regulators consist of two parts: the base and the lid. The base subcomplex 
connects to the 20S proteasome and contains the six ATPase subunits that unfold pro- 
teasome substrates. These subunits are members of the AAA+ family of ATPases 
(ATPases associated with various cellular activities); AAA+ ATPases are an evolution- 
arily ancient family of proteins involved in a variety of cellular functions requiring 
energy-dependent unfolding, disassembly, and remodeling of proteins. The lid subcom- 
plex acts as a cap on the base subcomplex and one of its subunits functions in recogni- 
tion and ubiquitin-chain processing of proteasome protein substrates. 


ATPase Modules Mediate the Unfolding of Proteins in the Proteasome The base of the 
19S regulators that cap the 26S proteasome consists of a hexameric ring of AAA+ 
ATPases that mediate the ATP-dependent unfolding of ubiquitinated proteins tar- 
geted for destruction in the proteasome. Structural studies have revealed the presence 
of loops extending from these AAA+ ATPase subunits; these loops face the central 
channel of the hexameric ring. The loops are in an up position when an AAA+ 
ATPase subunit has ATP bound in its active site, but they shift to a down position 
when ADP occupies the active site. Apparently, these loops bind protein substrates 
and act like the levers of a machine, using the energy of ATP hydrolysis to tug the 
protein into an unfolded state and thread it down through the narrow central chan- 
nel. This channel leads into the cavity of the 20S proteasome, where proteolytic 
degradation takes place. The AAA+ ATPase cycle of ATP binding, protein substrate 
binding, ATP hydrolysis, and protein unfolding is reminiscent of the ATP-dependent 
action of Hsp70 chaperones. 
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Ubiquitination is a general regulatory protein modification. Protein ubiquitination is 
a signal for protein degradation, as described in Section 31.4a, but ubiquitin conjuga- 
tion to proteins is also used for other purposes in cells. Nondegradative functions for 
ubiquitination include roles in chromatin remodeling, DNA repair, transcription, sig- 
nal transduction, endocytosis, spliceosome assembly, and sorting of proteins to spe- 
cific organelles and cell structures. In addition, cells possess a variety of protein 
modifiers attached to target proteins by processes similar to the ubiquitin pathway, as 
described in the following section. 


31.4c Small Ubiquitin-like Protein Modifiers 
Are Post-transcriptional Regulators 


Small ubiquitin-like protein modifiers (SUMOs) are a highly conserved family of 
proteins found in all eukaryotic cells. Like ubiquitin, SUMO family members are 
covalently ligated to lysine residues in target proteins by a three-enzyme conjugating 
system (Figure 31.15). SUMO proteins share only limited homology to ubiquitin, and 
sumoylated proteins are not targeted for destruction. Instead, sumoylation alters the 
ability of the modified protein to interact with other proteins. This ability to change 
protein-protein interactions is believed to be the biological purpose of SUMO 
proteins. Post-translational attachment of SUMOs is a key regulatory protein 
modification in eukaryotic cells. Hundreds of proteins involved in DNA repair, 
transcription, macromolecular assembly, protein homeostasis and trafficking, and 
signal transduction are subject to reversible sumoylation. 

Sumoylation can have three general consequences for modified proteins (Figure 31.16): 
(1) it can interfere with the interactions between the target and its partner so that the 
interaction can occur only in the absence of sumoylation; (2) sumoylation can create a 
binding site for an interacting partner protein; and (3) sumoylation can induce a confor- 
mational change in the modified target protein, altering its interactions with partner 
proteins. The regulatory opportunities associated with sumoylation are significant 
for many cellular functions, including transcriptional regulation, chromosome organi- 
zation, nuclear transport, and signal transduction. Overall, sumoylation represents a 
protective response. 


Figure 31.15 The mechanism of reversible sumoylation. Before conjugation, small ubiquitin-like 
protein modifiers (SUMOs) need to be proteolytically processed, removing anywhere from 

2 to 11 amino acids to reveal the C-terminal Gly—Gly motif. SUMOs are then activated by the El 
enzyme in an ATP-dependent reaction to form a thioester bond between SUMO and E1. SUMO is 
then transferred to the catalytic Cys residue of the E2 enzyme, Ubc9. Finally, an isopeptide bond 

is formed, between the C-terminal Gly of SUMO and a Lys residue on the substrate protein, through 
the action of an E3 enzyme. The SUMO-specific protease SENP can deconjugate SUMO from 
target proteins. 
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Figure 31.16 The molecular consequences of sumoylation. The process can affect a modified 
protein in three ways: (a) Sumoylation can interfere with the interaction between a target protein 
and its binding partner. (b) Sumoylation can provide a new binding site for an interacting partner. 
(c) Sumoylation can induce a conformational change in the modified target protein. 


Many E2 enzymes participate in ubiquitination processes, but the only known 
SUMO E2 enzyme is Ubc9 (Figure 31.17a). Ubc9 recognizes a YKXD/E consensus 
sequence in proteins destined for sumoylation. In this sequence, w is an aliphatic 
branched amino acid (such as Leu), K is the lysine to which SUMO is conjugated, and 
X is any amino acid, with an acidic D or E completing the sequence. Recognition by 
Ubc9 is possible only if the consensus sequence is in a relatively unstructured part of a 
target protein or if it is part of an extended loop, as in RanGAP!1 (Figure 31.17a). In the 
Ubc9-RanGAP!I complex, Leu*** of RanGAP1 is in van der Waals contact with several 
nonpolar Ubc9 residues, whereas RanGAP1 Lys*° lies in a hydrophobic groove of 
Ubc9, juxtaposed with the catalytic Cys” (Figure 31.17b). The pK, of Lys? is lowered 
in this complex, activating the lysine amino group for nucleophilic attack to form the 
SUMO-target protein conjugate. 


(b) 


Figure 31.17 Ubc9 is a SUMO E2 enzyme. (a) The complex formed by the E2 enzyme, Ubc9 (yellow), 
and a target protein, RanGAP!1 (blue). The helices of both proteins are labeled alphabetically. (b) In 
the Ubc9-RanGAP1 complex, the exposed loop of RanGAPI lies in the binding pocket of Ubc9. The 
exposed loop contains the consensus sequence for sumoylation (WK XD/E), including Leu, which is 
surrounded by hydrophobic residues from Ubc9; Lys*°, which is coordinated by Asp!?’ and Cys” of 
Ubc9; and Glu”, which is coordinated by Ubc9 Thr*!. (PDB id = 1KPS) 
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Sumoylation is reversed by a small number of isopeptidases that cleave the target 
protein-Lys-SUMO linkage. Six proteins in mammals, known as sentrin-specific 
proteases (SENPs) and denoted SENP-1, -2, -3, -5, -6, and -7, possess varying specifici- 
ties for SUMOylated target proteins, as well as their intracellular locations. 


31.4d HtrA Proteases Also Function in Protein Quality Control 


The discussion thus far has stressed the importance of protein quality control to cellular 
health. HtrA proteases are a class of proteins involved in quality control that combine the 
dual functions of chaperones and proteasomes. (The acronym Htr comes from “high 
temperature requirement” because E. coli strains bearing mutations in HtrA genes do not 
grow at elevated temperatures.) In addition to their novel ability to be either chaperones 
or proteases, HtrA proteases are the only known protein quality control factor that is not 
ATP-dependent. Prokaryotic HtrA proteases act as chaperones at low temperatures 
(20°C) where they have negligible protease activity. However, as the temperature increases, 
these proteins switch from a chaperone function to a protease function to remove mis- 
folded or unfolded proteins from the cell. With this functional duality, HtrA proteases 
have the potential to mediate quality control through protein triage (see Section 31.4e: 
Protein Triage—A Model for Quality Control). 

Three HtrA proteases, denoted DegP, DegS, and DegQ, are expressed in E. coli. The 
human counterparts are HtrA 1-4. The E. coli HtrA protein DegP is the best characterized 
HtrA protease. DegP is localized in the E. coli periplasmic space, where it oversees qual- 
ity control of proteins involved in the cell envelope. It is a 448 residue protein containing 
a central protease domain with a classic Ser protease Asp-His-Ser catalytic triad (see 
Chapter 14) and two C-terminal PDZ domains. These domains are structural modules 
involved in protein-protein interactions, and they recognize and bind selectively to the 
C-terminal three or four residues of target proteins. The PDZ domains act as gatekeep- 
ers, determining access of protein substrates to the proteolytic centers. Protease and 
chaperone activity of all HtrA proteins is dependent on the oligomeric state. The basic 
building block is a funnel-shaped trimer consisting of protease domains which form the 
central core and outward-protruding PDZ domains (Figure 31.18a). Binding of 
unfolded protein targets prompts formation of higher oligomeric states and activates 
protease function. DegP, for example, can form hexamers (Figure 31.18b) and also 
12-, 15-, 18-, 24-, and 30-mers in the presence of substrate proteins that can be refolded 


Figure 31.18 Oligomeric states of the HtrA proteases. (a) the HtrA protomer, showing the protease 
domain and the two PDZ domains (PDB id = 1K Y9). (b) the DegP hexamer (PDB id = 1KY9). 
(c) a DegP 12-mer (PDB id = 2ZLE). (d) a DegP 24-mer (PDB id = 3CS0). 


(a) 
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Proteasome Inhibitors in Cancer Chemotherapy 


Proteasome inhibition offers a promising approach to treating can- 
cer. The counterintuitive rationale goes like this: The proteasome is 
responsible for the regulated destruction of proteins involved in cell 
cycle progression and the control of apoptosis (programmed cell 
death). Inhibition of proteasome function leads to cell cycle arrest 
and apoptosis. In clinical trials, proteasome inhibitors have retarded 
cancer progression by interfering with the programmed degradation 
of regulatory proteins, causing cancer cells to self-destruct. 
Bortezomib, a slowly-reversible proteasome inhibitor developed 
by Millenium Pharmaceuticals, Inc., has been used since 2003 for 


the treatment of multiple myeloma, a cancer of plasma cells that 
accounts for 10% of all cancers of the blood. Carfilzomib is a tet- 
rapeptide epoxyketone proteasome inhibitor that binds irreversibly 
and provides prolonged proteasome inhibition. Both bortezomib 
and carfilzomib must be administered by intravenous infusion. 
Ixazomib acts similarly to bortezomib but is taken in capsules by 
mouth. 


Reference 
Manasanch, E. E., and Orlowski, R. Z., 2017. Proteasome inhibitors in 
cancer therapy. Nature Reviews Clinical Oncology 14:417-433. 
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Proteasome inhibitors used to treat multiple myeloma. 


or degraded. Human HtrA2 is active as a trimer, but human HtrA1 assembles into an 
active 12-mer complex upon binding of unfolded polypeptides. The multiple oligomeric 
states of HtrA proteases provide a novel mechanism for substrate specificity and 
chaperone and protease activity. The different oligomeric states possess different sizes 
and geometries of central protease cavities, as well as different arrangements of the sur- 
rounding PDZ domains. For example, Figure 31.19 shows a 12-mer of E. coli DegQ 
with six lysozyme molecules in the central protease cavity. 

Human HtrA proteases are implicated in stress response pathways and also in a wide 
range of diseases and disorders, including Alzheimer’s and Parkinson’s diseases and 
cancer. Human HtrA1 is expressed at higher levels in osteoarthritis and aging and lower 
levels in ovarian cancer and melanoma. Secreted human Htr1 may be involved in deg- 
radation of extracellular matrix proteins involved in arthritis as well as tumor progres- 
sion and invasion. 
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Figure 31.19 A DegQ 12-mer with six folded lysozyme molecules (red) coordinated in the central ne 
cavity (PDB id =4A8B). / Protein bound 
to proteasome 
31.4e Protein Triage—A Model for Quality Control protem oT i U 
Triage is a medical term for the sorting of patients according to their need for (and their Degraded protein 


likelihood to benefit from) medical treatment. Sue Wickner, Michael Maurizi, and 
Susan Gottesman have pointed out that cells control the quality of their proteins 
through a system of triage based on chaperones and the ubiquitination-proteasome 
degradation pathway. These systems recognize non-native proteins (proteins that are 
partially folded, misfolded, incorrectly modified, damaged, or in an inappropriate 
compartment). Depending on the severity ofits damage, a non-native protein is directed 
to chaperones for refolding or targeted for destruction by a proteasome (Figure 31.20). 


Figure 31.20 A triage system recognizes non-native 
proteins, partially folded proteins, and misfolded 
proteins and directs them to chaperones for refolding 
or to proteasomes for degradation. (Adapted from 
Figures 2 and 3 in Wickner, S., Maurizi, M., and 
Gottesman, S., 1999. Post-translational quality 
control: Folding, refolding, and degrading proteins. 
Science 286:1888-1893.) 


Foundational Biochemistry Things You Should Know after Reading Chapter 31. 


expression, mRNA stability, and protein synthesis, but also by post- 
translational modifications, protein-protein interaction, intracellular 
localization, and protein degradation. 


Nascent proteins are often assisted in folding by a family of helper 
proteins known as molecular chaperones, whose actions are often 
ATP-dependent. 


Hsp70 chaperones bind to hydrophobic regions of extended polypep- 
tides and use the energy of ATP hydrolysis to drive protein folding. 


The GroES-GroEL complex of E. coli is an Hsp60 chaperonin. It 
forms a cylindrical structure with a central cavity where ATP hydro- 
lysis drives the folding of nascent proteins. 


The eukaryotic Hsp90 chaperone system acts on proteins of signal 
transduction pathways and facilitates the maturation of these pro- 
teins. 


Proteolytic cleavage is the most common form of post-translational 
processing of proteins. 


Proteins are targeted to their proper cellular locations by signal se- 
quences. 


e The functional protein pool of any cell is determined not only by gene * 


Protein translocation involves interactions of signal recognition par- 
ticles, signal receptors, translocons, and proteolytic processing. 


Prokaryotic proteins destined for translocation are synthesized as 
pre-proteins. 


Eukaryotic proteins are routed to their proper destinations by protein 
sorting and translocation. 


The synthesis of secretory proteins and many membrane proteins is 
coupled to translocation across the ER membrane. 


The eukaryotic translocon is a complex of proteins that transports 
nascent polypeptides with targeting signal sequences into the interior 
of the endoplasmic reticulum. 


Binding of the ribosome-nascent chain/signal recognition particle 
(RNC-SRP) complex to a signal receptor delivers the RNC to the 
membrane. 


Retrograde translocation prevents secretion of damaged proteins and 
recycles old ER proteins. 


Eukaryotic proteins are targeted for proteasome destruction by the 
ubiquitin pathway. 
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Proteasomes are large oligomeric structures enclosing a central cavity 
where proteolysis takes place. 


The 26S proteasome is a 45 nm-long structure composed of a 20S 
proteasome plus two 19S regulators. ATPase modules mediate the 
unfolding of proteins in the 26S proteasome. 


Completing the Protein Life Cycle: Folding, Processing, and Degradation 


Small ubiquitin-like protein modifiers are post-transcriptional regula- 
tors that are covalently ligated to lysine residues on target proteins. 
SUMOrylation alters the ability of the modified proteins to interact 
with other proteins. 


HtrA proteases are proteins involve in quality control that combine 
the dual functions of chaperones and proteasomes. 


Answers to all problems are at the end of this book. Detailed solutions 
are available in the Student Solutions Manual. 


1. 


10. 


Assessing the ATP Cost of Protein Synthesis (Integrates with 

Chapter 30.) Human rhodanese (33 kDa) consists of 296 amino acid 
residues. Approximately how many ATP equivalents are consumed in 
the synthesis of the rhodanese polypeptide chain from its constituent 
amino acids and the folding of this chain into an active tertiary 
structure? 


. Understanding the Consequences of Proteolysis A single proteolytic 


break in a polypeptide chain of a native protein is often sufficient to 
initiate its total degradation. What does this fact suggest to you 
regarding the structural consequences of proteolytic nicks in proteins? 


. Understanding the Relevance of Chaperones in Protein Folding Protein 


molecules, like all molecules, can be characterized in terms of general 
properties such as size, shape, charge, and solubility/hydrophobicity. 
Consider the influence of each of these general features on the 
likelihood of whether folding of a particular protein will require 
chaperone assistance or not. Your analysis should distinguish between 
proteins that interact only with Hsp70 chaperones and those that 
interact with both Hsp70 chaperones and Hsp60 chaperonins. 


. Assessing the Chaperone-Assisted Folding of Multidomain 


Proteins Many multidomain proteins apparently do not require 
chaperones to attain the fully folded conformations. Suggest a 
rational scenario for chaperone-independent folding of such proteins. 


. Assessing the Dimensions and Capacity of GroEL The GroEL ring has 


a 5 nm central cavity. Calculate the maximum molecular weight for a 
spherical protein that can just fit in this cavity, assuming the density 
of the protein is 1.25 g/mL. 


. Understanding the Possible Phosphorylated States of Acetyl-CoA 


Carboxylase (Integrates with Chapter 24.) Acetyl-CoA carboxylase has 
at least seven possible phosphorylation sites (residues 23, 25, 29, 76, 77, 
95, and 1200) in its 2345 residue polypeptide (see Figure 24.4). How 
many different covalently modified forms of acetyl-CoA carboxylase 
protein are possible if there are seven phosphorylation sites? 


. Covalent Modifications of Proteins A protein’s primary structure 


contains two T, one S, and two Y residues. Assume that each residue 
could be phosphorylated. How many different forms of this protein 
could exist? 


. Comparing the Mechanisms of Action of EF-Tu/EF-Ts and DnaK/ 


GrpE (Integrates with Chapter 30.) In what ways are the mechanisms 
of action of EF-Tu/EF-Ts and DnaK/GrpE similar? What mechanis- 
tic functions do the ribosome A-site and DnaJ have in common? 


. Understanding the Structure and Function of Signal Sequences and 


Signal Peptidases The amino acid sequence deduced from the 
nucleotide sequence of a newly discovered human gene begins: 
MRSLLILVLCFLPLAALGK.... . Is this a signal sequence? If so, 
where does the signal peptidase act on it? What can you surmise 
about the intended destination of this protein? 


Understanding the Mechanism of Transport of Integral Membrane 
Proteins Not only is the Sec61 translocon complex essential for 
translocation of proteins into the ER lumen, it also mediates the 
incorporation of integral membrane proteins into the ER membrane. 
The mechanism for integration is triggered by stop-transfer signals 


11. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


19. 


20. 


that cause a pause in translocation. The structure of Sec61 is shown 
in Figure 31.9. Suggest a mechanism for lateral transfer of an integral 
membrane protein from the protein-conducting channel of the 
translocon into the hydrophobic phase of the ER membrane. 


Assessing the Dimensions and Capacity of Sec61 The Sec61 core 
complex of the translocon has a highly dynamic pore whose internal 
diameter varies from 0.6 to 6 nm. In post-translational translocation, 
folded proteins can move across the ER membrane through this pore. 
What is the molecular weight of a spherical protein that would just 
fit through a 6 nm pore? (Assume a spherical protein with a density 
of 1.25 g/mL.) 


Assessing the Dimensions and Capacity of the Protein-Conducting 
Translocon (Integrates with Chapters 6, 9, and 30.) During co- 
translational translocation, the peptide tunnel running from the 
peptidyl transferase center of the large ribosomal subunit and the 
protein-conducting channel are aligned. If the tunnel through the 
ribosomal subunit is 10 nm and the translocon channel has the same 
length as the thickness of a phospholipid bilayer, what is the 
minimum number of amino acid residues sequestered in this 
common conduit? 


Understanding the Nature of the Ubiquitin-Ubiquitin Linkage 
Draw the structure of the isopeptide bond formed between Gly” of 
one ubiquitin molecule and Lys* of another ubiquitin molecule. 


Ubiquination of E1 and E2 Ubiquitin is attached to both El and E2 
(see Figure 31.10) and yet El and E2 are not degraded? Explain why 
this the case. 


Understanding the Nature of E3 Ubiquitin Protein Ligation 

Assign the 20 amino acids to either of two groups based on their 
susceptibility to ubiquitin ligation by E3 ubiquitin protein ligase. 
What are the common attributes among the amino acids in the less 
susceptible versus the more susceptible group? 


Proteolysis Inside the Proteasome (Integrates with Chapter 14.) 
Section 31.4b discusses the mechanism of the proteolytic action inside 
the proteasome. Draw the sequence of chemical reactions that occur 
to achieve a complete hydrolysis of the peptide bond. (Hint: Revisit 
Section 14.6.) 


Assessing the Consequences of Proteasome Inhibition Lactacystin is a 
Streptomyces natural product that acts as an irreversible inhibitor of 
26S proteasome B-subunit catalytic activity by covalent attachment to 
N-terminal threonine —OH groups. Predict the effects of lactacystin 
on cell cycle progression. 


Action of Met-Aminopeptidase and Half-Life of Proteins Explain how 
the activity of Met-aminopeptidase may affect the longevity of a 
protein. 


Understanding the Dual Functionality of the HtrA Protease 

HtrA proteases are dual-function chaperone-protease protein quality 
control systems. The protease activity of HtrA proteases depends on 
a proper spatial relationship for the Asp-His-Ser catalytic triad. 
Propose a mechanism for the temperature-induced switch of HtrA 
proteases from chaperone function to protease function. 


Ubc9 Consensus Sequence In the five peptides shown, circle the 
sequence of four amino acids recognized by Ubc9 (see Section 31.4¢c). 
(Hint: Only two contain the sequence.) 
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21. Understanding the Scope and Diversity of Post-Translational Protein 
Modification As described in this chapter, the most common 
post-translational modifications of proteins are proteolysis, phos- 
phorylation, methylation, acetylation, and linkage with ubiquitin and 
SUMO proteins. Carry out a Web search to identify at least eight 
other post-translational modifications and the amino acid residues 
involved in these modifications. 


ao 7S 


22. Post-Translational Modifications of Proteins (Integrates with Chapters 4, 
5, 7, 8, 19.) A select range of post-translation modifications of proteins 
is discussed in this chapter. Review previous chapters regarding protein 
structure and/or function, and list at least five other typical covalent 
modifications of proteins that can occur post-translation. 


23. Understanding the Nature and Functional Utility of Fluorescence 
Resonance Energy Transfer (FRET) Fluorescence resonance energy 
transfer (FRET) is a spectroscopic technique that can be used to 
provide certain details of the conformation of biomolecules. Look up 
FRET on the Web or in an introductory text on FRET uses in 
biochemistry, and explain how FRET could be used to observe 
conformational changes in proteins bound to chaperonins such as 
GroEL. A good article on FRET in protein folding and dynamics can 
be found here: Haas, E., 2005. The study of protein folding and 
dynamics by determination of intramolecular distance distributions 
and their fluctuations using ensemble and single-molecule FRET 
measurements. ChemPhysChem 6:858-870. Studies of GroEL using 
FRET analysis include the following: Sharma, S., et al., 2008. 
Monitoring protein conformation along the pathway of chaperonin- 
assisted folding. Cell 133:142—153; and Lin, Z., et al., 2008. GroEL 
stimulates protein folding through forced unfolding. Nature Structural 
and Molecular Biology 15:303-311. 
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24. Understanding the Nature and Functions of the Phosphodegron 
The cross-talk between phosphorylation and ubiquitination in protein 
degradation processes is encapsulated in the concept of the phospho- 
degron. What is a phosphodegron, and how does phosphorylation 
serve as a recognition signal for protein degradation? (A good reference 
on the phosphodegron and crosstalk between phosphorylation and 
ubiquitination is Hunter, T., 2007. The age of crosstalk: Phosphoryla- 
tion, ubiquitination, and beyond. Molecular Cell 28:730-738.) 


25. Assessing the Consequences of a Common Post-Translational 
Modification A common post-translational modification is removal 
of the universal N-terminal methionine in many proteins by 
Met-aminopeptidase. How might Met removal affect the half-life 
of the protein? 


Think-Pair-Share Question 


Proteins as the Primordial Molecules of Life The “Deeper Look” segment 
in Section 30.5f described the notion of RNAs as the primordial 
molecules of life. The assumption is that billions of years ago primordial 
RNA molecules fulfilled the basic functions of biochemistry: carrying 
information and enabling catalysis. This has led to the hypothesis that life 
began in a so-called RNA world. 


Some of the basic assumptions of any primordial world are: 

1. A primordial soup of precursors that could have formed basic 
carbohydrates, nucleotides, or amino acids; 

2. energy for chemical reactions in the forms of UV radiation and 
geothermal energy; and 

3. minerals with microscopic porous surfaces that could have 
served as catalytic entities to promote chemical reactions. 


Review everything you have learned about protein science (particularly 
chapters 4, 5, 6, and 31) and consider the possibility that protein mol- 
ecules could serve as the primordial molecules of life. In evaluating this 
possibility, focus on: information storage, ability to replicate (without 
error), and ability to function. 
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The Reception 32 
and Transmission 
of Extracellular Information 


“Ships that pass in the night, and speak each other 
in passing, Only a signal shown and a distant voice 
in the darkness.” 


by Henry Wadsworth Longfellow 


< Drawing of a human fetus in utero, by 
Leonardo da Vinci. Human sexuality 
and embryonic development represent 
two hormonally regulated processes of 
universal interest. 


Higher life forms must have molecular mechanisms for detecting environmental 32.1 What Are Hormones? 
information as well as mechanisms that allow for communication at the cell and tissue 32.2 What Is Signal Transduction? 
levels. Sensory systems detect and integrate physical and chemical information from 
the environment and pass this information along to the rest of the organism. Control 
and coordination of processes at the cell and tissue levels are achieved by chemical 
signals that are secreted by one set of cells to direct the activity of other cells. What 
are the mechanisms of this information transfer and how do excitable membranes 
transduce the signals of sensory systems? 


Windsor/SuperStock 


32.3 How Do Signal-Transducing Receptors 
Respond to the Hormonal Message? 


32.4 How Are Receptor Signals 
Transduced? 


32.5 How Do Effectors Convert the Signals 
to Actions in the Cell? 


32.6 How Are Signaling Pathways 
Organized and Integrated? 


32.7 How Do Neurotransmission Pathways 
Control the Function of Sensory 
Systems? 
nformation transfer from cell to cell and between tissues is mediated by hormones. 


Hormones are secreted by certain cells, usually located in glands, and travel, either 
by simple diffusion or circulation in the bloodstream, to specific target cells 
(Figure 32.1). As we shall see, some hormones, whether steroid or nonsteroid, bind to 
specialized receptors on the plasma membrane and induce responses within the cell 
without themselves entering the target cell (Figure 32.2). Other hormones actually 
enter the target cell and interact with specific receptors there. By these mechanisms, 
hormones: 


e Regulate the metabolic processes of various organs and tissues. 

e Facilitate and control growth, differentiation, reproductive activities, learning, and 
memory. 

e Help the organism cope with changing conditions and stresses in its environment. 


All these consequences of hormonal signals are either metabolic responses or gene 
expression responses (Figure 32.2). 
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Figure 32.1 The sites of synthesis and action of a 
few of the polypeptide and steroid hormones. 
Hormones typically circulate at low concentrations 
(1 nM or less) and bind with high affinity to their 
receptor proteins. 


Nonsteroid 
hormone 


Testosterone 

Estradiol 

(many: vascular system, 
reproductive organs, 
central nervous system, 
gastrointestinal tract, 
immune system, skin, 
kidney, and lung) 


Bradykinin 
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Prolactin 
(breast) 


hormone (ACTH) 
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bladder, pancreas) 
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Metabolic responses 


Figure 32.2 Nonsteroid hormones bind exclusively to 
plasma membrane receptors, which mediate the 
cellular responses to the hormone. Steroid hormones 
exert their effects either by binding to plasma 
membrane receptors or by diffusing to the nucleus, 
where they modulate transcriptional events. 
Intracellular responses to hormone binding include 
metabolic changes and alterations of gene expression. 


Y 


Gene expression 
Transcription responses 


Nucleus 
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32.1 | What Are Hormones? 


Many different chemical species act as hormones. Steroid hormones, all derived from 
cholesterol, regulate metabolism, salt and water balances, inflammatory processes, and 
sexual function. Several hormones are amino acid derivatives. Among these are epineph- 
rine and norepinephrine (which regulate smooth muscle contraction and relaxation, 
blood pressure, cardiac rate, and the processes of lipolysis and glycogenolysis) and the 
thyroid hormones (which stimulate metabolism). Peptide hormones are a large group of 
hormones that regulate processes in all body tissues, including the release of yet other 
hormones. 

Hormones and other signal molecules in biological systems bind with very high 
affinities to their receptors, displaying Kp values in the range of 10~!? to 1076 M. The 
hormones are produced at concentrations equivalent to or slightly above these Kp 
values. Once hormonal effects have been induced, the hormone is usually rapidly 
metabolized. 


32.la Steroid Hormones Act in the Nucleus or at the 
Plasma Membrane 


The steroid hormones include the glucocorticoids (cortisol and corticosterone), the 
mineralocorticoids (aldosterone), and the sex hormones (progesterone and testoster- 
one, for example) (Figure 32.1; see Chapter 24 for the details of their synthesis). The 
steroid hormones exert their effects in two ways. First, by entering cells and migrating 
to the nucleus, steroid hormones act as transcription regulators, modulating gene 
expression. These effects of the steroid hormones occur on time scales of hours and 
involve synthesis of new proteins. Steroids can also act at the cell membrane, directly 
regulating ligand-gated ion channels and perhaps other processes. These latter pro- 
cesses take place very rapidly, on time scales of seconds and minutes. 


32.1b Polypeptide Hormones Share Similarities 
of Synthesis and Processing 


The largest class of hormones in vertebrate organisms is the polypeptide hormones 
(Figure 32.1). One of the first polypeptide hormones to be discovered, insulin, was 
described by Banting and Best in 1921. Insulin, a secretion of the pancreas, controls 
glucose utilization and promotes the synthesis of proteins, fatty acids, and glycogen. 
Insulin, which is typical of the secreted polypeptide hormones, is discussed in detail in 
Chapters 5, 15, and 22. 

Many other polypeptide hormones are produced and processed in a manner similar 
to that of insulin. Three unifying features of their synthesis and cellular processing 
should be noted. First, all secreted polypeptide hormones are originally synthesized 
with a signal sequence, which facilitates their eventual routing to secretory granules, and 
from there to the extracellular milieu. Second, peptide hormones are usually synthesized 
from mRNA as inactive precursors, termed preprohormones, which become activated by 
proteolysis. Third, a single polypeptide precursor or preprohormone may produce sev- 
eral different peptide hormones by suitable proteolytic processing. 

An impressive example of the production of many hormone products from a single 
precursor is the case of prepro-opiomelanocortin, a 250 residue precursor peptide 
synthesized in the pituitary gland. A cascade of proteolytic steps produces, as the name 
implies, a natural opiate substance (endorphin) and several other hormones (Figure 32.3). 
Endorphins and other opiate-like hormones are produced by the body in response to 
systemic stress. These substances probably contribute to the runner’s high that mara- 
thon runners describe, reducing pain and promoting pleasure. 


32.1 


What Are Hormones? 
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Figure 32.3 The conversion of prepro-opiomelanocortin to a family of peptide hormones, including 
corticotropin, B- and y-lipotropin, a- and B-MSH, and endorphin. 
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32.2 | What Is Signal Transduction? 


Hormonal regulation depends on the transduction of the hormonal signal across the 
plasma membrane to specific intracellular sites, particularly the nucleus. This signal 
transduction consists of a stepwise progression of signaling stages: receptor—> 
transducers—> effectors. The receptor detects the signal, transducers relay the signal, 
and the effectors convert the signal into an intracellular response. 
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What are the characteristics of signal transduction systems? Regardless of the organ- 
ism or the tissue, certain features appear to be universal, or nearly so: 


° Transmembrane communication of hormonal signals by receptor proteins 

e Clustering of membrane receptors and their ligands in large aggregates called 
signalsomes 

e Reversible covalent modifications that change the function of certain proteins and 
lipids (including phosphorylation, methylation, acetylation, ubiquitylation, hydrox- 
ylation, and cleavage) 

e Protein interaction domains that selectively recognize specific structural motifs and 
bind them with high affinity and specificity 

e Second messengers that bind to specific targets, changing their activity and 
behavior 

e Intracellular signaling pathways, often involving a series of enzymes (such as protein 
kinases), that link receptors to their downstream functional targets 

e Cooperativity, in which a structural change in one subunit of a multimeric protein 
induces similar changes in the other subunits 

e Spatial and temporal control of signals and messengers 

e Integration of signals 

e Converging and diverging networks 

e Signal amplification 

e Desensitization and adaptation 


Each of these phenomena will be discussed and elaborated throughout this chapter. 


32.2a Many Signaling Pathways Involve Enzyme Cascades 


Signaling pathways must operate with speed and precision, facilitating the accurate relay 
of intracellular signals to specific targets. This happens via enzyme cascades. Enzymes 
can produce (or modify) a large number of molecules rapidly and specifically. Many 
enzymes of signaling cascades are protein kinases and protein phosphatases, and many 
steps in signaling pathways involve phosphorylation of serine, threonine, and tyrosine 
residues on target proteins. The complexity of signal transduction is thus manifested in 
the estimates that the human genome contains over five hundred protein kinase genes 
and 200 protein phosphatase genes. Enzyme cascades act like a series of amplifiers, dra- 
matically increasing the magnitude of the intracellular response available from a very 
small amount of hormone (for an example, see the discussion of the phosphorylase 
cascade in Figure 15.18 and Section 22.5). 


32.2b Signaling Pathways Connect Membrane Interactions 
with Events in the Nucleus 


The complete pathway from hormone binding at the plasma membrane to modulation 
of transcription in the nucleus has been described for many signaling pathways. 
Figure 32.4 shows a complete signal transduction pathway that connects receptor tyro- 
sine kinases, the Ras GTPase, cytoplasmic Raf, and two other protein kinases with 
transcription factors that alter gene expression in the nucleus. This pathway represents 
just one component of a complex signaling network that involves many other proteins 
and signaling factors. The existence of nearly 4000 human genes devoted to signal trans- 
duction portends a complex and interwoven network of signaling interactions in nearly 
all human cells. 


32.2c Signaling Pathways Depend on Multiple 
Molecular Interactions 
Each step in a signaling pathway depends on one or more molecular interactions. For 


example, a protein along a signaling pathway may bind to a small molecule, a peptide, 
or even another protein (Figure 32.5). In fact, many signaling proteins are constructed 


32.2 What Is Signal Transduction? 
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Figure 32.4 A complete signal transduction Nucleus 


pathway that connects a hormone receptor with 
transcription events in the nucleus. In this pathway, 


hormone binding causes phosphorylation of the Transcription 
tyrosine kinase domain of the receptor tyrosine factor 
kinase protein. Binding of protein GRB2 to the phosphorylation 


phosphorylated tyrosine of the RTK recruits the 
SOS protein which binds to the Ras protein, causing 
dissociation of GDP and binding of GTP. The 
Ras-GTP complex binds to and activates the Raf 
protein, catalyzing phosphorylation of the MAPK 
kinase, which in turn phosphorylates the MAP 
kinase (MAPK). This leads to stimulation of gene 
transcription in the nucleus, activating synthesis of 
cytosolic proteins with a variety of functions. The 
steps in this signaling cascade are elaborated in more y 


detail later in this chapter. Many similar pathways | Protein synthesis | 
have been characterized. 


Gene transcription 
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Figure 32.5 Five of the protein modules found in cell signaling proteins. The binding specificity 

of these modules is shown in Figure 32.6a. (a) SH3 domains bind to proline-rich peptides; 

(b) SH2 domains bind to phosphorylated tyrosine residues; (c) PH domains bind to phospho- 
inositides (such as IP3); (d) PDZ domains bind to the terminal four or five residues of a target 
protein; and (e) pumilio domains bind to segments of RNA. A given protein may have a number of 
these protein modules, giving it the ability to interact with multiple partners. 


in a cassette-like fashion with several distinct modules, termed protein interaction 
domains (PIDs), each of which mediates a particular interaction (Figure 32.6a). 
A protein with multiple modules can interact with several binding targets at once to 
create a complex, called a signalsome, with multiple signaling capabilities (Figure 32.6b). 
Several signaling systems described in this chapter involve events at a signalsome. 
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Figure 32.6 Many signaling proteins consist of 
combinations of protein modules, each with a specific 
binding or enzymatic function. (a) A hypothetical 
signaling protein consisting of SH3, SH2, PH, WW, 
and PDZ domains. The binding preferences for each 
of these domains are described in Figure 32.5. 

(b) Such multiply-interacting proteins can act as 
scaffolds that direct the assembly of large signaling 
complexes, termed signalsomes. 


(b) 


li, 


XXX — COO™ 


32.3 | How Do Signal-Transducing Receptors Respond 


to the Hormonal Message? 


All receptors that mediate transmembrane signaling processes fit into one of three 
receptor superfamilies (Figure 32.7): 


The G-protein—coupled receptors are integral membrane proteins with an extracel- 
lular recognition site for ligands and an intracellular recognition site for a GTP- 
binding protein (see following discussion). 


. The single-transmembrane segment (1-TMS) catalytic receptors are proteins with 


only a single transmembrane segment and substantial globular domains on both 
the extracellular and the intracellular faces of the membrane. The extracellular 
domain is the ligand recognition site, and the intracellular catalytic domain is either 
a tyrosine kinase or a guanylyl cyclase. 


. Oligomeric ion channels consist of associations of protein subunits, each of which 


contains several transmembrane segments. These oligomeric structures are ligand- 
gated ion channels. Binding of the specific ligand typically opens the ion channel. 
The ligands for these ion channels are neurotransmitters. 


The first two of these receptor superfamilies will be discussed in this section and the 
third will be discussed in Section 32.7. 


32.3a The G-Protein—Coupled Receptors Are 7-TMS Integral 


Membrane Proteins 


The G-protein—coupled receptors (GPCRs) have primary and secondary structure 
similar to that of bacteriorhodopsin (see Chapter 9), with seven transmembrane 
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a-helical segments; they are thus known as 7-transmembrane segment (7-TMS) 
proteins. Rhodopsin and the B-adrenergic receptors, for which epinephrine is a 
ligand, are good examples (Figure 32.8). 

The site for binding of cationic catecholamines to the adrenergic receptors is 
located within the hydrophobic core of the receptor. Binding of hormone to a 
GPCR induces a conformation change that activates a GTP-binding protein, also 
known as a G protein. Activated G proteins trigger a variety of cellular effects, includ- 
ing activation of adenylyl and guanylyl cyclases (which produce cAMP and cGMP from 
ATP and GTP), activation of phospholipases (which produce second messengers from 
phospholipids), and activation of Ca** and K* channels (which leads to elevation of 
intracellular [Ca**] and [K*]). (All of these effects are described in Section 32.4.) 


(a) Bo-Adrenergic receptor (b) 


> ae ae, 
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Membrane 
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Figure 32.7 The membrane receptor superfamilies. 
The G-protein—coupled receptors are named 

for the GTP-binding proteins that mediate some 
of their effects. The receptor tyrosine kinases and 
receptor guanylyl cyclases contain intracellular 
enzymatic domains that respond to extracellular 
hormone binding. The oligomeric ion channels 
(some of which were discussed in Chapter 9) 
open and close in response to ligand binding 
and/or changes of the transmembrane 
electrochemical potential. 


Figure 32.8 GPCRs are 7-transmembrane 
segment membrane proteins. (a) The arrangement 
of the B-adrenergic receptor in the membrane. 
Substitution of Asp!" in the third hydrophobic 
domain of the B-adrenergic receptor with an Asn 
or Gln by site-directed mutagenesis results in a 
dramatic decrease in affinity of the receptor for 
both agonists and antagonists. Significantly, this 
Asp residue is conserved in all GPCRs that bind 
biogenic amines but is absent in receptors whose 
ligands are not amines. Asp!” appears to be the 
counterion of the amine moiety of adrenergic 
ligands. (b) The structure of a B,-adrenergic 
receptor (PDB id = 2RH1). The flexible third 
intracellular loop and C-terminal segment are not 
shown. (From Figure 2 from Palczewski, K., et al., 
2000. Crystal structure of rhodopsin: A G-protein— 
coupled receptor. Science 289:739-745.) 
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There are 600 genes for non-olfactory GPCRs in the human genome, but alternative 
splicing produces between 1000 and 2000 active GPCRs in human cells. Louis Luttrell 
and colleagues have shown that synthesis of GPCRs and trafficking to the plasma mem- 
brane are regulated by RNA signals. This is one example of the growing awareness that 
RNA molecules can act as biological signals. 


32.3b The Single TMS Receptors Are Guanylyl Cyclases 
or Tyrosine Kinases 


Receptor proteins that span the plasma membrane with a single helical transmem- 
brane segment—either receptor tyrosine kinase (RTK) or receptor guanylyl cyclase 
(RGC)—possess an external ligand recognition site and an internal domain with 
enzyme activity. Each of these enzyme activities is manifested in two different 
cellular forms. Thus, guanylyl cyclase activity is found in both membrane-bound 
receptors and in soluble, cytoplasmic proteins. Tyrosine kinase activity, on the 
other hand, is exhibited by two different types of membrane proteins: The RTKs are 
integral transmembrane proteins, whereas the non-RTKs are peripheral, lipid- 
anchored proteins. More than 90 tyrosine kinases have been identified in the human 
genome; 58 of these are RTKs. Seven receptor guanylyl cyclases have been identified 
in the human genome. All soluble guanylyl cyclase proteins are dimeric and consist of 
alpha and beta subunits. Vertebrates possess two alpha forms (a; and a2) and two 
beta forms (8; and 62). The predominant soluble guanylyl cyclase in animals is the 
a,/B, dimer. 


32.3c RTKs and RGCs Are Membrane-Associated Allosteric Enzymes 


The binding of polypeptide hormones and growth factors to RTKs and RGCs acti- 
vates the intracellular enzyme activity of these proteins. These catalytic receptors are 
composed of three domains (Figure 32.9): an extracellular receptor-binding domain 
(which may itself include several subdomains), a transmembrane domain consisting 
of a single transmembrane a-helix, and an intracellular domain. This intracellular 
portion includes a tyrosine kinase or guanylyl cyclase domain that mediates the 
biological response to the hormone or growth factor via its catalytic activity and a 
regulatory domain that contains multiple phosphorylation sites. The 58 RTKs of the 
human genome can be grouped into about 20 families on the basis of their kinase 
domain sequences and the various extracellular subdomains. The epidermal growth 
factor (EGF) receptor and the insulin receptor are representative of this class of recep- 
tor proteins. 

Given that the extracellular and intracellular domains of RTKs and RGCs are joined 
by only a single transmembrane helical segment, how does extracellular hormone bind- 
ing activate intracellular enzyme activity? How is the signal transduced? As shown in 
Figure 32.10, signal transduction occurs by ligand-induced oligomeric association of 
receptors. Hormone binding triggers a conformational change in the extracellular 
domain, which induces oligomeric association. Oligomeric association allows adjacent 
cytoplasmic domains to interact, leading to phosphorylation of the cytoplasmic domains 
and stimulation of cytoplasmic enzyme activity. By virtue of these ligand-induced 
conformation changes and oligomeric interactions, RTKs and RGCs are considered to 
be membrane-associated allosteric enzymes. 


32.3d The EGF Receptor Is Activated by Ligand-Induced Dimerization 


Prior to EGF binding, the EGF receptors exist as inactive monomers in the plasma 
membrane, but binding of an activating ligand induces dimerization. 


The Structure of the EGF Receptor Human EGF is a 53 amino acid peptide that 
stimulates proliferation of epithelial cells (cells that cover a surface or line a cavity 
in biological tissues). The human EGF receptor is a 1186—amino acid RTK. The 
extracellular domain, where EGF binds, contains 622 amino acids and is divided 
into four domains (Figure 32.9). Domains I and III are similar B-helical barrels, 
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Figure 32.9 Four classes of single TMS receptors. The intracellular enzyme domains are tyrosine 
kinases or guanylyl cyclases. (a) The EGF receptor and (b) the insulin receptor are receptor tyrosine 
kinases. EGF receptors are activated by ligand-induced dimerization, whereas the insulin receptor is a 
glycoprotein composed of two kinds of subunits in an a2, tetramer stabilized by disulfide bonds. 
The extracellular portions of the EGF and insulin receptors consist of multiple modules or domains. 
Fn refers to a series of fibronectin domains numbered 1, 2a, 2b, and 3. (c) The atrial natriuretic 
peptide receptor is a receptor guanylyl cyclase with a large extracellular hormone-binding domain and 
two intracellular domains, and activation typically involves ligand-induced dimerization. (d) The 
growth factor receptor Ret is a receptor tyrosine kinase that mediates the effects of neuronal growth 
factors known as neurotrophins. The Ret receptor domains (yellow) requires a GPI-anchored 
coreceptor (blue) for binding of ligands (pink). 


(a) Ligand (b) Ligand (c) Ligand monomer 
or dimer 


single hormone ligand at the dimer interface. (Ligand dimers could 


of a GPI-anchored coreceptor protein. 
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presumably act in a similar manner). (d) Neurotrophin activation of Ret 
involves association of two molecules of the Ret kinase and two molecules 
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Figure 32.10 Ligand-induced oligomerization of membrane receptors can 
occur in several ways. (a) EGF receptor dimerization and activation involves 
binding of hormone to the extracellular domains of two receptor molecules. 
(b) The insulin receptor is a preexisting tetramer. Two insulin-binding sites 
are created by association of the extracellular domains. (c) RGCs bind a 
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(a) 


(b) 


Binding site for EGF 
on domain III 


Auto-inhibited 


Domain I/II 
rigid body 


rotation 


whereas domains II and IV are long, slender, cysteine-rich domains stabilized by 
disulfide bonds. Domain II is characterized by a B-hairpin structure that protrudes 
from the middle of the domain (Figure 32.1 1a). 

In the monomer state, the four domains are folded so that domains II and IV lie 
parallel, with the B-hairpin of domain II in contact with domain IV (Figure 32.1 1a). 
Binding of EGF to domain I induces a conformation change that rotates domains I and II 
so that the bound EGF is brought into contact with domain III and the B-hairpin is 
extended away from the rest of the structure. Pairs of such receptor monomers then 
dimerize by mutual association of the B-hairpin structures (Figure 32.11b). The next 
event is the critical step in transmembrane signal transduction. Dimerization of the 
extracellular domains brings the intracellular domains together, activating the tyrosine 
kinase activity. Thus, EGF-induced dimerization allows an extracellular signal (EGF) 
to exert an intracellular response. 


EGF Receptor Activation Forms an Asymmetric Tyrosine Kinase Dimer The tyrosine ki- 
nase domain of the EGF receptor consists of an N-terminal domain built around a 
twisted B-sheet and a C-terminal domain that is primarily a-helical (Figure 32.12a). 
In the inactive, monomeric EGF receptor, the active site of the tyrosine kinase 


Il z Dimerization 


Dimer 


Figure 32.11 Hormonal activation of EGF receptors. (a) In the absence of 
hormone, the extracellular domain of the EGF receptor is folded with the 
hairpin loop and dimerization interface on domain II (green) associated with 
domain IV (orange). Binding of EGF (red) induces a conformation change 
that exposes the dimerization interface and promotes formation of the 
activated dimer complex (PDB id = 1NQL). (b) Dimerization involves 
dove-tailing of the domain II hairpin loops (PDB id = IIVO). 
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(b) "a 


domain is blocked by a 30 residue loop of the protein (residues 831-860) that is ter- 
med the activation loop. Extracellular dimer formation by the EGF receptor appears 
to promote formation of an asymmetric dimer of the intracellular tyrosine kinase 
domains (Figure 32.12b), with the C-terminal lobe of one kinase domain juxtaposed 
with the N-terminal lobe of the other kinase domain. In this asymmetric dimer, one 
monomer is inactive but acts as an activator of kinase activity in the other monomer. 
Conversion of the kinase domain from its inactive state to the active conformation 
involves rotation of the activation loop out of the active site, making room for a pep- 
tide substrate to enter the site (Figure 32.12c). 

The activated tyrosine kinase domain of the EGF receptor can phosphorylate 
several tyrosine residues at its own C terminus (Figure 32.13), a process termed 
autophosphorylation. These phosphorylated tyrosines are binding sites for a variety of 
other signaling proteins that contain phosphotyrosine-binding SH2 domains (see 
Figures 32.5 and 32.6). Each of these SH2-domain—containing proteins can initiate 
several signal transduction cascades. 


32.3e The Insulin Receptor Mediates Several Signaling Pathways 


Insulin, a small heterodimeric peptide (see Figure 5.8), is the most potent anabolic hor- 
mone known. It regulates blood glucose levels, and it promotes the synthesis and stor- 
age of carbohydrates, proteins, and lipids. Abnormalities of insulin production, action, 
or both lead to diabetes, as well as other serious health issues. Insulin binding to 
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Figure 32.12 Several structures of the kinase 
domain of the epidermal growth factor (EGF) 
receptor. (a) The kinase domain of the EGF 
receptor consists of an N-terminal lobe built 
around a twisted -sheet and an a-helical 
C-terminal lobe. Bound ATP is pink, and a 

portion of the activation loop, shown blocking the 
active site, is blue. (b) Hormone-induced receptor 
dimerization promotes formation of an asymmetric 
dimer of the intracellular kinase domains (yellow 
and green). (c) Before hormonal activation (left), the 
kinase active site is blocked by a 30-residue 
activation loop (green). In the activated complex 
(right), the activation loop (green) has rotated away 
from the active site in one of the subunits, making 
room for binding of a peptide substrate (blue). 
Activation brings Lys”! close enough to form an ion 
pair with Glu”®. (a, b, c [right]: PDB id = 2GS6; 

c [left]: PDB id = 2GS7.) 
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Figure 32.13 Hormone-induced dimerization of the EGF receptor promotes autophosphorylation 
of five tyrosine residues near the C terminus of each EGF receptor subunit. Signaling proteins 
that contain phosphotyrosine-binding SH2 domains can be activated by binding to these 
phosphotyrosines. 


receptors in liver, muscle, and other tissues triggers multiple signaling pathways, and 
insulin action is responsible for a variety of cellular effects. 

The insulin receptor is an RTK that catalyzes the phosphorylation of tyrosine resi- 
dues of several intracellular substrates, including the insulin receptor substrate (IRS) 
proteins, and several other proteins known as Gab-1, She, and APS (Figure 32.14). 
Each of these phosphorylated substrates binds a particular family of proteins contain- 
ing SH2 domains (see Figures 32.5 and 32.6). These SH2 proteins interact specifically 
with phosphotyrosine-containing IRS sequences. Each of these signaling pathways can 
be confined to distinct cellular locations and can proceed with a different time course, 
thus providing spatial and temporal dimensions to insulin action in cells. 


The Insulin Receptor Adopts a Folded Dimeric Structure Unlike the majority of RTKs, 
which are single-chain receptors, the insulin receptor is an œf» tetramer. The a-chain 
contains two leucine-rich domains (L1 and L2) with a cysteine-rich domain (CR) be- 
tween them, as well as an intact fibronectin domain (Fn1) and a partial fibronectin 


Insulin 
receptor 


Lipid raft 


Figure 32.14 Substrates of the insulin receptor 


s : ; ; 3 MAP kinase 
tyrosine kinase include the insulin receptor substrate pathway < 
(IRS) proteins, as well as Gab-1, Shc, and APS. The 
phosphorylated substrates in turn bind to several 
families of SH2 domain—containing proteins, 
activating several signaling pathways. AKT pathway 
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domain (Fn2). The B-chain contains the other half of Fn2, followed by a third fibro- 
nectin domain (Fn3), a transmembrane a-helix, and (inside the cell) the tyrosine ki- 
nase domain (see Figure 32.9). The extracellular domain (the ectodomain) forms a 
folded dimer, with the L1 and L2 domains of one a-chain juxtaposed with the Fn2 
domain of the other a/6 chain pair, to create the insulin-binding site (Figure 32.15). 
Thus, each of the two insulin-binding sites of the ectodomain consists of portions of 
both a-chains. 


Autophosphorylation of the Insulin Receptor Kinase Opens the Active Site Binding of 
insulin to its receptor activates the tyrosine kinase activity of the intracellular do- 
mains. Like the EGF receptor kinase, the insulin receptor kinase contains an activa- 
tion loop that lies across the kinase active site, excluding substrate peptides and thus 
inhibiting the kinase. Phosphorylation of three tyrosine residues on the activation 
loop—another case of autophosphorylation—causes the activation loop to move out 
of and away from the active site (Figure 32.16). This opens the active site so that tar- 
get proteins are bound and phosphorylated by the kinase, triggering the appropriate 
signaling pathways (see Figure 32.14). 


32.3f Receptor Guanylyl Cyclases Mediate Effects 
of Natriuretic Hormones 


When you eat a salty meal, your body secretes hormones that protect you from the 
harmful effects of excess salt intake. When your heart senses that blood volume is 
too great, it sends signals to the kidneys to excrete NaCl and water (processes 
termed natriuresis and diuresis, respectively). These are just two examples of the 
action of natriuretic hormones, which allow tissues and organs to communicate 
with one another to regulate the volumes of blood and other body fluids and the 
osmotic effects of Na*, K*, Cl-, and other ions. 


Receptor Guanylyl Cyclases Are Membrane-Associated Allosteric Enzymes Guanylin and 
uroguanylin are produced in the intestines after ingestion of a salty meal and are 
secreted into the intestinal lumen. Binding of guanylin and uroguanylin to receptor 
guanylyl cyclases on cell membranes lining the lumen activates the intracellular 
guanylyl cyclase (Figure 32.17a). cGMP (see Figure 10.12) produced in this reaction 
is a second messenger that inhibits Na‘ uptake from the lumen and activates Cl~ 
export into the lumen. The result is a beneficial enhanced excretion of NaCl and 
water. (This effect can be carried too far, however. Heat-stable enterotoxin (ST) 
produced by E. coli—with a sequence similar to guanylin and uroguanylin—causes 
violent diarrhea.) Atrial natriuretic peptide (ANP) and brain natriuretic peptide 
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Figure 32.15 The ectodomain formed by the insulin 
receptor is constructed from the two a/B units. 
These two units are folded so that the L1 domain of 
one a-chain is juxtaposed with the Fn1 domain of 
the other a-chain. Each insulin-binding site is 
created by the L1 and L2 domains of one a-chain 
and the Fn2 domain of the other a/B chain pair 
(see Figure 32.9; PDB id = 2DTG). 


Figure 32.16 In its inactive state, the insulin receptor 
tyrosine kinase is inhibited by an activation loop 
(yellow and red), which prevents substrate access to the 
active site (/eft; PDB id = 1IRK). Autophosphorylation 
of three tyrosine residues on the activation loop induces 
a conformation change that rotates the loop out of the 
active site (right; PDB id = 1IR3), permitting access by 
insulin receptor substrates (blue) and ATP (ATP analog 
in orange). 
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Figure 32.17 Ingestion of a salty meal triggers excretion of NaCl and water in the intestines. Binding 
of guanylin and uroguanylin to RGC on cell membranes lining the lumen activates the intracellular 
guanylyl cyclase. cGMP produced by the cyclase inhibits Na* uptake from the lumen and activates 
Cl export into the lumen. Atrial natriuretic peptide (ANP) protects the heart and circulatory system 
from the deleterious effects of increased blood volume. When the heart muscle is stressed and 
stretched, the heart secretes ANP. ANP binding to RGCs in kidney tubules activates intracellular 
guanylyl cyclase, producing cGMP. Inhibition of Na* uptake in the kidneys stimulates excretion of 
salt and water, reducing blood volume. 


(BNP, so named because it was discovered first in the brain) are both produced 
primarily in the heart. When the heart muscle is stressed and stretched by increased 
blood volume, the heart secretes ANP and BNP into the blood. In the kidneys, 
ANP and BNP bind to RGCs in kidney tubules, activating intracellular guanylyl 
cyclase and producing cGMP (Figure 32.17b). As in the intestines, cGMP inhibits 
Na‘ uptake, once again stimulating excretion of salt and water and reducing blood 
volume. 


A Symmetric Dimer Binds an Asymmetric Peptide Ligand RGC monomers associate as 
dimers in the membrane, even in the absence of their hormone ligands (Figure 32.18a). 
A dimeric receptor complex is activated by the binding of a single polypeptide hor- 
mone. This raises two important questions about the mechanism of action of these 
receptors: (1) How does an asymmetric hormone ligand (for example, ANP) bind to a 
symmetric homodimeric receptor? And (2) how does hormone binding to its extracel- 
lular dimeric receptor activate the intracellular guanylyl cyclase domain? Answers to 
these provocative questions have been provided by Kunio Misono and his colleagues, 
who determined the structure of the ANP receptor-ANP complex (Figure 32.18b). 
Remarkably, there is no significant intramolecular conformational change in either of 
the ANP receptor monomers. However, upon ANP binding, the two ANP receptor 
molecules undergo a twist motion in the membrane in order to insert the ANP hor- 
mone between them (Figure 32.19). The hormone lies like a disc between the two recep- 
tor subunits. The hormone-induced twist of the extracellular domains suggests a 
mechanism for activation of guanylyl cyclase activity across the membrane. Rotation of 
the transmembrane helices upon hormone binding could reorient the two intracellular 
domains, giving rise to guanylyl cyclase activity. 
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Figure 32.18 Activation of the ANP receptor involves binding of an asymmetric ligand at the 
interface of two identical receptor domains. Comparison of the structures of the receptor domain in 
the (a) absence and (b) presence of ANP reveals no significant intramolecular conformational change 
in either of the receptor monomers. ANP binding induces a twist of the two ANP receptor molecules, 
allowing the ANP hormone to insert between them. (a, PDB id = 1DP4; b, PDB id = 1T34). 
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32.3g Nonreceptor Tyrosine Kinases Are Typified by pp60%"° 


The first tyrosine kinases to be discovered were associated with viral transforming 
proteins. These proteins, produced by oncogenic viruses, enable the virus to trans- 
Jorm animal cells, that is, to convert them to the cancerous state. A prime example is 
the tyrosine kinase expressed by the sre gene of Rous or avian sarcoma virus. The 
protein product of this gene is pp60%%": (the abbreviation refers to phosphoprotein, 
60 kDa, viral origin, sarcoma-causing). The v-sre gene was derived from the avian 
proto-oncogenic gene c-sre during the original formation of the virus. The cellular 
proto-oncogene homolog of pp60Y-™° is referred to as pp60°*"*. pp60"* is a 526 
residue peripheral membrane protein. It undergoes two post-translational modifica- 
tions: First, the amino group of the NH)-terminal glycine is modified by the cova- 
lent attachment of a myristoy/ group (this modification is required for membrane 
association of the kinase; see Figure 32.20). Then Ser!” and Tyr*" are phosphorylated. 


iid ANP 
% _ C terminus 
_ 


N terminus 


C terminus 


Figure 32.19 The rotation mechanism proposed by 
Misono and colleagues for transmembrane signaling 
by the ANP receptor. ANP binding causes a twist of 
the two extracellular domains, leading to rotation 

of the two intracellular domains and activation of 
guanylyl cyclase activity. PKLD is a protein kinase- 
like domain, and GCD is a guanylyl cyclase domain. 
(Adapted from Misono, K., Ogawa, H., Qiu, Y., and 
Ogata, C., 2005. Structural studies of the natriuretic 
peptide receptor: A novel hormone-induced rotation 
mechanism for transmembrane signal transduction. 
Peptides 26:957-968.) 
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(a) 


Figure 32.20 Activation mechanisms for the soluble 
forms of tyrosine kinases and guanylyl cyclases. 

(a) The soluble tyrosine kinase pp60"*" is activated 

by anchoring to the plasma membrane via an 
N-terminal myristoyl group, followed by phosphoryla- 
tion as shown. (b) The structure of protein tyrosine 
kinase pp60°*", showing AMP-PNP in the active site 
(blue, green, and red), Tyr*!® (orange), and Tyr’ 
(yellow). Tyr’ is phosphorylated (purple). (c) Domain 
structure of the a- and B-subunits of soluble guanylyl 
cyclase (sGC). Nitric oxide (NO) activates the cyclase 
by binding to the H-NOX domain. (d) Cryo-EM 
structure of the inactive form of sGC, showing the 
bent coiled-coil (PDB id = 6JTO). (e) Cryo-EM 
structure of the active form of sGC, showing the 
extended coiled-coil (PDB id = 6JT2). 


(b) 


(c) H-NOX PAS CC CAT 


69 258 274 405 460 660 


(d) (e) 


A Deeper Look 


Nitric Oxide, Nitroglycerin, and Alfred Nobel 


NO.: is the active agent released by nitroglycerin (see accompanying 
figure), a powerful drug that ameliorates the symptoms of heart 


attacks and angina pectoris (chest pain due to coronary artery 
disease) by causing the dilation of coronary arteries. Nitroglycerin 
is also the active agent in dynamite. Ironically, Alfred Nobel, the 
inventor of dynamite who also endowed the Nobel prizes, himself 
suffered from angina pectoris. In a letter to a friend in 1885, Nobel 


The structure of nitroglycerin, a potent vasodilator. 


wrote, “It sounds like the irony of fate that I should be ordered by 


my doctor to take nitroglycerin internally.” 
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The phosphorylation at Tyr*!®, which increases kinase activity twofold to threefold, 
appears to be an autophosphorylation. On the other hand, phosphorylation at Tyr>?’ 
is inhibitory and is catalyzed by another kinase known as CSK. The significance of 
nonreceptor tyrosine kinase activity to cell growth and transformation is only par- 
tially understood, but 1% of all cellular proteins (many of which are also kinases) are 
phosphorylated by these kinases. 


32.3h Soluble Guanylyl Cyclases Are Receptors for Nitric Oxide 


Nitric oxide, or NO- , a reactive free radical, acts as a neurotransmitter and as a second 
messenger, activating soluble guanylyl cyclase more than 400-fold. The cGMP thus pro- 
duced also acts as a second messenger, inducing relaxation of vascular smooth muscle 
and mediating penile erection. As a dissolved gas, NO- is capable of rapid diffusion 
across membranes in the absence of any apparent carrier mechanism. This property 
makes NO- a particularly attractive second messenger because NO: generated in one 
cell can exert its effects quickly in many neighboring cells. NO- has a very short cellular 
half-life (1 to 5 s) and is rapidly degraded by nonenzymatic pathways. 

The a- and B-subunits of sGCs are homologous and each contains four domains 
(Figure 32.20c): a heme nitric oxide/oxygen (H-NOX) binding domain, a Per/Arnt/Sim 
(PAS)-like domain, a coiled-coil (CC) domain, and a catalytic cyclase domain (CAT). 
Binding of NO- causes a large-scale conformation change that converts the inactive 
form to the active guanylyl cyclase (Figure 32.20d and e). The CC domain is bent in the 
inactive form but is fully extended in the active form. The H-NOX and CAT domains 
do not contact each other, but both rotate relative to the CC domain in the conversion 
to the active form. The CC domain is thus the transducer of NO- activation of sGCs. 


32.4 | How Are Receptor Signals Transduced? 
Receptor signals are transduced in one of three ways to initiate actions inside the cell: 


1. Exchange of GDP for GTP by GTP-binding proteins (G proteins) on the cytoplas- 
mic side of the plasma membrane, which leads to generation of second messengers, 
including cAMP, phospholipid breakdown products, and Ca?*. 

2. Receptor-mediated activation of phosphorylation cascades that in turn trigger 
activation of various enzymes. This is the action of the receptor tyrosine kinases 
described in Section 32.3. Protein kinases and protein phosphatases acting as 
effectors will be discussed in Section 32.5. 

3. Conformation changes that open ion channels or recruit proteins into nuclear 
transcription complexes. (Ion channels are discussed in Section 32.7, and the 
formation of nuclear transcription complexes was described in Chapter 29.) 


32.4a GPCR Signals Are Transduced by G Proteins 


The signals of G-protein—coupled receptors (GPCRs) are transduced by GTP-binding 
proteins, known more commonly as G proteins. The large G proteins are heterotrimers 
consisting of a- (45 to 47 kDa), B- (35 kDa), and y- (7 to 9 kDa) subunits. The a-subunit 
binds GDP or GTP and has an intrinsic, slow GTPase activity. The Gag, complex 
(Figure 32.21 and see Figure 15.20) in the unactivated state has GDP at the nucleotide 
site. Binding of hormone to receptor stimulates a rapid exchange of GTP for GDP on 
G,. The binding of GTP causes G, to dissociate from Gg, and to associate with an 
effector protein such as adenylyl cyclase (Figure 32.22). Binding of G,(GTP) activates 
adenylyl cyclase. The adenylyl cyclase actively synthesizes cAMP as long as G,(GTP) 
remains bound to it. However, the intrinsic GTPase activity of G, eventually hydrolyzes 
GTP to GDP, leading to dissociation of G,(GDP) from adenylyl cyclase and reassocia- 
tion with the Gg, dimer, regenerating the inactive heterotrimeric Gagy complex. 

The hormone-activated GPCR is a guanine—-nucleotide exchange factor (GEF)— 
promoting the exchange of GDP with GTP on the G protein—in a manner similar to the 
interaction of EF-Ts with EF-Tu(GDP) (see Section 30.5d). By contrast, the Gg, complex, 
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Figure 32.21 A heterotrimeric G protein (a—pink, 
B—yellow, and y—blue) docked with a B»-adrenergic 
receptor (green; PDB id = 3SN6). 
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Figure 32.22 Adenylyl cyclase activity is modulated by the interplay of stimulatory (G,) and inhibitory 
(Gi) G proteins. Binding of hormones to 6,- and B>-adrenergic receptors activates adenylyl cyclase via 
G, whereas hormone binding to a-adrenergic receptors leads to the inhibition of adenylyl cyclase. 
Inhibition may occur by direct inhibition of cyclase activity by Gia or by binding of Gigy to Gyy. 


which normally acts to inhibit the spontaneous release of GDP from G, (in the inactivated 
state of the GPCR), is termed a guanine-nucleotide dissociation inhibitor (GDI). Other 
proteins may also behave as GEFs and GDIs; their actions are discussed in Section 32.5. 

Two stages of amplification occur in the G-protein—mediated hormone response. 
First, a single hormone-receptor complex can activate many G proteins before the 
hormone dissociates from the receptor. Second, the G,-activated adenylyl cyclase syn- 
thesizes many cAMP molecules. Thus, the binding of hormone to a very small number 
of membrane receptors stimulates a large increase in concentration of cAMP within 
the cell. The hormone receptor, G protein, and cyclase constitute a complete hormone 
signal transduction unit. 

Hormone-receptor—mediated processes regulated by G proteins may be stimulatory 
or inhibitory. Each hormone receptor interacts specifically with either a stimulatory 
G protein, denoted G,, or an inhibitory G protein, denoted G; (Figure 32.22). 


32.4b Cyclic AMP Is a Second Messenger 


Cyclic AMP (cAMP) was identified in 1956 by Earl Sutherland, who termed cAMP a 
second messenger because it is the intracellular response provoked by binding of hor- 
mone (the first messenger) to its receptor. Since Sutherland’s discovery of cAMP, 
many other second messengers have been identified (Table 32.1). The concentrations 
of second messengers in cells are carefully regulated. Synthesis or release of a second 
messenger is followed quickly by degradation or removal from the cytosol. Following 
its synthesis by adenylyl cyclase, cAMP is broken down to 5-AMP by phosphodies- 
terase (Figure 32.23). 

Adenylyl cyclase (AC) is an integral membrane enzyme. Its catalytic domain, on the 
cytoplasmic face of the plasma membrane, includes two subdomains, denoted VC, and 
IIC,. Binding of the a-subunit of G, (denoted G,,) activates the AC catalytic domain. 

Alfred Gilman, Stephen Sprang, and coworkers have determined the structure of a 
complex of G,, (with bound GTP) with the cytoplasmic domains (VC, and IHC) of 
adenylyl cyclase (Figure 32.24). The G,, complex binds to a cleft at one corner of the C, 
domain, and the surface of G,,-GTP that contacts adenylyl cyclase is the same surface 
that binds the Gz, dimer. The catalytic site, where ATP is converted to cyclic AMP, is far 
removed from the bound G protein. 

Spatial and temporal cAMP concentration gradients modulate the actions of this 
second messenger. These gradients lie between the sites of cAMP synthesis (by adenylyl 
cyclase) and the sites of degradation (by specific phosphodiesterases). 
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| Table 32.1 | Intracellular Second Messengers* 


Activates protein kinases 


Activates protein kinases, regulates ion channels, regulates 
phosphodiesterases 


Activates protein kinases, activates Ca**-modulated proteins 
Activates Ca?* channels 

Activates protein kinase C 

Activates Ca?* channels, inhibits adenylyl cyclase 


Activates protein kinases 


Messenger Source Effect 
cAMP Adenylyl cyclase 

cGMP Guanylyl cyclase 

Ca?* Ion channels in ER and plasma membrane 

IP; PLC action on PI 

DAG PLC action on PI 

Phosphatidic acid Membrane component and product of PLD 

Ceramide PLC action on sphingomyelin 

Anandamide Action of phospholipases and other enzymes 


on glycerophos 


pholipids 


Activation or inhibition of neuronal circuits 
Regulation of blood vessels and heart and inflammatory 


Cryoprotective and homeostatic functions 


Activates guanylyl cyclase, relaxes smooth muscle 


Hydrogen sulfide (HS) Action of cystathionine y lyase on cysteine 
responses 
Carbon monoxide (CO) Action of heme oxygenase in heme catabolism 
Nitric oxide (NO-) NO synthase 
Cyclic ADP-ribose cADP-ribose synthase 


Activates Ca?* channels 


*|P; = inositol-1,4,5-trisphosphate; PI = phosphatidylinositol; DAG = diacylglycerol; PLC = phospholipase C; PLD = phospholipase D (see Figure 32.26). cCMP and cUMP 


(not shown) are also cyclic nucleotide second messengers. 
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Figure 32.23 Cyclic AMP is synthesized by Fyrophosphatase 


membrane-bound adenylyl cyclase and degraded by 
soluble phosphodiesterase. 


v 


cAMP Activates Protein Kinase A All second messengers exert their cellular effects by 
binding to one or more target molecules. cAMP produced by adenylyl cyclase acti- 
vates a protein kinase, which is thus known as cAMP-dependent protein kinase. 
Protein kinase A, as this enzyme is also known, activates many other cellular proteins 
by phosphorylation. The activation of protein kinase A by cAMP and regulation of 
the enzyme by intrasteric control was described in detail in Chapter 15. The structure 
of protein kinase A has served as a paradigm for understanding many related protein 
kinases (see Figure 15.9). 


32.4c Ras and Other Small GTP-Binding Proteins Are 
Proto-Oncogene Products 


GTP-binding proteins are implicated in growth control mechanisms in higher organ- 
isms. Certain tumor virus genomes contain genes encoding 21 kDa proteins that bind 
GTP and show regions of homology with other G proteins. The first of these genes to 
be identified was found in rat sarcoma virus and was dubbed the ras gene. Genes 
implicated in tumor formation are known as oncogenes; they are often mutated ver- 
sions of normal, noncancerous genes involved in growth regulation, so-called proto- 
oncogenes. The normal, cellular Ras protein is a GTP-binding protein that functions in 
a manner similar to that of other G proteins described previously, activating metabolic 
processes when GTP is bound and becoming inactive when GTP is hydrolyzed to GDP. 
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Figure 32.24 Structure of a Ga 
complex with the catalytic domain 
of adenylyl cyclase. (a) Two views 
of the complex of G,, with the 
VC IIC; catalytic domain of 
adenylyl cyclase (with 2’-deoxy- 
adenosine 3’-monophosphate 
shown in magenta) and Gyq. 
(5'-guanosine-diphosphate- 
monothiophosphate shown in 
aqua). (b) Details of the Gs« 
complex in the same orientation as 
the structures in (a). SW-I and 
SW-II are switch regions, whose 
conformations differ greatly 
depending on whether GTP or 
GDP is bound. (PDB id = 1CS4) 


(a) 


(b) 


(b) 


The GTPase activity of the normal Ras p21 is very low, as is appropriate for a G protein 
that regulates long-term effects like growth and differentiation. A specific GTPase- 
activating protein (GAP) increases the GTPase activity of the Ras protein. Mutant 
(oncogenic) Ras proteins have severely impaired GTPase activity, locking them in an 
state that activates cellular growth and metabolism in tumor cells. The conformations of 
Ras proteins (Figure 32.25) in complexes with GDP are different from the correspond- 
ing complexes with GTP analogs such as GMP-PNP (a nonhydrolyzable analog of 
GTP in which the B-P and y-P are linked by N rather than by O). Two regions of the 
Ras structure change conformation upon GTP hydrolysis. These conformation changes 
mediate the interactions of Ras with other proteins, termed effectors. 


32.4d G Proteins Are Universal Signal Transducers 


A given G protein can be activated by several different hormone-receptor complexes. 
For example, either glucagon or epinephrine, binding to their distinctive receptor 
proteins, can activate the same species of G protein in liver cells. The effects are addi- 
tive, and combined stimulation by glucagon and epinephrine leads to higher cytoplas- 
mic concentrations of cAMP than activation by either hormone alone. 

G proteins are a universal means of signal transduction in higher organisms, activat- 
ing many hormone-receptor—initiated cellular processes in addition to adenylyl cyclase. 
Such processes include, but are not limited to, activation of phospholipases C and A, 
and the opening or closing of transmembrane channels for K+, Na+, and Ca?* in brain, 
muscle, heart, and other organs (Table 32.2). G proteins are integral components of 
sensory pathways such as vision and olfaction. More than 800 different GPCRs and at 
least 21 distinct G proteins are known. At least a dozen different G-protein effectors 
have been identified, including a variety of enzymes and ion channels. 


Figure 32.25 The structure of Ras complexed with (a) GDP (PDB id = 1LF5) and (b) GMP-PNP 
(PDB id = 1LF0). The Ras p21-GMP-PNP complex is the active conformation of this protein. 
A Mg?* ion (red) is shown in both structures. 
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| Table 32.2 | G Proteins and Their Physiological Effects 


G Protein Location Stimulus Effector 
G, Liver Epinephrine, glucagon Adenylyl cyclase 
G, Adipose tissue Epinephrine, glucagon Adenylyl cyclase 
G; Kidney Antidiuretic hormone Adenylyl cyclase 
Gi Heart muscle Acetylcholine Potassium channel 
Gi/G, Brain neurons Enkephalins, Adenylyl cyclase, potassium 
endorphins, opioids channels, calcium channels 
G, Smooth muscle cells Angiotensin Phospholipase C 
in blood vessels 
Gorr Neuroepithelial cells Odorant molecules Adenylyl cyclase 


in the nose 


Transducin (G) Retinal rod and cGMP phosphodiesterase 


cone cells of the eye 


Light detection 


32.4e Specific Phospholipases Release Second Messengers 


A diverse array of second messengers are generated by breakdown of membrane 
phospholipids. Binding of certain hormones and growth factors to their respective 
receptors triggers a sequence of events that can lead to the activation of specific 
phospholipases. The action of these phospholipases on membrane lipids produces the 
second messengers shown in Figure 32.26. 


Inositol Phospholipid Breakdown Yields Inositol-1,4,5-Trisphosphate and Diacylglycerol 
Breakdown of phosphatidylinositol (PI) and its derivatives by phospholipase C pro- 
duces a family of second messengers. In the best-understood pathway, successive 
phosphorylations of PI produce phosphatidylinositol-4-P (PIP) and phosphatidylinositol- 
4,5-bisphosphate (PIP2). Four isozymes of phospholipase C (denoted a, B, y, and 6) 
hydrolyze PI, PIP, and PIP». Hydrolysis of PIP, by phospholipase C yields the second 
messenger inositol-1,4,5-trisphosphate (IP3) as well as another second messenger, 
diacylglycerol (DAG) (Figure 32.27). IP; is water soluble and diffuses to intracellular 
organelles where release of Ca”* is activated. DAG, on the other hand, is lipophilic 
and remains in the plasma membrane, where it activates a Ca**-dependent protein 
kinase known as protein kinase C. 


(a) o PLC 
| 19 
c | 
“or — P- O — Polar head group 
o | 
= = = ch TO 
| 
O PLD 
PLA, 
Action of phospholipases 
(b) 
Phospholipid Phosphatidylinositol Phosphatidylcholine Sphingomyelin 
PLA, PLC PLC SMase 
Y 
Unsaturated DAG Inositol DAG Ceramide 
fatty acid (arachidonate) phosphates 
y 
Eicosanoids 
and 
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Effect 


Glycogen breakdown 

Fat breakdown 

Conservation of water 

Decreased heart rate and pumping force 


Changes in neuron electrical activity 
Muscle contraction, blood pressure elevation 
Odorant detection 


Light detection (vision) 


Figure 32.26 Lipid second messengers are generated by 
the action of phospholipases. (a) The general action of 
phospholipase A; (PLA2), phospholipase C (PLC), and 
phospholipase D (PLD). (b) The synthesis of second 
messengers from phospholipids by the action of 
phospholipases and sphingomyelinase (SMase), which 
for example cleaves a choline sphingomyelin to yield 
phosphocholine and ceramide. 
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Figure 32.27 The family of second messengers 
produced by phosphorylation and breakdown of 
phosphatidylinositol. PLC action instigates a 
bifurcating pathway culminating in two distinct and 
independent second messengers: DAG and IP}. 


Figure 32.28 Phospholipase C-f is activated 
specifically by G,, a GTP-binding protein, and also 
by Ca?*, 


B-type 
PLC-B, | 
X Y 
ytype 
PLC-y; 
X SH2 SH2 SH3 Y 
ô-type 
PLCS, | | 
X Y 


Figure 32.30 The amino acid sequences of 
phospholipase C isozymes £, y, and ô share two 
homologous domains, denoted X and Y. The 
sequence of y-isozyme contains sre homology 
domains, denoted SH2 and SH3. SH2 domains 
(approximately 100 residues in length) interact with 
phosphotyrosine-containing proteins (such as 
RTKs), whereas SH3 domains mediate interactions 
with Pro-rich sequences. (Adapted from Dennis, E., 
Rhee, S., Gillah, M., and Hannun, E., 1991. Role of 
phospholipases in generating lipid second 
messengers in signal transduction. The FASEB 
Journal 5:2068-2077.) 


PI ———————_> PI4-P —————-> PI-4,5-P, 


PLC 


DAG 


Polypeptide 
hormone 


\ Receptor 


Phospholipase C-6 


GTP GDP PIP, IP; + DAG 


Activation of Phospholipase C Is Mediated by G Proteins or by Tyrosine Kinases Phospho- 
lipase C-B, C-y, and C-6 are all Ca** dependent, but the different phospholipase C 
isozymes are activated by different intracellular events. Phospholipase C- is stimu- 
lated by G proteins (Figure 32.28). On the other hand, phospholipase C-y is activated 
by receptor tyrosine kinases (Figure 32.29). The domain organization of phospholipase 
C-B and C-y is shown in Figure 32.30. The X and Y domains of phospholipase C-8 
and C-y are highly homologous, and both of these domains are required for phospho- 
lipase C activity. The other domains of these isozymes confer specificity for G-protein 
activation or tyrosine kinase activation. 


Polypeptide O 


hormone \ 


_|— Receptor tyrosine Phospholipase C-y 
kinase (RTK) 


Tyrosine 
kinase ® 


PIP, IP; + DAG 


Figure 32.29 Phospholipase C-y is activated upon phosphorylation by receptor tyrosine kinases and 
by Ca?*, 
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Cancer, Oncogenes, and Tumor Suppressor Genes 


The disease state known as cancer is the uncontrolled growth and 
proliferation of one or more cell types in the body. Control of cell 
growth and division is an incredibly complex process, involving the 
signal-transducing proteins (and small molecules) described in this 
chapter and many others like them. The genes that give rise to these 
growth-controlling proteins are of two distinct types: 


1. Oncogenes: These genes code for proteins that are capable of 
stimulating cell growth and division. In normal tissues and 
organisms, such growth-stimulating proteins are regulated so 
that growth is appropriately limited. However, mutations in these 
genes may result in loss of growth regulation, leading to uncon- 
trolled cell proliferation and tumor development. These mutant 
genes are known as oncogenes because they induce the oncogenic 
state—cancer. The normal versions of these genes are termed 
proto-oncogenes; proto-oncogenes are essential for normal cell 
growth and differentiation. Oncogenes are dominant because 
mutation of only one of the cell’s two copies of the gene can lead 
to tumor formation. Table A lists a few of the known oncogenes 
(more than 60 are now known). 


| Tabea e| A A Representative List of Proto-Oncogenes 


Implicated in Human Tumors 


Proto-Oncogene Neoplasm(s) 


Abl Chronic myelogenous leukemia 

ErbB-1 Squamous cell carcinoma; astrocytoma 

ErbB-2 (Neu) Adenocarcinoma of breast, ovary, and stomach 

Myc Burkitt’s lymphoma; carcinoma of lung, breast, 
and cervix 

H-Ras Carcinoma of colon, lung, and pancreas; 
melanoma 

N-Ras Carcinoma of genitourinary tract and thyroid; 
melanoma 

Ros Astrocytoma 

Sre Carcinoma of colon 

Jun 

Fos | Several 


Adapted from Bishop, J. M., 1991. Molecular themes in oncogenesis. Cell 64:235-248; 
Croce, C. M., 2008. Oncogenes and cancer. New England Journal of Medicine 
358:502-511. 


2. Tumor suppressor genes: These genes code for proteins whose 
normal function is to turn off cell growth. A mutation in one of 
these growth-limiting genes may result in a protein product that 
has lost its growth-limiting ability. Since the normal products 
suppress tumor growth, the genes are known as tumor suppressor 
genes. Because both cellular copies of a tumor suppressor gene 
must be mutated to foil its growth-limiting action, these genes 
are recessive in nature. Table B presents several recognized tumor 
suppressor genes. 


Careful molecular analysis of cancerous tissue has shown that 
tumor development may result from mutations in several proto- 
oncogenes or tumor suppressor genes. The implication is that there 
is redundancy in cellular growth regulation. Many (if not all) tumors 
are either the result of interactions of two or more oncogene prod- 
ucts or arise from simultaneous mutations in a proto-oncogene and 
both copies of a tumor suppressor gene. Cells have thus evolved 
with overlapping growth-control mechanisms. When one is compro- 
mised by mutation, others take over. 


| TableB | B Representative Tumor Suppressor Genes 


Implicated in Human Tumors 


Tumor 

Suppressor 

Gene Neoplasm(s) 

RBI Retinoblastoma; osteosarcoma; carcinoma of 
breast, bladder, and lung 

ps3 Astrocytoma; carcinoma of breast, colon, and 
lung; osteosarcoma 

WTI Wilms’ tumor 

DCC Carcinoma of colon 

NFI Neurofibromatosis type 1 

FAP Carcinoma of colon 

MEN-1 Tumors of parathyroid, pancreas, pituitary, and 


adrenal cortex 


Adapted from Bishop, J. M., 1991. Molecular themes in oncogenesis. Cell 64:235—248, 
and Sherr, C. J., 2004. Principles of tumor suppression. Cell 116:235-246. 


32.4f Lipids Such as Phosphatidylcholine, Sphingomyelin, Glycosphingolipids, 
and Arachidonic Acid Also Generate Second Messengers 


In addition to PI, other phospholipids serve as sources of second messengers. Breakdown 
of phosphatidylcholine by phospholipases yields a variety of second messengers, 
including DAG, phosphatidic acid, and prostaglandins. The action of sphingomyelin- 
ase on sphingomyelin produces ceramide, which stimulates ceramide-activated protein 
kinase. Similarly, gangliosides (such as ganglioside Gy3; see Section 8.4) and their 
break-down products modulate the activity of protein kinases and GPCRs. 
Anandamide, formed by the action of several phospholipases from PC and PE (Chapter 
8 and Table 32.1), is an intracellular messenger in many cells. Anandamide for such 
purposes is stored in lipid droplets (adiposomes) and other intracellular reservoirs. 
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Figure 32.31 Cytosolic [Ca?*] increases occur via 
the opening of Ca** channels in the membranes of 
calciosomes, the endoplasmic reticulum, and the 
plasma membrane. 


32.4g Calcium Is a Second Messenger 


Calcium ion is an important intracellular signal. Binding of certain hormones and signal 
molecules to plasma membrane receptors can cause transient increases in cytoplasmic 
Ca?* levels, which in turn can activate a wide variety of enzymatic processes, including 
smooth muscle contraction, exocytosis, and glycogen metabolism. (Most of these acti- 
vation processes depend on special Ca**-binding proteins discussed in the following 
section.) Cytoplasmic [Ca?*] can be increased in two ways (Figure 32.31). As mentioned 
briefly earlier, cAMP can activate the opening of plasma membrane Ca’* channels, 
allowing extracellular Ca** to stream in. On the other hand, cells also contain intra- 
cellular reservoirs of Ca?*, within the endoplasmic reticulum and calciosomes, small 
membrane vesicles that are similar in some ways to muscle sarcoplasmic reticulum. 
These special intracellular Ca?* stores are not released by cAMP. They respond to IP3, a 
second messenger derived from PI. 


Intracellular Calcium-Binding Proteins Mediate the Calcium Signal Given the central 
importance of Ca** as an intracellular messenger, it should not be surprising that com- 
plex mechanisms exist in cells to manage and control Ca**. When Ca?* signals are 
generated by cAMP, IP3, and other agents, these signals are translated into the desired 
intracellular responses by calcium-binding proteins, which in turn regulate many cellular 
processes (Figure 32.32). One of these, protein kinase C, is described in Section 32.5. 
The other important Ca?*-binding proteins can, for the most part, be divided into two 
groups on the basis of structure and function: (1) the calcium-modulated proteins, in- 
cluding calmodulin, parvalbumin, troponin C, and many others, all of which have in 
common a structural feature called the EF hand (Figure 32.33) and (2) the annexin 
proteins, a family of homologous proteins that interact with membranes and phospho- 
lipids in a Ca*-dependent manner. 

More than 170 calctum-modulated proteins are known. All possess a characteristic 
peptide domain consisting of a short a-helix, a loop of 12 amino acids, and a second 
a-helix (Figure 32.33). Robert Kretsinger at the University of Virginia initially discov- 
ered this pattern in parvalbumin, a protein first identified in the carp fish and later in 
neurons possessing a high firing rate and a high oxidative metabolism. Kretsinger 
lettered the six helices of parvalbumin A through F. He noticed that the E and F heli- 
ces, joined by a loop, resembled the thumb and forefinger of a right hand and named 
this structure the EF hand, a name in common use today to identify the helix-loop-helix 
motif in calcium-binding proteins. In the EF hand, Ca?" is coordinated by six carboxyl 
oxygens contributed by a glutamate and three aspartates, by a carbonyl oxygen from a 
peptide bond, and by the oxygen of a coordinated water molecule. The EF hand was 
subsequently identified in calmodulin, troponin C, and calbindin-9K. Most of the 
known EF-hand proteins possess two or more (as many as eight) EF-hand domains, 
usually arranged so that two EF-hand domains may directly contact each other. 


32.4h Calmodulin Target Proteins Possess a Basic Amphiphilic Helix 


The conformations of EF-hand proteins change dramatically upon binding of Ca** 


ions. This change promotes binding of the EF-hand protein with its target protein(s). 
For example, calmodulin (CaM), a 148 residue protein found in many cell types, 


Human Biochemistry 


PI Metabolism and the Pharmacology of Li* 


Several steps in the metabolism of phosphatidylinositol and its ion. For example, Li* is an uncompetitive inhibitor of myo-inositol 
derivatives are sensitive to lithium ion, Li*. Lithium salts have been =monophosphatase (see Section 13.4b). Li* levels similar to those 
used in the treatment of manic-depressive illnesses for more than 50- used in treatment of manic illness thus lead to the accumulation of 
years, but the mechanism of lithium’s therapeutic effects had been several key intermediates. This story is far from complete, and many 
unclear. Recently, however, several reactions in the phosphatidylino- new insights into phosphoinositide metabolism and the effects of 
sitol degradation pathway have been shown to be sensitive to Lit Li* can be anticipated. 
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Figure 32.32 IP;-mediated signal transduction pathways. Increased [Ca?°*] activates protein kinases, 
which phosphorylate target proteins. Ca?*/CaM represents calci-calmodulin (Ca?* complexed with the 
regulatory protein calmodulin). 


(a) (b) 


Figure 32.33 The structure of calmodulin changes dramatically upon binding to a target protein. 

(a) Structure of uncomplexed calmodulin (PDB id = 1LKJ). Calmodulin, with four Ca?*-binding 
domains, forms a dumbbell-shaped structure with two globular domains joined by an extended, 

central helix. Each globular domain juxtaposes two Ca?*-binding EF-hand domains. An intriguing 
feature of these EF-hand domains is their nearly identical three-dimensional structure despite a 
relatively low degree of sequence homology (only 25% in some cases). (b, c) Complex of calmodulin 
(red) with a peptide from myosin light chain kinase (blue); (b) side view; (c) top view (PDB id = 1QTX). 
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(b) 


Trp 


Val 


Met 


Figure 32.34 Helical wheel representations of (a) a model calmodulin-binding peptide, 
Ac-WKKLLKLLKKLLKL-CONH,, and (b) the calmodulin-binding domain of spectrin. Positively 
charged and polar residues are indicated in green, and hydrophobic residues are orange. (Adapted from 
O’Neil, K., and DeGrado, W., 1990. How calmodulin binds its targets: Sequence independent 
recognition of amphiphilic a-helices. Trends in Biochemical Sciences 15:59-64.) 


modulates the activities of a large number of target proteins, including Ca2*-ATPases, 
protein kinases, phosphodiesterases, and NAD* kinase. CaM binds to these and to 
many other proteins with extremely high affinities (Kp values typically in the high 
picomolar to low nanomolar range). All CaM target proteins possess a basic amphiphi- 
lic alpha helix (a Baa helix), to which CaM binds specifically and with high affinity. 
Viewed end-on, in the so-called helical wheel representation (Figure 32.34), a Baa helix 
has mostly hydrophobic residues on one face; basic residues are collected on the oppo- 
site face. However, the Baa helices of CaM target proteins, although conforming to the 
model, show extreme variability in sequence. How does CaM, itself a highly conserved 
protein, accommodate such variety of sequence and structure? Each globular domain 
consists of a large hydrophobic surface flanked by regions of highly negative electro- 
static potential—a surface suitable for interacting with a Baa helix. The long central 
helix joining the two globular regions behaves as a long, flexible tether. When the tar- 
get protein is bound, the two globular domains fold together (Figure 32.33b). The 
flexible nature of the tethering helix allows the two globular domains to adjust their 
orientation synergistically for maximal binding of the target protein or peptide. 


32.5 | How Do Effectors Convert the Signals to Actions 
in the Cell? 


Transduction of the hormonal signal leads to activation of effectors—usually protein 
kinases and protein phosphatases—that elicit a variety of actions that regulate dis- 
crete cellular functions. Of the 518 protein kinases in the human genome, 428 are 
presumed to phosphorylate serine and threonine residues, whereas the remaining 90 
are protein tyrosine kinases. On the other hand, the human genome contains approxi- 
mately 200 phosphatases. The structures and functions of a few are representative. 


32.5a Protein Kinase A Is a Paradigm of Kinases 


Most protein kinases share a common catalytic core first characterized in protein kinase 
A (PKA), the enzyme that phosphorylates phosphorylase kinase (see Figures 15.9 and 
15.18). The active site of the catalytic subunit of PKA in a ternary complex with 
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A Deeper Look 


Mitogen-Activated Protein Kinases and Phosphorelay Systems 


In multicellular organisms, many physiological processes, including 
mitosis, gene expression, metabolism, and programmed death of 
cells, are regulated by a family of mitogen-activated protein kinases 
(MAPKs). (A mitogen is any agent that induces cell division, that is, 
mitosis.) MAPKs phosphorylate specific serines and threonines of 
target protein substrates, and these phosphorylation events function 
as switches to turn on or off the activity of the substrate proteins. 
These substrates may be other protein kinases, phospholipases, 
transcription factors, and cytoskeletal proteins. Protein phosphatases 
reverse the process, removing the phosphates that were added by 
MAPKs. 

MAPKs are part of a phosphorelay system composed of three 
kinases that are activated in sequence (see accompanying figure). 
In such systems, the MAPK itself is phosphorylated by a MAPK 


MAPK phosphorelay systems. The left column is a 
general model. The four columns to the right show 
the four known MAPK phosphorelay families. 


From Johnson, G. L., and Lapadat, R., 2002. Mitogen-activated protein 


kinase pathways mediated by ERK, JNK, and p38 protein kinases. Science 


298:1911-1912. 


kinase (denoted MKK), which is itself phosphorylated by a MAPK 
kinase kinase (denoted MKKK). MKKKs have distinct domains 
and motifs that respond to different cellular stimuli, and they have 
other domains that recognize specific MKKs. The same kinds of 
specificities regulate the action of MKKs. These specificities are 
accounted for in the classification of four subfamilies of MAPKs: 
One group is that of the extracellular signal-regulated kinases, 
notably ERK1 and ERK2; another is the c-Jun-amino-terminal 
kinases, including JNK, JNK1, and JNK2; a third group depends 
on the ERKS kinase; and the fourth group involves the p38 kinases, 
including p38a, p388, p38y, and p386. There are undoubtedly other 
MAPK families yet to be discovered. As shown in the figure, these 
phosphorelay systems link a variety of stimuli to substrates that 
affect many cellular functions. 


Growth Teen Oxidative 
factor gu stress 


MnAMP-PNP and a pseudosubstrate inhibitor peptide, as shown in Figure 32.35, 
includes a glycine-rich B-strand that acts as a flap over the triphosphate moiety of the 
bound nucleotide. A conserved residue, Asp!®, is the catalytic base that deprotonates 
the Ser/Thr-OH during phosphorylation, and Lys'® stabilizes the transition state of the 
reaction. Three Glu residues on the enzyme are involved in recognition of the pseudo- 


substrate inhibitor peptide. 


32.5b Protein Kinase C Is a Family of lsozymes 


The enzymes called protein kinase C are actually a family of 28 similar enzymes— 
isozymes—that encompass three subclasses. The conventional PKCs, a, BI, BII, and 
y, are regulated by Ca?*, diacylglycerol (DAG), and phosphatidylserine (PS). Because 
Ca?* levels increase in the cell in response to IP}, the activation of conventional 
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Figure 32.35 The structure of the catalytic subunit 
of PKA in a ternary complex with MnAMP-PNP 
and a pseudosubstrate inhibitor peptide (pink). 

A glycine-rich B-strand acts as a flap over the 
triphosphate moiety of the bound nucleotide. 

The glycine-rich flap that covers the ATP-binding 
site is shown in white, AMP—PNP is bound in the 
ATP site, Asp! is shown in beige, and Lys'® is 
shown below in white (PDB id = 1ATP). 


CHOH 
12-0-Tetradecanoylphorbol-13-acetate 


Figure 32.37 The structure of a phorbol ester. 
Long-chain fatty acids predominate at the 
12-position, whereas acetate is usually found at 
the 13-position. 


Conventional a BI BIL y CIA C1B c2 c3 C4 
pseudosubstrate hinge 
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Figure 32.36 The primary structures of the PKC isozymes. Conserved domains C1-C4 are indicated. 
Variable regions are shown as simple lines. 


PKCs depends on both of the second messengers released by the hydrolysis of PIP. 
The novel PKCs, ô, €, 0, and 7, are Ca** independent but are regulated by DAG and 
PS. PKCs @, v, and A are termed atypical and are activated by PS alone. These various 
cofactor requirements are imparted by subdomains represented in the conventional 
PKC polypeptide sequence. Conventional PKCs are comprised (Figure 32.36) of four 
conserved domains (C1—C4) and five variable regions (VI-V5). Domain Cl is a pseu- 
dosubstrate sequence that regulates the kinase by intrasteric control (see Section 15.4b), 
C2 is a Ca**-binding domain, C3 is the ATP-binding domain, and C4 binds peptide 
substrates. 

PKC phosphorylates serine and threonine residues on a wide range of protein sub- 
strates. A role for protein kinase C in cellular growth and division is demonstrated 
through its strong activation by phorbol esters (Figure 32.37). These compounds, from 
the seeds of Croton tiglium, are tumor promoters—agents that do not themselves cause 
tumorigenesis but that potentiate the effects of carcinogens. The phorbol esters mimic 
DAG, bind to the regulatory pseudosubstrate domain of the enzyme, and activate pro- 
tein kinase C. 


32.5c Protein Tyrosine Kinase pp60°°"* Is Regulated 
by Phosphorylation/Dephosphorylation 


Protein tyrosine kinase pp60°*" provides a good example of how phosphorylation con- 
verts signals to actions in cells. The structure of protein tyrosine kinase pp60°*"* (see 
Figure 32.20) consists of an N-terminal unique domain, an SH2 domain, an SH3 domain, 
and a kinase domain that includes a small lobe comprised mainly of a twisted B-sheet 
and a large lobe that is predominantly a-helical (Figure 32.38; see also Section 32.2). 
Phosphorylation of Tyr*?’ in the SH2 domain inhibits tyrosine kinase activity by drawing 
an activation loop into the active site, blocking ATP, and/or substrate binding. Dephos- 
phorylation of Tyr*?’ induces a conformation change that removes the activation loop 
from the active site, permitting autophosphorylation of Tyr*!®, which stimulates tyrosine 
kinase activity. 


32.5d Protein Tyrosine Phosphatase SHP-2 Is a Nonreceptor 
Tyrosine Phosphatase 


Protein tyrosine phosphatases reverse the actions of kinases like the protein tyrosine 
kinase pp60°** described in the previous section. The human phosphatase SHP-2 is a 
cytosolic nonreceptor tyrosine phosphatase. It comprises two SH2 domains, a catalytic 
phosphatase domain, and a C-terminal domain. The SH2 domains enable the enzyme 
to bind to its target substrates, and they also regulate the phosphatase activity. The cata- 
lytic domain of SHP-2 consists of nine a-helices and a ten-stranded mixed B-sheet that 
wraps around one of the helices (Figure 32.39). The other eight helices pack together on 
the opposite side of the B-sheet. 

The N-terminal SH2 domain regulates phosphatase activity by binding to the phospha- 
tase domain and directly blocking the active site. When a target peptide containing a phos- 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


32.6 How Are Signaling Pathways Organized and Integrated? 1191 


C helix 


SH3 
G-rich loop 


Activation loop 
(A-loop helix) 


Large lobe 


Figure 32.38 A ribbon diagram showing the structure of protein tyrosine kinase pp60°" with bound 
AMP-PNP. (Image kindly provided by Stephen C. Harrison.) 


photyrosine group binds to the SH2 domain, a conformation change causes this domain 
to dissociate from the catalytic domain, exposing the active site and allowing peptide sub- 
strate to bind. Binding of phosphotyrosine-containing peptide to the second SH2 domain 
provides additional activation. Target peptides with two phosphotyrosines (one to bind to 
each SH2 domain) provide maximal activation of the phosphatase activity. 


32.6 | How Are Signaling Pathways 
Organized and Integrated? 


All signaling pathways are organized in time and space in the cell, they are carefully 

regulated, and they are integrated with one another. Remarkably, these complex fea- 

tures of signaling depend on the simple concepts already covered in this chapter: Figure 32.39 A ribbon diagram showing the 

PIDs (see Figures 32.5 and 32.6) modulate and control the association of signaling S'™¥cture of protein tyrosine phosphatase SHP-2 
5 . ; ‘ . in its autoinhibited, closed conformation. The 

molecules with one another, often in large signalsomes; signaling molecules are N- and C-terminal SH2 domains are orânge and 

switched on and off by covalent modifications such as phosphorylations; and purple, respectively. The catalytic domain is blue 

signaling often involves amplification and cooperative effects. The multifaceted (PDB id = 2SHP). 

behavior of GPCRs serves as a paradigm for the organization and integration of sig- 

naling pathways and is the focus of this section. 


32.6a GPCRs Can Signal Through G-Protein—Independent Pathways 


The classic GPCR signaling pathway involves coupling to heterotrimeric G proteins, 
but GPCRs can interact with other effector molecules as well. The cellular src kinase 
(Section 32.3k) can be activated directly by the B,-adrenergic receptor (Figure 32.40), 
leading to activation of a MAPK pathway (see A Deeper Look box, “Mitogen- 
Activated Protein Kinase and Phosphorelay Systems” in Section 32.5). Activated src 
phosphorylates Tyr” on the Shc adaptor protein, promoting binding by an SH2 
domain of Grb2. Grb2 in turn binds to and activates Sosl, which activates Ras. Ras 
then activates a kinase cascade (Figure 32.40). 
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Figure 32.40 GPCRs can initiate cellular signaling 
pathways without involvement of G proteins. Under 
certain conditions, binding to GPCRs can induce 
autophosphorylation of the src kinase. Activated 
sre can then bind She and phosphorylate Tyr*!’, 
which promotes binding of Grb2 and initiation of 

a Ras-dependent kinase cascade. 


(MAPKK) MEK1 ~<— Raf I (MAPKKK) 


| 
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The Janus protein kinase (JAK) and its associated transducers and activators of 
transcription (STAT) constitute the JAK/STAT signaling pathway (Figure 32.41). For 
example, Jana Stankova and her colleagues have shown that binding of the platelet- 
activating factor (see Figure 8.8b) to the platelet-activating factor receptor (a GPCR) 
directly stimulates phosphorylation of tyk2, a Janus kinase. The activations of src 
(described earlier) and tyk2 raise the intriguing possibility that GPCR action may 
represent an alternate means of activating these signaling pathways (Figures 32.40 
and 32.41). 


32.6b G-Protein Signaling Is Modulated by RGS/GAPs 


The signal-transducing effects of G proteins persist as long as the bound GTP is not 
hydrolyzed. However, G proteins, such as Ras p21 and G,,, are themselves weak 
GTPases. For example, Ras p21 hydrolyzes GTP with a rate constant of only 0.02 min™!. 
If Ras p21 and G,, were efficient enzymes, the GTP-bound state would be short-lived 
and G-protein—mediated signaling would be ineffective. 

How can G proteins be switched off if they are inherently poor GTPases? The answer 
is provided by regulators of G-protein signaling (RGS), which act as GTPase-activating 
proteins (GAPs). RGS/GAPs elicit dramatic increases in GTPase activity when bound 
to G proteins. For example, RGS/GAPs increase the GTPase activity of Ras p21 by 
a factor of 100,000 and accelerate G,,-catalyzed GTP hydrolysis nearly 100-fold. 
RGS/GAPs induce conformation changes in the switch domains of their G protein tar- 
gets and stabilize the transition state of the GTPase reaction (Figure 32.42). 

Interestingly, however, RGS proteins are more than GAPs. All RGS proteins contain 
several other signaling modules in addition to the 120 residue RGS module (Figure 32.43). 
These additional modules enable RGS proteins to bind to a variety of signaling pro- 
teins, to behave as effector molecules themselves, and to act as scaffolding proteins in 
the formation of signalsome complexes. RGS proteins interact directly, for example, 
with adenylyl cyclase, phospholipase C-B, cGMP phosphodiesterase, guanylyl cyclase, 
Ca?* channels, and potassium channels (Figure 32.44). These RGS-mediated interac- 
tions enable cross-talk between many signaling pathways. 


32.6c GPCR Desensitization Leads to New Signaling Pathways 


Activation of GPCRs by hormones and other extracellular signals leads to heterotri- 
meric G protein binding, which triggers a variety of intracellular signals as shown. 
Importantly, the activated GPCR also binds two other classes of molecules: a family of 
at least six protein kinases known as GPCR kinases (GRKs) and a family of adaptor and 
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Figure 32.41 Platelet-activating factor receptor (a GPCR) can trigger autophosphorylation of the 
tyk2 kinase (a Janus protein kinase). Tyk2-induced phosphorylation of STAT essentially mimics the 
activation of the JAK/STAT signaling pathway by hormones such as erythropoietin (EPO). 


scaffolding proteins known as ß-arrestins. On one level, these two protein families work 
together to desensitize the GPCRs, arresting the G-protein activation by GPCRs. On 
another level, the GRKs and f-arrestins act together as a molecular switch, directing 
GPCRs to a distinctly different role in cell signaling. GRK phosphorylation of various 
combinations of 13 separate sites on the C-terminal sequence of the GPCR promotes 
binding of B-arrestin (Figure 32.45) to form a signalsome assembly. Binding of B-arrestin 
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(b) 


Figure 32.42 G-Protein signaling is mediated by 
RGS/GAPs. (a) A fragment of an RGS/GAP (brown) 
bound to Ras p21 (blue). GAPs increase the GTPase 
activity of Ras p21 by a factor of 100,000 (PDB id = 
1WQ1). (b) An RGS/GAP (pink) bound to Gia (blue). 
GDP is shown in yellow (PDB id = 21K8). 


Subfamily 


a-Helix 


Figure 32.43 The RGS proteins (mammalian cells 
contain more than 30) are classified in subfamilies. 
Representative members of six of those subfamilies 
are shown. In addition to the RGS module of 121 
residues, each RGS protein contains other motifs 
and modules that define its functionality. Many of 
those shown are discussed elsewhere in Chapter 32. 
(Adapted from Bansal, G., Druey, K. M., and 

Xie, Z., 2007. R4 RGS proteins: Regulation of 
G-protein signaling and beyond. Pharmacology and 
Therapeutics 116:473-495.) 
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Figure 32.44 Several interactions 
between RGS proteins and G-protein 
effectors. RGS proteins interact 
directly with adenylyl cyclase, 
phospholipase C-6, cGMP 
phosphodiesterase, and retinyl 
guanylyl cyclase as well as with the 
potassium channel known as GIRK 
and certain calcium channels. Y 
(Adapted from Abramow-Newerly, M., Rhodopsin 
Roy, A. A., Nunn, C., and Chidiac, P., 
2006. RGS proteins have a signaling 
complex: Interactions between RGS 
proteins and GPCRs, effectors, and 
auxiliary proteins. Cellular Signaling N-type Ca?* = 
18:579-591.) channel GPCR 


GIRK 
channel 
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has two effects: The GPCR is no longer able to interact with and activate G proteins, and 
the GPCR is targeted for clathrin-mediated endocytosis (Figure 32.45). Following forma- 
tion of the endosome, the GPCR--arrestin signalsome enters an arrestin signaling 
mode, recruiting a variety of catalytically active proteins, such as components of the 
Raf-MEK-ERK signaling cascade. Arrestin-mediated signaling continues until the 
GPCR returns to the plasma membrane, where it waits for a new extracellular signal. It 
is important to appreciate that G-protein signaling occurs within seconds of extracellular 
GPCR activation and typically lasts for 10 min or less, whereas signaling by the GPCR- 
B-arrestin complex becomes maximal around 10 min and typically persists for 30 min. 


(a) Signal initiation 


Clathrin-coated 
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Figure 32.45 GPCRs have two signaling modes. (a) Signal initiation by ligand binding first activates 
the G-protein—mediated signaling pathways at the plasma membrane. At the same time, G-protein 
receptor kinases (GRKs) begin to phosphorylate the receptor, creating high-affinity binding sites for 
arrestin. Arrestin binding uncouples the receptor from G proteins and targets the receptor either for 
arrestin-based signaling or for endocytosis. The GPCR—arrestin complex serves as a signalsome 
scaffold, recruiting catalytically active proteins such as components of the Raf-MEK-ERK cascade. 
This initiates a distinct set of signals from the endosome-bound signalsome. Endosomes are eventually 
processed and degraded, returning the GPCRs to the plasma membrane. (Adapted from Figure | of 
Gesty-Palmer, D., and Luttrell, L. M., 2008. Heptahelical terpsichory. Who calls the tune? Journal of 
Receptors and Signal Transduction 28:39-58.) (b) Complex of human neurotensin (a GPCR), purple, 
and B-arrestinl, green. (PDB id = 6UP7). 
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A Deeper Look 


Whimsical Names for Proteins and Genes 


The study of cell signaling and the identification of hundreds of new 
signaling proteins provided an unprecedented creative opportunity 
for cell biologists and geneticists in the naming of these proteins. 
In the early days of molecular biology, such names were typically 
arcane abbreviations and acronyms. One such case is the family of 
14-3-3 proteins, named for the migration patterns of these proteins 
on DEAE-cellulose chromatography and starch-gel electrophoresis. 
In the 1970s, a few creative scientists suggested whimsical names 
for newly discovered genes, such as sevenless, named in reference 
to R7, one of the eight photoreceptor cells in the compound eye of 
Drosophila, the common fruit fly. What began as a trickle became 
a torrent of whimsical names for proteins and genes. Sevenless was 
followed by bride of sevenless (boss, a ligand of sevenless), and son 
of sevenless (sos, first isolated in a genetic screen of the sevenless 
receptor tyrosine kinase pathway in Drosophila). The hedgehog (hh) 
genes, including sonic hedgehog (Shh), play critical roles in the devel- 
opment and patterning of animal embryonic tissues but were named 
for a popular video game. 

The accompanying table lists a few notable examples of 
whimsically named genes and gene products, many of which were 
first identified in Drosophila. 


Name 
Armadillo 


Bag of marbles 


Role or Function 


Plakoglobin = B-catenin 


Novel protein involved in oogenesis and 
spermatogenesis 


Bullwinkle Oocyte protein 

Cactus Signaling protein—IkB homolog 

Cheap date Alcohol sensitivity 

Chickadee Profilin homolog—regulation of actin 
cytoskeleton 

Corkscrew A protein tyrosine phosphatase 

Dachshund Nuclear protein required for cell fate 
determination 

Dishevelled Novel cytoplasmic protein in the 
wingless pathway 

Dunce A cAMP phosphodiesterase 

Hopscotch A Janus family tyrosine kinase 

Naked A segment polarity gene 

Reaper A death-domain protein functioning in 
apoptosis 

Rutabaga A Ca?**/calmodulin-dependent protein 
kinase 

Shark A tyrosine kinase SH2-nonreceptor 

Yak Literally “yet another kinase” 


32.6d Receptor Responses Can Be Coordinated by Transactivation 


Hormone and signaling receptors and their pathways do not operate in isolation. Rather, 
one or more of the second messengers and effectors of a given pathway can often acti- 
vate (or inhibit) another pathway. For example, binding of insulin-like growth factor 1 
(IGF-1) to its own RTK signals enhanced synthesis of a peptide known as RANTES. 
RANTES is secreted into the extracellular medium, where it binds toa GPCR known as 
the CCRS receptor (Figure 32.46a). As an extracellular signal, RANTES initiates a sig- 
naling pathway that induces cell migration, a part of the cell’s response to inflammation 
caused by infection. This is an example of transactivation of a GPCR by a RTK. 

The reverse can happen as well. GPCRs can transactivate RTKs by a variety of 
mechanisms. For example, in certain neurons of the hypothalamus, stimulation of 
a-adrenergic receptors triggers a G-protein—mediated pathway that activates a matrix 
metalloproteinase. Metalloproteinase action releases heparin-binding EGF-like growth 
factor (HB-EGF) in the extracellular matrix. Binding of HB-EGF to the EGF receptor 
initiates a classic RTK-activated signaling pathway (Figure 32.46b). 


32.6e Signals from Multiple Pathways Can Be Integrated 


A cell can be exposed simultaneously to multiple, potentially contradictory signals in 
the form of soluble hormones and ligands anchored to adjacent cells or the extracel- 
lular matrix. Cells must have mechanisms for sorting these various signals into a 
defined response. The Rsk1 protein serine/threonine kinases exhibit such behavior, 
integrating several signals to achieve full activation. Rskl has two protein kinase 
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(a) Transcriptional regulation of GPCR ligand synthesis 
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Figure 32.46 Examples of transactivation of cell signaling pathways. (a) Binding of IGF-1 to RTK 

(in human breast cancer cells) initiates a signaling pathway that leads to an enhanced synthesis of 
RANTES (a peptide). RANTES is secreted into the extracellular medium, where it can bind a GPCR 
known as CCRS. CCRS activation triggers G-protein—mediated signaling pathways. (b) Transactivation 
that is the reverse of that in (a). Activation of the a;-adrenergic receptor activates phospholipase C-B, 
which triggers metalloproteinase (MMP) activation. MPP cleaves a precursor of heparin-binding 
EGF-like growth factor (HB-EGF). Binding of HB-EGF to the EGF receptor initiates an RTK 
signaling pathway. 


domains (Figure 32.47), of which the N-terminal domain phosphorylates downstream 
targets. This N-terminal kinase domain is controlled by multiple inputs, including 
from the C-terminal domain. The ERK MAPK binds to a docking site at the 
C terminus of Rsk1, phosphorylating sites in the linker region between the two kinase 
domains and in the C-terminal domain, all of which are essential for activation. 
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Figure 32.47 Integration of signaling pathways. 
Activation of a ribosomal serine/threonine kinase 
known as Rsk1 requires phosphorylation by two 
protein kinases: a phosphoinositide-dependent 
kinase (PDK1) and a mitogen-activated protein 
kinase (ERK; see the Deeper Look box “Mitogen- 
Activated Protein Kinases and Phosphorelay 
Systems” in Section 32.5). Thus, both 
phosphoinositide-mediated and Ras-mediated 
pathways must be active to activate Rsk1. 


Full activation, however, also requires phosphorylation of the N-terminal kinase 
domain by the PIP3-stimulated PDK1 protein kinase. Rsk1 activation thus requires 
inputs from both the Erk MAPK pathway and the PIP; pathway (Figure 32.47). 


32.7 | How Do Neurotransmission Pathways Control 
| the Function of Sensory Systems? 


The survival of higher organisms is predicated on the ability to respond rapidly to sen- 
sory input from physical signals (sights, sounds) and chemical cues (smells). The responses 
to such stimuli may include muscle movements and many forms of intercellular com- 
munication. Hormones (as described earlier in this chapter) can move through an organ- 
ism only at speeds determined by the circulatory system. In most higher organisms, a 
faster means of communication is crucial. Nerve impulses, which can be propagated at 
speeds up to 100 m/s, provide a means of intercellular signaling that is fast enough to 
facilitate sensory recognition, movement, and other physiological functions and behav- 
iors in higher animals. The generation and transmission of nerve impulses in vertebrates 
is mediated by an extremely complicated neural network that connects every part of the 
organism with the brain—itself an interconnected array of as many as 10!! nerve cells. 
Despite their complexity and diversity, the nervous systems of animals all possess 
common features and common mechanisms. Physical or chemical stimuli are recog- 
nized by specialized receptor proteins in the membranes of excitable cells. Conforma- 
tional changes in the receptor protein result in a change in enzyme activity or a change 
in the permeability of the membrane. These changes are then propagated throughout 
the cell or from cell to cell in specific and reversible ways to carry information through 
the organism. This section describes the characteristics of excitable cells and the mecha- 
nisms by which these cells carry information at high speeds through an organism. 


32.7a Nerve Impulses Are Carried by Neurons 


Neurons and neuroglia (or glial cells) are cell types unique to nervous systems. The recep- 
tion and transmission of nerve impulses are carried out by neurons (Figure 32.48), 
whereas glial cells serve protective and supportive functions. (Neuroglia could be trans- 
lated as “nerve glue.”) Glial cells differ from neurons in several ways. Glial cells do not 
possess axons or synapses, and they retain the ability to divide throughout their life 
spans. Glial cells outnumber neurons by at least 10 to 1 in most animals. 

Neurons are distinguished from other cell types by their long cytoplasmic extensions 
or projections, called processes. Most neurons consist of three distinct regions (see 
Figure 32.48): the cell body (called the soma), the axon, and the dendrites. The axon ends 
in small structures called synaptic terminals, synaptic knobs, or synaptic bulbs. Dendrites 
are short, highly branched structures emanating from the cell body that receive neural 
impulses and transmit them to the cell body. The space between a synaptic knob on one 
neuron and a dendrite ending of an adjacent neuron is the synapse or synaptic cleft. 

Three kinds of neurons are found in higher organisms: sensory neurons, interneu- 
rons, and motor neurons. Sensory neurons acquire sensory signals, either directly or 
from specific receptor cells, and pass this information along to either interneurons or 
motor neurons. Interneurons simply pass signals from one neuron to another, whereas 
motor neurons pass signals from other neurons to muscle cells, thereby inducing muscle 
movement (motor activity). 


32.7b lon Gradients Are the Source of Electrical Potentials in Neurons 


Propagation of information along a neuron is accomplished by electrical potentials, 
whereas communication between neurons is chemical in nature. These electrical sig- 
nals occur as transient changes in the electrical potential differences (voltages) across 
the membranes of neurons (and other cells). Such potentials are generated by ion 
gradients. Such gradients are used for a variety of purposes in all cells. In neurons, ion 
gradients are the essence of action potentials that carry the neural messages. The 
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Figure 32.48 The structure of a mammalian motor neuron. The nucleus and most other organelles 
are contained in the cell body. One long axon and many shorter dendrites project from the body. The 
dendrites receive signals from other neurons and conduct them to the cell body. The axon transmits 
signals from this cell to other cells via the synaptic knobs. Glial cells called Schwann cells envelop the 
axon in layers of an insulating myelin membrane. Although glial cells lie in proximity to neurons in 
most cases, no specific connections (such as gap junctions, for example) connect glial cells and 
neurons. However, gap junctions can exist between adjacent glial cells. 


. . P : Cell body 
cytoplasm of a neuron at rest is low in Na* and Cl and high in Kt, relative to the 


extracellular fluid (Figure 32.49). These gradients are generated by the Na*, K*-ATPase 
(see Section 9.8b). A resting neuron exhibits a potential difference of approximately 
—60 mV (that is, negative inside). 


\ 


` 


32.7c Action Potentials Carry the Neural Message 


Nerve impulses, also called action potentials, are transient changes in the membrane 
potential that move rapidly along nerve cells. Action potentials are created when the 
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Figure 32.49 Action potentials are generated by transmembrane flows of ions. (a) The concentrations 

of Na+, K+, and Cl” ions inside and outside of a typical resting mammalian axon are shown. Assuming 

relative permeabilities for K*, Na*, and Cl are 1, 0.04, and 0.45, respectively, the Goldman equation p 
yields a membrane potential of —60 mV. (See problem 19.) (b and c) The time dependence of an action 

potential, compared with the ionic permeabilities of Na* and K+. (b) The rapid rise in membrane 

potential from —60 mV to slightly more than +30 mV is referred to as a depolarization. This 

depolarization is caused (c) by a sudden increase in the permeability of Na*. As the Na* permeability 

decreases, K* permeability is increased and the membrane potential drops, eventually falling below the | 
resting potential—a state of hyperpolarization—followed by a slow return to the resting potential. é j és \ 

(Adapted from Hodgkin, A., and Huxley, A., 1952. A quantitative description of membrane current and 

its application to conduction and excitation in nerve. Journal of Physiology 117:500-544.) 
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membrane is locally depolarized by approximately 20 mV from the resting value of about 
—60 mV to a new value of approximately —40 mV. This small change is enough to have 
a dramatic effect on specific proteins in the axon membrane called voltage-gated ion 
channels. These proteins are ion channels that are specific either for Na* or K+. These 
ion channels are normally closed at the resting potential of —60 mV. When the potential 
difference rises to —40 mV, the gates of the Na* channels are opened and Na* ions begin 
to flow into the cell. As Na* enters the cell, the membrane potential continues to increase 
and additional Na* channels are opened (Figure 32.49). The potential rises to more than 
+30 mV. At this point, Na* influx slows and stops. As the Na* channels close, K* chan- 
nels begin to open and K* ions stream out of the cell, returning the membrane potential 
to negative values. The potential eventually overshoots its resting value a bit. At this 
point, K* channels close, and the resting potential is eventually restored by action of the 
Na*,K*-ATPase and the other channels. Alan Hodgkin and Andrew Huxley originally 
observed these transient increases and decreases, first in Na* permeability and then in 
K* permeability. For this and related work, Hodgkin and Huxley, along with J. C. Eccles, 
won the Nobel Prize in Physiology or Medicine in 1963. 


32.7d The Action Potential Is Mediated by the Flow of Na* and K* lons 


Changes in potential in one part of the axon are rapidly passed along the axonal mem- 
brane (Figure 32.50). The sodium ions that rush into the cell in one localized region 
actually diffuse farther along the axon, raising the Na* concentration and depolariz- 
ing the membrane, causing Na* gates to open in that adjacent region of the axon. In 
this way, the action potential moves down the axon in wavelike fashion. This simple 
process has several very dramatic properties: 


1. Action potentials propagate very rapidly—up to and sometimes exceeding 100 m/s. 
2. The action potential is not attenuated (diminished in intensity) as a function of 
distance transmitted. 


The input of energy all the way along an axon—in the form of ion gradients maintained 
by Na*, K*-ATPase—ensures that the shape and intensity of the action potential are 
maintained over long distances. The action potential has an all-or-none character. 
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Figure 32.50 The propagation of action potentials along an axon. Figure 32.49 shows the time 
dependence of an action potential at a discrete point on the axon. This figure shows how the membrane 
potential varies along the axon as an action potential is propagated. (For this reason, the shape of the 
action potential is the apparent reverse of that shown in Figure 32.49.) At the leading edge of the action 
potential, membrane depolarization causes Na* channels to open briefly. As the potential moves along 
the axon, the Na* channels close and K* channels open, leading to a drop in potential and the onset of 
hyperpolarization. When the resting potential is restored, another action potential can be initiated. 
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There are no gradations of amplitude; a given neuron is either at rest (with a polarized 
membrane) or is conducting a nerve impulse (with a reversed polarization). Because 
nerve impulses display no variation in amplitude, the size of the action potential is not 
important in processing signals in the nervous system. Instead, it is the number of 
action potential firings and the frequency of firing that carry specific information. 

The action potential is a delicately orchestrated interplay between the Na*, K*-ATPase 
and the voltage-gated Na* and K* channels that is initiated by a stimulus at the postsyn- 
aptic membrane. The density and distribution of Na* channels along the axon are differ- 
ent for myelinated and unmyelinated axons (Figure 32.51). In unmyelinated axons, Na* 
channels are uniformly distributed, although they are few in number—approximately 
20 channels per um?. On the other hand, in myelinated axons, Na* channels are clustered 
at the nodes of Ranvier. In these latter regions, they occur with a density of approximately 
10,000 per m2. (Ion channel structure and function were discussed in Section 9.7.) 


32.7e Neurons Communicate at the Synapse 


How are neuronal signals passed from one neuron to the next? Neurons are juxta- 
posed at the synapse. The space between the two neurons is called the synaptic cleft. 
The number of synapses in which any given neuron is involved varies greatly. There 
may be as few as one synapse per postsynaptic cell (in the midbrain) to many thou- 
sands per cell. Typically, 10,000 synaptic knobs may impinge on a single spinal motor 
neuron, with 8000 on the dendrites and 2000 on the soma or cell body. The ratio of 
synapses to neurons in the human forebrain is approximately 40,000 to 1! 

Synapses are actually quite specialized structures, and several different types 
exist. A minority of synapses in mammals, termed electrical synapses, are character- 
ized by a very small gap—approximately 2 nm—between the presynaptic cell (which 
delivers the signal) and the postsynaptic cell (which receives the signal). At electrical 
synapses, the arrival of an action potential on the presynaptic membrane leads 
directly to depolarization of the postsynaptic membrane, initiating a new action 
potential in the postsynaptic cell. However, most synaptic clefts are much wider—on 
the order of 20 to 50 nm. In these, an action potential in the presynaptic membrane 
causes secretion of a chemical substance—called a neurotransmitter—by the presyn- 
aptic cell. This substance binds to receptors on the postsynaptic cell, initiating a new 
action potential. Synapses of this type are thus chemical synapses. 

Different synapses utilize specific neurotransmitters. The cholinergic synapse, a para- 
digm for chemical transmission mechanisms at synapses, employs acetylcholine as a 
neurotransmitter. Other important neurotransmitters and receptors have been discov- 
ered and characterized. These all fall into one of several major classes, including amino 
acids (and their derivatives), catecholamines, peptides, and gaseous neurotransmitters. 
Table 32.3 lists some, but not all, of the known neurotransmitters. 


1201 


Figure 32.51 Some axons are covered by a myelin 
sheath—a membrane rich in lipids but with very few 
proteins. The myelin sheath acts as an electrical 
insulator and speeds the transmission of action 
potentials. Na* channels are infrequently and 
randomly distributed in unmyelinated nerve. In 
myelinated axons, Na* channels are clustered in 
large numbers in the nodes of Ranvier, between the 
regions surrounded by myelin sheath structures. 


| Table 32.3. | Families of 


Neurotransmitters 


Cholinergic Agents 
Acetylcholine 


Catecholamines 
Norepinephrine (noradrenaline) 
Epinephrine (adrenaline) 
L-Dopa 

Dopamine 

Octopamine 

Amino Acids (and Derivatives) 
y-Aminobutyric acid (GABA) 
Alanine 

Aspartate 

Cystathionine 

Glycine 

Glutamate 

Histamine 

Proline 

Serotonin 

Taurine 

Tyrosine 

Peptide Neurotransmitters 
Cholecystokinin 

Enkephalins and endorphins 
Gastrin 

Gonadotropin 

Neurotensin 

Oxytocin 

Secretin 

Somatostatin 

Substance P 

Thyrotropin releasing factor 
Vasopressin 

Vasoactive intestinal peptide (VIP) 
Gaseous Neurotransmitters 
Carbon monoxide (CO) 
Nitric oxide (NO) 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


1202 Chapter 32 The Reception and Transmission of Extracellular Information 


(a) (b) 


0 0000 
00000 


o Acetylcholine 
in vesicles 


| 


\\0. 


Q 


D 


o\e S/o Acetylcholine 
ole ofo receptors 
ole elo 
Resting state Action potential causes Acetylcholine is released Opening of receptor 
Ca®* influx which causes and diffuses to receptors channels permits flow 
vesicles to fuse with of ions 
membrane 


Figure 32.52 Cell-cell communication at the (a) synapse is mediated by neurotransmitters such as 
acetylcholine, produced from choline by choline acetyltransferase. (b) The arrival of an action potential at 
the synaptic knob opens Ca?* channels in the presynaptic membrane. (©) Influx of Ca?* induces 

the fusion of acetylcholine-containing vesicles with the plasma membrane and release of acetylcholine 
into the synaptic cleft. (d) Binding of acetylcholine to receptors in the postsynaptic membrane opens 
Na* channels. The influx of Na* depolarizes the postsynaptic membrane, generating a new 

action potential. 


32.7f Communication at Cholinergic Synapses Depends upon Acetylcholine 


In cholinergic synapses, small synaptic vesicles inside the synaptic knobs contain large 
amounts of acetylcholine (approximately 10,000 molecules per vesicle; Figure 32.52). 
When the membrane of the synaptic knob is stimulated by an arriving action potential, 
special voltage-gated Ca?* channels open and Ca? ions stream into the synaptic knob, 
causing the acetylcholine-containing vesicles to attach to and fuse with the knob mem- 
brane. The vesicles open, spilling acetylcholine into the synaptic cleft. Binding of acetyl- 
choline to specific acetylcholine receptors in the postsynaptic membrane causes opening 
of ion channels and the creation of a new action potential in the postsynaptic neuron. 

A variety of toxins can alter or affect this process. The anaerobic bacterium Clos- 
tridium botulinum, which causes botulism poisoning, produces several toxic proteins 
that strongly inhibit acetylcholine release. The black widow spider, Latrodectus mactans, 
produces a venom protein, @-latrotoxin, that stimulates abnormal release of acetylcho- 
line at the neuromuscular junction. The bite of the black widow causes pain, nausea, 
and mild paralysis of the diaphragm but is rarely fatal. 


32.7g There Are Two Classes of Acetylcholine Receptors 


Two different acetylcholine receptors are found in postsynaptic membranes. They 
were originally distinguished by their responses to muscarine, a toxic alkaloid in 
toadstools, and nicotine (Figure 32.53). The nicotinic receptors are cation channels in 
postsynaptic membranes, and the muscarinic receptors are transmembrane proteins 
that interact with G proteins. The receptors in sympathetic ganglia and those in 
motor endplates of skeletal muscle are nicotinic receptors. Nicotine locks the ion 
channels of these receptors in their open conformation. Muscarinic receptors are 
found in smooth muscle and in glands. Muscarine mimics the effect of acetylcholine 
on these latter receptors. 

The nicotinic acetylcholine receptor is a 290 kDa transmembrane glycoprotein con- 
sisting of a ring of four homologous subunits (a, B, y, and ô) in the order ayaBé 
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Figure 32.53 Two types of acetylcholine receptors are known. Nicotinic acetylcholine receptors are 
locked in their open conformation by nicotine. Obtained from tobacco plants, nicotine is named for 
Jean Nicot, French ambassador to Portugal, who sent tobacco seeds to France in 1550 for cultivation. 
Muscarinic acetylcholine receptors are stimulated by muscarine, obtained from the intensely 
poisonous mushroom, Amanita muscaria. 


(a) 


Figure 32.54 The nicotinic acetylcholine 
receptor is an elongated funnel constructed from 
homologous subunits named a, B, y, and 6. The 
pentameric channel includes two copies of the 
a-subunit. The extracellular domain of each 
subunit is a B-barrel, whereas the transmembrane 
and intracellular domains are a-helical. (a) Top 
view; (b) side view (PDB id = 2BG9). 


(Figure 32.54a). The receptor is shaped like an elongated (160 A) funnel, with a large 
extracellular ligand-binding domain, a membrane-spanning pore, and a smaller intra- 
cellular domain. Acetylcholine binds to the two a-subunits at sites that lie 40 A from the 
membrane surface. 


The Nicotinic Acetylcholine Receptor Is a Ligand-Gated lon Channel The nicotinic ace- 
tylcholine receptor functions as a ligand-gated ion channel, and on the basis of its 
structure, it is also an oligomeric ion channel. When acetylcholine (the ligand) binds 
to this receptor, a conformational change opens the channel, which is equally perme- 
able to Na* and K*. Na* rushes in while K* streams out, but because the Na* gradi- 
ent across this membrane is steeper than that of K*, the Na* influx greatly exceeds 
the K* efflux. The influx of Na* depolarizes the postsynaptic membrane, initiating 
an action potential in the adjacent membrane. After a few milliseconds, the channel 
closes, even though acetylcholine remains bound to the receptor. At this point, the 
channel will remain closed until the concentration of acetylcholine in the synaptic 
cleft drops to about 10 nM. 
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Figure 32.56 Following a synaptic transmission 
event, acetylcholine is repackaged in vesicles in a 
multistep process. Synaptic vesicles are formed by 
endocytosis, and acetylcholine is synthesized by 
choline acetyltransferase. A proton gradient is 
established across the vesicle membrane by an 
H*-transport ATPase, and a proton—acetylcholine 
transport protein transports acetylcholine into the 
synaptic vesicles, exchanging acetylcholine for 
protons in an electrically neutral antiport process. 
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Figure 32.55 Acetylcholine is degraded to acetate and choline by acetylcholinesterase, a serine protease. 


Acetylcholinesterase Degrades Acetylcholine in the Synaptic Cleft Following every syn- 
aptic signal transmission, the synapse must be readied for the arrival of another 
action potential. Several things must happen very quickly. First, the acetylcholine left 
in the synaptic cleft must be rapidly degraded to resensitize the acetylcholine recep- 
tor and to restore the excitability of the postsynaptic membrane. This reaction is 
catalyzed by acetylcholinesterase (Figure 32.55). 

When [acetylcholine] has decreased to low levels, acetylcholine dissociates from the 
receptor, which thereby regains its ability to open in a ligand-dependent manner. 
Second, the synaptic vesicles must be reformed from the presynaptic membrane by 
endocytosis (Figure 32.56) and then must be restocked with acetylcholine. 

This occurs through the action of an ATP-driven H* pump and an acetylcholine 
transport protein. The H* pump in this case is a member of the family of V-type 
ATPases. It uses the free energy of ATP hydrolysis to create an H* gradient across the 
vesicle membrane. This gradient is used by the acetylcholine transport protein to drive 
acetylcholine into the vesicle, as shown in Figure 32.56. 

Antagonists of the nicotinic acetylcholine receptor are particularly potent neurotox- 
ins. These agents, which bind to the receptor and prevent opening of the ion channel, 
include d-tubocurarine, the active agent in the South American arrow poison curare, and 
several small proteins from poisonous snakes. These latter agents include cobratoxin 
from cobra venom, and a-bungarotoxin, from Bungarus multicinctus, a snake common 
in Taiwan (Figure 32.57). 


Muscarinic Receptor Function Is Mediated by G Proteins There are several different 
types of muscarinic acetylcholine receptors, with different structures and different ap- 
parent functions in synaptic transmission. However, certain structural and functional 
features are shared by this class of receptors. Muscarinic receptors are 70 kDa glyco- 
proteins and are members of the GPCR family. 

Activation of muscarinic receptors (by binding of acetylcholine) results in several 
G-protein—mediated effects, including the inhibition of adenylyl cyclase, the stimulation 
of phospholipase C, and the opening of K* channels. Many antagonists for muscarinic 
acetylcholine receptors are known, including atropine from A tropa belladonna, the deadly 
nightshade plant whose berries are sweet and tasty but highly poisonous (Figure 32.57). 

Both the nicotinic and muscarinic acetylcholine receptors are sensitive to certain 
agents that inactivate acetylcholinesterase itself. Acetylcholinesterase is a serine ester- 
ase similar to trypsin and chymotrypsin (see Section 14.6). The reactive serine at the 
active site of such enzymes is a vulnerable target for organophosphorus inhibitors 
(Figure 32.58). DIPF and related agents form stable covalent complexes with the active- 
site serine, irreversibly blocking the enzyme. Malathion is a commonly used insecti- 
cide, and sarin and tabun are nerve gases used in chemical warfare. All these agents 
effectively block nerve impulses, stop breathing, and cause death by suffocation. 


32.7h Amino Acids, Catecholamines, and Peptides Can Act as 
Neurotransmitters within Synaptic Junctions 


Synaptic junctions that use amino acids, catecholamines, and peptides (see Table 32.3) 
appear to operate much the way the cholinergic synapses do. Presynaptic vesicles 
release their contents into the synaptic cleft, where the neurotransmitter substance 
can bind to specific receptors on the postsynaptic membrane to induce a conforma- 
tional change and elicit a particular response. Some of these neurotransmitters are 
excitatory in nature and stimulate postsynaptic neurons to transmit impulses, whereas 
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Figure 32.57 Plants and reptiles are rich sources of neurotoxic agents. (a) Tubocurarine, obtained 
from the plant Chondrodendron tomentosum, is the active agent in tube curare, named for the bamboo 
tubes in which it is kept by South American indigenous hunters. (b) Atropine is produced by Atropa 
belladonna, the poisonous deadly nightshade. The species name, which means “beautiful woman,” 
derives from the use of atropine in years past by Italian women to dilate their pupils. Atropine is still 
used for pupil dilation in eye exams by ophthalmologists. (c) Cobratoxin and (d) a-bungarotoxin are 
proteins produced by the cobra (Naja naja) and the banded krait snake (Bungarus multicinctus), 
respectively. 


Figure 32.58 Covalent inhibitors of acetylcholinesterase include DIPF, (also known as 
diisopropylfluorophosphate, or DIFP), the nerve gases tabun and sarin, and the insecticide malathion. 


others are inhibitory and prevent the postsynaptic neuron from carrying other signals. 
Just as acetylcholine acts on both nicotinic and muscarinic receptors, so most of the 
known neurotransmitters act on several (and in some cases, many) different kinds of 
receptors. Biochemists are just beginning to understand the sophistication and com- 
plexity of neuronal signal transmission. 


32.71 Glutamate and Aspartate Are Excitatory Amino Acid Neurotransmitters 


The common amino acids glutamate and aspartate act as neurotransmitters. Like 
acetylcholine, glutamate and aspartate are excitatory and stimulate receptors on 
the postsynaptic membrane to transmit a nerve impulse. No enzymes that degrade 
glutamate exist in the extracellular space, so glutamate must be cleared by the high- 
affinity presynaptic and glial transporters—a process called reuptake. 
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A Deeper Look 


Tetrodotoxin and Saxitoxin Are Na* Channel Toxins 


Tetrodotoxin and saxitoxin are highly specific blockers of Na* 
channels and bind with very high affinity (Kp <1 nM). This 
unique specificity and affinity have made it possible to use ra- 
dioactive forms of these toxins to purify Na* channels and map 
their distribution on axons. Tetrodotoxin is found in the skin 
and several internal organs of puffer fish, also known as blowfish 
or swellfish, members of the family Tetraodontidae, which react 
to danger by inflating themselves with water or air to nearly 
spherical (and often comical) shapes (see accompanying figure). 


(a) 


Stephen Frink/Encyclopedia/Corbis 


(b) 


Toxins that block the Nat channel in a closed state 


Although tetrodotoxin poisoning can easily be fatal, puffer fish 
are delicacies in Japan, where they are served in a dish called fugu. 
For this purpose, the puffer fish must be cleaned and prepared by 
specially trained chefs. Saxitoxin is made by Gonyaulax catenella 
and G. tamarensis, two species of marine dinoflagellates or plank- 
ton that are responsible for red tides that cause massive fish kills. 
Saxitoxin is concentrated by certain species of mussels, scallops, 
and other shellfish that are exposed to red tides. Consumption of 
these shellfish by animals, including humans, can be fatal. 


Stephen Frink/Terra/Corbis 
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Saxitoxin Tetrodotoxin is found in puffer fish, which are prepared and served in 


Japan as fugu. (a) The puffer fish on the left is unexpanded; the one on the 
right is inflated. (b) Structures of tetrodotoxin and saxitoxin. 
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Neurexins and Neuroligins—the Scaffolding of Learning and Memory 


How do we learn, remember, and forget? And how do our brains 
change in response to new experiences? Answers to these questions 
are being revealed by new research at the crossroads of cell biology 
and biochemistry. The human brain contains about 100 billion neu- 
rons that are interconnected by more than 100 trillion synapses—a 
total of approximately 100,000 miles of biological wiring. Synapses 
are the fundamental unit of neural circuits. Each of the 20 billion 
neurons of the neocortex have, on average, 7000 synaptic connec- 
tions to other neurons, with some neurons making up to 100,000 
synaptic connections to other neurons. Remarkably, it is estimated 
that each neuron is able to contact any other neuron with no more 
than six interneuronal connections. Thus, any two neurons are no 
more than six connections from one another. 


CASK 


Learning and the acquisition of long-term memory involve 
plasticity—in other words, the ability to change the structures and 
functions of synaptic connections. Plasticity may include the remod- 
eling and strengthening of existing synapses as well as the formation 
of new synapses. In all cases, synaptic function requires that postsyn- 
aptic neurotransmitter receptors on the dendrite be precisely aligned 
opposite chemically matched presynaptic vesicles in the axon. This 
alignment is achieved with a complex scaffolding network composed 
of presynaptic and postsynaptic partner proteins that associate 
across the cleft. The best understood of such partners are presynaptic 
neurexins and postsynaptic neuroligins (figure). Neurexins were first 
identified as the receptors for a-latrotoxin, the venom protein of the 
black widow spider. The mammalian genome contains only three 


SH3 GuK 


Presynaptic 
membrane 


| AE 


Trans-synaptic complexes of scaffolding proteins anchor and stabilize the synapse and, by virtue of their 
associated intracellular functional domains, also activate a variety of signaling processes that regulate synaptic 
function in both neurons. (Adapted with permission from Südhof, T., 2008. Neuroligins and neurexins link 
synaptic function to cognitive function. Nature 455:903-911.) 


(Continued) 
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neurexin genes, but extensive alternative splicing at five positions gener- 
ates thousands of neurexin isoforms. Neurexins partner with a variety 
of postsynaptic scaffolding proteins besides the neuroligins, and some 
neurexin-partner associations involve soluble secreted bridge proteins 
such as the 35 kDa cerebellin precursor protein, Cblni. Emerging 
research also is identifying other presynapse-postsynapse scaffolding 
protein pairs, including the presynaptic LAR protein tyrosine phos- 
phatase receptor and its partner the postsynaptic netrin G ligand-3. 
The involvement of trans-synaptic scaffold protein interactions 
such as these in learning and memory is becoming increasingly 
apparent. For example, Nobelist Eric Kandel and his colleagues 
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have shown recently that expression of an autism-linked variant of 
neuroligin in Aplysia (large, shell-less gastropod molluscs commonly 
known as sea slugs) blocks both intermediate and long-term memory 
formation while having no effect on basal synaptic transmission. In 
other words, neuroligin defects can lead to sluggish learning. 
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At least five subclasses of glutamate receptors are known. The best understood of 


+l these excitatory receptors is the N-methyl-D-aspartate (NMDA) receptor, a ligand- 


H— C— CH; — COOT 
| 


COO- 
N-Methyl-p-aspartate (NMDA) 


Phencyclidine 


gated channel that, when open, allows Ca** and Na* to flow into the cell and K* to 
flow out of the cell. Phencyclidine (PCP) is a specific antagonist of the NMDA recep- 
tor (Figure 32.59). Phencyclidine was once used as an anesthetic agent, but legitimate 
human use was quickly discontinued when it was found to be responsible for bizarre 
psychotic reactions and behavior in its users. Since this time, PCP has been used ille- 
gally as a hallucinogenic drug, under the street name of angel dust. Sadly, it has 
caused many serious, long-term psychological problems in its users. 


32.7) y-Aminobutyric Acid and Glycine Are Inhibitory Neurotransmitters 


Certain neurotransmitters, acting through their conjugate postsynaptic receptors, 
inhibit the postsynaptic neuron from propagating nerve impulses from other neu- 
rons. Two such inhibitory neurotransmitters are y-aminobutyric acid (GABA) and 


Figure 32.59 NMDA receptors assemble as 
heterotetramers, with two (glycine-binding) GluN1 
and two (glutamate-binding) GluN2 subunits. 

The architecture of the receptor consists of amino- 
terminal domains (ATDs) and ligand-binding 
domains (LBDs) on the extracellular side of the 
membrane, a transmembrane domain (TMD) and 
a carboxy-terminal domain (not shown). NMDA 
receptors are Na* and Ca?* channels. They are 
stimulated by NMDA, inhibited by phencyclidine, 
and regulated by Zn** and glycine (PDB id = 
4TLM). 
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(b) 


Figure 32.60 GABA (y-aminobutyric acid) and glycine are 
inhibitory neurotransmitters that activate chloride channels. 
Influx of CI” causes a hyperpolarization of the postsynaptic 
membrane. GABA receptors are similar in many respects to 
nicotinic acetylcholine receptors, with a.By6 stoichiometry. 
A pentamer of 83 subunits is shown here (PDB id = 
4COF). (a) Side view; (b) top view; (c) the pentameric 
transmembrane domain. 


glycine. These agents make postsynaptic membranes permeable to chloride ions and 
cause a net influx of Cl, which in turn causes hyperpolarization of the postsynaptic 
membrane (making the membrane potential more negative). Hyperpolarization of a 
neuron effectively raises the threshold for the onset of action potentials in that 
neuron, making the neuron resistant to stimulation by excitatory neurotransmitters. 
These effects are mediated by the GABA and glycine receptors, which are ligand- 
gated chloride channels (Figure 32.60). GABA is derived by a decarboxylation of 
glutamate (Figure 32.61) and appears to operate mainly in the brain, whereas glycine 
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Figure 32.61 Glutamate is converted to GABA by glutamate decarboxylase. GABA is degraded 
by the action of GABA-glutamate transaminase and succinate semialdehyde dehydrogenase to 
produce succinate. 
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Human Biochemistry 


The Biochemistry of Neurological Disorders 


Defects in catecholamine processing are responsible for the 
symptoms of many neurological disorders, including clinical de- 
pression (which involves norepinephrine [NE]) and parkinsonism 
(involving dopamine [DA]). Once these neurotransmitters have 
bound to and elicited responses from postsynaptic membranes, 
they must be efficiently cleared from the synaptic cleft (Figure 1). 
Clearing can occur by several mechanisms. NE and DA transport 
or reuptake proteins exist both in the presynaptic membrane and 
in nearby glial cell membranes. On the other hand, catecholamine 
neurotransmitters can be metabolized and inactivated by two 
enzymes: catechol-O-methyl-transferase in the synaptic cleft and 
monoamine oxidase in the mitochondria (Figure 2). Catechol- 
amines transported back into the presynaptic neuron are accu- 
mulated in synaptic vesicles by the same H*-ATPase/H*-ligand 
exchange mechanism described for glutamate. Clinical depres- 
sion has been treated by two different strategies. Monoamine 
oxidase inhibitors act as antidepressants by increasing levels of 
catecholamines in the brain. Another class of antidepressants, 
the tricyclics, such as desipramine (Figure 3a), act on several 
classes of neurotransmitter reuptake transporters and facilitate 


Mitochondrion 


Monoamine oxidase 


Presynaptic 
neuron 


more prolonged stimulation of postsynaptic receptors. Prozac 
is a more specific reuptake inhibitor and acts only on serotonin 
reuptake transporters. 

Parkinsonism is characterized by degeneration of dopaminergic 
neurons, as well as consequent overproduction of postsynaptic 
dopamine receptors. In recent years, Parkinson’s patients have been 
treated with dopamine agonists such as bromocriptine (Figure 3b) to 
counter the degeneration of dopamine neurons. 

Catecholaminergic neurons are involved in many other in- 
teresting pharmacological phenomena. For example, reserpine 
(Figure 3c), an alkaloid from a climbing shrub of India, is a 
powerful sedative that depletes the level of brain monoamines 
by inhibiting the H*—monoamine exchange protein in the mem- 
branes of synaptic vesicles. 

Cocaine (Figure 3d), a highly addictive drug, binds with high 
affinity and specificity to reuptake transporters for the monoamine 
neurotransmitters in presynaptic membranes. Thus, at least one of 
the pharmacological effects of cocaine is to prolong the synaptic 
effects of these neurotransmitters. 


Postsynaptic 
neuron 


Receptor 


Tranylcypromine 


Norepinephrine @ 


Desipramine 


Norepi- 
nephrine 
receptor 


Figure 1 The sites of action of drugs used to treat neurological disorders. The pathway for 
reuptake and vesicular repackaging of the catecholamine neurotransmitters. The sites of action of 
desipramine, tranylcypromine, and reserpine are indicated. 


(Continued) 
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Figure 2 The actions of catechol-O-methyltransferase and monoamine oxidase. 
Norepinephrine can be degraded in the synaptic cleft by catechol-O-methyltransferase 
or in the mitochondria of presynaptic neurons by monoamine oxidase. 
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Figure 3 (a) The structures of tranylcypromine, desipramine, and Prozac. (b) The structure 

of bromocriptine. (c) The structure of reserpine. (d) The structure of cocaine. (e) The structure 
of Cymbalta (duloxetine), a widely prescribed antidepressant. (f) The structure of Xanax, a 
benzodiazapine used in the treatment of epilepsy, insomnia, anxiety, and panic disorder. 

(g) The structure of propofol, a widely used anesthetic. 
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Figure 32.63 The pathway for the synthesis of 


catecholamine neurotransmitters. Dopa, dopamine, 


noradrenaline, and adrenaline are synthesized 
sequentially from tyrosine. 


O 
Strychnine 


Figure 32.62 Strychnine binds strongly to glycine receptors. 


acts primarily in the spinal cord. The glycine receptor has a specific affinity for the 
convulsive alkaloid strychnine (Figure 32.62). The effects of ethanol on the brain arise 
in part from the opening of GABA receptor Cl" channels. 

GABA receptors assemble as pentamers from a family of homologous subunits 
encoded by 19 different genes: al to 6, B1 to 3, yl to 3, 6, €, 0, m, and pl to 3. The 
physiologically active heteromers usually contain two a subunits, two $ subunits, and a y 
subunit. Interestingly, 83 subunits can assemble into functional homomeric channels 
(Figure 32.60). Each subunit includes an extracellular domain of 200 to 250 residues, a 
transmembrane domain of four a-helices, and an intracellular domain of 85 to 255 resi- 
dues. Neurotransmitters bind in extracellular pockets between the subunits. GABA recep- 
tors are the targets of a wide range of drugs, including benzodiazapines (for example, 
Valium and Xanax) used in the treatment of epilepsy, insomnia, anxiety and panic disor- 
der, and the intravenous general anesthetic propofol. 


32.7k The Catecholamine Neurotransmitters Are Derived from Tyrosine 


Epinephrine, norepinephrine, dopamine, and L-dopa (3,4-dihydroxyphenylalanine) are 
collectively known as catecholamine neurotransmitters. These compounds are syn- 
thesized from tyrosine (Figure 32.63), both in sympathetic neurons and in the adrenal 
glands. They function as neurotransmitters in the brain and as hormones in the circu- 
latory system. However, these two pools operate independently, thanks to the blood- 
brain barrier, which permits only very hydrophobic species in the circulatory system 
to cross over into the brain. Hydroxylation of tyrosine (by tyrosine hydroxylase) to 
form L-dopa is the rate-limiting step in this pathway. Dopamine, a crucial catechol- 
amine involved in several neurological diseases, is synthesized from L-dopa by a pyri- 
doxal phosphate-dependent enzyme, dopa decarboxylase. Subsequent hydroxylation 
and methylation produce norepinephrine and epinephrine (Figure 32.63). The methyl 
group in the final reaction is supplied by S-adenosylmethionine. 

Each of these catecholamine neurotransmitters is known to play a unique role in 
synaptic transmission. The neurotransmitter in junctions between sympathetic nerves 
and smooth muscle is norepinephrine. On the other hand, dopamine is involved in other 
processes. Either excessive brain production of dopamine or hypersensitivity of dopa- 
mine receptors is responsible for psychotic symptoms and schizophrenia, whereas low- 
ered production of dopamine and the loss of dopamine neurons are important factors 
in Parkinson’s disease. 


32.71 Various Peptides Also Act as Neurotransmitters 


Many relatively small peptides have been shown to possess neurotransmitter activity 
(see Table 32.3). One of the challenges of this field is that the known neuropeptides 
may represent a very small subset of the neuropeptides that exist. Another challenge 
arises from the small in vivo concentrations of these agents and the small number of 
receptors that are present in neural tissue. Physiological roles for most of these pep- 
tides are complex. For example, the endorphins and enkephalins are natural opioid 
substances and potent pain relievers. The endothelins are a family of homologous 
regulatory peptides, synthesized by certain endothelial and epithelial cells that act on 
nearby smooth muscle and connective tissue cells. They induce or affect smooth mus- 
cle contraction; vasoconstriction; heart, lung, and kidney function; and mitogenesis 
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and tissue remodeling. Vasoactive intestinal peptide (VIP) produces a G protein- 
adenylyl cyclase—mediated increase in cAMP, which in turn triggers a variety of pro- 
tein phosphorylation cascades, one of which leads to conversion of phosphorylase b 
to phosphorylase a, stimulating glycogenolysis. Moreover, VIP has synergistic effects 
with other neurotransmitters, such as norepinephrine. In addition to increasing 
cAMP levels through B-adrenergic receptors, norepinephrine acting at a,-adrenergic 
receptors markedly stimulates the increases in cAMP elicited by VIP. Many other 
effects have also been observed. For example, injection of VIP increases rapid eye 
movement (REM) sleep and decreases waking time in rats. VIP receptors exist in 
regions of the central nervous system involved in sleep modulation. 


32.7m Some lon Channels Respond to Physical Stimuli Rather 
Than Chemicals 


There are other ion channels involved in neurotransmission that are not chemical 
receptors but rather are sensors of physical stimuli, including temperature, voltage, 
pH, redox status, and mechanical phenomena. These include the transient receptor 
potential (TRP) channels and the mechano-sensing Piezo channels. 

Piezo channels are mechanically activated proteins that enable a cell to sense force by 
allowing ions to pass through the cell membrane in response to mechanical stimuli. 
Piezo channels are found in many vertebrates, invertebrates, and plants. Piezo proteins 
have key roles in many physiological and developmental processes that involve mechano- 
sensitivity, including breathing, touch, and vascular development. The best-characterized 
of the mammalian Piezo proteins is Piezol (Figure 32.64). Piezol is found in the lungs, 
bladder, and skin of mammals as well as in red blood cells. A homologous protein, 
Piezo2, is found in sensory neurons. Piezol has been implicated in extrusion of epider- 
mal cells and skin homeostasis, preventing excess proliferation of skin tissue. It is also 
involved in cancer biology and plays a role in metastases. 

TRP receptors are found in mammals, amphibians, birds, and reptiles. The TRP 
superfamily in mammals is composed of 28 members divided into six subfamilies. TRP 
channels consist of four subunits, each containing six transmembrane helical segments, 
and virtually all conduct Ca**. TRPM8 (Figure 32.64) is the cold- and menthol-sensing 
receptor in animals and plays a role in migraine pain sensing. Animals feel cold when a 
decrease in temperature triggers the opening of TRPM8 channels and Ca?* conduction. 
Variation of protein sequence in TRPM6§ results in different triggering temperatures for 
the channel and enables animals such as penguins and elephants to adapt to low tem- 
peratures. Wild giant pandas in the Qinling mountains of China roll in fresh horse 
droppings when the temperature decreases. The horse manure contains caryophyllenes, 
sesquiterpenes that bind to TRPM8 and prevent channel opening, presumably contrib- 
uting to cold tolerance by the pandas. 


(a) 


Figure 32.64 Structures of ion 
channels that respond to physical 
stimuli. (a) Cryo-EM structure of 
Piezol. This protein is unusually large 
compared with most proteins, 
consisting of 2547 amino-acid residues. 
It is unrelated structurally to most 
other proteins, and forms a 
homotrimer in the membrane with 
three propellor blades around a central 
core. Remarkably, each subunit of the 
trimer possesses 38 transmembrane 
helices (PDB id = 5Z10). (b) Cryo-EM 
structure of TRPM§8, which exists as 

a homotetramer in the membrane, each 
subunit having six transmembrane 
a-helices (PDB id = 6BPQ). Images in 
parts (a) and (b) are looking through 
the central transport channel. 
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Foundational Biochemistry Things You Should Know after Reading Chapter 32. 


Hormones regulate metabolic processes of tissues and organs as well 
as facilitate and control growth and differentiation, reproductive 
activities, learning, and memory. 


Hormones help the organism cope with changing conditions and 
stresses in the environment. 


The effects of hormonal signals are either metabolic responses or gene 
expression responses. 


Steroid hormones act either at the plasma membrane or in the nucleus 
of eukaryotic cells. 


Peptide hormones are often synthesized as inactive precursors that 
are activated by proteolysis. 


Signal transduction systems typically involve receptor proteins and 
signalsomes, covalent modifications of proteins and lipids, protein 
interaction domains, second messengers, and intracellular signaling 
pathways. 


Signal transduction systems are modulated by cooperativity, spatial 
and temporal control, signal integration, converging and diverging 
networks, signal amplification, and desensitization and adaptation. 


Many signaling pathways involve enzyme cascades. 


Signaling pathways connect membrane interactions with events in 
the nucleus. 


Signaling pathways depend on multiple molecular interactions. 


All receptors that mediate transmembrane signaling belong to one of 
three receptor superfamilies—the G-protein—coupled receptors, the 
single-transmembrane segment catalytic receptors, and the oligomeric 
ion channels. 


The G-protein-coupled receptors are 7-TMS integral membrane 
proteins. 


The single TMS receptors are guanylyl cyclases or tyrosine kinases. 
RTKs and RGCs are membrane-associated allosteric enzymes. 
The EGF receptor is activated by ligand-induced dimerization. 


The insulin receptor mediates several signaling pathways that regu- 
late blood glucose levels and promote the synthesis and storage of 
carbohydrates, proteins and lipids. 


Receptor guanylyl cyclases mediate effects of natriuretic hormones. 


Soluble (non-receptor) tyrosine kinases and guanylyl cyclases, analo- 
gous to their receptor counterparts, also participate in signaling 
pathways. 


Soluble guanylyl cyclases are receptors for nitric oxide, a gaseous 
second messenger. 


GPCR signals are transduced by GTP-binding proteins (G proteins). 


Second messengers involved in signaling pathways include cyclic 
AMP, cyclic GMP, Ca?*, IP}, NO, and a variety of lipid breakdown 
products. 


Ras and other small GTP-binding proteins are proto-oncogene 
products. 


Specific phospholipases release second messengers including DAG, 
the inositol phosphates, ceramide, and unsaturated fatty acids such 
as arachidonic acid. 


Activation of phospholipase C is mediated by G proteins or by tyro- 
sine kinases. 


Intracellular calcium-binding proteins mediate the calcium signal. 


Transduction of hormonal signals activates effectors—protein ki- 
nases and protein phosphatases—that elicit a variety of intracellular 
actions that regulate cellular functions. 


Most protein kinases share a common catalytic core first characterized 
in protein kinase A. 


The protein kinase C isozymes are regulated by Ca?*, diacylglycerol, 
and phosphatidylserine. 


Protein tyrosine kinase pp60°* is regulated by phosphorylation/ 
dephosphorylation. 


G-Protein signaling is modulated by RGS/GAPs. 
GPCR desensitization by arrestin leads to new signal pathways. 


Nerve impulses are carried by action potentials—changes in ion gra- 
dients mediated by the flows of Na* and K*—that are propagated 
in neurons. 


Neurotransmitters enable neuron communication at synapses by 
binding to receptor proteins such as the nicotinic- and muscarinic- 
acetylcholine receptors. 


Glutamate and aspartate are excitatory amino acid neurotransmitters. 
y-Aminobutyric acid and glycine are inhibitory neurotransmitters. 
Catecholamine neurotransmitters are derived from tyrosine. 


Peptides such as endorphins, enkephalins, endothelins, and the 
vasoactive intestinal peptide also function as neurotransmitters. 


Answers to all problems are at the end of this book. Detailed solutions 
are available in the Student Solutions Manual. 


1. Assessing the Features and Physiology of Hormones Compare and 


contrast the features and physiological advantages of each of the 
major classes of hormones, including the steroid hormones, polypep- 
tide hormones, and the amino acid—derived hormones. 


. Understanding the Features and Physiology of the Second Messengers 


Compare and contrast the features and physiological advantages of 
each of the known classes of second messengers. 


. Are There Gaseous Second Messengers Besides NO? Nitric oxide 


may be merely the first of a new class of gaseous second messenger/ 
neurotransmitter molecules. Based on your knowledge of the 


molecular action of nitric oxide, suggest another gaseous molecule 
that might act as a second messenger, and propose a molecular 
function for it. 


Assessing the Action of an Antibiotic Herbimycin A is an antibiotic 
that inhibits tyrosine kinase activity by binding to SH groups of 
cysteine in the src gene tyrosine kinase and other similar tyrosine 
kinases. What effect might it have on normal rat kidney cells 

that have been transformed by Rous sarcoma virus? What other 
effects might this interesting antibiotic have? 


Proposing Uses for Monoclonal Antibodies Against Phosphotyrosine 
Monoclonal antibodies that recognize phosphotyrosine are commer- 
cially available. How could such an antibody be used in studies of cell 
signaling pathways and mechanisms? 
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13. 
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15. 


16. 


17. 


. Studying Phosphorylated Proteins ATP [y-°P] is a commercially 


available compound. (a) Describe the structure of this compound. 
(b) Design an experiment to measure kinase activity using this 
compound. 


. Are there Functions of Hormone Receptors Other Than Signal 


Amplification? Explain and comment on this statement: The main 
function of hormone receptors is that of signal amplification. 


. Determining the Concentration of a Neurotransmitter in a Synaptic 


Vesicle Synaptic vesicles are approximately 40 nm in outside 
diameter, and each vesicle contains about 10,000 acetylcholine 
molecules. Calculate the concentration of acetylcholine in a 
synaptic vesicle. 


. Imagining a Model for Neurotransmitter Release GTPYS is a 


nonhydrolyzable analog of GTP. Experiments with squid giant 
axon synapses reveal that injection of GTyS into the presynaptic 
end (terminal) of the neuron inhibits neurotransmitter release 
(slowly and irreversibly). The calcium signals produced by 
presynaptic action potentials and the number of synaptic vesicles 
docking on the presynaptic membrane are unchanged by GTPyS. 
Propose a model for neurotransmitter release that accounts for all 
of these observations. 


Assessing the Binding of a Hormone to Its Receptor A typical 
hormone binds to its receptor with an affinity (Kp) of approximately 
1 X 10~? M. Consider an in vitro (test-tube) system in which the total 
hormone concentration is approximately 1 nM and the total 
concentration of receptor sites is 0.1 nM. What fraction of the 
receptor sites is bound with hormone? If the concentration of 
receptors is decreased to 0.033 nM, what fraction of receptor is 
bound with the hormone? 


Epinephrine and Glucagon Glucagon and epinephrine signaling 
activates the same G protein type in liver cells. What signal is 
transmitted by glucagon? By epinephrine? How do these signals lead 
to similar effects inside a cell? 


Assessing the Effects of Statins on Steroid Hormones (Integrates 
with Chapter 24.) All steroid hormones are synthesized in the 
human body from cholesterol. What is the consequence for 
steroid hormones and their action from taking a statin drug, such 
as Zocor, which blocks the synthesis of cholesterol in the body? 
Are steroid hormone functions compromised by statin action? 


Reconciling the Apparent Preference for Helices in Transmembrane 
Proteins Given that B-strands provide a more genetically economical 
way for a polypeptide to cross a membrane, why has nature chosen 
a-helices as the membrane-spanning motif for G-protein—coupled 
receptors? That is, what other reasons can you imagine to justify the 
use of a-helices? 


a-Helices vs B-Sheets in Transmembrane Receptors What is the 
functional benefit of a-helix structural elements compared to B-sheet 
structural elements in the transmembrane portion of a transmem- 
brane receptor? 


Understanding Reaction Mechanisms for Adenylyl Cyclase and 
Phosphodiesterases Write simple reaction mechanisms for the 
formation of cAMP from ATP by adenylyl cyclase and for the 
breakdown of cAMP to 5'-AMP by phosphodiesterases. 


Receptors and ATP Kinases in signaling pathways depend on ATP. 
Which other signaling pathway components depend (indirectly) on 
the presence of ATP? 


Intervention in Cell Signaling You have been tasked with developing 
an enzyme inhibitor to decrease the production of cAMP (see Figures 
32.22 and 32.23). List at least three potential targets. For each target, 
evaluate whether the inhibition (and the subsequent reduction in 
cAMP levels) would be selective or affect multiple cell signaling 
pathways. 


18. 


19. 


20. 


21. 


22. 


23. 


24. 


25. 


26. 
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Problems 


Determining the Equilibrium Transmembrane Potential for K* and Na* 
(Integrates with Chapter 9.) Consider the data in Figure 32.49a. 
Recast Equation 9.2 to derive a form from which you could 
calculate the equilibrium electrochemical potential at which no net 
flow of potassium would occur. This is the Nernst equation. 
Calculate the equilibrium potential for K* and also for Na‘, 
assuming T = 37°C. 


Calculating Transmembrane Potential Using the Goldman Equation 
The calculation of the actual transmembrane potential difference for 
a neuron is accomplished with the Goldman equation: 


RT LP[Coursiae Ey ert a) 
Ay = l 
y F ği G + PALA ouside 


where [C] and [A] are the cation and anion concentrations, respec- 
tively, and Pc and Px are the respective permeability coefficients of 
cations and anions. 

Assume relative permeabilities for K*, Na+, and Cl” of 1, 0.04, 
and 0.45, respectively, and use this equation to calculate the actual 
transmembrane potential difference for the neuron whose ionic 
concentrations are those given in Figure 32.49a. 


Understanding the Movement of K+, Na+, and CI- During an Action 
Potential Use the information in problems 18 and 19, together with 
Figure 32.50, to discuss the behavior of potassium, sodium, and 
chloride ions as an action potential propagates along an axon. 


Action Potential Trigger The action potential in a neuron is an 
all-or-nothing phenomenon: Once the membrane is depolarized by 
20 mV, the action potential can commence. What triggers this initial 
small depolarization? 


Assessing Signal Amplification in a Signaling Pathway Review the cell 
signaling pathway shown in Figure 32.4. With the rest of the chapter 
as context, discuss all the steps of this pathway that involve signal 
amplification. 


Understanding the Consequences of a Ras Mutation on a Signaling 
Pathway In the cell signaling pathway shown in Figure 32.4, what 
would be the effect if Ras were mutated so that it had no GTPase 
activity? 


Understanding the Action of GPCRs in G-Protein — Independent 
Pathways One of the topics discussed in this chapter is the ability 

of GPCRs to exert signaling effects without the involvement of 

G proteins. Using the pathway shown in Figure 32.40, and consider- 
ing everything you have learned in this chapter, suggest some reasons 
that would explain why this G-protein—independent signaling was 
difficult to verify experimentally. 


Pharmacology and GABA Receptors Why does it make pharmacologi- 
cal sense that agonists (activators) of GABA receptors are used to 
treat conditions like epilepsy, insomnia, and anxiety? (Hint: See 
Section 32.7j.) 


Assessing the Action of a Popular Pesticide Malathion (Figure 32.58) 
is one of the secrets behind the near-complete eradication of the boll 
weevil from cotton fields in the United States. For most of the 
twentieth century, boll weevils wreaked havoc on the economy of 
states from Texas to the Carolinas. When boll weevils attacked cotton 
fields in a farming community, the destruction of cotton plants meant 
loss of jobs for farm workers, bankruptcies for farm owners, and 
resulting hardship for the entire community. Relentless application of 
malathion to cotton crops and fields has turned the tide, however, and 
agriculture experts expect that boll weevils will be completely gone 
from cotton fields within a few years. Remarkably, malathion-resistant 
boll weevils have not emerged despite years of this pesticide’s use. 
Consider the structure and chemistry of malathion, and suggest what 
you would expect to be the ecological consequences of chronic 
malathion application to cotton fields. 
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27. Assessing the Structural Requirements for a Regulatory Protein in a 
Signaling Network Consult the excellent review article “Assembly of 
Cell Regulatory Systems Through Protein Interaction Domains” 
(Science 300:445—452, 2003, by Pawson and Nash), and discuss the 
structural requirements for a regulatory protein operating in a 
signaling network. 


PSAFE Problems 


(PSAFE models, listed by chapter or protein name, can be found at http:// 
www.psafeonline.net.) 


28. Using the PSAFE Model for Brca2 (the protein encoded by the 
BRCA2 gene associated with breast cancer susceptibility), discuss 
how a mutation in a BRC motif could cause breast cancer. 


Chapter 32 The Reception and Transmission of Extracellular Information 


29. Using the PSAFE Model for c-Abl kinase, explain how Gleevec 
functions as a therapy for chronic myelogenous leukemia patients. 


Think-Pair-Share Question 


Binding of Hormones to Receptors A cell receives two hormonal signals 
simultaneously: (1) insulin signaling to promote glycogen synthesis and 
(2) epinephrine signaling to promote glycogen breakdown. What would be 
the ultimate effect inside the cell? List at least six factors that could 
determine the overall balance between these two opposing signals. 
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Detailed answers to the end-of-chapter problems and additional problems 


to solve are available in the Student Solutions Manual, Study Guide and 
Problems Book. 


Chapter 1 


1. 


10. 


11. 


12; 


a. The amino acid side chains of proteins provide a range of 
shapes, polarity, and chemical features that allow a protein to 
be tailored to fit almost any possible molecular surface in a 
complementary way. 


b. Immunoglobulin is hundreds of times bigger than alanine. It 
would take only a very small fraction of the surface area of 
IgG to accommodate alanine. 


. Biopolymers may be informational molecules because they 


are constructed of different monomeric units (“letters”) 
joined head to tail in a particular order (“words, sentences”). 
Polysaccharides are often linear polymers composed of only 
one (or two repeating) monosaccharide unit(s) and thus 
display little information content. Polysaccharides with a 
variety of monosaccharide units may convey information 
through specific recognition by other biomolecules. Also, 
most monosaccharide units are typically capable of forming 
branched polysaccharide structures that are potentially very 
rich in information content (as in cell surface molecules that 
act as the unique labels displayed by different cell types in 
multicellular organisms). 


Abbreviated Answers to Problems 


. Informational polymers must have “sense” or direction, and they 


must be composed of more than one kind of monomer unit. 


. DNA molecules essentially have only one function: “store infor- 


mation on how to make RNA and protein molecules.” Proteins 
are the “workhorses” of our biochemistry; the multitude of bio- 
chemical events happening in our cells are mediated by a multi- 
tude of different proteins; requiring them to adopt a multitude of 
different shapes. 


. Molecular recognition is based on structural complementarity. 


If complementary interactions involved covalent bonds (strong 
forces), stable structures would be formed that would be less 
responsive to the continually changing dynamic interactions 
that characterize living processes. 


. a. Raise the temperature. 


b. The kinetic energy may overwhelm the energy of the weak 
forces and disrupt interactions between molecules. 


c. Generally speaking, the opposite of b. 


. 1,000 kg of monomer would correspond to 5,000 mol monomer. 


(5,000-1) mol linkages would have been made with release 
of (5,000-1) mol of water. 4,999 mol water corresponds to 
89,982 g water. 89,982 g water would correspond to 89.982 L 
water. 


. As this compound has both an amine and a carboxylic 


acid functional group, it can make repeated amide bonds 
(as shown in Figure 1.9) with other molecules of its kind. 


Living systems are maintained by a continuous flow of matter 
and energy through them. Despite the ongoing transformations of 
matter and energy by these highly organized, dynamic systems, no 
overt changes seem to occur in them: They are in a steady state. 


(0) O (0) O (0) 
Ho | H Hy | Hf Hy | H\ Bo | H Ho Il 
HN —CG—O— CH,— C— N— C— O— CHy— C— N-F C— O— CHy— C— NF C— OCH C— N— C— O— CH,— C— OH 
n 
. The long-chain alkanes that make up the structure of calcium 13. Because bacteria (compared with humans) have simple nutri- 
stearate do not contain any functional groups that could interact tional requirements, their cells obviously contain enzyme systems 
favorably with water. This inability to interact with water will that allow them to convert rudimentary precursors (even inor- 
lead to a decreased solubility and increased tendency to separate ganic substances such as NH,*, NO3 , N2, and CO;) into com- 
from water (the hydrophobic effect). plex biomolecules—proteins, nucleic acids, polysaccharides, and 
Two carbon atoms interacting through van der Waals forces are complex lipids. On the other hand, animals have an assortment 
0.34 nm apart; two carbon atoms joined in a covalent bond are of different cell types designed for specific physiological func- 
0.154 nm apart. tions; these cells possess a correspondingly greater repertoire of 
: : Suh : complex biomolecules to accomplish their intricate physiology. 
Slight changes in temperature, pH, ionic concentrations, and so i 
forth may be sufficient to disrupt weak forces (H bonds, ionic 14. Consult Figures 1.19 to 1.21 to confirm your answer. 
bonds, van der Waals interactions, hydrophobic interactions). 15. a. Laid end to end, 250 E. coli cells would span the head of a pin. 


b. The volume of an E. coli cell is about 10715 L. 


c. The surface area of an E. coli cell is about 6.3 X 107”? m?. Its 
surface-to-volume ratio is 6.3 X 106 m™!. 


d. 600,000 molecules. 
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A-2 Abbreviated Answers to Problems 


e. 1.7 nM. Think-Pair-Share Question: 

f. Because we can calculate the volume of one ribosome to be OH OH OH 
4.2 X 10-74 m? (or 4.2 X 107”! L), 15,000 ribosomes would 
occupy 6.3 X 1071 L, or 6.3% of the total cell volume. 

g. Because the E. coli chromosome contains 4600 kilobase pairs 
(4.6 X 10° bp) of DNA, its total length would be 1.6 mm— 
approximately 800 times the length of an £E. coli cell. 


h. This DNA would encode 4300 different proteins, each 
360 amino acids long. OH 


OH OH 


OH 


16. a. The volume of a single mitochondrion is about 4.2 x 107! L 
(about 40% the volume calculated for an Æ. coli cell in problem 3). NH, 
b. A mitochondrion would contain on average fewer than eight OH 
molecules of oxaloacetate. 


17. a. Laid end to end, 25 liver cells would span the head of a pin. 


b. The volume of a liver cell is about 8 x 107!? L (8000 times the 


volume of an E. coli cell). Chapter 2 


1. a. 3.3; b. 9.85; c. 5.7; d. 12.5; e. 4.4; f. 6.97. 
2. a. 1.26 mM; b. 0.25 uM; c. 4 X 10°? M; d. 2 x 1074 M; 


c. The surface area of a liver cell is 2.4 X 107° m?. 


d. Its surface-to-volume ratio is 3 X 10° m~!, or about 0.05 (1/20) 
that of an E. coli cell. 
Cells with lower surface-to-volume ratios are limited in their 
exchange of materials with the environment. 

e. The number of base pairs in the DNA of a liver cell is 
6.06 X 10° bp. 

f. The total length of the DNA in liver cell is 2.06 m (or more 
than 6 feet of DNA!) contained within a cell that is only 
20 um on a side. 

g. Maximal information content of liver cell DNA = 3.03 X 10° bp, 
which, expressed in proteins 400 amino acids in length, could 
encode 2.5 X 10° proteins. 


e. 3.16 X 107! M; f. 1.26 X 1077 M (0.126 uM). 


3. The scientist is incorrect. Given the values of pH and pK, and 


using the Henderson—Hasselbalch equation one can determine 
that the ratio of uncharged species to charged species is 1/100. 
Thus about 1% would be in the uncharged form and about 99% 
would be in the charged form. 


4. Structure C, containing the carboxylic acid functional group 


—COOH is the acidic compound. 


5. When acids dissociate in water, water molecules bind the released 


protons (H,0 —> H;0*). Water is acting as a base. An organic 
molecule like liquid hexane does not have basic properties and 
cannot pick up any released protons. Thus, in this type of envi- 


18. a. The fraction of the M. genitalium genes encoding proteins = ronment the pK, values of acids would be much higher (the acids 
0.925. would be much less acidic). 
b. Genes not encoding proteins encode RNA molecules. 6. a. [H*] = 2.51 X 1075 M; b. K, = 3.13 X 1078 M; 
c. (0.925)(580,074 base pairs) = 536,820 base pairs devoted to pK, = 7.5. 
protein-coding genes. 7. a. pH = 2.38; b. pH = 4.23. 
d. Since three base pairs specify an amino acid in a protein, 8. Combine 187 mL of 0.1 M acetic acid with 813 mL of 0.1 M 
369 amino acids are found in the average M. genitalium protein. sodium acetate. 
e. If each amino acid contributes on average 120 D to the mass 9. [HPO,-7)/[H2PO,] = 0.398. 
of a protein, the mass of an average M. genitalium protein is 10. Combine 555.7 mL of 0.1 M NaPO; with 444.3 mL of 0.1 M 
44,280 D. H3PO,. Final concentrations of ions will be [H,PO, ] = 
19. (0.925)(206) = 191 proteins. Assuming its genes are the same size 0.0333 M; [HPO,?-] = 0.0667 M; [Na+] = 0.1667 M; [H+] = 
as M. genitalium, the minimal genome would be 228,480 base 3.16 X 1078 M. 
pairs. 11. a. Fraction of H;PO,: @pH 0 = 0.993; @pH 2 = 0.58; @pH 4 = 
20. Given 1109 nucleotides (or base pairs) per gene, the minimal 0.01; negligible @pH 6. 
virus, with a 3500-nucleotide genome, would have only three b. Fraction of H,PO,~: @pH 0 = 0.007; @pH 2 = 0.41; 
genes; the maximal virus, with a 280,000-bp genome, would have @pH 4 = 0.986; @pH 6 = 0.94; @pH 8 = 0.14; negligible 
252 genes. @pH 10. 
21. Fate of proteins synthesized by the rough ER: c. Fraction of HPO,2~: negligible @pH 0, 2, and 4; @pH 6 = 


a. Membrane proteins would enter the ER membrane, and, as 
part of the membrane, be passed to the Golgi, from which 
vesicles depart and fuse with the plasma membrane. 


b. A secreted protein would enter the ER lumen and be trans- 
ferred as a luminal protein to the Golgi, from which vesicles 
depart. When the vesicle fuses with the plasma membrane, 
the protein would be deposited outside the cell. 


0.06; @pH 8 = 0.86; @pH 10 ~ 1.0; @pH 12 = 0.72. 
d. Fraction of PO,3~: negligible at any pH < 10; @pH 12 = 0.28. 


12. At pH 5.2, [H;A] = 4.33 x 10-5 M; [HA7] = 0.0051 M; 


[HA?-] = 0.014 M; [A?-] = 0.0009 M. 


13. a. pH = 7.02; [H,PO,"] = 0.0200 M; [HPO,2-] = 0.0133 M. 


b. pH = 7.38; [H»PO,7] = 0.0133 M; [HPO] = 0.0200 M. 


14. The increase in CO, levels in the circulation creates acidosis. 


The kidneys try to counteract this situation by reabsorbing 
bicarbonate as a base to keep the pH balanced. 
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15. [H,CO3] = 2.2 uM; [COxa] = 0.75 mM. When [HCO;"] = 


15 mM and [COxa] = 3 mM, pH = 6.8. 

Titration of the fully protonated form of anserine will require 
the addition of three equivalents of OH~. The pK, values lie 
at 2.64 (COOH); 7.04 (imidazole-N*H); and 9.49 (NH;*). 

Its isoelectric point lies midway between pK, and pK3, so 

pH, = 8.265. To prepare 1 L of 0.04 M anserine buffer, add 
164 mL of 0.1 M HCI to 400 mL of 0.1 M anserine at its 
isoelectric point and make up to 1 L final volume. 

Eis 

18. 166.7 g/mole. 

19. Add 193 mL of water and 307 mL of 0.1 M HCl to 500 mL of 
0.1 M triethanolamine. 

Combine 200 mL of 0.1 M Tris-H* with 732 mL water and 
68 mL of 0.1 M NaOH. 


2l.a. 2 


16. 


20. 


10 


1.0 
Equivalents OH~ 


2.0 


b. CHCH OH 
| 
:NCH,COO- 


CHCH OH 


c. The relevant pK, for this calculation is 8.3. Combine 400 mL 
of 0.1 M bicine at its pH; (pH 5.3) with 55 mL of 0.1 M NaOH 
and 545 mL of water. 


d. The relevant pK, for this calculation is 2.3. The concentration 
of fully protonated form of bicine at pH 7.5 is 2.18 X 1077 M. 


22. 0.063 M 


| 
0.6 0.8 
Equivalents OHT 


25. 1.53 nmol/mL - sec 


Abbreviated Answers to Problems A-3 


26. a. 3.55 uM 


b. Theoretically, 4 uM OH™ would neutralize all of the H»CO;, 
swamping the buffer system, and leading to a dangerous rise in 
pH. 

1.2 mM. 

. K sveraii = 8.07 X 1077. 

$ PKoveratt = 6.1. 

. As [CO,] drops due to excessive exhalation, [HCOs] in the blood 
plasma falls, followed by a decline in [H+] and [HCO  ] in the 
blood plasma. Blood pH rises within 20 sec of the onset of 
hyperventilation, becoming maximal within 15 minutes pH can 
change from its normal value of 7.4 to pH = 7.74. This rise in 
plasma pH (increase in alkalinity) is termed respiratory alkalosis. 


b. [H*] drops from its normal value of 40 nM (pH = 7.4) to 
18 nM (pH = 7.74). 


Hypoventilation results in respiratory acidosis, as CO, 

accumulates, giving rise to HxCO3;, which dissociates to form 

H* and HCO;-. 

Think-Pair-Share Question: 

a. Two inflection points can be observed in the titration curve: after 
the addition of 10 and 20 mL 0.2 M NaOH. This implies that, 
0.01 L x 0.2 mol/L = 0.002 mol, corresponds to the molar equiva- 
lents of acid that was present. Thus, the concentration of acid was 
0.002 mol / 0.02 L = 0.1 M. 

b. Given that 10 mL 0.2 M NaOH corresponds to the inflection 

points, 5 mL and 15 mL 0.2 M NaOH added, corresponds to the 

half-way points. Thus, one can estimate that the pK, values are 

about 2.0 and 7.0. 


voae 


27. 


28. 


Chapter 3 

1. In the human body, the temperature is 37°C (310 K). Assuming 
the values of AH and AS are the same in the human body as in 
the laboratory, and applying AG = AH — TAS one calculates 
AG = —20 kJ/mol; making the reaction spontaneous. 

2. Keg = 613 M; AG? = —15.9 kJ/mol. 

3. AG? = 1.69 kJ/mol at 20°C; AG? = —5.80 kJ/mol at 30°C. 

AS® = 0.75 kJ/mol - K. 

4. AG = —24.8 kJ/mol. 

5. State functions are quantities that depend on the state of the 
system and not on the path or process taken to reach that state. 
Volume, pressure, and temperature are state functions. Heat and 
all forms of work, such as mechanical work and electrical work, 
are not state functions. 


6. The dispersion of a set of molecules confined to a small 
volume (concentrated solution) into a much larger volume 
(diluted solution) significantly increases the entropy of the 
system; making it a spontaneous process. 


7. AG” = AG — 39.5 n (in kJ/mol), where n is the number of 
H* produced in any process. So AG? = AG” + 39.5n = 
—30.5 kJ/mol + 39.5(1) kJ/mol. 


AG° = 9.0 kJ/mol at 1 M [H*]. 
8. a. K.g(AC) = (0.02 X 1000) = 20. 
b. AG°(AB) = 10.1 kJ/mol. 
AG‘(BC) = —17.8 kJ/mol. 
AG*(AC) = —7.7 kJ/mol. 
Ka = 20. 
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A-4 


9; 


10. 


11. 


12. 


13. 
14. 


15; 


16. 


17. 


18. 


19. 


Abbreviated Answers to Problems 


a. The AG of the first reaction would not change. 

b. The addition of the second enzyme catalyzing the hydrolysis of 
ATP (typically with AG < 0) would make this an example of 
“coupled reactions” and the AG of the combined reactions 
would be <0. 

See The Student Solutions Manual, Study Guide and Problems 

Book for resonance structures. 

Keg = [Cr][Pi/[CrP][H20]. 

Keg = 3.89 X 10’. 

CrP in the amount of 135.3 moles would be required per day 

to provide 5860 kJ energy. This corresponds to 28,548 g of 

CrP per day. With a body content of 20 g CrP, each molecule 

would recycle 1,427 times per day. 

Similarly, 637 moles of glycerol-3-P, or 108,300 g of 

glycerol-3-P, would be required. Each molecule would 

recycle 5410 times/day. 

AG = —46.1 kJ/mol. 

The hexokinase reaction is a sum of the reactions for hydrolysis 

of ATP and phosphorylation of glucose: 


ATP + H,O == ADP + P; 
Glucose + P; —— G-6-P + H,O 
Glucose + ATP == G-6-P + ADP 


The free energy change for the hexokinase reaction thus can be 
obtained by summing the free energy changes for the first two 
reactions listed here. 


AG” for hexokinase = —30.5 kJ/mol + 13.9 kJ/mol = 
—16.6 kJ/mol 


The oxidation reaction may require the input of activation energy 
to initiate the oxidation reactions. Enzymes typically lower the 
required activation energy and as such promote reactivity. An 
excellent related article: Borden, W. T. et al., 2020. Dioxygen: 
What Makes This Triplet Diradical Kinetically Persistent? 
Journal of the American Chemical Society 139:9010-9018. 


Comparing the acetyl group of acetoacetyl-CoA and the methyl 
group of acetyl-CoA, it is reasonable to suggest that the acetyl 
group is more electron-withdrawing in nature. For this reason, it 
tends to destabilize the thiol ester of acetoacetyl-CoA, and the 
free energy of hydrolysis of acetoacetyl-CoA should be somewhat 
larger than that of acetyl-CoA. In fact, AG” = —43.9 kJ/mol, 
compared with —31.5 kJ/mol for acetyl-CoA. 


Carbamoyl phosphate should have a somewhat larger free 
energy of hydrolysis than acetyl phosphate, at least in part 
because of greater opportunities for resonance stabilization in 
the products. In fact, the free energy of hydrolysis of carbamoyl 
phosphate is —51.5 kJ/mol, compared with —43.3 kJ/mol for 
acetyl phosphate. 


a. Keg, 298 = 0.0204. Keg, 320 = 9, 
b. AH = 219.44 kJ/mol. AS = 704 J/mol. A G° = 9.642 kJ/mole. 


The value of A G° is determined at standard state, which includes 
1M [H*]. Using Equation 3.17, a value for A G° of -3.36 kJ/mol 
can be calculated. This value applies at pH 2, 7, and 12 because, 
of course, A G° is pH independent. However, Equation 3.17 can 
also be used to determine that AG” is — 14.77 kJ/mol at pH 2 
and -71.82 kJ/mol at pH 12. Finally, AG°’ for enolase is not pH- 
dependent because the enolase reaction (see Figure 3.8) neither 
consumes nor produces protons. AG? = AG” = 1.8 kJ/mol. 


20 


21. 


22; 


23; 
24. 


25; 


. The magnitude of A G” for ATP hydrolysis is sufficiently great 


that it provides ample energy to drive the conversion of A to B, 
even though the reaction is unfavorable. The result is that the 
equilibrium ratio of B to A is more than 108 greater when the 
reaction is coupled to ATP hydrolysis. If A —> B were coupled 
instead to 1,3-bisphosphoglycerate hydrolysis, the ratio of B 

to A would be even greater, since the A G” for hydrolysis of 
1,3-bisphosphoglycerate is substantially greater than that of ATP. 
Using the concentrations for 1,3-BPG and 3-PG in Table 18.2, 
and repeating the calculations in the A Deeper Look box “ATP 
Changes the Keq by a Factor of 108 in Section 3.8” yields a 
ratio of [Beql/[Acq] = 4.14 X 10°, an even greater ratio than that 
calculated in the A Deeper Look box “ATP Changes the K., by 
a Factor of 108” in Section 3.8. 


a. AG” = —31.5 kJ/mol 

b. Keg = 1.38 

c. The concentration of fatty-acyl CoA is not specified in this 
problem, since it varies in a typical cell. Assuming [fatty-acyl 
CoA] = 1 mM, AG = 15.63 kJ/mol 

a. Temperature is not specified. 
Assuming T = 298K, AG = —58.8 kJ/mol 

b. Keg = 1.24 X 107 

c. AG” = —9.97 kJ/mol 

d. Keg = 55.7 

This exercise is left to the student. Use Figure 3.5 as a guide. 

Without pyrophosphate cleavage, the acyl-CoA synthetase 

reaction is only slightly favorable, with a A G°’ of 0.8 kJ/mol. 

With pyrophosphate cleavage included, the net AG*®’ for the 

reaction is —33.6 kJ/mol, a far more favorable value. 

Use data in Table 3.1 to determine A G” for creatine kinase reaction: 

Cr + ATP — CrP + ADP 

AG” = +12.8 kJ/mol. 


Use AG in muscle of 8.04 kJ/mol with Equation 3.8 to calculate 
[Cr] = 0.012M = 12 mM. 


Think-Pair-Share Question: 


a. 


The various reagents should be organized with increasing values 
of o. Beaker B should contain FMN, beaker C should contain 
cytochrome b; (Fe**), beaker D should contain UQ and beaker E 
should contain cytochrome c (Fe**). 


. Beakers A and B: 
NADH + H* + FMN == NAD* + FMNH, 
AGo' = -0.219V — (-0.32V) = 0.101V 
Beakers B and C: 


FMNH; + 2 cyto by, (Fe**) —= FMN + 
2 cyto by (Fe**) + 2H* 
Ay = -0.100V — (-0.219V) = 0.119V 
Beakers C and D: 
2 cyto by (Fe**) + 2H* + UQ == 
2 cyto by (Fet) + UQH) 
AE’ = 0.060V — (-0.100V) = 0.16V 
Beakers D and E: 
UQH), + 2 cyto c (Fe**) == UQ + 
2 cyto c (Fe?) + 2H* 
AE = 0.254V — (0.060V) = 0.194V 
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Beakers E and F: 
2 cyto c (Fe**) + 2H* + %O; == 2 cyto c (Fe**) + H,O 
AC = 0.816V — (0.254V) = 0.562V 
c. Overall reaction 
NADH + H* + 40, == NAD* + H,0 
AC = 0.816V — (-0.32V) = 1.136V 


Chapter 4 


1. Structures for glycine, aspartate, leucine, isoleucine, methionine, 
and threonine are presented in Figure 4.3. 


2. Asparagine = Asn = N. 
Arginine = Arg = R. 
Cysteine = Cys = C. 
Lysine = Lys = K. 
Proline = Pro = P. 
Tyrosine = Tyr = Y. 
Tryptophan = Trp = W. 

3. LDPTY and MHQAT 


4. a. The structure is not a zwitterion as it has a surplus of positive 
charges. 


b. Increasing the pH could remove a proton from one of the 
N-containing functional groups and balance the positive and 
negative charges within the structure. 


5. Alanine dissociation: 


COOH COO- (000 
H;N*—C—H = H;N*—C-—H == HNC 
CH3 CH; bn, 
Glutamate dissociation: 
COOH (00103 coo coo 
H;N*—C—H = H3N*—C—H > H;N* i H =~ HN i H 
CH3 CH ba, TA 
CHo CHo Gi ts, 
COOH COOH tage boo- 
Histidine dissociation: 
COOH coo coo COO- 
H,N*—C—H == a == HN ai = ee 
CH, i” Gi ma 
HC=CH Hodi Heed mea 
HN Nu HNS Xu N, NH N Nu 
cH CH cH 
Lysine dissociation: 
COOH CoO” coo” COO” 
H;Nt—C—H == H3N*—C—H == H.N-C—H == ee 
(CH) 4 (CH) 4 (CH) 4 (ct), 
NH,* NH,* NH,* i 
Phenylalanine dissociation: 
COOH COO” coo 
H;N*—C—H == H;N*—C—H =~ H,N—C—H 
CHo CHo CHo 
A a ZA 
SS | WJ A 


6. The proximity of the a-carboxyl group lowers the pK, of the 
a-amino group. 


9. 


10. 


11. 
12; 


13. 


A-5 


Abbreviated Answers to Problems 


OH (equivalents) 


b. Ratio = 25.1 
ae Equivalence point 
12 
1l} 
10 H la — aNH;* pK,= 9.8 


pH 


R group pK, = 3.9 


a-COOH p&K,= 2.1 


=. N Oo fF ODT © OO 


© 
olg 


Equivalents 

Denoting the four histidine species as His?*, His*, His?, and 
His , the concentrations are: 
pH 2: [His?*] = 0.097 M, [His*] = 0.153 M, 
[His?] = 1.53 x 1075 M, [His-] = 9.6 x 10-8 M. 
pH 6.4: [His?*] = 1.78 X 10-4 M, [His*] = 0.071 M, 
[His] = 0.179 M, [His~] = 2.8 x 10-4 M. 
pH 9.3: [His?*] = 1.75 X 107!" M, [His*] = 5.5 x 1075 M, 
[His’] = 0.111 M, [His~] = 0.139 M. 
pH = pK, + log (2/1) = 4.3 + 0.3 = 4.6. 
The y-carboxyl group of glutamic acid is two-thirds dissociated 
at pH = 4.6. 
pH = pK, + log (1/4) = 10.5 + (—0.6) = 9.9. 
a. The pH of a 0.3 M leucine hydrochloride solution is 

approximately 1.46. 


b. The pH of a 0.3 M sodium leucinate solution is approxi- 
mately 11.5. 

c. The pH of a 0.3 M solution of isoelectric leucine is approxi- 
mately 6.05. 


The sequence of reactions shown would demonstrate that 
L(—)-serine is related stereochemically to L(—)-glyceraldehyde: 
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A-6 


Abbreviated Answers to Problems 


COOH COOH COOH 
HN H > HN | a Br 
CH,OH CH; CH; 


L-(—)-Serine i-(—)-Alanine 2-Bromopropanoic acid 


CHO COOH COOH 
HO H > HO H > HO H 
CH,OH CH,OH CH; 


L-(—)-Glyceraldehyde 


15. 
16. 


17. 
18. 


19. 


20. 


21. 


L-Glyceric acid 2-Hydroxypropanoic acid 


14. 


Straight arrows indicate reactions that occur with retention of 
configuration. Looped arrows indicate inversion of configura- 
tion. (From Kopple, K. D., 1966. Peptides and Amino Acids. 
New York: Benjamin Co.) 


Cystine (disulfide-linked cysteine) has two chiral carbons, the two 
a-carbons of the cysteine moieties. Each chiral center can exist in 
two forms, so there are four stereoisomers of cystine. However, it 
is impossible to distinguish the difference between L-cysteine/ 
D-cysteine and D-cysteine/L-cysteine conjugates. So three distinct 
isomers are formed: 


coo” Coo” coo COO” Coo% ! coo" 
| ‘an [3 | ai T 
H,N—C—H HN-C-H NH; H ¢ NH, H,N—C—H  1H—C—NHs; 


I 
I 
I 
1H ¢ 
I 
CH,—S—S—CH, | 
I 


Mirror plane 
L-Cystine 


43 


|l - 
I—CH,—C—NH, l 


m S 


| > 
CH, 


HsN—C—COO™ 
H 
There are eight Tyr residues in the protein. 


A water is removed from each amino acid when it is incorporated 
into a protein, so the “molecular weight” of a residue is lower 

by 18 units. Also, most proteins have relatively more small side 
chains (Gly, Ala) and fewer Trp side chains than a statistical 
average would predict. 


Aspartame is composed of aspartic acid and phenylalanine, with 
a carboxymethyl cap. These amino acids are linked by a peptide 
(amide) bond in aspartame. Heating can cleave amide linkages. 
For this reason, drinks such as coffee and hot chocolate must be 
consumed relatively quickly after preparation. Aspartame kept 
hot for several hours is quite bitter tasting (based on the experi- 
ence of one of the authors). 


Phenylketonuria is an autosomal-recessive genetic disease caused 
by a deficiency or absence of the enzyme phenylalanine hydroxy- 
lase (PAH), an essential enzyme that converts phenylalanine to 
tyrosine. Without sufficient PAH activity, phenylalanine accumu- 
lates and is converted to phenylpyruvate (which can be detected in 
the urine). Without treatment, phenylketonurics eventually experi- 
ence progressive mental retardation and seizures. Phenylketonuria 
can be controlled by eliminating phenylalanine from the diet, and 
phenylketonurics should be advised not to use aspartame. 


The process for distinguishing R- and S-configurations of chiral 
molecules is described in the Critical Developments in Biochem- 
istry box “Rules for Description of Chiral Centers in the (R,S) 
System” in Section 4.4. Enzymes discriminate between isomers of 
chiral molecules thanks to the asymmetric arrangement of amino 
acid residues in the enzyme active site. 


CH, —S—S—CHy 


b-Cystine 


22 


23; 


24. 


25. 


26. 


27. 


28. 


Mirror plane 


meso-Cystine 


. Appropriate ranges for buffering: 
Alanine—1.4-3.4, 8.7-10.7 


Histidine—0.8-2.8, 5.0-7.0, 8.2-10.2 
Aspartic acid—1.1—4.9, 8.8—10.8 
Lysine—1.2-3.2, 8-11.5 


This exercise is left to the student and will presumably be 
different for every student. 


M. A. Rosanoff, an instructor at New York University, proposed 
the D,L-nomenclature system for chiral molecules. 

See “A Deeper Look - Why Nature Chose Selenium“ on page 82 
for a full explanation. 


This exercise is left to the student and will presumably be 
different for every student. 


With pK, of 8.3, cysteine would make a useful buffer except that 
cystine, the disulfide of cysteine, can form readily in this pH range. 
For every cystine formed, two cysteine-SH groups are eliminated, 
making the buffering capacity of cysteine of limited usefulness. 
Also, the cysteine sulfhydryl group is the most potent nucleophile 
among the side chains of the 20 common amino acids. 
L-threonine is (2S, 3R)-threonine. 

D-threonine is (2R, 3.S)-threonine. 

L-allothreonine is (2S, 3S)-threonine. 


D-allothreonine is (2R, 3R)-threonine. 


Think-Pair-Share Question: 


PHE 3.82 3.83 
ASP 1.08 1.11 
PRO 2.78 2.79 
MET 3.46 3.47 
LYS 4.46 4.47 
SER 2.29 2.31 
ALA 2.89 2.91 
VAL 3.33 3:35 
LEU 3.67 3.68 
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Chapter 5 


1. Nitrate reductase is a dimer (2 Mo/240,000 M,). 

2. Phe-Asp-Tyr-Met-Leu-Met-Lys. 

3; 

4. The ionic character of proteins A, B, and C is A>C>B at the 


Ser-Glu-Tyr-Arg-Lys-Lys-Phe-Met-Asn-Pro. 


pH used in the ion exchange column. Assuming all three proteins 
have a similar globular shape, their sizes are C>A>B. Protein C 
is a flavin-binding protein; proteins A and B are not. 


. The protein may consist of two polypeptide chains linked 


together through disulfide bridges. 


6. Gly-Arg-Lys-Trp-Met-Tyr-Arg-Phe. 
7. Actually, there are four possible sequences: NIGIRVIA, 


GINIRVIA, VIRNIGIA, and, of course, VIRGINIA. 


8. Gly-Trp-Arg-Met-Tyr-Lys-Gly-Pro. 


10. 


11. 
12; 


13; 


14. 
15. 
16. 


I7; 


18. 


Leu-Met-Cys-Val-Tyr-Arg-Cys-Gly-Pro. 


| | 

5—5 
Alanine, attached to a solid-phase matrix via its a-carboxyl 
group, is reacted with diisopropylcarbodiimide-activated lysine. 
Both the a-amino and e-amino groups of the lysine must be 
blocked with 9-fluorenyl-methoxycarbonyl (Fmoc) groups. To 
add leucine to Lys-Ala to form a linear tripeptide, precautions 
must be taken to prevent the incoming Leu a-carboxyl group 
from reacting inappropriately with the Lys e-amino group 
instead of the Lys a-amino group. 


Bovine ribonuclease; Neurospora crassa NADPH-nitrite reductase 


The mass of the myoglobin chain is calculated to be 16,947 + 

1 daltons. 

Unlike any amino acid side chain, the phosphate group (or more 
appropriately, the phosphoryl group) bears two equivalents of 
negative charge at physiological pH. Furthermore, replacing an 
H atom on an S, T, or Y side chain with a phosphoryl group 
introduces a very bulky substituent into the protein structure 
where none existed before. 

Compound Z the ligand with the strongest affinity for the protein. 
A graph of v versus [L] reveals that at v = 0.5, [L] = Kp = 2.4 mM. 


Percent Saturation of Protein with Ligands A and B 


Ligand concentration Ligand A Ligand B 
(mM) (Kp = 0.3 mM) (Kp = 1 mM) 
0.1 0.25 0.091 
0.2 0.4 0.167 
0.5 0.625 0.33 
1 0.77 0.5 
2 0.87 0.67 


More B is required to achieve the same degree of saturation as any 
given concentration of A. Kp and affinity are inversely related. 


IRS-1 has 1242 amino acids. Its average molecular mass is 
131,590.97. The amino acid sequence of the tryptic peptide of 
IRS-1 of mass of 1741.9629 is LNSEAAAVVLQLMNIR. The 
sequence of the tryptic fragment containing the SHPTP-?2 site is 
LCGAAGGLENGLNYIDLDLVK. 


Nucleophilic attack by the hydroxyl O of the active-site serine on 
the carbonyl carbon of a peptide bond. 


Abbreviated Answers to Problems 


A-7 


19. a. Amino acid changes in mutant hemoglobins that appear on the 
surface of the folded globin chains may affect quaternary structure. 


b. Amino acid substitutions on the surface on the quaternary 
hemoglobin structure that create hydrophobic patches might 
lead to polymerization. Such amino acids would include all of 
the hydrophobic amino acids. 


20. a. They refer to the sedimentation coefficients of these two subunits. 


b. The ribosomal subunits are quaternary complexes of protein 
and rRNA. The larger the S value, the larger the rRNA is and 
the more protein molecules the ribosomal subunit has. 


Think-Pair-Share Question: 
Digestion order 4: 


1. Chymotrypsin: QVDGL 
MRTSEQMKNSRV 


2. Trypsin: QVDGL MR 
TSEQMK NSR V 

3. St. Protease: QVD GL MR 
TSE QMK NSR V 


Digestion order 5: 


1. St. Protease: QVD GLMR- 
TSE QMKNSRV 


2. Trypsin: QVD GLMR TSE 
QMK NSR V 

3. Chymotrypsin: QVD GL 
MR TSE QMK NSR V 


Edman degradation : 


Edman degradation 


Edman degradation 


Edman degradation : 


Edman degradation 


Edman degradation 


QM 


: QMTNV 


: QOGMTNV 


QG 


: QGTNV 


: QOGMTNV 


The order of endopeptidase treatment did not affect the final results. 


Chapter 6 


1. The central rod domain of keratin is composed of distorted 
a-helices, with 3.6 residues per turn, but a pitch of 0.51 nm, 
compared with 0.54 nm for a true a-helix. 


(0.51 nm/turn)(312 residues)/(3.6 residues/turn) = 
44.2 nm = 442 A. 


For an a-helix, the length would be: 


(0.54 nm/turn)(312 residues)/(3.6 residues/turn) = 
46.8 nm = 468 A. 


The distance between residues is 0.347 nm for antiparallel 
B-sheets and 0.325 nm for parallel B-sheets. So 312 residues 
of antiparallel B-sheet amount to 1083 A and 312 residues of 
parallel B-sheet amount to 1014 A. 


2. The collagen helix has 3.3 residues per turn and 0.29 nm per 


residue, or 0.96 nm/turn. Then: 


(4 in/year)(2.54 cm/in)(107 nm/cm)/(0.96 nm/turn) = 
1.06 X 108 turns/year. 
(1.06 X 108 turns/year)(1 year/365 days)(1 day/24 hours) 

(1 hour/60 minutes) = 201 turns/minute. 


3. Asp: The ionizable carboxyl can participate in ionic and hydrogen 
bonds. Hydrophobic and van der Waals interactions are negligible. 


Leu: The leucine side chain does not participate in hydrogen 
bonds or ionic bonds, but it will participate in hydrophobic and 


van der Waals interactions. 


Tyr: The phenolic hydroxyl of tyrosine, with a relatively high pKa, 
will participate in ionic bonds only at high pH but can both donate 
and accept hydrogen bonds. Uncharged tyrosine is capable of 
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A-8 


10. 


11. 


12. 


13. 


14. 


15. 
16. 


Abbreviated Answers to Problems 


hydrophobic interactions. The relatively large size of the tyrosine 
side chain will permit substantial van der Waals interactions. 


His: The imidazole side chain of histidine can act as both an 
acceptor and donor of hydrogen bonds and, when protonated, can 
participate in ionic bonds. Van der Waals interactions are expected, 
but hydrophobic interactions are less likely in most cases. 


. All proteins, independent of their primary structure, are 


polyamides containing regularly spaced amide functionalities. 
As a helices and B sheets are created through hydrogen bonds 
between amide groups, a helices or B sheets can be created in 
many different proteins. 


. The presence of the hydroxyl groups on the proline or lysine 


residues provide additional hydrogen-bonding capabilities that 
help stabilize the triple helical conformation of collagen. 


. Protein A appears to have the most appropriate amino acid balance 


with 31.4% hydrophobic (A, G, I, L, F, P, V), 29.1% hydrophilic 
(R, N, D, C, E, Q, H, S, T) and 33.5% amphipathic (K, M, W, Y) 
amino acids. Protein B has excess hydrophobic amino acids (55%), 
while protein C has excess hydrophilic amino acids (48.1%). 


. There would be much more steric hindrance in the peptide 


or protein chain, limiting the possibilities to form secondary 
or tertiary structures. 


. As an imino acid, proline has a secondary nitrogen with only one 


hydrogen. In a peptide bond, this nitrogen possesses no hydro- 
gens and, thus, cannot function as a hydrogen-bond donor in 
a-helices. More importantly, b, the angle of rotation around the 
C,-N bond, is fixed in Pro. On the other hand, proline stabilizes 
the cis-configuration of a peptide bond and thus, is well suited to 
B-turns, which require the cis-configuration. 


. For a right-handed cross-over, moving in the N-terminal to 


C-terminal direction, the cross-over moves in a clockwise 
direction when viewed from the C-terminal side toward the 
N-terminal side. The reverse is true for a left-handed cross-over; 
that is, movement from N-terminus to C-terminus is accompa- 
nied by counterclockwise rotation. 


The Ramachandran plot reveals allowable values of ¢ and w for 
a-helix and B-sheet formation. The plots consider steric hindrance 
and will be somewhat specific for individual amino acids. For exam- 
ple, peptide bonds containing glycine can adopt a much wider range 
of ¢ and y angles than can peptide bonds containing tryptophan. 


The protein appears to be a tetramer of four 60-kD subunits. 
Each of the 60-kD subunits in turn is a heterodimer of two 
peptides, one of 34 kD and one of 26 kD, joined by at least 
one disulfide bond. 


Hydrophobic interactions frequently play a major role in 
subunit—subunit interactions. The surfaces that participate 
in subunit—subunit interactions in the B, tetramer are likely 
to possess larger numbers of hydrophobic residues than the 
corresponding surfaces of protein A. 


The length is given by (53 residues) X (0.15 nm run/residue) = 
7.95 nm. The number of turns in the helix is given by 

(53 residues)/(3.6 residues/turn) = 14.7 turns. There are 

49 hydrogen bonds in this helix. 


Glycines are essential components of tight turns (8-turns) and, 
thus, are often essential for maintenance of protein structure. 
Asp, Glu, Ser, Thr, His, and perhaps also Asn, Gln, Cys, Arg, Lys. 
The ability of poly-Glu to form a-helices requires that the gluta- 
mate carboxyls be protonated. Deprotonation produces a poly- 
anionic peptide that is not amenable to helix formation. 


17. 


18. 


19. 


20. 


21. 


22. 


23. 
24. 


25. 


A coiled-coil formed from a-helices with 3.5 residues per turn would 
form a symmetrical seven-residue-repeating structure that would 
place the first and fourth residues of the seven-residue repeat at the 
same positions about the helix axis in every seven-residue repeat. 
This would allow the two helices of a coiled-coil structure to lie side 
by side with no twist about the coiled-coil axis. Such a structure 
would probably not be as stable as the twisted structure of coiled 
coils formed from a-helices with 3.6 residues per turn. 


Linear motifs are peptide sequences that are short and conserved. 
These blocks occur within a larger protein and act as recognition 
modules. A sequence containing a phosphorylatable tyrosine and 
a proline-rich sequence are two examples. Linear motifs are dif- 
ferent from domains in the sense that the linear sequence rather 
than a folded domain is the recognizable unit. Consult the refer- 
ences provided in this question for additional information. 


The student should consult the references provided to answer this 
question. 


The work of Dorothee Kern has provided evidence that con- 
formational transitions in proteins may be assisted by transient 
interactions resulting from internal protein motions. Consult 
the reference provided for more information on this interesting 
phenomenon. 


Born in Pocahontas, Virginia, Herman Branson received a B.S. 
from Virginia State University in 1936 and a Ph.D. in Physics 
from the University of Cincinnati in 1939. He became a profes- 
sor of physics and chemistry at Howard University in 1941 and 
spent 27 years on that faculty. In 1948, he took a sabbatical leave 
in the laboratory of Linus Pauling at the California Institute of 
Technology. Pauling asked him to do a mathematical analysis of 
all helical structures for peptide chains that would fit the struc- 
tural constraints determined previously by Pauling and Robert 
Corey. The first (and most revolutionary) paper on the a-helix 
was submitted to Proceedings of the National Academy of Sci- 
ences on Pauling’s 50th birthday (2/28/1951) by Pauling, Corey, 
and Branson. An excellent account of this story is provided by 
David Eisenberg (Proceedings of the National Academy of Sci- 
ences 100:11207—11210 [2003]). 


a. The third sequence would place hydrophobic residues on both 
sides of a B-strand and thus could be found in a parallel B-sheet. 


b. The second sequence would place hydrophobic residues on 
just one side of a B-strand and thus could be found in an anti- 
parallel B-sheet. 


c. The sixth sequence consists of GPX repeats (where X is any 
amino acid) and thus could be part of a tropocollagen molecule. 


d. The first sequence consists of seven-residue repeats, with first 
and fourth residues hydrophobic, and thus could be part of a 
coiled coil structure. 


The solution to this exercise is to be completed by the student. 


AG” = —34.23 kJ/mol, a number that corresponds to one to two 
H bonds. 
For the folded state (F) to unfolded state (U) reaction, 

F=—U 
the equilibrium constant is given by Keq = u/f = u/(1 — u), 
where u and f are the unfolded and folded fractions; u + f = 1. 
Given AG” = —RTInK.,, if u = 0.8, Keg = 0.8/0.2 = 4 and 
AG” = 3.4 kJ/mole. If u = 0.2, then Keq = 0.2/0.8 = 0.25 and 
AG” = —3.4 kJ/mole. So the transition from 80% unfolded to 
80% folded is given by AG = 3.4 kJ/mol — (—3.4 kJ/mole) = 
6.8 kJ/mol = 7 kJ/mol. 
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26. 


2T: 


Abbreviated Answers to Problems A-9 


a. 28 3. CHO CHO CHO CHO 
b. about 8 
c. Assuming an antiparallel pair of B-strands, about 12. HOS Oo OG H H a H 
d. 2 X 12 (B-strands) + 28 (a-helix) + 2 X 4(each B-bend) = H—— OH HO H H On HĦH—— OH 
approximately 60. H—+—OH H OH H OH HO—~——H 
e. 2 for the B-bends, n — 4 = 24 for a-helix, and about 12 for the | 
pair of B-strands = approximately 38 total. CHOH CHON CHON CHOH 
D-Arabinose D-Lyxose D-Ribose L-Xylose 


Charge will be approximately —1, although the tyrosine will 
begin to deprotonate at pH 9.8. 4. See Figure 7.2. All of the structures in the same row as D-xylose 
are D-diastereomers of it. 


28. Charge will be approximately +2.5. (The histidine will be half- 
protonated at pH 6.) 5. L-Gulose 
29. This exercise is left to the student and will presumably be 6. a. Converting D-galactose to D-talose would require breaking of 
different for every student. bonds, so they represent different configurations. 
Think-Pair-Share Question: b. Chair structures of B-D-allose differ by conformation. 
7. D-2-Deoxyarabinose 
eo N 8. D-Fructose can tautomerize to D-glucose or D-mannose, both of 
which are reducing sugars. 
9. The systematic name for trehalose is a-D-glucopyranosyl- 
Sy (1 — 1)-a-pD-glucopyranoside. 
~ aan “If Trehalose is not a reducing sugar. Both anomeric carbons are 
occupied in the disaccharide linkage. 
Chapter 7 10. See structures in The Student Solutions Manual, Study Guide and 
Problems Book. 
1. See structures in The Student Solutions Manual, Study Guide and il 1.4 
. a-l, 


13; 


14. 


15. 


16. 


17. 


18. 


19. 


. See structures and titration curve in The Student Solutions 


Problems Book. 
12. 


Manual, Study Guide and Problems Book. 


(0) CHOH HOH,C (0) CHOH HOHC O OH 
OH OH OH 
HOH,C OH OH CH,OH 
OH OH OH 
enantiomer anomer C3-epimer 
Structure A: Hexose, L, aldose and protein are at approximately the same oxidation level, and 


Structure B: Hexose, L, aldose, œ, pyranose both of these are significantly less than that of fat. 


Structure C: Hexose, D, ketose, B, furanose 20. This mechanism could involve either an Syl or Sy2 mechanism. 
In the former, protonation of the bridging oxygen would result 
in dissociation to produce a carbo-cation intermediate, which 
could be attacked by the phosphate nucleophile. The observation 


A sample that is 0.69 g a-D-glucose/mL and 0.31 g 
B-bD-glucose/mL will produce a specific rotation of 83°. 


See structures in The Student Solutions Manual, Study Guide and of retention of configuration at the anomeric carbon favors this 
Problems Book. mechanism. An Sy2 mechanism would presumably involve water 
A 0.2 g sample of amylopectin corresponds to 0.2 g/162 g/mole attack at the anomeric carbon, with dissociation of the oxygen of 
or 1.23 X 1073 mole glucose residues; 50 umole is 0.04 of the the carbohydrate chain. This would be followed by Sy2 attack by 
total sample or 4% of the residues. Methylation of such a sample phosphate. Two Sy2 attacks would result in retention of configu- 
should yield 1,2,3,6-tetramethylglucose for the glucose residues ration at the anomeric carbon atom. 

on the reducing ends of the sample. The amylopectin sample 21. Laetrile contains a cyanide group. Breakdown of laetrile and 
contains 1.2 X 10'* reducing ends. release of this cyanide function in the body would be highly toxic. 
There are several target sites for trypsin and chymotrypsin 22. Chondroitin and glucosamine are amino sugar components of carti- 
in the extracellular sequence of glycophorin, and it would lage and connective tissue. Dietary supplement with these substances 
be reasonable to expect that access to these sites by trypsin could help replenish the cartilage matrix proteoglycan, relieving pain 
and chymotrypsin would be restricted by the presence of and restoring the proper function of connective tissue structures. 
oligosaceharides mi Rae Sxi acelhilar doman of glycophorin; 23. Two of the sugar units in stachyose are glucose and fructose, 
Energy yield upon combustion (whether by metabolic pathways and the bond joining them is easily cleaved by stomach enzymes. 
or other reactions) depends on the oxidation level. Carbohydrate However, the other two sugar units in stachyose are galactoses 
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A-10 


24. 


25; 


26. 


27. 


28. 


29, 


30. 


31. 


32. 


Abbreviated Answers to Problems 


in a(1 —> 6)-linkages, which are not broken down by human 
enzymes. The result is that stachyose loses a fructose in the 
stomach but the resulting trisaccharide passes into the intestines, 
where bacterial enzymes degrade it, producing intestinal gas 

in the process. Beano contains an enzyme that hydrolyzes 

a(l —> 6)-galactose linkages. Taking several Beano tablets before 
a meal of beans or legumes facilitates complete breakdown of 
stachyose and other related oligosaccharides in the stomach, 
avoiding the production of intestinal gas. The active enzyme in 
Beano is referred to as a a(1 —> 6)-galactosidase. 
B-D-Fructofuranosyl-O-a-D-galactopyranosyl-(1 —> 6)-O-a-D- 
galactopyranosyl-(1 —> 6)-a-D-glucopyranoside 

Starch phosphorylase cleaves glucose units, one at a time, from 
starch chains until a (1 —> 6)-branch is encountered. Thus, 

limit dextrins are amylopectin fragments with a glucose in 

(1 — 6)-linkage at each nonreducing end. The mechanism of the 
reaction that cleaves these glucose units is essentially the same as 
that for starch phosphorylase, except that it occurs at a glucose 
unit that is (1 —> 6)-linked. 


In the beer-making process, the mash fermented by yeast con- 
tains starch, which is partially broken down by the amylase from 
the malt to produce limit dextrins (see problem 16). These limit 
dextrins add significant caloric content to regular beers. Joseph 
Owades used the enzyme amyloglucosidase, which hydrolyzes 
both (1 —> 4) and (1 —> 6) linkages in starch, thus breaking 
down the limit dextrins to simple glucose, which is fermented 
normally. (This raises the alcohol content of the beer above what 
normally would be produced, and water typically is added to 
adjust the alcohol content.) 


As described in problem 14, Beano is a a(1 —> 6)-galactosidase. 
Thus, Beano can slowly convert the limit dextrins produced in 
the beer-brewing process into simple glucose units, much like 
the enzyme used by Joseph Owades (see problem 17). Amateur 
brewers have learned to use Beano to make their own light beer, 
often referred to as Beano beer. 


Assuming each glucose unit contributes 0.55 nm to the 
growing cellulose polymer (and the plant length), a growth rate 
of 1 foot per day corresponds to a rate for cellulose synthase 
of 6400 glucose units per second. 


Lactaid contains a natural lactase enzyme derived from Aspergil- 
lus oryzae. It converts lactose into glucose and galactose. 


Lactose-free milk is prepared by treating regular milk with a 
lactase enzyme, which converts lactose to glucose and galactose. 
One could make lactose-free milk at home by treating regular 
milk with Lactaid tablets. Suitable references would reveal the 
lactose concentration in regular milk and the activity of the 
lactase enzyme in Lactaid tablets. One could then calculate how 
many Lactaid tablets and how much time (at a given tempera- 
ture) would be required to consume all of the lactose in a given 
quantity of regular milk. 


This exercise is left to the student and will presumably be 
different for every student. 


This exercise is left to the student and should be based on the 
reference provided. 


34. 


3D: 


36. 


3T, 


38. 


Lewis-a trisaccharide 


This exercise is left to the student and should be based on the 
reference provided. 


Basic amino acid side chains (Arg, His, Lys) in antithrombin III 
present positively charged side chains for ionic interactions with 
sulfate functions on heparin; H-bond donating amino acid side 
chains could form H bonds with O atoms in —OH groups on 
the heparin carbohydrate residues; H-bond accepting amino acid 
side chains could form H bonds with H atoms in —OH groups 
of heparin. 


Because these glycosaminoglycans are rich in hydroxyl groups, 
amine groups, and anionic functions (carboxylates, sulfates), they 
interact strongly with water. The heavily hydrated proteoglycans 
formed from these glycosaminoglycans are reversibly dehydrated 
in response to the pressure imposed on a joint during normal 
body movements. This dehydration has a cushioning effect on the 
joint; when the pressure is relieved, the glycosaminoglycans are 
spontaneously rehydrated due to their affinity for water. 


This exercise is left to the student and will presumably be 
different for every student. 


This exercise is left to the student and will presumably be 
different for every student. 


Chapter 8 


l. 


Because the question specifically asks for triacylglycerols that 
contain stearic acid and arachidonic acid, we can discount the 
triacylglycerols that contain only stearic or only arachidonic acid. 
In this case, there are six possibilities: 


| | l 
S S A S A S A S S 


[ | | | I 
S A A A S A A A S 


. See The Student Solutions Manual, Study Guide and Problems 


Book for a discussion. 


. a. Phosphatidylethanolamine and phosphatidylserine have a net 


positive charge at low pH. 


b. Phosphatidic acid, phosphatidylglycerol, phosphatidylinositol, 
phosphatidylserine, and diphosphatidylglycerol normally carry 
a net negative charge. 


c. Phosphatidylethanolamine and phosphatidylcholine carry a 
net zero charge at neutral pH. 


. Unlike the phosphate group, the sulfate group in a diester bond 


is uncharged. This lack of the negative charge would give the 
“glycerosulfoplipids” different physicochemical properties. 
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10. 


11. 


12. 
13. 


14. 


15. 


16. 


17, 
18. 


19. 


. Diets high in cholesterol contribute to heart disease and stroke. 


On the other hand, plant sterols bind to cholesterol receptors in 
the intestines but are not taken up by the cells containing these 

receptors. There is substantial evidence that a diet that includes 
plant sterols can reduce serum cholesterol levels significantly. 


. Former Interior Secretary James Watt was well known during the 


Reagan administration for his comments on several occasions 
that trees caused and produced air pollution. As noted in this 
chapter, it is, of course, true that trees emit isoprenes that are 
the cause of the blue-gray haze that is common in still air in 
mid-to late-summer in the eastern United States. However, these 
isoprene compounds are not significantly toxic to living things, 
and it would be misleading to call them pollutants. 


. Louis L Amour clearly knew his biochemistry. His protagonist 


knew that fat carries a higher energy content than protein or car- 
bohydrate. If meals are going to be scarce (as for a person living 
in the wild and on the run), it is wise to consume fat rather than 
protein or carbohydrate. The same reasoning applies for migra- 
tory birds in the weeks preceding their long flights. 


. Phospholipase A, from snake bites operates without regulation 


or control, progressively breaking down cell membranes. Phos- 
pholipase A, action is under precise control and is carefully regu- 
lated to produce just the right amounts of required cell signals. 


. The lethal dose for 50% of animals is referred to as the LD5ọ. The 


LD» for dogs is approximately 3 mg/kg of body weight. Thus, 
for a 40-lb dog (18.2 kg), consumption of approximately 55 mg 
of warfarin would be lethal for 50% of animals. 


See the Student Solution Guide and www.cengage.com/login for 
the solutions to this problem. 


Humans require approximately 2000 kcal per day. Seal blubber is 
predominantly triglycerides, which yield approximately 9 kcal per 
gram. This means that a typical human would need to consume 
222 grams of seal blubber per day (about half a pound) in order 
to obtain all of his or her calories from this energy source. 


a-linolenic acid 


Results for this problem will depend on the particular cookies 
chosen by the student. 


The only structural differences between cholesterol and stigmas- 
terol are the double bond of stigmasterol at Cy,—C3;, and the 
ethyl group at Cx. 

Androgens mediate the development of sexual characteristics 
and sexual function in animals. Glucocorticoids participate in the 
control of carbohydrate, protein, and lipid metabolism. Miner- 
alocorticoids regulate salt (Na*, K*, and C17) balances in tissues. 


The answers to this question will depend on the household prod- 
ucts chosen and the isoprene substances identified. 


6 


Hydroxide ions (in lye) catalyze the breakdown of triglycerides to 
produce fatty acid salts such as sodium stearate and sodium palmi- 
tate and leaving glycerol as a by-product. Micelle formation by these 
“soaps” leads to emulsification of fats and other nonpolar substances. 


Amphipathic phosphatidylcholine from egg yolk exerts a deter- 
gentlike action on mixtures of vegetable oil and water. Micelles 
made primarily of egg phosphatidylcholine emulsify the veg- 
etable oil, forming a stable suspension that persists indefinitely. 
(Thus, mayonnaise does not need to be mixed or shaken before 
use, whereas oil and vinegar mixtures do.) The micellar par- 
ticles of mayonnaise are large enough to scatter visible light, so 


20. 


21. 


22. 


23. 


24. 


25. 
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mayonnaise appears milky white, whereas the separated layers of 
oil and water in containers of oil and vinegar appear clear. 


In general, esters are more readily hydrolyzed compared to 
ethers. Phospholipases specifically act upon ester bonds; not 
ether bonds. This makes ether linkages more stable from a 
chemical or biochemical point of view. 


Stanol esters function in cholesterol reduction by binding to 
cholesterol receptors in the intestines. However, each serving 
of stanol esters consumed only blocks a fraction of all intesti- 
nal receptors. Regular consumption of stanol esters eventually 
blocks all or most available receptors. Binding of stanol esters 
is tight and long lasting, but not indefinite, so stanol ester con- 
sumption must be continued to maintain the beneficial results. 
The graph in the Human Biochemistry box “Plant Sterols and 
Stanols—Natural Cholesterol Fighters” in Section 8.7 shows 
that reductions of serum cholesterol levels can approach 15%; 
given that the typical diet accounts for about 15% of total serum 
cholesterol, it may be concluded that stanol esters are highly 
effective in preventing uptake of dietary cholesterol. 


Serum cholesterol is partly derived from diet and partly from 
synthesis in the liver. Stanol esters prevent uptake of dietary 
cholesterol but do not affect synthesis in the liver. Cholesterol- 
lowering drugs, on the other hand, block cholesterol synthesis 
but have no effect on uptake from diet. Thus, the effects of these 
two classes of cholesterol-lowering agents are additive. 


Research has shown that consumption of substantial quantities 
of plant fats can lead to significant reduction of serum choles- 
terol. The content of so-called phytosterols varies depending on 
the source. However, a sterol- or stanol-fortified spread like Bene- 
col probably provides the highest concentration of dietary agents 
for cholesterol lowering. 


Tetrahydrogestrinone (THG) is synthesized by the catalytic 
hydrogenation of gestrinone (which has an acetylene group in 
place of the ethyl group on the D-ring of tetrahydrogestrinone). 
Patrick Arnold, known as the “father of prohormones,” is cred- 
ited with the synthesis of THG from gestrinone and the promo- 
tion of THG as an anabolic steroid in preparations such as The 
Clear and The Cream. These substances were marketed aggres- 
sively to top athletes in several sports by Arnold and Victor 
Conte of the Bay Area Laboratory Co-Operative. THG (at that 
time) was undetectable by existing laboratory analyses, and its 
use was not specifically prohibited in athletic competitions by the 
World Anti-Doping Code. Use of THG has since been prohib- 
ited by nearly all sports regulatory agencies, and effective meth- 
ods for its detection have been developed. Numerous athletes 
have by now admitted to use of THG or have been convicted for 
perjury in denial of its use, and many have been stripped of med- 
als and awards for athletic accomplishments aided by THG. 


Most obviously, a diet of triglycerides (from the blubber of seals, 
the polar bears’ favorite food) provides high energy and mate- 
rial that can be reprocessed into other triglycerides and used as 
insulation under the skin. Less obvious is the need of the polar 
bear to stay warm and conserve water. (Polar bears cannot afford 
to eat snow or ice [they are too cold], and they cannot drink 
seawater [it is too salty].) The polar bear is adapted to conserve 
body water and stay warm, and thus it does not urinate for 
months at a time. (Urination would give up both water and heat.) 
To achieve this, it must consume little or no nitrogen because 

a diet rich in nitrogen would require urination and defecation. 
Triglycerides contain no nitrogen and are thus ideal food for the 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


A-12 


26. 


27. 


28. 


29. 


30. 


31. 


Abbreviated Answers to Problems 


adult polar bear. Juvenile polar bears, which have not yet reached 
their full adult body size, must consume protein in order to make 
their own proteins. Once the bear reaches its full size, it changes 
its diet and consumes much less protein! 


Snake venom phospholipase A; cleaves fatty acids from phos- 
pholipids at the C-2 position. The fatty acids behave as deter- 
gents and form micelles (see Chapter 9) that can remove lipids 
and proteins from the membrane and disrupt membrane struc- 
ture, causing pores and eventually rupturing the cell itself. 
This exercise is left to the student and will presumably be 
different for every student. 

This exercise is left to the student and will presumably be 
different for every student. 

a. Phospholipase D 

b. Phospholipase D 

c. Phospholipase Ay 

d. A lyso-phosphatidic acid. 

The starting phospholipid is 
1-linoleoyl-2-a-linolenoyl-phosphatidylcholine. 

The first enzyme is phospholipase A;, which produces linoleic 
acid and a lysophosphatidylcholine. The second enzyme is 
phospholipase D, which cleaves the latter molecule to produce 
a lysophosphatidic acid and choline. 


This exercise is left to the student and will presumably be 
different for every student. 


Think-Pair-Share Question: 
energy bar 1: 1.19 ATP; energy bar 2: 1.19 ATP 


Chapter 9 


l. 


Glycerophospholipids with an unsaturated chain at the C-1 
position and a saturated chain at the C-2 position are rare to 
nonexistent. Glycerophospholipids with two unsaturated chains, 
or with a saturated chain at C-1 and an unsaturated chain at 
C-2, are commonly found in biomembranes. 


. The phospholipid/protein molar ratio in purple patches of 


H. halobium is 10.8. 


. Apart from the basic nutritional value provided by the fat and 


protein content, eggs are rich in cholesterol. During embryonic 
growth and development, the process of cell replication requires 
the making of new membranes which creates an increased require- 
ment for cholesterol in pregnant women. 


. See The Student Solutions Manual, Study Guide and Problems Book 


for plots of sucrose solution density versus percent by weight and 
by volume. The plot in terms of percent by weight exhibits a greater 
curve because less water is required to form a solution that is, for 
example, 10% by weight (10 g sucrose/100 g total) than to form a 
solution that is 10% by volume (10 g sucrose/100 mL solution). 


r= (4D1!?. 


According to this equation, a phospholipid with D = 1 X 
1078 cm? will move approximately 200 nm in 10 milliseconds. 


. Fibronectin: For t = 10 msec, r = 1.67 X 1077 cm = 1.67 nm. 


Rhodopsin: r = 110 nm. 


All else being equal, the value of D is roughly proportional to 
(M,)-', Molecular weights of rhodopsin and fibronectin are 

40,000 and 460,000, respectively. The ratio of diffusion coeffi- 
cients is thus expected to be (40,000)~ /3/(460,000)~!3 = 2.3. 


10. 


11 


12. 


On the other hand, the values given for rhodopsin and 
fibronectin give an actual ratio of 4286.The explanation is 

that fibronectin is anchored in the membrane via interactions 
with cytoskeletal proteins, and its diffusion is severely restricted 
compared with that of rhodopsin. 


. a. Divalent cations increase Tn. 


b. Cholesterol broadens the phase transition without significantly 
changing Tm. 

c. Distearoylphosphatidylserine should increase Tm, due to 
increased chain length and also to the favorable interactions 
between the more negative PS head group and the more posi- 
tive PC head groups. 


d. Dioleoylphosphatidylcholine, with unsaturated fatty acid 
chains, will decrease Tn- 

e. Integral proteins will broaden the transition and could raise or 
lower Tm, depending on the nature of the protein. 


. AG = RT In(GVC). 


AG = +4.0 kJ/mol. 


. AG = RT In({Cou/Cin]) + ZFAW. 


AG = +4.08 kJ/mol —2.89 kJ/mol. 
AG = +1.19 kJ/mol. 


The unfavorable concentration gradient thus overcomes the 
favorable electrical potential, and the outward movement of Nat 
is not thermodynamically favored. 


The student should note the structural similarities between 
digoxin and the cardiotonic steroids shown in Figure 9.50. Based 
upon the discussions within Section 9.8b one could assume that 
digoxin is used to treat cardiovascular issues like hypertension 
and that it is an inhibitor of the Na*, K*-ATPase transport 
protein. 


. One could solve this problem by going to the trouble of plot- 


ting the data in v vs. [S], 1/v vs. 1/[S], or [S]/v vs. [S] plots, but it 

is simpler to examine the value of [S]/v at each value of [S]. The 
Hanes—Woolf plot makes clear that [S]/v should be constant for 
all [S] for the case of passive diffusion. In the present case, [S]/v is 
a constant value of 0.0588 (L/min) ~!. It is thus easy to recognize 
that this problem describes a system that permits passive diffu- 
sion of histidine. 


This is a two-part problem. First calculate the energy available 
from ATP hydrolysis under the stated conditions; then use the 
answer to calculate the maximal internal fructose concentration 
against which fructose can be transported by coupling to ATP 
hydrolysis. Using a value of —30.5 kJ/mol for the AG” of ATP 
and the indicated concentrations of ATP, ADP, and P;, one finds 
that the AG for ATP hydrolysis under these conditions (and at 
298 K) is — 52.0 kJ/mol. Putting the value of +52.0 kJ/mol into 
Equation 9.1 and solving for C, yields a value for the maximum 
possible internal fructose concentration of 1300 M! Thus, ATP 
hydrolysis could (theoretically) drive fructose transport against 
internal fructose concentrations up to this value. (In fact, this 
value is vastly in excess of the limit of fructose solubility.) 


. Each of the transport systems described can be inhibited (with 


varying degrees of specificity). Inhibition of the rhamnose 
transport system by one or more of these agents would be con- 
sistent with involvement of one of these transport systems with 
rhamnose transport. Thus, nonhydrolyzable ATP analogs should 
inhibit ATP-dependent transport systems, ouabain should spe- 
cifically block Na* (and K*) transport, uncouplers should inhibit 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


14. 
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21. 


22. 


23; 


24. 


proton gradient-dependent systems, and fluoride should inhibit 
the PTS system (via inhibition of enolase). 


N-myristoyl lipid anchors are found linked only to N-terminal 
Gly residues. Only the peptide in (e) of this problem contains an 
N-terminal Gly residue. 


Only the peptide in (a) possesses a CAAX sequence (where 

C = Cys, A = aliphatic (Ala, Val, Leu, Ile), and X = any amino 
acid). 

The hydropathy plot of a soluble protein should show no sub- 
stantial stretches of hydrophobic residues, except for the signal 
sequence. 


Amino acids with side chains that can be positively or negatively 
charges are most likely to be found in the interior of ion chan- 
nels. This would include amino acids like aspartic acid, glutamic 
acid, histidine, arginine or lysine. 

The hydration of ions by water molecules would result in the 
masking of the ions’ charges. Transmembrane channels that 
allow for the transport of hydrated ions would allow the trans- 
port of negatively and positively charged ions. 


Prolines are destabilizing to a-helices, and a short helix with a 
proline would not be likely to be stable. Where a proline occurs in 
an a-helix, the helix is bent or kinked. Helices with a proline kink 
tend to be longer than average, presumably because longer helices 
are more stable and more able to tolerate the loss of H bonds in 
the kink region. 


The structural consequences of a proline-induced kink in an 
a-helix are described in detail in the references provided in this 
problem. 


Porin proteins typically consist of 18-stranded B-barrels. With 

9 to 11 residues per strand, about 180 to 200 residues would be 
required to form the barrel, with a roughly equivalent number 
required to form the loops between strands. (The maltoporin 
chains from Æ. coli consists of about 420 residues.) The trans- 
membrane domain of Wza consists of eight a-helical segments 
with about 25 residues per helix, about 200 residues in all. Thus, 
the number of residues needed to create the transmembrane pore 
in these two proteins is about the same, even though the number 
of residues per membrane spanning segment is less for a B-strand 
(9 to 11) than for an a-helix (21 to 25). 


From Figure 9.30, the area within a typical “fenced” area is 
about 0.3 um X 0.3 um, or 9 X 10° A2. Dividing by 60 Å?, it 
appears that there are about 150,000 phospholipids in a 
monolayer-fenced area in the membrane (assuming a membrane 
of pure phospholipid, with no cholesterol, etc.). 


The lysine side chain N is about 5.5 A from the a-carbon (about 
1.1 A per bond). If a Lys side chain was reoriented toward the 
membrane center, the maximum change in position relative to 
the membrane center would be 10-11 A (assuming the position 
of the a-carbon did not change). Figure 9.15 indicates that the 
energy of a Lys side chain would change by 4kT over the 15-A 
distance from the membrane surface to the membrane center. A 
movement of 10 A thus would correspond to two-thirds of 4kT, 
or 8kT/3—almost twice the average translation kinetic energy of 
a molecule in the gas phase (3kT/2). 

In a hydrophobic environment, the aspartate carboxyl group will 
be more stable in the protonated (uncharged) form than in the 
deprotonated (charged) form. 


25. 
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In a hydrophobic environment, the side chains of lysine and 
arginine would be more stable in their deprotonated (uncharged) 
forms than in the protonated (charged) forms. As a result, the 
pK, values of these residues would be lowered significantly in a 
hydrophobic environment. 


Based on the discussion in problems 24 and 25 in this chapter, it 
would be reasonable to imagine that light-induced conformation 
changes could alter the pX, values of the proton-transferring 
moieties in the protein. For example, a conformation change 
that made the environment around an Asp carboxyl more polar 
would reduce the pK, of that group, promoting dissociation (1.e., 
proton release). An appropriate sequence of such changes could 
accomplish transmembrane proton transport. 


The torsion angles for transmembrane a-helices (p = —65°, Yy = 
—40°) differ from those in water-soluble helices (p = —60°, y = 
—45°). As a result, the carbonyl oxygens do not project outward 
from the helix axis as much in transmembrane helices, shielding 
the partial charge on the carbonyls and strengthening intra- 
helical H-bonds. In addition, the hydroxyls of serines and threo- 
nines in membrane proteins can form intra-helical H-bonds with 
the helix backbone, stabilizing these polar side chains during 
helix insertion in the membrane. Once in the membrane, Ser and 
Thr can form inter-helical H-bonds that stabilize helix clusters. 


Susanne Dobler and her colleagues have shown that substitution 
of an asparagine residue with histidine at position 122 (N122H) 
in the alpha subunit of Nat,K*-ATPase dramatically lowers the 
binding affinity for cardiac glycosides such as ouabain, allowing 
the monarchs to thrive on their toxic diet. Dobler and colleagues 
have identified the same amino acid substitution in certain 
species of Coleoptera (beetles), Diptera (flies), and Heteroptera 
(typical bugs), and they have cited these mutations as an example 
of convergent evolution in insect adaptation. 


ATP-dependent flippases are closely related to ion pumps like 
the plasma membrane Na*,K*-ATPase and sarcoplasmic 
reticulum Ca**t-ATPase (Section 9.8). All these transport 
enzymes have similar structures, including a transmembrane 
domain with ten transmembrane a-helices, and they all form an 
aspartyl-phosphate intermediate and undergo similar catalytic 
cycles. However, compared to the ATPases in Section 9.8 that 
transport simple monovalent and divalent cations, the flippases 
must transport comparatively very large substrates (such as 
phosphatidylserine) across the plasma membrane. This is referred 
to as the “giant substrate problem” Jens Andersen and his 
colleagues have suggested that the flippase known as ATP8A2 
creates a hydrophobic gate that separates the entry and exit sites 
of the transported lipid and directs sequential formation and 
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annihilation of water-filled cavities that drive the hydrophilic seven-subunit protein complex (LptA-G). Subunits LptF, LptG, 
phospholipid head group through the membrane, with the and LptB comprise an ABC transporter complex (Section 9.8). 
hydrocarbon chains following passively, still in the membrane Yihua Huang and Changjiang Dong and their respective coworkers 
lipid phase. have determined the structures of the LptD and LptE subunits of 
30. Structures of intermediate states of the sarcoplasmic reticulum the transporters from S. typhimurium and S. flexneri, respectively 
Ca?+-ATPase reveal some of the conformation changes that (see figure on next page. LptD in orange and pink; LptE in blue). 
convert ATP hydrolysis free energy into transport of Ca** ions The LptD-LptE dimer in the outer membrane functions to col- 
from the cytosol to the SR lumen. A structure of the E; state lect LPS molecules as they cross the intermembrane space and 
(figure) shows two bound Ca?* ions coordinated by side chains deposit them in the outer monolayer of the outer membrane. 
of the M4, M5, M6, and M8 transmembrane helices. LPS molecules migrate from the inner membrane through LptC 


and LptA and through the jellyroll structure and into the cavity 
of the B-barrel. The LptE subunit is also inserted into that cavity, 
where it can interact with the sugar chains of the transported 
LPS molecules, as they traverse this cavity and are inserted into 
the outer leaflet of the outer membrane. 


In the E; state, the Ca?*-binding sites are exposed to the 
cytosol, permitting Ca?* binding. The M4 helix moves like a pis- 
ton during the E; > E, enzyme transition, carrying along the 
Ca?* bound to the carboxyl group of Glu*”? (E309 shown in the 
figure). In the E, state, Glu?™® and the other Ca?” site ligands are 
exposed to the opposite side of the membrane, facilitating Ca?* 
release into the lumen. In this transition, the M4 helix moves 
toward the lumen like a piston or pump rod by 5.5 A (see figure). 
In the E,—>E, transition of the dephosphoenzyme, M4 moves a 
similar distance in the opposite direction toward the cytosol. 
Similar movements of transmembrane helices are involved in the | Think-Pair-Share Question: 
transport cycles of the other P-type ATPases. — drug concentration in reservoir 


32. Point (b)—that proteins can be anchored to the membrane via 
covalent links to lipid molecules—was not part of the Singer— 
Nicolson fluid mosaic model. 


— hydrophobicity/hydrophilicity of drug molecule 
— chemical nature of polymer layer 

— thickness of polymer layer 

— thickness of skin layer 

— integrity of skin layer (healthy or wounded) 


Chapter 10 
1. 1 O 
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. . . . H H 
31. Gram-negative bacteria have a cell wall which includes an outer OH OH 

membrane adorned with a tightly packed layer of lipopolysac- i 
charide (LPS) molecules (Section 7.4). The transport of LPS 2. See Figure 10.14. 
from inner membrane to outer membrane is accomplished by a 3. Pentose, D, aldose, 6, furanose. 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


10. 


11 


O 


A, : ~ 
7 H 
LN 


N 


A-15 


Abbreviated Answers to Problems 


OH o 
H H H 
NZ N HN N HN \ HN N 
ea | j BN | a | ji A | U = 
<a 
O N N HO N N O N > O N N 
H H H 


5. fa = 0.29; fg = 0.22; fo = 0.25; fr = 0.28. 
6. The number of A and T residues = 3.14 X 10° each; the number 


of G and C residues = 2.68 X 10° each. 


. 5'-TAGTGACAGTTGCGAT-3’. 
. The structure is built from 3’-deoxyribonucleotides; not 


2'-deoxyribonucleotides. 


. 5'-ATCGCAACTGTCACTA-3’. 
5’-TACGGTCTAAGCTGA-3’. 
. There are two possibilities, a and b. (E = EcoRI site; B = BamHI 
site.) Note that b is the reverse of a. 
a. 
B E B 
t | | 
1 3 0.5 5.5 
b. 
BE B 
5.5 0.5 5 1 


12; 


13. 


14. 


a. GGATCCCGGGTCGACTGCAG; 
b. GICGACCCGGGATCCTGCAG. 
a. GGATCCC and GGGTCGACTGCAG. 
b. GTCGACCC and GGGATCCTGCAG. 


Synthesis of a polynucleotide 100 residues long requires forma- 
tion of 99 phosphodiester bonds. A G” for phosphodiester syn- 
thesis (assuming it is the same magnitude but opposite sign as 
that for phosphoric anhydride cleavage of an NTP to give NMP + 
PP;) is +32.3 kJ/mol. AG aeran = (99)(+32.3) = 3198 kJ/mol. 


a. Hydrogen bonding and van der Waals interactions between amino 
acid side chains and DNA, and ionic interactions of amino acid 
side chains with the nucleic acid backbone phosphate groups. 
Double-helical DNA does not present hydrophobic regions for 
interaction with proteins because of base-pair stacking. 


Smal products: 


b. Proteins can recognize specific base sequences if they can fit 
within the major or minor groove of DNA and “read” the 


15. 


16. 


17. 
18. 


19. 


H-bonding pattern presented by the edges of the bases in the 
groove. The dimensions of an a-helix are such that it fits snugly 
within the major groove of B-DNA; then, depending on the 
amino acid sequence of the protein, the side chains displayed 
on the circumference of the a-helix have the potential to form 
H bonds with H-bonding functions provided by the bases. 


a. The restriction endonuclease must be able to recognize a 
specific nucleotide sequence in the DNA that has twofold 
rotational symmetry. 


b. In order to read a base sequence within DNA, either the 
restriction endonuclease must interact with the bases by direct 
access, for example, by binding in the major groove, or the 
restriction endonuclease must be able to “read” the base 
sequence by indirect means, for example, if the base sequence 
imparts some local variation in the cylindrical surface of the 
DNA that the enzyme might recognize. 


c. The restriction endonuclease must be able to cleave both DNA 
strands, often in a staggered fashion. 


d. An obvious solution to the requirements listed in (a) and (c) 
would be a homodimeric subunit organization for restriction 
endonucleases. 

a. The ribose group of nucleosides greatly increases the water 
solubility of the base. 

b. Ribose has fewer hydroxyl groups and thus less likelihood to 
undergo unwanted side reactions. 

c. The absence of the 2-OH in 2-deoxyribose leads to a polynu- 
cleotide sugar—-phosphate backbone that is more stable because 
it is not susceptible to alkaline hydrolysis. 

963,520,000 base pairs. 


a. Phosphate groups bear a negative electrical charge at 
neutral pH. 

b. Cleavage of phosphoric anhydride bonds is strongly exergonic 
and can provide the thermodynamic driving force for diverse 
metabolic reactions. 

c. Cleavage of phosphoric anhydride bonds is strongly exergonic 
and can provide the thermodynamic driving force for 
phosphodiester bond formation in polynucleotide synthesis. 


Once in every 4.4 X 10!? nucleotides. 
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20. 
21. 


22. 
23. 


24. 


A DNA sequence 16 nucleotides long. 


The strategy for protein sequencing by Edman degradation, 

as described in Figure 4.8a and Section 5.4, could be adapted, 
replacing Edman’s reagent with snake venom phosphodiesterase 
acting on an immobilized nucleotide sequence. 


3.59 X 10" D. 


The use of bases as “information symbols” in metabolism allows the 
cell to allocate portions of its phosphorylation potential (as total 
NTP) to dedicated tasks, as in GTP for protein synthesis, CTP for 
phospholipid synthesis, UTP for carbohydrate synthesis, with ATP 
serving the central role. Enzymes in these pathways selectively bind 
the proper nucleotide through recognition of the particular base. 


The most prominent structural feature of the DNA double helix 
in terms of structural complementarity is found in the canonical 
base-pairing of A with T and G with C. Base pairing is essential 
to DNA replication (preservation of the genetic information) and 
transcription (expression of genetic information). 


Think-Pair-Share Question: 


If one considers every A/T combination as “0” and every C/G combi- 
nation as “1,” then this dsDNA code would read: 


From left to right: 
010010110011000011111100111100000101011110100010 
From right to left: 
010001011110101000001111001111110000110011010010 


If one considers every A/T combination as “1” and every C/G combi- 
nation as “0,” then this dsDNA code would read: 


From left to right: 
101101001100111100000011000011111010100001011101 
From right to left: 
101110100001010111110000110000001111001100101101 


Chapter 11 
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2. Sanger’s method of DNA sequencing. 
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ee HHHuHHHHHUHUHHUHUHUNHHAH 
> PPP Pr PrP Pr er Pr > > > > > PP > PD 
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ee 


ee 9 
ee 


rr PrP rr Pr > bP > > PP > PP PPP 
agaanaananananannnannannanaana 
ao Ao fo 48 ff. Of Aa Of A fo oO eo 4 Oo of Aa A 
ee | 


3. Original nucleotide: 5S’°-GTTTAAATTTGTCATTAACC. 
4. a. 10.5 base pairs per turn; b. Ad = 34.3°; c (true repeat) = 


6.72 nm. 


5. 27.3 nm; 122 base pairs. 


6. 4.35 um. 


7. Lo = 160. If W = 
12/160 = 


o = AL/Ly = 


12,L=T+ W= 160 + (—12) = 148. 


0.075. 
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8. 


WY, 


0 


11. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


For | turn of B-DNA (10 base pairs): Lg = 1.0 + Wp. 

For Z-DNA, 10 base pairs can form only 10/12 turn (0.833 turn), 
and Lz = 0.833 + Wz. 

For the transition B-DNA to Z-DNA, strands are not broken, 

SO Lpg = Lz; that 1S, 1.0 + Ws, = 0.833 = Wz, or Wz = Wg = 
+0.167. 

(In going from B-DNA to Z-DNA, the change in W, the number 
of supercoils, is positive. This result means that, if B-DNA 
contains negative supercoils, their number will be reduced in 


Anticodon 


10. 


Abbreviated Answers to Problems A-17 


Z-DNA. Thus, all else being equal, negative supercoils favor the 
B — Z transition.) 


. 6 X 10° bp/200 bp = 3 X 107 nucleosomes. 


The length of B-DNA 6 X 10° bp long = (0.34 nm) 

(6 X 10°) = 2.04 X 10° nm (more than 2 meters!). The height 
of 3 x 107 nucleosomes = (6 nm)(3 x 10’) = 18 X 107nm 
(0.18 meter). 


From Figure 11.35, similarly shaded regions indicate comple- 
mentary sequences joined via intrastrand hydrogen bonds: 


CCA end 


1 


ae! 


LA 
60 7 


0 


Nucleotide no. 


Increasing order of Ta: yeast < human < salmon < wheat < 
E. coli. 


In 0.2 M Na+, Ta(°C) = 69.3 + 0.41(%G + C): 
Rats (%G + C) = 40%, Tm = 69.3 + 0.41(40) = 85.7°C 
Mice (%G + C) = 44%, Tm = 69.3 + 0.41(44) = 87.3°C 


Because mouse DNA differs in GC content from rat DNA, they 
could be separated by isopycnic centrifugation in a CsCl gradient. 


GC content = 0.714 (from Table 10.1 and equations used in 
problem 3, Chapter 10). p = 1.660 + 0.098(GC) = 1.730 g/mL. 


See Figure 11.37 and compare your structure with the base pair 
formed between G18 and 55 in yeast phenylalanine tRNA, 
which has a single H bond between the 2-amino group of G 
(2-NH) --- O=) and the O atom at position 4 in w. 


Assuming the plasmid is in the B-DNA conformation, where 
each pair contributes 0.34 nm to the length of the molecule, the 
circumference of a perfect circle formed from pBR322 would be 
(0.34 nm/bp)(4363 bp) = 1483 nm = 1.48 um. 


The E. coli K12 chromosome laid out as a perfect circle would 
have a circumference of (0.34 nm/bp)(4,639,000 bp) = 
1,577,260 nm = 1.58 mm. 


The diameter of a B-DNA molecule is about 2.4 nm (see 

Table 11.2). For pBR322, the length/diameter ratio would be 

1480/2.4 = 616.7. For the E. coli K12 chromosome, the length/ 

diameter ratio would be 1,577,260/2.4 = 657,192. 

(a) Z-DNA; (b) cruciform; (c) triplex DNA; (d) tRNA; 

(e) type-II restriction endonuclease site. 

a. Serine, threonine, asparagine, glutamine: these amino acids 
contain functional groups with dipoles that are capable of 
forming hydrogen bonds water molecules or other functional 
groups with dipoles. 

b. Lysine, arginine, histidine: these amino acids contain positively 
charged amino acids capable of interaction with the negatively 
charged polyphosphate backbone of DNA. 

c. Tryptophan: the structure of this amino acid contains fused 
rings with one a heterocyclic ring suitable for intercalating with 
the base structures of DNA. 

Rely on the features in Figure 11.35 to draw a tRNA cloverleaf 

with the anticodon positioned at nucleotides 34 to 36: CUG. 


19. 


20. 
21. 


FOHQAQNFHAOQ 
HOAPFPOQHFPOA 


5'— CGAT 
3'— GCTA 


TAATGC —3' 
ATTACG—5' 


QPFPAQOPOH 
QArPaAAHOS 


Erythromycin is within the palm of the mitten. 
a. Tumor necrosis factor 

b. 31,575,565 bp to 31,578,336 bp 

c. 4 exons 


d. 233 amino acid residues 


. The DNA in such a thermophilic organism would be subjected 


to very high temperatures that might denature the DNA. Because 
G:C base pairs are more heat stable than A:T base pairs, one 
might expect DNA from a thermophilic organism to have a high 
G + Ccontent. 


. DNA is the material of heredity; that is, genetic information. 


This information is encoded in the sequence of bases in DNA, so 
this is its most important structural feature. The double-stranded 
nature of DNA and the complementary base sequence of the two 
strands are also crucial structural features in the transmission 

of genetic information through DNA replication. Beyond these 
points, one might cite the structural features of DNA Cq that 
impart stability to the double helix and structural aspects that 
render DNA less susceptible to degradation. 
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A-18 


Abbreviated Answers to Problems 


24. a. Total weight = 6 pg; total length = 1.92 m. 


25: 


26. 


27. 


b. 1.25 X 10! bp 

c. 1.25 X 10° g 

d. 4.685 X 10” bp. 

e. 0.469 g 

Hyperchromic shift: DNA > RNA, because it is entirely double- 
stranded and RNA is only partially double-stranded. 

Colligative properties: Such properties depend on the number of 
molecules per gram. DNA molecules are much larger than RNA 
molecules, so there are fewer of them per gram, and their colliga- 
tive properties would be less. 

Inverted repeats: DNA does not depend on inverted repeats to 
form secondary structure, but RNA does. 

Tm: Tm is the temperature of transition from double-stranded to 
single-stranded. Virtually all of the bases in DNA are base paired 
and the T, depends on relative G:C content. Fewer bases are 
paired in RNA, so Tp is less definitive. 

Buoyant density: RNA is more dense than DNA (see the Buoy- 
ant Density of DNA box on page 368). 

Chemical and physical stability: DNA lacks a 2’-OH on its ribose 
rings, so it is much less susceptible to hydrolysis. DNA has been 
recovered from fossil remains over a million years old. 


CH: 3 
Seyi C Ny N4 NH 
A 
mC mC m’G 
HC 
N 
NH» NH 
CH 
y, yt i y, SN 
Aa C1 
N N N N 
mA mA 


Think-Pair-Share Question: 

From the four different results one can figure out that the overall 
sequence should read: 

TACCGTTAGGCTTGCATCC and the original DNA segment 
would have a sequence of GGATGCAAGCCTAACGGTA 


Chapter 12 


l. 


Cloning: vector, endonuclease, ligase, linkers, 

PCR: DNA polymerase, primer 

RNAi: dicer, siRNA, RISC, passenger strand, guide strand 
CRISPR-Cas9: endonuclease, gRNA, PAM 


. Protein: endonuclease, ligase, DNA polymerase, reverse 


transcriptase, dicer, RISC, Cas9 
RNA: siRNA, passenger strand, guide strand, gRNA 


DNA: vector, linkers, BAC, YAC, HAC, primer, expressed 
sequence tags, PAM 


3. 


oOo ona 


10. 


11. 
12; 


13; 


Linear and circular DNA molecules consisting of one or more 
copies of just the genomic DNA fragment; linear and circular 
DNA molecules consisting of one or more copies of just the 
vector DNA; linear and circular DNA molecules containing 
one or more copies of both the genomic DNA fragment and 
plasmid DNA. 


* -GAATTCCCGGGGATCCTCTAGAGTCGACCTGCAGGCATGC- 


GAATTC GGATCC GTCGAC GCATGC 
EcoRI BamHI Sall Sphi 
CCCGGG TCTAGA CTGCAG 
Smal Xbal PstI 
a. AAGCTTGAGCTCGAGATCTAGATCGAT 
HindIII Xhol Xbal 
Sacl Bell Clal 
b. 
Vector: Hindi: 5'-A.. 2.2... -gap-. eee CGAT-3': Clal 
3'-TTCGA. .. -gap-.... 6.2.6. TA-5’ 


Fragment: HindIII: 5'-AGCTT(NNNN-etc-NNNN)AT-3' 
3'-A(NNNN-etc-NNNN) TAGC-5’ 


. N = 3480. 
. N = 10.4 million. 
. N = 367,665 


. 5'-ATGCCGTAGTCGATCAT and 


5'-ATGCTATCTGTCCTATG. 


PCR efficiency 
4,000,000 | 


3,500,000 
3,000,000 


95% 
2,500,000 


1,500,000 


Number of DNA copies 


1,000,000 


2,000,000 | 


85% 


75% 
0 5 10 15 20 

Cycle 
5'-GUUAUA-3' or 5'-UUAUAC-3' or 5'-UAUACA-3' 
-Thr-Met-Ile-Thr-Asn-Ser-Pro-Asp-Pro-Phe-Ile-His-Arg-Arg- 
Ala-Gly-Ile-Pro-Lys-Arg-Arg-Pro... 
The junction between B-galactosidase and the insert amino acid 
sequence is between Pro and Asp, so the first amino acid encoded 
by the insert is Asp. (The polylinker itself codes for Asp just at 
the BamHI site, but in constructing the fusion, this Asp and all 
of this downstream section of polylinker DNA is displaced to a 
position after the end of the insert.) 


5'(GJAATTCNGGNATGCAYCCNGGNAAR¢'Ty*GCNAGY 
TGG-TTYGTNGGGAATTCN- 


(Note: The underlined triplet AGY represents the middle 
Ser residue. Ser codons are either AGY or TCN [where Y = 
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14. 


15. 


16. 


17. 


18. 


19. 


pyrimidine and N = any base]; AGY was selected here so that 
the mutagenesis of this codon to a Cys codon [TGY] would 
involve only an A —> T change in the nucleotide sequence.) 


Because the middle Ser residue lies nearer to the 3’-end of this 
EcoRI fragment, the mutant primer for PCR amplification 
should encompass this end. That is, it should be the primer for 
the 3 —> 5’ strand of the EcoRI fragment: 


5'-NNNGAATTCCCN,.ACR.AACCAR,.CAN,GC-3’ 


where the mutated Ser —> Cys triplet is underlined, NNN = 
several extra bases at the 5'-end of the primer to place the EcoRI 
site internal, N, = the nucleotide complementary to the nucleo- 
tide at this position in the 5’-53’ strand, and R, = the pyrimidine 
complementary to the purine at this position in the 5’—3’ strand. 


The number of sequence possibilities for a polymer is given by x”, 
where x is the number of different monomer types and y is the 
number of monomers in the oligomers. Thus, for RNA oligomers 
15 nucleotides long: x” = 4!5 = 1,073,741,824. For pentapeptides, 
x? = 20° = 3,200,000. 

a.l6 b.64 c. N= 4*in which N is the number of possible 
oligonucleotides and x is the length of the oligonucleotide. 


1.2E+12 | 


1E+12 


(N) 


8E+11 


6E+11 


4E+11 


Number of possibilities 


2E+11 


0 5 10 15 
Oligonucleotide length (x) 


20 


See Figure 12.19. You would need a GAL4-deficient yeast strain 
expressing a fusion protein composed of the GAL4 DB domain 
fused with a cytoskeletal protein (such as actin or tubulin) to 
serve as the bait. These GAL4~ cells would be transformed with 
a cDNA library of human epithelial proteins constructed so that 
the human proteins were expressed as GAL4 TA fusion proteins; 
these fused proteins are the target. Interaction of a target protein 
with the cytoskeletal protein fused to the GAL4 DB domain will 
lead to GAL4-driven expression of the lacZ gene, whose presence 
can be revealed by testing for B-galactosidase activity. 


You would use a GALA4 negative yeast strain containing two 
plasmids. One plasmid coding for the DB domain of GALA4 
fused with protein A and one plasmid coding for the TA domain 
fused with protein B. You would grow this yeast strain in media 
containing high levels of Ca?* or in the absence or presence of 
Ca?*-channel opening agents (see Section 9.8). If proteins A and B 
interact the lacZ reporter gene will be activated. See Figure 12.19. 


See Figure 12.10 for isolation of mRNA and Figure 12.11 for prep- 
aration of a cDNA library from mRNA. The mRNA would be iso- 
lated, and the cDNA libraries would be prepared from two batches 
of yeast cells, one grown aerobically and one grown anaerobically. 
See Figure 12.12. Differently labeled single-stranded cDNA from 
separate libraries prepared from aerobically versus anaerobically 
grown yeast cells would be hybridized with a DNA microarray 
(gene chip) of yeast genes. 


20. 


2i; 


22. 


23. 
24. 


25. 


26. 


27. 


Abbreviated Answers to Problems A-19 


An antibody against hexokinase A could be used to screen the 
yeast cDNA library to identify a yeast colony expressing this pro- 
tein (see discussion in Section 12.3 on Screening cDNA Expres- 
sion Libraries with Antibodies). Once a sample of the putative 
yeast hexokinase A protein was isolated, its identity could be 
confirmed by peptide mass fingerprinting using mass spectrom- 
etry (see in Section 5.4 on “Peptide Mass Fingerprinting”). 


The experiment in problem 19 identified the cDNA clone for the 
protein, but this clone will not have the regulatory elements (pro- 
moter) necessary for the experiment at hand. Using the cDNA 
clone as a probe, the genomic clone for this nox gene could be 
identified in a yeast genomic library. Cloning and sequencing 

a genomic clone would identify putative promoter regions that 
could be fused to the coding region of green fluorescent protein 
(GFP) to create a reporter gene construct (see Figure 12.18) that 
would “light up” more and more as oxygen levels declined. 


AUGCAGGAGGGUGGCGAGAGGGGCCGAGAUUGG_ 
GUUUUACAGCUAUGCU- 


5'-AAUCGCAUUGCAGCUUC-3' 


a. 
NH, o 
A HO NHs NH 
eo one 
H H 
c U 


b. The U base is still attached to a deoxyribose. 


NH, 
N S HO NH» 
n Fy 2 NH 
A J 
< | j —— | 
N N 
H H N 


N H,O NB; N 
NH 2 2 NH 
DA 
N N 
H N NH, H N (0) 
G 


This purpose of this question is to encourage students to explore 
the extensive genome databases. Course instructors can adapt 
this question to a variety of purposes. 


Insertion of DNA at any of the following restriction sites would 
render cells harboring the recombinant plasmid ampicillin- 
sensitive: SspI, Scal, Pvul, PstI, and Ppal. 


The respective codons are Trp = UGG; Cys = UGC and UGU. 
Thus, changing only the third base G in the Trp codon to a 
pyrimidine (either C or U) would yield a Cys codon. In this case, 
there are now six differences in codon possibilities, so 2° = 64 
different oligonucleotides must be synthesized. 


Think-Pair-Share Question: 


— Using “classic” organic/medicinal chemistry, find/design an inhibi- 


tor for this enzyme. 


— Using RNAi technology with a specifically designed guide strand 


to target the RNA transcript of this gene. The guide strand should 
be complementary to the portion of the RNA transcript that con- 
tains the mutation. 


— Using any of the CRISPR-Cas9 approaches to either eliminate the 


gene or to alter/reverse the mutation back to the “normal” status. 
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A-20 


Abbreviated Answers to Problems 


Chapter 13 


1. Enzyme A has similar affinities for D-hexose carbohydrates, but 
a lesser affinity for D-ribose. Enzyme B has a higher affinity for 


D-ribose and a lesser affinity less for the D-hexoses. Enzyme B 
acts specifically on D-ribose. 


2. W/V max = 0.8. 
3. v = 91 uwmol/mL - sec. 


4. K, = 1.43 X 1075 M; Ka = 3 X 1074 M. Because k, is 20 times 
greater than k_,, the system behaves like a steady-state system. 


5. If the data are graphed as double-reciprocal Lineweaver—Burk plots: 


a. Vmax = 51 wmol/mL - sec and Km = 3.2 mM. 
b. Inhibitor (2) shows competitive inhibition with a Kj = 


2.13 mM. Inhibitor (3) shows noncompetitive inhibition with 


a Kı =4mM. 
6. For inhibitor A: K;’ = 283 mM and K; = 24.3 mM 
For inhibitor B: K;’ = 8 mM and K; = 13.1 mM 


[S] 


[S] 


[S] 


[S] 


[S] 


10. 
11. 


12; 


13. 


14. 


15, 


16. 


17. 


18. 


19. 


In (a), Vmax doubles, but Km is constant. 

In (b), Vmax halves, but K,, is constant. 

In (c), Vmax is constant, but the apparent Km increases. 
In (d), Vmax decreases, but Km is constant. 


In (e), Vmax decreases, Km decreases, but the ratio Km/ Vmax 18 
constant. 


. a. The slope is given by Ky,?/Vimax(Ks“/[A] + 1). 


b. y-intercept = ((Ky“/[A]) + 1)1/Vnax- 

c. The horizontal and vertical coordinates of the point of 
intersection are 1/[B] = — Km^/Ks^Km” and 
Lv = UVinax(1 — (Km/Ks4)). 


. Top left: (1) Competitive inhibition (I competes with S for bind- 


ing to E). (2) I binds to and forms a complex with S. 

Top right: (1) Pure noncompetitive inhibition. (2) Random, 

single-displacement bisubstrate reaction, where A doesn’t 

affect B binding, and vice versa. (Other possibilities include 

[3] Irreversible inhibition of E by I; [4] 1/v vs. 1/[S] plot at two 

different concentrations of enzyme, E.) 

Bottom left: (1) Mixed noncompetitive inhibition. (2) Ordered 

single-displacement bisubstrate mechanism. 

Bottom right: (1) Uncompetitive inhibition. (2) Double- 

displacement (ping-pong) bisubstrate mechanism. 

Clancy must drink 694 mL of wine, or about one 750-mL bottle. 

a. Ks = 5 uM, b. Km = 30 WM; c. keat = 5 X 108 sec™!; d. keal Km = 

1.67 X 108 M~'sec™!; e. Yes, because keat Km approaches the limit- 

ing value of 10? M~ !sec™!; f. Vmax = 1075 mol/mL - sec; g. [S] = 

90 uM; h. Vmax Would equal 2 X 10->mol/mL - sec, but Km = 

30 uM, as before. 

a. Vmax = 132 umol mL™! sec™!; b. Keat = 44,000 sec™!; c. keal Km = 

24.4 X 108 M`! sec™!; d. Yes! Keat/Km actually exceeds the theoreti- 

cal limit of 10° M~! sec™! in this problem; e. The rate at which E 

encounters S; the ultimate limit is the rate of diffusion of S. 

a. Vmax = 1.6 umol mL~! sec™!; b. v = 1.45 umol mL! sec7™!; 

C. Koat Km = 1.6 X 108 M~! sec™!; d. Yes. 

a. Vmax = 6 umol mL~! sec™!; b. keat = 1.2 X 106 sec™!; c. Keat/Km = 

1 X 108 M`! sec™!; d. Yes! keat Km approaches the theoretical limit 

of 10° M`! sec™!; e. The rate at which E encounters S; the ultimate 

limit is the rate of diffusion of S. 

a. Vinax = 64 umol mL-~! sec7!; b. keat = 1.28 X 104 seem}; 

C. Keat Km = 1.4 X 108 M`! sec™!; d. Yes! Keat/Km approaches the 

theoretical limit of 10° M~! sec™!. 

a. Vmax = 120 mmol mL~! sec™!; b. v = 104.7 mmol mL~! sec™!; 

C. Keat Km = 3.64 X 108 M`! sec™!; d. Yes! keat Km approaches the 

theoretical limit of 10° M~! sec™!. 

For enzyme A: catalytic efficiency = 450 (s mM)~!. 

For enzyme B: catalytic efficiency = 576 (s mM)"!. 

a. Starting from Vmax! = k[Er] and Vmax? = k_\[Ez] for the 
maximal rates of the forward and reverse reactions, respec- 
tively, and the Michaelis constants Km5=(k-1 + k2)/kı and 
Ky? = (k-11 + ko)/k-2 for S and P, respectively, 

y= (Vox [S]/Km5) T V max [PV Km? /(1 + [S Kn" + [P\/Km?) 

b. At equilibrium, v = 0 and Keq = [PI/[S] = Vmax! Kin?/Vinax' Kin’. 

a. S is the preferred substrate; its Km is smaller than the Km for T, 
so a lower [S] will give v = Vmax/2, compared with [T]. 

b. keat Km defines catalytic efficiency. Keat Km for S = 
2 X 107 M`'sec™!; Keat/Km for T = 4 X 10’ M~'sec™!, so the 
enzyme is a more efficient catalysis with T as substrate. 
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20. As lemon juice is rich in organic acids, the acidity of the lemon 
juice may denature proteins in the sliced apples, including the 
polyphenol oxidase, and may render this enzyme inactive. 


21. Maximum enzymatic activity is obtained (and detected) if [S] is 
much higher than Km. 


22. Increased temperature may denature an enzyme and render it 
inactive. 


23. a. Because the enzyme shows maximal activity at or below 
40°C, it seems more like a mammalian enzyme than a plant 
enzyme, which would be expected to have a broader tempera- 
ture optimum because plants experience a broader range of 
temperatures. 


b. An enzyme from a thermophilic bacterium growing at 80°C 
would show an activity versus temperature profile similar to 
his one but shifted much farther to the right. 


Think-Pair-Share Question: 


a. 1,200,000 
1,000,000 | 
800,000 
600,000 


400,000 


Mol mass (g/mol) 


200,000 


Reaction time (min) 


1.8E- 05 
1.6E - 05 
1.4E -05 


1.2E-05 


1.0E - 05 


Moles 


8.0E —- 06 


ee alee 


D 
6.0E — 06 


4.0E — 06 


2.0E — 06 


0.0E + 00 


Reaction time (min) 


1/mol mass (mol/g) 


0.0E + 00 


Reaction time (min) 


Abbreviated Answers to Problems A-21 


b. The number of moles can be calculated by dividing the amount 
of poly- or oligosaccharide (0.5 g) by its molecular mass. As the 
polysaccharide is degraded, the molecular mass declines, but the 
amount of poly- or oligosaccharide remains at 0.5 g. As a function of 
the reaction time one divides 0.5 g with the appropriate molecular 
mass to obtain the number of moles as a function of the reaction time. 
The velocity of the reaction can be determined as Dmoles/Dtime. One 
can plot the number of moles as a function of reaction time and 
obtain the velocity of the reaction from the slope of the best-fitting 
line through the data points (linear regression). This is shown in the 
next chart with linear regression results shown for polysaccharide D. 


c. The slopes in chart c) follow the same pattern as the slopes for chart b). 
This indicates that the slope of the plot of (1/mol mass) as a function 
of reaction time is another, relative, measure of the velocity of the 
reaction. This is useful of the amount of polysaccharide is not known. 


Chapter 14 


1. a. Nucleophilic attack by an imidazole nitrogen of His*’ on the 
—CH),— carbon of the chloromethyl group of TPCK cova- 
lently inactivates chymotrypsin. (See The Student Solutions 
Manual, Study Guide and Problems Book for structures.) 


b. TPCK is specific for chymotrypsin because the phenyl ring of 
the phenylalanine residue interacts effectively with the binding 
pocket of the chymotrypsin active site. This positions the chlo- 
romethyl group to react with His”. 


c. Replacement of the phenylalanine residue of TPCK with argi- 
nine or lysine produces reagents that are specific for trypsin. 


2. a. The structures proposed by Craik et al., 1987 (Science 
237:905—907) are shown here. (If you look up this reference, 
note that the letters A and B of the figure legend for Figure 3 
of this article actually refer to parts B and A, respectively. 
Reverse either the letters in the figure or the letters in the 
figure legend and it will make sense.) 


A \ 357 Wild type 
| P= 
56 N H 
ASH ee 
>Q 
\ 
Ne 
102 O His” 
Asp t Su ~S 
H N 
`v ‘EN 
SX 195 
H Ser 
Ser?!4 ( Si 
oS 
B \ 57 D102N mutant 
N— 
56 l id 
~ `H J 
“0 
A H His” 
Asn??? N~ `N N 
\ Uy 
p \ 
Ser! 
—H \ 
O 


Po 
Ser ( H = 


b. Asn!” of the mutant enzyme can serve only as a hydrogen- 
bond donor to His”. It is unable to act as a hydrogen-bond 
acceptor, as aspartate does in native trypsin. As a result, His?’ 
is unable to act as a general base in transferring a proton from 
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Ser!®°, This presumably accounts for the diminished activity of 
the mutant trypsin. 


3. Oxygen is more electronegative than sulfur, and serine is capable 


of making stronger hydrogen bonds. Electrostatics and mechanis- 


tic preferences may explain the preferences for serine and cysteine. 


See Gisdon, F. J., et al., 2022. Serine and cysteine peptidases: so 
similar, yet different. How the active-site electrostatics facilitates 
different reaction mechanisms. J. Phys. Chem. B 126:4035—4048. 
The presence of the methyl group in proximity to the alcohol 
within threonine may lead to steric hindrance in enzymatic reac- 
tions involving the alcohol functional group. 


Lisinopril 


T O Benzapril 


6. The enzyme-catalyzed rate is given by: 


v= k [ES] = k,'[EXt] 
[EX*] 
[ES] 
AG,! = —RTIn Kt 


t — p—-AGe}/RT 
K, e Ge 


Ki= 


[EX*] = K [ES] = e747 [ES] 
So k, [ES] = k,’e~SGe!/87 [ES] 
or k, = k'e7AGet/RT 
Similarly, for the TETT reaction: 


= —AGy{/RT 
k, k, e is 


HO NHs 
fe) 
HN 
HN 


4. a. The usual explanation for the inhibitory properties of 


pepstatin is that the central amino acid, statine, mimics the 
tetrahedral amide hydrate transition state of a good pepsin 
substrate with its unique hydroxyl group. 


b. Pepsin and other aspartic proteases prefer to cleave peptide 
chains between a pair of hydrophobic residues, whereas HIV-1 
protease preferentially cleaves a Tyr-Pro amide bond. Because 
pepstatin more closely fits the profile of a pepsin substrate, we 
would surmise that it is a better inhibitor of pepsin than of 
HIV-1 protease. In fact, pepstatin is a potent inhibitor of 
pepsin (Kı < 1 nm) but only a moderately good inhibitor of 
HIV-1 protease, with a K; of about 1 uM. 


Perindopril 


Enalapril 


ie) 


Captopril 


O 


N 


Assuming that k,’ = k,' 


Then 


| |S 


= eAGI-AG RT 


. This problem is solved best by using the equation derived in 


problem 4. 
k, 


K€ — elAGut-AGet)RT 

Using this equation, we can show that the difference in activation 
energies for the uncatalyzed and catalyzed hydrolysis reactions 
(AG, — AG,) is 92 kJ/mol. 
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8. Trypsin catalyzes the conversion of chymotrypsinogen to 
a-chymotrypsin, and chymotrypsin itself catalyzes the conver- 
sion of z-chymotrypsin to a-chymotrypsin. 

9. The mechanism suggested by Lipscomb is a general base 


pathway in which Glu?” promotes the attack of water on the 
carbonyl carbon of the substrate: 


Carboxypeptidase 
i i 
H—C— COO H—C—COoo- 
BHS | | : 18. 
Zu ~N | => Zu?" “La 
O=C— CHR’ ~“O—C— CHR’ 
aN | Sy 
O—H H O—H H 
O | a 19, 
| PH m l 
Gla?™— C— O7 Gla? — C— O—H 


| 20. 


R 
| 21 
H—C—COO- 
NH, 22. 
+ 
Ow 
NC— CHR’ 
a 
on Ñ 23. 


10. Using the equation derived in problem 4, it is possible to calcu- 
late the ratio k./k, as 1.86 X 10. 

If the concentration of free enzyme is equal to the concentra- 
tion of enzyme-—ligand complex, the concentration of ligand 
would be 1 x 1077 M. This corresponds to 1.67 X 10? liters 
per molecule. 


11. 


12. a. The dissociation constant Ks for compound B is low indi- 
cating that the compound can bind into the active site of 
the enzyme (stronger than compound A). However, as com- 
pound B is lacking the carbonyl group (C=O) it can not 
proceed with the reaction as shown for compound A. This 


leads to higher values for Ky for compound B. 


b. When A and B are mixed together with the enzyme, com- 
pound B could act as a competitive inhibitor for the 
enzyme’s action on compound A. 


13. AG = —154 kJ/mol. This value is intermediate between noncova- 
lent forces (H bonds are typically 10 to 30 kJ/mol) and covalent 


bonds (300 to 400 kJ/mol). 
14. a. The total amount of enzyme is given by [E]; = [E] + [EL]. 
b. Kp represents the concentration of L needed to achieve 50% 

saturation of the enzyme similar as for Ky. 

à = 0.63A 
As discussed in Section 6.4g organic solvents, detergents or 
agents like urea or guanidine hydrochloride can denature 
proteins and render enzymes inactive. 


15. 
16. 


17. Assuming that the rate of gluconeogenesis would be equal to 


the slowest step in the pathway, with k = 2 X 107%/sec, and 


24. 


25: 


26. 


Abbreviated Answers to Problems A-23 


assuming a cellular concentration of fructose-1,6-bisphosphatase 
of 0.031 (see Table 18.2), the rate of sugar synthesis would be 

2 X 10°-*/sec X 0.031 mM, or 6.2 X 107? mM/sec. Assuming 

a total human cell volume of 40 L, this corresponds to 2.48 X 
1073 moles glucose/sec. Assuming 30 ATP per glucose (under 
cellular conditions) and 50 kJ/mole of ATP hydrolyzed, we find 
that the rate of energy production is (2.48 X 1073) x (30 ATP/ 
glucose) X (30.5 kJ/mole), or 2.269 X 107% kJ/sec. Converting 
to kilocalories and years, we find that the time to synthesize the 
needed glucose with an uncatalyzed reaction would be 2.8 X 10! 
years, or roughly 200,000 times the lifetime of the universe so far. 


The correct answer is c. The stomach is a very acidic environ- 
ment, whereas the small intestine is slightly alkaline. The lower 
part of the figure shows that enzyme X has optimal activity near 
pH 2, whereas enzyme Y works best at a pH near 8. 


The correct answer is a. The two enzymes have nonoverlapping 
pH ranges, so it is highly unlikely that they could operate in the 
same place at the same time. 


The correct answer is b. Only enzyme A has a temperature range 
that encompasses human body temperature (37°C). 


. The correct answer is d. The activities of the two enzymes over- 


lap between 40° and 50°C. 


The correct answer is c. We have no information on the pH 
behavior of enzymes A and B, nor on the behavior of X and Y as 
a function of temperature. The only answer that is appropriate to 
the data shown is c. 


The only possible answer is b, because a “leveling off” implies 
that all the enzyme is saturated with S. 


The correct answer is c. In order to bring the substrate into the 
transition state, an enzyme must enjoy environmental conditions 
that favor catalysis. There is no activity apparent for enzyme Y 
below pH 5.5. 


Serine and aspartic proteases both exhibit general acid—base 
catalysis, but only the serine proteases employ covalent cataly- 
sis. The steps that lead to peptide bond cleavage are similar in 
the two mechanisms: Proton abstraction forms a nucleophilic 
species that attacks the carbonyl carbon of the bond to be 
cleaved, and then protonation of the amide nitrogen facilitates 
bond cleavage. 


Observation of burst kinetics with model substrates (esters) 
could provide evidence for a covalent mechanism. X-ray crystal- 
lography and NMR could provide evidence for involvement of 
low-barrier hydrogen bonds. (See Kidd, R. E., 1999. Protein 
Science 8:410-417.) 


The published structure of the PAF-acetylhydrolase reveals that 


Ser?” is the active site residue that initiates the esterase reaction: 
eer R R 
Sere \ \ 
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27. The serine-B-lactamase mechanism involves nucleophilic attack 
by a Ser-O: as shown below: 


aS. CH, 


“COOH 


sx 


er 


Metallo-B-lactamases require zinc for activity and catalysis. 

In the recently discovered New Delhi metallo-B-lactamase, a 
hydroxyl anion that is stabilized by two Zn* ions cleaves the 
B-lactam ring by nucleophilic attack. The mechanism is similar 
to the serine-B-lactamase, except that the stabilized hydroxyl 
group is the attacking nucleophile. 


H 
His | His 
ae. 
His —7Zn - Zn-Cys 
| mo” 
His Asp 


28. Dorothee Kern and her colleagues have used a combination 
of nuclear magnetic resonance measurements, pre-steady-state 
kinetics, molecular dynamics simulations, and X-ray crystallogra- 
phy of active complexes to show that an essential Mg?* ion in the 
active site plays two very different roles in catalysis of the kinase 
reaction. During the reaction, the active site of the enzyme adopts 
both open and closed conformations. The open conformation is 
solvent-accessible, and in the closed conformation the y-phosphoryl 
group is positioned for transfer. 


The single Mg”* ion at the active site plays two very different 
roles: it increases the rate of phosphoryl transfer in the active site 
by a factor of 10°, an effect that depends on limiting active site 
dynamics (motions) so that the active site geometry favors phos- 
phoryl transfer. The second role of Mg?" is essentially opposite 
in nature—it greatly accelerates the lid-opening conformation 
change (by a factor of 103), thus facilitating product release at 
the end of the reaction cycle. 


Think-Pair-Share Question: 
a. Step 1: Using Figure 14.20 as a model, show the covalent binding 


of the alcohol group of serine to the substrate creating an ester 
bond between the substrate and serine as shown for structure B. 


Structure B Ho NxN 

O 

O 
Qy [N 
À — (4 
HO O 
Phe N Phe 
His CHo His CHo 


Step 2: Using Figure 14.12 as a model show the function of the 
histidine residue to catalyze the base hydrolysis of the ester bond 
between serine and the remaining portion of the substrate; com- 
pleting the hydrolysis of the substrate. 


b. The presence of the phenylalanine residue in the active site may 
affect the substrate preference of the enzyme: preferring substrates 
with a hydrophobic portion close to the ester bond. Structure A 
may be too hydrophilic to bind in the active site, while structure C, 
though hydrophobic, may be too “bulky” to fit into the active site. 
Structure B may have the optimal hydrophobicity and size to 
interact with the phenylalanine residue of the active site and pro- 
mote optimal binding and chemical reactivity. 


Chapter 15 


1. a. As [P] rises, the rate of P formation shows an apparent decline, 
as enzyme-catalyzed conversion of P —> S becomes more likely. 


b. Availability of substrates and cofactors. 

c. Changes in [enzyme] due to enzyme synthesis and degradation. 
d. Covalent modification. 

e. Allosteric regulation. 


f. Specialized controls, such as zymogen activation, isozyme 
variability, and modulator protein influences. 
2. Induction will lead to an increase in the amount of enzyme and 
thus an increase in the metabolic activity due to the enzyme. 
Repression will have the opposite effect. 


3. If the proteases were made as active enzymes they would digest 
the proteins inside the cells of the pancreas, destroying them 
from within. 


4. By regulating the first (or an early) enzyme of a pathway the 
decision to commit the substrate to the entire pathway, and the 
synthesis of product F, can be made before intermediates B, C, 
D, and E accumulate, thus saving the cell from the buildup of 
unneeded metabolites. 


5. Monod, Wyman, Changeux allosteric system: 


Lineweaver—Burk plot 


1/v 


1/[S] 
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Hanes—Woolf plot 
[S]/v 


[S] 


6. Using the curves for negative cooperativity, as shown in 


Figure 15.8, as a guide: 


Negative 
cooperativity 


[S] 


Michaelis- 


Lineweaver—Burk 
Menten 


Negative 
cooperativity 


Vis] 


Negative 
Hanes—Woolf cooperativity 
Michaelis— 


Menten 


z 


=| 


[S] 


7. Forn = 2.8, Yiungs = 0.98 and Y capillaries = 0.77. 


Forn = 1.0, Yiungs = 0.79 and Y capillaries = 0.61. 


Thus, with an n of 2.8 and a Psp of 26 torr, hemoglobin becomes 
almost fully saturated with O, in the lungs and drops to 77% 
saturation in resting tissue, a change of 21%. If n of 1.0 anda 
Ps of 26 torr, hemoglobin would become only 79% saturated 
with O; in the lungs and would drop to 61% in resting tissue, a 
change of 18%. The difference in hemoglobin O, saturation 
conditions between the values for n = 2.8 and 1.0 (21% — 18%, 
or 3% saturation) seems small, but note that the potential for 

O, delivery (98% saturation versus 79% saturation) is large and 
becomes crucial when pO, in actively metabolizing tissue falls 
below 40 torr. 

. More glycogen phosphorylase will be in the glycogen phosphory- 
lase a (more active) form, but caffeine promotes the less active T 
conformation of glycogen phosphorylase. 


11. 


12. 


13. 


14. 


15. 


16. 


I7. 


18. 


19. 


20. 


A-25 


Abbreviated Answers to Problems 


.8.7L 
10. 


Yes. Any triphosphate nucleotide can be converted into its cyclic 
counterpart as long as the appropriate cyclase enzyme is present. 


Over time, stored erythrocytes will metabolize 2,3-BPG via the 
pathway of glycolysis. If [BPG] drops, hemoglobin may bind O, 
with such great affinity that it will not be released to the tissues 
(see Figure 15.29). The patient receiving a transfusion of [BPG]- 
depleted blood may actually suffocate. 


a. By definition, when [Pj] = Kos, v = 0.5 Vmax- 


b. In the presence of AMP, at [Pi] = Kos, v = 0.85 Vinax 
(from Figure 15.15c). 


c. In the presence of ATP, at [P:i] = Kos, v = 0.12 Vmax 
(from Figure 15.15b). 


The inhibition of phosphodiesterase would lead to more cAMP 
and greater activation of the cascade that promotes the activity 
of glycogen phosphorylase, leading to more glycogen hydrolysis. 
If G,-GTPase activity is inactivated, the interaction between 
G, and adenylyl cyclase will be persistent, adenylyl cyclase will 
be active, [CAMP] will rise, and glycogen levels will fall because 
glycogen phosphorylase will be predominantly in the active, 
phosphorylated a form. 


Inhibition of adenylyl cyclase leads to less cAMP and less acti- 
vation of glycogen phosphorylase, the enzyme that hydrolyzes 
glycogen according to the cascade shown in Figure 15.18. Inhibi- 
tion of the GTPase activity of the G-protein keeps it in a perma- 
nently active state, promoting the activity of adenylyl cyclase and 
the production of cAMP. This will lead to more activation of 
glycogen phosphorylase and less glycogen storage. 


An excess of a negatively cooperative allosteric inhibitor could 
never completely shut down the enzyme. Because the enzyme 
leads to several essential products, inhibition by one product 
might starve the cell for the others. 


If both enzymes were active in an uncontrolled fashion, every 
round of kinase/phosphatase activity would lead to a net result 
of ATP + HO —> ADP + P.. 


a. -R(R/K)X(S*/T*)- 
b. -KRKQIAVRGL- 


Ligand binding is the basis of allosteric regulation, and allosteric 
effectors are common metabolites whose concentrations reflect 
prevailing cellular conditions. Through reversible binding of such 
ligands, enzymatic activity can be adjusted to the momentary needs 
of the cell. On the other hand, allosteric regulation is inevitably 
determined by the amounts of allosteric effectors at any moment, 
which can be disadvantageous. Covalent modification, like allo- 
steric regulation, is also rapid and reversible because the converter 
enzymes act catalytically. Furthermore, covalent modification 
allows cells to escape allosteric regulation by covalently locking the 
modified enzyme in an active (or inactive) state, regardless of effec- 
tor concentrations. One disadvantage is that covalent modification 
systems are often elaborate cascades that require many participants. 


Sickle-cell anemia is the consequence of Hb S polymerization 
through hydrophobic contacts between the side chain of Val@6 and 
a pocket in the EF corner of B-subunits. Potential drugs might tar- 
get this interaction directly or indirectly. For example, a drug might 
compete with Val6 side chain for binding in the EF corner. Alter- 
natively, a useful drug might alter the conformation of Hb S such 
that the EF corner was no longer accessible or accommodating to 
the Val86 side chain. Another possibility might be to create drugs 
that deter the polymerization process in other ways through altera- 
tions in the surface properties between Hb S molecules. 
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21. Nitric oxide is covalently attached to Cys93. Thus, the interac- The wedge-shaped protein monomers (red) assemble into trimers, 
tion is not reversible binding, as in allosteric regulation. On the but the alternative conformation for the monomer (square, green) 
other hand, the reaction of NO: with this cysteine residue is forms tetramers. The substrate or allosteric regulator (yellow) 
apparently spontaneous, and no converter enzyme is needed to binds only to the square conformation, and its binding prevents 
add or remove it. Thus, the regulation of covalent modification the square from adopting the wedge conformation. Thus, if S or 
that is afforded by converter enzyme involvement is obviated. the allosteric regulator is present, equilibrium favors a greater 
The Hb: NO: interaction illustrates that nature does not always population of square tetramers among the morpheein ensemble 
neatly fit the definitions that we create. at the expense of round trimers. 

22. Pro: Lactate is a metabolic indicator of the need for oxygen; it 24. | 
binds to Mb at a distinct site (the allosteric site?), and it lowers Lo ATP or CIP 
Mb’s affinity for O. 


Con: Mb is a monomeric protein. Traditionally, allosteric phe- 
nomena have been considered the realm of oligomeric proteins. 
How then is Mb “allosteric”? Since a ligand-induced conforma- 
tional change in a monomeric protein can affect binding of 


0.8 


Glutamine 


“substrate” (i.e., O2), the definition of allostery may need to be 0.6 
broadened. v A 
23. a max 
E Do 0.4 
s/N ~S 
L4 HH s 
va E SESS 
Va 0 3 6 9 
ee 


25. Negative cooperativity in NAD* binding to glyceraldehyde-3-P 


dehydrogenase: 
Where affinity for NADH is 
0-H-¢ 
NAD* Induced 
\ _Sontormamon 
"Mange - 7 
26. Mass spectrometry is a prominent tool in proteomics. Proteomics to GDP + P;, and the affinity of G,-GDP for adenylyl cyclase is 
would reveal the presence of acetylated forms of metabolic enzymes. low, so it dissociates and adenylyl cyclase is no longer activated. 
27. Hyperventilation results in decreased blood pCO», which in Residual cAMP will be converted to 5'-AMP by phosphodiester- 
turn leads to an increase in pH. Thus, the affinity of Hb for O» ase, and the catalytic subunits of protein kinase A will be bound 
increases (less O, will be released to tissues). Hypoventilation has again by the regulatory subunits, whose cAMP ligands have dis- 
exactly the opposite effects on pCO, and pH; thus, hypoventila- sociated. When protein kinase A becomes inactive, phosphorylase 
tion leads to a diminished affinity of Hb for O, (more O; will be kinase will revert to the unphosphorylated, inactive form through 
released to the tissues). loss of phosphoryl groups. Phosphoprotein phosphatase 1 will 


act on glycogen phosphorylase a, removing the phosphoryl group 
from Ser! and thereby converting glycogen phosphorylase to the 
less active, allosterically regulated b form. 


28. When hormone disappears, the hormone: receptor complex disso- 
ciates, the GTPase activity of the G, subunit cleaves bound GTP 
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Think-Pair-Share Question: 


the metabolism of the intracellular parasite may generate elevated 
levels of lactic acid, acidifying the intracellular environment. 

O, has lower affinity for hemoglobin in acidic environments, creat- 
ing deoxyhemoglobin 

in its deoxy state Hb S is more likely to form insoluble complexes 
with other Hb S molecules leading to the deformation of the RBC 
the deformed RBC is more likely to be destroyed by the liver or 
spleen; parasite and all. 


Chapter 16 


1. The pronghorn antelope is truly a remarkable animal, with numer- 


ous specially evolved anatomical and molecular features. These 
include a large windpipe (to draw in more oxygen and exhale 

more carbon dioxide), lungs that are three times the size of those 
of comparable animals (such as goats), and lung alveoli with five 
times the surface area so that oxygen can diffuse more rapidly into 
the capillaries. The blood contains larger numbers of red blood 
cells and thus more hemoglobin. The skeletal and heart muscles are 
likewise adapted for speed and endurance. The heart is three times 
the size of that of comparable animals and pumps a proportionally 
larger volume of blood per contraction. Significantly, the muscles 
contain much larger numbers of energy-producing mitochondria, 
and the muscle fibers themselves are shorter and thus designed for 
faster contractions. All these characteristics enable the pronghorn 
antelope to run at a speed nearly twice the top speed of a thor- 
oughbred racehorse and to sustain such speed for up to 1 hour. 


. Refer to Figure 16.9. The step in which the myosin head confor- 


mation change occurs, is the step that should be blocked by 
B,y-methylene-ATP, because hydrolysis of ATP should occur in 
this step and B,y-methylene-ATP is nonhydrolyzable. 


. Phosphocreatine is synthesized from creatine (via creatine kinase) 


primarily in muscle mitochondria (where ATP is readily generated) 
and then transported to the sarcoplasm, where it can act as an 
ATP buffer. The creatine kinase reaction in the sarcoplasm yields 
the product creatine, which is transported back into the mito- 
chondria to complete the cycle. Like a number of other mitochon- 
drial proteins, the expression of mitochondrial creatine kinase is 
directed by mitochondrial DNA, whereas sarcoplasmic creatine 
kinase is derived from information encoded in nuclear DNA. 


. Note in step 3 of Figure 16.9 that it is ATP that stimulates dissocia- 
tion of myosin heads from the actin filaments—the dissociation of 
the cross-bridge complex. When ATP levels decline (as happens rap- 
idly after death), large numbers of myosin heads are unable to dis- 
sociate from actin and the muscle becomes stiff and unable to relax. 


. The skeletal muscles of the average adult male have a cross- 
sectional area of approximately 35,000 cm. The gluteus maxi- 
mus muscles represent approximately 300 cm? of this total. 
Assuming 4 kg of maximal tension per square centimeter of 
cross-sectional area, one calculates a total tension for the gluteus 
maximus of 1200 kg (as stated in the problem). The same calcu- 
lation shows that the total tension that could be developed by all 
the muscles in the body is 140,000 kg (or 154 tons)! 


. Taking 55,000 g/mol divided by 6.02 X 107/mol and by 


1.3 g/mL, one obtains a volume of 7.03 X 107% mL. Assume a 
sphere and use the volume of a sphere (V = (4/3)ar?) to obtain a 
radius of 25.6 A. The diameter of the tubulin dimer (see Figure 
16.12) is 8 nm, or 80 A, and two times the radius we calculated 
here is approximately 51 A, a reasonable value by comparison. 


15. 


16. 


I7. 


18. 


Abbreviated Answers to Problems A-27 


. The presence of the two different subunits allows for the differ- 


ential binding of the two heads of kinesin during its step-by-step 
movement over the tubulin structure. The conformational dif- 
ferences in the head groups of kinesin based upon the presence 
of ADP or ATP could lead to different affinities for either the a 
or b subunit. This could allow for the step-by-step movement of 
kinesin over tubulin that may not have happened if tubulin was 
built from one monomer or more than two monomers. 


. A liver cell is 20,000 nm long, which would correspond to 


5000 tubulin monomers if using the value of Figure 16.12, 
and about 7800 tubulin monomers using the value of 25.6 A 
calculated in problem 6. 


. 4inches (length of the giant axon) = 10.16 cm. Movement at 2 to 


5 p/sec would correspond to a time of 5.6 to 14 hours to traverse 
this distance. 


. 88,900 s or 24.7 hours; 11,112,500 units of tubulin. 
. 14 nm is 140 A, which would be approximately 93 residues of a 


coiled coil. 


. The presence of LC 1 is needed for ATPase activity, as 


mentioned in Section 16.2b. 


. Sequence (b). In sequence (a) the fourth amino acid is hydro- 


phobic but not small. In sequence (c) the first amino acid is not 
hydrophobic. In sequence (d) the sixth amino acid is not charged. 


. Using Equation 9.1, one can calculate a AG of 17,100 J/mol for 


this calcium gradient. 


Using Equation 3.8, one can calculate a cellular AG of 
—48,600 J/mol for ATP hydrolysis. 


Using equation 3.8 the chart can be constructed. 


S 
§ -15 
> 
= [9 
D -20 
< 
-25 
-30 


17,100 J are required to transport 1 mole of calcium ions. Two 
moles of calcium would require 34,200 J, and three would cost 
51,300 J. Thus, the gradient of problem 10 would provide enough 
energy to drive the transport of two calcium ions per ATP 
hydrolyzed. 


Ratio [ADP]/ [ATP] 


Energy (or work) = force X distance. If 1 ATP is hydrolyzed 
per step and the cellular value of AG” for ATP hydrolysis is 
—50 kJ/mol, then the calculation of force exerted by a motor is 
force = 50 kJ/mol/(step size). The SI unit of force is the newton, 
and | newton - meter = 1 J. 

For the kinesin-1 motor, 50 kJ/mol/(8 x 107? m) = 6.25 x 10! 
newtons. 

For the myosin-V motor, 50 kJ/mol/(36 X 107°? m) = 1.39 x 10!? 
newtons. 

For the dynein motor, 50 kJ/mol/(28 X 10~° m) = 1.78 x 10! 
newtons. 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


A-28 


Abbreviated Answers to Problems 


19. Perhaps the simplest way to view this problem is to consider 
the potential energy change for lifting a 10-kg weight 0.4 m. 
E = mgh = (10 kg) - (9.8 m/sec?) - (0.4 m) = 39.2 J. Now, if 
1 ATP is expended per myosin step along an actin filament, 
39.2 J/(50 kJ/mol) = 7.8 X 10-4 mol. 


Then, (7.8 X 1074 mol) - (6.02 x 1073) = 4.72 X 10% molecules 
of ATP expended, one per step. So there must be 4.72 X 107° 
myosin steps along actin. However, the stepping process is almost 
certainly not 100% efficient. As shown in the Critical Develop- 
ments in Biochemistry box “Molecular “Tweezers’ of Light Take 
the Measure of a Muscle Fiber’s Force” in Section 16.2, a step 
size of 11 nm against a force of 4 pN would correspond to an 
energy expended per myosin per step of 4.4 X 107% J. With 

this assumed energy expended per step, we can calculate 

39 J/(4.4 X 107% J) = 8.86 x 10” steps total. 


If we take the step size value for skeletal myosin from the Critical 
Developments in Biochemistry box “Molecular “Tweezers’ 

of Light Take the Measure of a Muscle Fiber’s Force” in 

Section 16.2, as 11 nm, one myosin head would have to take 

0.4 m/11 X 10~°m or 3.64 X 10’ steps to raise the weight a dis- 
tance of 0.4 m. Taking (8.86 X 10% steps total)/(3.64 X 107 steps 
per myosin) = 2.43 X 10'3 myosin heads involved per 11-nm step. 


20. All these are smooth muscle except for the diaphragm. (See 
Attp://chanteur.net/contribulindex. htm#http://chanteur. net! 
contribulcJ Mdiaph.htm for an explanation of the common mis- 
conception that the diaphragm is smooth muscle.) 


21. 1, a; 2, e; 3, d; 4, c; 5, b. 


22. This exercise is left to the student and will presumably be 
different for every student. 


23. The 80 A step of the kinesin motor is a 0.008 um step. To cover 
10.16 cm would require 12.7 million steps. 


24. The correct answer is d, although each answer is reason- 
able. ATP is needed for several processes involved in muscle 
relaxation. Salt imbalances can also prevent normal muscle 
function, and interrupted blood flow could prevent efficient 
delivery of oxygen needed for ATP production during cellular 
respiration. 


25. The correct answer is a. Understanding the inheritance pattern 
of sex-linked traits is essential. Males always inherit sex-linked 
traits (on the X chromosome) from their mothers. In addition, 
sex-linked traits are much more commonly expressed in males 
because they have only one X chromosome. 


26. This exercise is left to the student and will presumably be 
different for every student. 

27. This exercise is left to the student and will presumably be 
different for every student. 

28. This exercise is left to the student and will presumably be 
different for every student. 

Think-Pair-Share Question: 

Suggested answer: 

The polysaccharide should be a heteropolysaccharide to provide 

diversity in structural elements capable of secondary interactions 

with the domains of the motor proteins. 


The polysaccharide should be long enough to span enough length 
within or outside a cell. 


One could suggest the requirement the polysaccharide to be charged 
(containing uronic acids or phosphorylated or sulfate units) to 
expand its secondary interaction possibilities. Basic polysaccharides 
are polyalcohols and secondary interactions are limited to hydrogen 
bonds. Charged functional groups on a polysaccharide could provide 
more diverse and stronger interaction capabilities with motor 
proteins. 


The polysaccharide should be capable of bundling with other poly- 
saccharides to make sturdy fiber-like structures. 


The motor proteins should have hydrophilic polysaccharide-binding 
domains to interact with the hydrophilic polysaccharide structure. 


Chapter 17 
1. 6.5 X 10! (6.5 trillion) people. 
2. Consult Table 17.2. 


3. During daytime plants are considered photoautotrophic because 
of the photosynthesis process. During nighttime their metabo- 
lism is chemoautotrophic because of the aerobic oxidation of 
organic compounds. 


4. QO, HO, and CO). 
5. See Section 17.3. 


6. Catabolic pathways increase entropy and lower the enthalpy of 
biomolecules while anabolic pathways decrease entropy and store 
enthalpy in biomolecules. 


7. Carbon is more electronegative than hydrogen, but less electro- 
negative than oxygen. In chemical bonds between carbon and 
hydrogen the electrons are closer to carbon than to hydrogen. In 
chemical bonds between carbon and oxygen the valence electrons 
are closer to oxygen than to carbon. In chemical bonds between 
carbon and oxygen, carbon “loses” its electrons to oxygen. 

In CH, the oxidation number of carbon is —4 while in CO, the 
oxidation number of carbon is +4. 


8. Consult Figure 17.5. 


9. Consult Corresponding Pathways of Catabolism and Anabolism 
Differ in Important Ways, Section 17.3f. See also Figure 17.8. 


10. See The ATP Cycle, Section 17.3g; NAD* Collects Electrons 
Released in Catabolism, Section 17.3h; and NADPH Provides the 
Reducing Power for Anabolic Processes, Section 17.31. 


11. NADP* has a phosphate group bonded to the 2’ carbon of the 
ribose unit. This difference in structure allows for the differential 
use of NAD*/NADH for catabolic reactions and NADP*/ 
NADPH for anabolic reactions. 


12. In terms of quickness of response, the order is allosteric regu- 
lation > covalent modification > enzyme synthesis and deg- 
radation. See The Student Solutions Manual, Study Guide and 
Problems Book for further discussions. 


13. See “Metabolic Pathways Are Compartmentalized within Cells,” 
in Section 17.4, and discussions in The Student Solutions Manual, 
Study Guide and Problems Book. 


14. Many answers are possible here. Some examples: Large 
numbers of metabolites in tissues and fluids; great diversity 
of structure of biomolecules; need for analytical methods that 
can detect and distinguish many different metabolites, often 
at very low concentrations; need to understand relationships 
between certain metabolites; time and cost required to analyze 
and quantitate many metabolites in a tissue or fluid. 
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15. 


16. 


17. 


18. 


19; 


20. 


21. 


Mass spectrometry provides great sensitivity for detection of 
metabolites. NMR offers a variety of methods for resolving and 
discriminating metabolites in complex mixtures. Other compari- 
sons and contrasts are discussed in the references at the end of 
the chapter. 


See Figure 13.23; A Deeper Look box “The Coenzymes of the 
Pyruvate Dehydrogenase Complex,” in Section 19.2, and the 
A Deeper Look box “Activation of Vitamin Bj,” in Section 23.3. 


The genome is the entire hereditary information in an organism, 
as encoded in its DNA or (for some viruses) RNA. The tran- 
scriptome is the set of all messenger RNA molecules (transcripts) 
produced in one cell or a population of cells under a defined set 
of conditions. The proteome is the entire complement of proteins 
produced by a genome, cell, tissue, or organism under a defined 
set of conditions. The metabolome is the complete set of low 
molecular weight metabolites present in an organism or excreted 
by it under a given set of physiological conditions. The fluxome 
is all of the metabolic fluxes in a metabolic network. It represents 
systems-level information on those cellular processes defined by 
the metabolic network. 


A mechanism for liver alcohol dehydrogenase: 


H 
H,C t o g e H "a 
a a Gea 3 M 

| A So 
H 

H/T 

a N 

a | NH, 

<y 
i 


TCA cycle: in mitochondria; converts acetate units to CO; plus 
NADH and FADH. 


Glycolysis: in cytosol; converts glucose to pyruvate. 


Oxidative phosphorylation: in mitochondria; uses electrons (from 
NADH and FADH.) to produce ATP. 


Fatty acid synthesis: in cytosol; uses acetate units to synthesize 
fatty acids. 


Most discussions of ocean sequestration of CO, are devoted 
to capture of CO, by phytoplankton or injection of CO, deep 
in the ocean. However, the A Deeper Look box “Calcium 
Carbonate—A Biological Sink for CO,” in Section 17.1 offers 
another possibility—carbon sequestration in the shells of 
corals, mollusks, and crustaceans. This approach may be 
feasible. However, there are indications that these classes of 
sea organisms are already suffering from global warming and 
from pollution of ocean water. 


32P and *°S both decay via beta particle emission. A beta particle 
is merely an electron emitted by a neutron in the nucleus. Thus, 
beta decay does not affect the atomic mass, but it does convert a 
neutron to a proton. Thus, beta emission changes **P to ?S and 
converts *S to 35C]: 

32p —__, Bo F 325, tin = 14.3 days. 

35S —> B~ + Cl, tın = 87.1 days. 
The decay equation is a first-order decay equation: 


AlAo = e70- Bitin 


Abbreviated Answers to Problems A-29 


Thus, after 100 days of decay, the fraction of **P remaining would 
be 0.786% and the fraction of 35S remaining would be 45%. 


22. DNA, RNA, nucleotides, some classes of glycerophospholipids, 
hexosamines and related molecules. 

23. a. Chemoautotroph 
b. Photoautotroph 
c. Chemoheterotroph 

24. The correct answer is b. Obligate anaerobes can survive only in 
the absence of oxygen. 

25. The correct answer is c, mass spectrometry. 

26. Consult the PSAFE entry for alcohol dehydroghenase. 

Think-Pair-Share Question: 

Other alternative metabolic diagrams that could be considered: 


Oxygen-centric: focused on the increase or decrease in carbon- 
bonded oxygen atoms (this leaves out the addition of oxygen atoms 
from phosphate or sulfate functional groups). 


Nitrogen-centric: focused on the increase or decrease in carbon- 
bonded nitrogen atoms. 


a. carbon-centric: A — E — G + H — CO, 
b. electron-centric: A —> C —> D — E 


c. oxygen-centric: A —> B — C — D — E — F—> 
G + H — CO, 


d. nitrogen-centric: A —> B — G — NH; 


Chapter 18 


1. a. Phosphoglucoisomerase, fructose bisphosphate aldolase, triose 
phosphate isomerase, glyceraldehyde-3-P dehydrogenase, phos- 
phoglycerate mutase, and enolase. 


b. Hexokinase/glucokinase, phosphofructokinase, phosphoglyc- 
erate kinase, pyruvate kinase, and lactate dehydrogenase. 


c. Hexokinase and phosphofructokinase. 
d. Phosphoglycerate kinase and pyruvate kinase. 


e. According to Equation 3.8, reactions in which the number 
of reactant molecules differs from the number of product 
molecules exhibit a strong dependence on concentration. 
That is, such reactions are extremely sensitive to changes 
in concentration. Using this criterion, we can predict from 
Table 19.1 that the free energy changes of the fructose 
bisphosphate aldolase and glyceraldehyde-3-P dehydroge- 
nase reactions will be strongly influenced by changes in 
concentration. 


f. Reactions that occur with AG near zero operate at or near 
equilibrium. See Table 18.1 and Figure 18.22. 


2. The carboxyl carbon of pyruvate derives from carbons 3 and 4 
of glucose. The keto carbon of pyruvate derives from carbons 
2 and 5 of glucose. The methyl carbon of pyruvate is obtained 
from carbons 1 and 6 of glucose. 

3. Increased [ATP], [citrate], or [glucose-6-phosphate] inhibits 
glycolysis. Increased [AMP], [fructose-1,6-bisphosphate], or 
[fructose-2,6-bisphosphate] stimulates glycolysis. 

4. BDAEC 


5. See The Student Solutions Manual, Study Guide and Problems 
Book for discussion. 


6. e) a) d) b) c) 
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A-30 Abbreviated Answers to Problems 


7. The mechanisms for fructose bisphosphate aldolase and carbon | of 1,3-bisphosphoglycerate. The label will be lost in the 
glyceraldehyde-3-P dehydrogenase are shown in Figures 18.12 next reaction, and no other glycolytic intermediates will be directly 
and 18.13, respectively. labeled. (Once the label is incorporated into ATP, it will also show 

8. The relevant reactions of galactose metabolism are shown in up in glucose-6-P, fructose-6-P, and fructose-1, 6-bisphosphate.) 
Figure 18.24. See The Student Solutions Manual, Study Guide 11. Oxidation occurs in the conversion of glyceraldehyde-3-phosphate 
and Problems Book for mechanisms. to 3-phosphoglycerate and NAD* is the oxidizing agent. 

9. Iodoacetic acid would be expected to alkylate the reactive 12. The sucrose phosphorylase reaction leads to glucose-6-P without 
active-site cysteine that is vital to the glyceraldehyde-3-P the need for the hexokinase reaction. Direct production offers the 
dehydrogenase reaction. This alkylation would irreversibly obvious advantage of saving a molecule of ATP. 
inactivate the enzyme. 13. All of the kinases involved in glycolysis, as well as enolase, 

10. Ignoring the possibility that °P might be incorporated into ATP, are activated by Mg?* ion. A Mg?* deficiency could lead to 
the only reaction of glycolysis that utilizes P; is glyceraldehyde-3-P reduced activity for some or all of these enzymes. However, 
dehydrogenase, which converts glyceraldehyde-3-P to other systemic effects of a Mg?” deficiency might cause even 
1,3-bisphosphoglycerate. **P; would label the phosphate at more serious problems. 


14. Through enediol isomerism, D-glucose, D-fructose and D-mannose 
are interchangeable. D-galactose is related to D-tagatose through 
enediol isomerism. 


O HO. HO O 
\ \ X 
CH CH CHOH CH CH 
H—C—OH C — OH C=O HO—C HO—C—H 
HO—C—H x HO—C—H x HO—C—H ~ HO—C—H x HO—C—H 
NT x © Sra. =o =, 
H—C—OH H—C—OH H — C — OH H — C — OH H — C — OH 
H— C — OH H—C—OH H—C—OH H — C — OH H — C — OH 
CHOH CHOH CHOH CHOH CHOH 
D-glucose D-fructose D-mannose 
O HO, 
\ X 
CH CH CHOH 
H — C — OH C — OH C=O 
HO—C—H —— Ss HO—C—H — S HO—C—H 
> x= e 
HO—C—H HO—C—H HO—C—H 
H—C—OH H—C—OH H—C—OH 
CH,OH CHOH CHOH 
D-galactose D-tagatose 
15. [GA— 3P]/[DHAPFP] should be 0.052 or about 1/19. 19. a. -19.1 kJ/mol. 
16. +1.08 kJ/mol. b. Keg = 1651. 
17. a. —31.7 kJ/mol. c. [ATP]/[ADP] = 13.8. 
b. Keg = 194,850. 20. AG” = —33.9 kJ/mol. 
c. [Pyr]/[PEP] = 24,356. 21. An 8% increase in [ATP] changes the [AMP] concentration to 
18. a. —13.8 kJ/mol. 20 uM (from 5 uM). 


b. Ką = 211. 
c. [FBP]/[F-6-P] = 528. 
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A-31 


Abbreviated Answers to Problems 


22. CH, — OH 24. Pyruvate kinase deficiency primarily affects red blood cells, 
o x which lack mitochondria and can produce ATP only via 
ov HN — Lyg glycolysis. Absence of pyruvate kinase reduces the production of 
CH, —O—PO,2- ATP by glycolysis in red cells, which in turn reduces the activity 
of the Na,K-ATPase (see Chapter 9). This results in alterations 
Formation of Schiff base of electrolyte (Na* and K*) concentrations, which leads to 
intermediate cellular distortion, rigidity, and dehydration. Compromised red 
blood cells are destroyed by the spleen and liver. 
25. As discussed in Section 18.7a diets rich in fructose will be 
processed by the liver and the carbohydrate will be turned 
CH, — OH into fatty acids and stored as triglycerides in the adipose tissues. 
aa Lyf One can fool the customer, but not our biochemistry! 
nf | 26. The best answer is d. PFK is more active at low ATP than at high 
T Sy H ATP, and F-2,6-bisP activates the enzyme. 
—B— = 
| CH; — O —PO,? 27. Hexokinase is inhibited by high concentrations of glucose-6-P, 
H Borohydride reduction so glycolysis would probably stop at such a high level of 
of Schiff base intermediate glucose-6-P. Also, the increased concentration of glucose-6-P 
would change the cellular AG of the hexokinase reaction to 
approximately —22,000 J/mol. 
28. This exercise is left to the student and will presumably be 
different for every student. 
‘ae — OH 29. This exercise is left to the student and will presumably be 
H—C—N— REN is ey student. 
| 30. This exercise is left to the student and will presumably be 
H different for every student. 
CHo — O —PO,* Think-Pair-Share Question: 
Stable (trapped) E-S derivative 14CH;—CH,OH would be obtained if C1 or C6 in D-glucose were 
labeled. CH;—'*CH,OH would be obtained if C2 or C5 in D-glucose 
| were labeled. '*CO, would be obtained if C3 or C4 in D-glucose were 
labeled. The same results would have been obtained if D-mannose or 
Degradation of enzyme D-fructose were used as starting molecules. 
(acid hydrolysis) 
CH, — OH 
H— C —N — Lys 
H 
CH, — OH 
N°-dihydroxypropylt-lysine 
23. UDPGlc Glc-1-P Gal-1-P UDPGal 
ky || ka ks || k hy || k4 ke || k 
ko ks, 
E E e UDPGlc ———__ E-UMP ° Glc-1-P E-UMP E-UMP e Gal-l-P —————__ E-UDPGal E 
kg ks 
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A-32 


Abbreviated Answers to Problems 


Chapter 19 
1. Glutamate enters the TCA cycle via a transamination to form 


a-ketoglutarate. The y-carbon of glutamate entering the cycle 

is equivalent to the methyl carbon of an entering acetate. Thus, 
no radioactivity from such a label would be lost in the first or 
second cycle, but in each subsequent cycle, 50% of the total label 
would be lost (see Figure 19.15). 


. CFABED 


3. d) f) b) c) a) e) 


. NAD* activates pyruvate dehydrogenase and isocitrate 
dehydrogenase and thus would increase TCA cycle activity. 

If NAD* increases at the expense of NADH, the resulting 
decrease in NADH likewise would activate the cycle by stimu- 
lating citrate synthase and a-ketoglutarate dehydrogenase. 
ATP inhibits pyruvate dehydrogenase, citrate synthase, and 
isocitrate dehydrogenase; reducing the ATP concentration thus 
would activate the cycle. Isocitrate is not a regulator of the 
cycle, but increasing its concentration would mimic an increase 
in acetate flux through the cycle and increase overall cycle 
activity. 

. For most enzymes that are regulated by phosphorylation, the 
covalent binding of a phosphate group at a distant site induces 

a conformation change at the active site that either activates or 
inhibits the enzyme activity. On the other hand, X-ray crystallo- 
graphic studies reveal that the phosphorylated and unphosphory- 
lated forms of isocitrate dehydrogenase share identical structures 
with only small (and probably insignificant) conformation changes 
at Ser!!3, the locus of phosphorylation. What phosphorylation 
does do is block isocitrate binding (with no effect on the binding 
affinity of NADP*). As shown in Figure 2 of the paper cited in 
the problem (Barford, D., 1991. Molecular mechanisms for the 
control of enzymic activity by protein phosphorylation. Biochim- 
ica et Biophysica Acta 1133:55-62), the y-carboxyl group of bound 
isocitrate forms a hydrogen bond with the hydroxyl group of 
Ser!!3. Phosphorylation apparently prevents isocitrate binding by 
a combination of a loss of the crucial H bond between substrate 
and enzyme and by repulsive electrostatic and steric effects. 


. Amechanism for the first step of the a-ketoglutarate dehydroge- 
nase reaction: 


CHT CH,—COO ~ CH,—CH,—COO™ 
LY 6G. AR x co 
mo H'B = Tee OH 
| COO- C=O 
= 
TPP a-KG O 
Y 
s qa CHə—COO` s CHə—CHə—COO7 
I y C—OH <— pe C—OH 
N+ N A 
| -P 
Covalent TPP intermediate B 


s 


. Aconitase is inhibited by fluorocitrate, the product of citrate 


synthase action on fluoroacetate. In a tissue where inhibition 
has occurred, all TCA cycle metabolites should be reduced 
in concentration. Fluorocitrate would replace citrate, and the 
concentrations of isocitrate and all subsequent metabolites 
would be reduced because of aconitase inhibition. 


. FADH; is colorless, but FAD is yellow, with a maximal absor- 


bance at 450 nm. Succinate dehydrogenase could be conveniently 
assayed by measuring the decrease in absorbance in a solution of 
the flavoenzyme and succinate. 


9. The central (C-3) carbon of citrate is reduced, and an adjacent 


10. 


carbon is oxidized in the aconitase reaction. The carbon bearing 
the hydroxyl group is obviously oxidized by isocitrate dehydroge- 
nase. In the a-ketoglutarate dehydrogenase reaction, the depart- 
ing carbon atom and the carbon adjacent to it are both oxidized. 
Both of the —CH,— carbons of succinate are oxidized in the 
succinate dehydrogenase reaction. Of these four molecules, all 
but citrate undergo a net oxidation in the TCA cycle. 


As shown in Figure 19.6, a basic amino acid residue of citrate 
synthase removes a proton from acetyl-coenzyme A to initiate 
the reaction. If D were to replace the H at this position, then D 
would have become part of the acid-base equilibria inside the cell 
and could be found associated with a wide variety of compounds. 


. Several TCA metabolite analogs are known, including malonate, an 


analog of succinate, and 3-nitro-2-S-hydroxypropionate, the anion 
of which is a transition-state analog for the fumarase reaction. 


1 i 
COO H + + + 
A H 
| HÍ NS _ H a i 
CH ° (0) > (0) 
(0070 UC UG 
~ooc” X OH “ooc” \ OH 
Malonate 3-Nitro-2-S- Transition-state 
(succinate analog) hydroxypropionate analog for fumarase 
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12. A mechanism for pyruvate decarboxylase: 


Abbreviated Answers to Problems A-33 


` ® ' CH 
R/S yo B -© r rn US : 3 
{g — \ + cLo>n3-@ 2 \ ay —OH 
i > | cote R / JN 
R" R" r S 
Pyruvate 
COs 
E CH; aa 


R" 


Resonance-stabilized carbanion on substrate 


Ht. 


Hydroxyethyl-TPP 


13. Ca? can be predicted to activate both the pyruvate dehydrogenase 
complex and the TCA cycle as they lead to the production of ATP. 
Contracting muscles need ATP for contraction to occur. 


14. [isocitrate]/[citrate] = 0.1. When [isocitrate] = 0.03 mM, 
[citrate] = 0.3 mM. 


15. '4CO, incorporation into TCA via the pyruvate carboxylase reac- 
tion would label the —CH,—COOH carboxyl carbon in oxaloac- 
etate. When this entered the TCA cycle, the labeled carbon would 
survive only to the a-ketoglutarate dehydrogenase reaction, 
where it would be eliminated as 4CO3. 


18. 5 


HC — COO7 


| o=- 


Acetaldehyde 


16. '4C incorporated in a reversed TCA cycle would label the two car- 
boxyl carbons of oxaloacetate in the first pass through the cycle. 
One of these would be eliminated in its second pass through the 
cycle. The other would persist for more than two cycles and would 
be eliminated slowly as methyl carbons in acetyl-CoA in the 
reversed citrate synthase reaction. 


17. The labeling pattern would be the same as if methyl-labeled acetyl- 
CoA was fed to the conventional TCA cycle (see Figure 19.15). 


@-3 : eK = (COO OH 
Glyoxalate | 
L > 


Acetyl-CoA 


CoASH 


Malate 
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A-34 Abbreviated Answers to Problems 


19. Mitochondrion Cytosol 
Pyruvate SSS SSS SSeS Se Slee Seas SSS a SS SSS Seas Pyruvate 
A 
NADPH + @® 
a NADP* 
Malate ---------{--)S--4------------ Malate 
A 
on So 
NAD* 
Malate Malate 
dehydrogenase dehydrogenase 
NADH + @ 
NADH + @® 
y 
Oxaloacetate Oxaloacetate 
A 
y 
-C HO 
pn io ae ADP + P; Acetyl-CoA 
Citrate ATP-citrate 
synthase lyase 
A 
o ATP CoA 
y 
Citrate- ---------- P -oL -»> Citrate 


Mitochondrial 
membrane 


21. This exercise is left to the student. Review of the calculation of 
oxidation numbers should be a prerequisite for answering this 
problem. 


22. This exercise is left to the student. 


20. For the malate dehydrogenase reaction, 

Malate + NAD* == oxaloacetate + NADH + H* 

with the value of AG” being +30 kJ/mol. Then 
AG” = -RT In Ka 


(8.314 J/mol - K)(298)1 ( []x ) 23. 2R, 3R-fluorocitrate is converted by aconitase to 2-fluoro-cis- 
= —(8. mol - n 


[20][2.2 x 10-4] aconitate. This intermediate then rotates 180 degrees in the 
active site. Addition of hydroxide at the C4 position is fol- 


= 30,000 J/mol = In (x/4.4 x 10-3) lowed by double bond migration from C3-C4 to C2-C3, to form 
2478 J/mol 4-hydroxy-trans-aconitate. This product remains tightly bound at 
—12.1 = In (x/4.4 X 1073) the aconitase active site, inactivating the enzyme. Studies by 
_ Lauble et al. (The reaction of fluorocitrate with aconitase and 
x = —RT In Keg 


the crystal structure of the enzyme-inhibitor complex. Proceed- 


x = [oxaloacetate] = 0.024 uM 


Using the dimensions given in the problem, one can calculate a 
mitochondrial volume of 1.57 X 107! L. (0.024 x 1076 M) - 


(1.57 X 107" L) - (6.02 x 1073 molecules/mole) = 22.7, or about 


23 molecules of OAA in a mitochondrion (“pOAA” = 7.62). 


ings of the National Academy of Sciences U.S.A. 93:13699-13703 
(1996)) have shown that the inhibitory product can be displaced 
by a 10°-fold excess of isocitrate. 


. The post translational modification of cysteine residues by fuma- 


rate is termed a succination and proceeds via a Michael addition: 


(0) 
Ht Q (0) 
HO. N 
OH HO YY HO 
lo ji p OH OH 
L ol l 
Cys Cys Cys 
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25. 


26. 
27. 


Only eight ATPs would be generated in the succinyl-CoA 
synthetase (and nucleoside diphosphate kinase) reaction of 
the TCA cycle itself. 

d is false. Succinyl-CoA is an inhibitor of citrate synthase. 
This exercise is left to the student and will presumably be 
different for every student. 


Think-Pair-Share Question: 


C!8O, and NADD (isotopically labeled NADH where H is replaced 
with a D). 


Chapter 20 


1. 


.a.al. UQ 


The cytochrome couple is the acceptor, and the donor is the 
(bound) FAD/FADH) couple, because the cytochrome couple 
has a higher (more positive) reduction potential. 


AG,’ = 0.254 V — 0.02 V* 
*This is a typical value for enzyme-bound [FAD]. 
AG” = —-n¥A€,’ = —43.4 kJ/mol. 
a2. FAD 
b2. UQH, 


a3. cytochrome c (Fe**) 


b. bl. FMNH, b3. a-ketoglutarate 


3. AE,’ = —0.03 V; AG” = 5790 J/mol. 


4. 


The net result of the reduction of NAD* is that a proton is 
consumed. Adding an appropriate term to Equation 3.33 
yields: 


€ =€,/ — RT In [H*] + (RTInF In (fox]/[red]) 


. Cyanide acts primarily via binding to cytochrome a3, and 


the amount of cytochrome a; in the body is much lower than 
the amount of hemoglobin. Nitrite anion is an effective anti- 
dote for cyanide poisoning because of its unique ability to 
oxidize ferrohemoglobin to ferrihemoglobin, a form of hemo- 
globin that competes very effectively with cytochrome a; for 
cyanide. The amount of ferrohemoglobin needed to neutralize 
an otherwise lethal dose of cyanide is small compared with 
the total amount of hemoglobin in the body. Even though 

a small amount of hemoglobin is sacrificed by sequestering 
the cyanide in this manner, a “lethal dose” of cyanide can be 
neutralized in this manner without adversely affecting oxygen 
transport. 


. You should advise the wealthy investor that she should decline 


this request for financial backing. Uncouplers can indeed pro- 
duce dramatic weight loss, but they can also cause death. Dini- 
trophenol was actually marketed for weight loss at one time, 
but sales were discontinued when the potentially fatal nature 
of such “therapy” was fully appreciated. Weight loss is best 
achieved by simply making sure that the number of calories of 
food eaten is less than the number of calories metabolized. A 
person who metabolizes about 2000 kJ (about 500 kcal) more 
than he or she consumes every day will lose about a pound in 
about 8 days.) 


. The calculation in Section 20.4 assumes the same conditions 


as in this problem (1 pH unit gradient across the membrane 
and an electrical potential difference of 0.18 V). The transport 
of 1 mole of H* across such a membrane generates 23.3 kJ of 
energy. For three moles of H+, the energy yield is 69.9 kJ. Then, 
from Equation 3.8, we have: 


10. 


11 


12, 


13. 


14. 


15. 


16. 


I7. 


18. 


19, 


20. 


21. 


Abbreviated Answers to Problems A-35 
69,900 J/mol = 30,500 J/mol + RT In([ATP|/[ADP][P;]) 
39,400 J/mol = RT In ([ATP]/[ADP][P;]) 
[ATP]/[ADP][P;] = 4.36 xX 10° M`! at 37°C. 


In the absence of the proton gradient, this same ratio is 
7.25 X 1076 M`! at equilibrium! 


. The succinate/fumarate redox couple has the highest (that is, 


most positive) reduction potential of any of the couples in 
glycolysis and the TCA cycle. Thus, oxidation of succinate by 
NAD* would be very unfavorable in the thermodynamic sense. 
Oxidation of succinate by NAD*: 

AZ,’ ~= —0.35 V; AG” = 67,500 J/mol. 
On the other hand, oxidation of succinate is quite feasible using 
enzyme-bound FAD. 
Oxidation of succinate by [FAD]: 


AE. ~ 0, AG” ~ 0, depending on the exact reduction potential 
for bound FAD. 


. a. AG” = —48.2 kJ/mol 
b. AG” = —67.7 kJ/mol 
a. —73.3 kJ/mol. 


b. Keg = 7.1 X 102, 
c. [ATP|[ADP] = 19.7. 


. a. —151.5 kJ/mol. 


b. Keq = 3.54 X 10%. 


c. [ATP|/[ADP] = 8.8. 
a. —219 kJ/mol. 

b. Keg = 2.63 X 10°. 

c. [ATP|/[ADP] = 3340 
a. —217 kJ/mol. 

b. Keg = 1.08 X 10°. 

c. [ATP]|/[ADP] = 3340. 
a. —26 kJ/mol. 


b. Keg = 3.68 X 104. 
c. [NAD*}[NADH] = 1.48. 


a. AG” = —26 kJ/mol 
b. AG” = —6.6 kJ/mol 
a. Koa = 4.8 X 10°. 


b. +48 kJ/mol. 
c. [ATP]/[ADP] = 2.3. 
The two heme molecules are surrounded by different protein 


domains and this difference in microenvironment leads to differ- 
ent reduction potentials. 


This answer should be based on the information in Section 20.2 
and should be in the student’s own words. 


This answer should be based on the information in Section 20.2 
and should be in the student’s own words. 


Yes. As shown in Figure 17.1 H,O is oxidized to O, by photoau- 
totrophic organisms. 

Several mechanisms are possible. See, for example, the 
mechanisms in the following references: Pryor, W. A., and 
Porter, N. A., 1990. Suggested mechanisms for the production 
of 4-hydroxy-2-nonenal from the autoxidation of polyunsatu- 
rated fatty acids. Free Radical Biology and Medicine 8:541—543; 
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22: 


23. 


24. 


25. 


26. 


27. 


28. 


Abbreviated Answers to Problems 


and Niki, E., Yoshida, Y., Saito, Y., and Noguchi, N., 2005. 
Lipid peroxidation: Mechanisms, inhibition, and biological 
effects. Biochemical and Biophysical Research Communications 
338:668-676. 

A calculation similar to that in problem 20 in Chapter 19 yields 
about 15 H* in a typical mitochondrion. 

In the succinate dehydrogenase reaction, a two-electron trans- 
fer reaction (the conversion of succinate to fumarate) must 
be coupled to iron-sulfur centers, which can transfer only 

one electron at a time. FAD can carry on both one-electron 
and two-electron transfers and thus is ideally suited for this 
reaction. 

a. Fet + e7 — Fe?* 

b. S04 + 9 Ht + 8e —> HS- + 4 H,O 

c. 2 NO; + 12 H+ + 10 e7 —> N; + 6 H,O 

d.S + H+ + 2 e7 —» HS7 


The answer is a. The mitochondrial ATPase reaction proceeds 
via nucleophilic Sn2-type substitution. 


At 298 K, doubling the proton gradient would change 

AG by 1717 J/mole. The mitochondrial membrane potential 
would have to change by approximately 18 mV. The student 
should decide whether this change should be positive or 
negative. 


This exercise is left to the student and will presumably be 
different for every student. 

This exercise is left to the student and will presumably be 
different for every student. 


Think-Pair-Share Question: 


1. 


The movement of electrons through an increased number of com- 
plexes would significantly increase the probability that oxygen 
would encounter the flow of electrons with an increased probabil- 
ity for oxygen to be partially reduced and to produce harmful 
reactive oxygen species (ROS). 


. The direct reduction of oxygen by NADH 1s very exergonic (see 


the discussions in Section 20.2). This release of energy may raise 
temperatures inside the mitochondria or cells to dangerous levels. 


. The electron transport with accompanying proton translocation is 


not a discrete, step-wise process, but a continuous process. The 
number of complexes I, III, and IV are likely to be different from 
the number of ATP synthase complexes and ATP/ADP transloca- 
tors. This will ensure a balanced in and out flow of protons across 
the mitochondrial inner membrane. 


. The proton transport through the various complexes would have 


to involve amino acid residues with sufficiently different pX, val- 
ues to drive the movement of protons within and from one com- 
plex to the next and ultimately towards the final reduction 
reaction. Given the limited number of side chains capable of 
binding protons, the options to have proton transport through 
the various complexes may be limited. Proton transport through 
the various complexes could induce conformational changes 

in the various proteins involved, but could this be associated with 
movement of additional electrons across the inner membrane? 
Cytochrome c could still serve as a collector of electrons at the 
intermembrane space side of the mitochondria and still deliver 
electrons to the final complex for the reduction of oxygen. Could 


the excess amounts of electrons on cytochrome C on the inter- 
membrane space side drive ATP synthesis? It would require a 
totally different ATP synthase complex. 


Chapter 21 


1 


10. 


11. 


12, 


. AG for ATP synthesis = +50,336 J/mol. AS,’ = 
. Reduced plastoquinone = PQH;; oxidized plastoquinone = PQ; 


. A redox-sensitive ion like Fe?+ may absorb any electrons from 
y y 


photoexcited chlorophyll and short-circuit the induction of 
photosynthetic electron transport. 


. Efficiency = AG°’/(light energy input) = n¥AG,//(NhclA) = 


0.564. 


. AG” = —n#¥AE,’, so AG”'/—nF = ASE,'. n = 4 for 2 H,O —> 


4e +4H?* + Oo. ASE,’ = (ĉ (primary oxidant) — 
E G O/H20)) = +0.0648 V. Thus, €,’(primary oxidant) = 
0.881 V. 


+0.34 V. 


Reduced plastocyanin = PC(Cu‘); plastocyanin = PC(Cu**): 


PQH: + 2 H*sicoma + 2 PC(Cu2*+) —> 
PQ +2 PC(Cu*) +4 H” thylakoid lumen 


. Noncyclic photophosphorylation: 3 ATP would be synthesized 


from 8 quanta, which equates to 2.67 Av/ATP. Cyclic photophos- 
phorylation theoretically yields 2 H*/hv, so if 14 H* yield 3 ATP, 
7 hv yield 3 ATP, which equates to 2.33 hv/ATP. 


. In mitochondria, H* translocation leads to a decline in inter- 


membrane space pH and hence cytosolic pH, because the outer 
mitochondrial membrane is permeable to protons. Thus, the 
eukaryotic cytosol pH is at risk from mitochondrial proton trans- 
location. So, mitochondria rely on a greater membrane potential 
(Ay) and a smaller ApH to achieve the same proton-motive 

force. Because proton translocation in eukaryotic photosynthe- 
sis deposits H* into the thylakoid lumen, the cytosol does not 
experience any pH change and ApH is not a problem. Moreover, 
the light-induced efflux of Mg?* from the lumen diminishes Aw 
across the thylakoid membrane, so a greater contribution of ApH 
to the proton-motive force is warranted. 


. Replacement of Tyr by Phe would greatly diminish the possibility 


of e` transfer between water and P680*, limiting the ability of 
P680* to regain an e~ and return to the P680 ground state. 


. Assuming 12 c-subunits means 12 H* are needed to drive one 


turn of the c-subunit rotor and the synthesis of 3 ATP by the 
CF; part of the ATP synthase. If the R. viridis cytochrome 

bc; complex drives the translocation of 2 H*/e~, then 2 hv gives 
4 H*, and 6 hv gives 12 H” and thus 3 ATP (thus, 2 hv yield 

1 ATP). 

AG? = —n¥ATE,’ = —2(96,485 J/volt - mol) 
(€,’(1,3-BPG/Gal3P) — €,/(NADP*/NADPH)) = 

—(192,970 J/volt - mol)(—0.29 V — (—0.32 V)) = — 192,970 
(0.03) J/mol = —5,789 J/mol. 


Use the first eight reactions in Table 21.1 to show that: 


CO, + Ru-1,5-BP + 3 H,O + 2 ATP + 2 NADPH + 2 Ht —> 
glucose + 2 ADP + 4 P; + 2 NADPt 


Radioactivity will be found in C-1 of 3-phosphoglycerate; 

C-3 and C-4 of glucose; C-1 and C-2 of erythrose-4-P; C-3, C-4, 
and C-5 of sedoheptulose-1,7-bisP; and C-1, C-2, and C-3 of 
ribose-5-P. 
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13: 


14. 


15. 


16. 


17. 


18. 


It would be present in some P; and in all the reaction 
intermediates. 


Yes. The generation of oxygen and the movement of electrons 
during the light reactions create opportunities to generate ROS. 


Light induces three effects in chloroplasts: (1) pH increase in the 
stroma, (2) generation of reducing power (as ferredoxin), and 
(3) Mg% efflux from the thylakoid lumen. Key enzymes in the 
Calvin—Benson CO, fixation pathway are activated by one or 
more of these effects. In addition, rubisco activase is activated 
indirectly by light, and, in turn, activates rubisco. 


The proton-motive force would be dissipated by the production 
of heat and no ATP would be generated by photosynthesis. 


The following series of reactions accomplishes the conversion of 
2-phosphoglycolate to 3-phosphoglycerate: 


1. 2 phosphoglycolate + 2 HO —> 2 glycolate + 2 P; 

2. 2 glycolate + 2 O, —> 2 glyoxylate + 2 H,O, 

3. 2 glyoxylate + 2 serine —> 2 hydroxypyruvate + 2 glycine 

4. 2 glycine —> serine + CO, + NH; 

5. hydroxypyruvate + glutamate —> serine + a-ketoglutarate 
(an aminotransferase reaction) 

6. a-ketoglutarate + NH; + NADH* + H* —> glutamate + 
H,O* NAD* (the glutamate dehydrogenase reaction) 

7. Hydroxypyruvate + NADH* + Ht —> glycerate + NAD+ 

8. Glycerate + ATP —> 3-phosphoglycerate + ADP 


Net: 2 phosphoglycolate + 2 NADH* + 2H* + ATP + 


2 O2 + HO —> 3-phosphoglycerate + CO, + 2 P; + 
ADP + 2 H,0O, + 2 NAD* 
Considering the reactions involving water separately: 
1. ATP synthesis: 18 ADP + 18 P; —> 18 ATP + 18 H,O 
2. NADP* reduction and the photolysis of water: 
12 HO +12 NADP* —> 12 NADPH + 12 H* + 60, 
3. Overall reaction for hexose synthesis: 


6 CO, + 12 NADPH + 12 H+ + 18 ATP + 12 HO —> 
glucose + 12 NADP* + 60, + 18 ADP + 18 P; 
Net: 6 CO, + 6 H:O —> glucose + 6 O: 
Of the 12 waters consumed in O, production in reaction 2 and 
the 12 waters consumed in the reactions of the Calvin—Benson 
cycle (reaction 3), 18 are restored by H,O release in phosphoric 
anhydride bond formation in reaction 1. 


19. 


20. 


21. 


22. 
23. 
24. 


25; 


26. 


27. 


28. 


29. 


Abbreviated Answers to Problems A-37 


a. 2 NO,;- —> 2 NO, + O; and 2 SO,2- —> 2 SO;2> + O, 

b. No protons are generated that could help enhance the proton- 
motive force and the electrons lost by oxygen atoms are cap- 
tured by the nitrogen or sulfur atoms. 

a. If the number of c-subunits increases, the H*/ATP ratio will 
increase. 

b. If the number of c-subunits increases, Ap can be smaller. 
(More protons moving down a shallower Ap will yield the 
same overall AG for ATP synthesis.) 

a. 870-nm light: AG = —192 kJ/mol 

b. 700-nm light: AG = —259 kJ/mol 

c. 680-nm light: AG = —269 kJ/mol 

AG = +35.1 kJ 

AG = —950 kJ 

Phenolic radicals (tyrosine radicals) are more stable than alco- 

holic (serine or threonine) radicals. This makes phenolic radicals 

more likely to be generated and to be part of an electron trans- 
port system. 

Complex organic molecules and water serve as sources of 

electrons in the electron transport systems that ultimately 

generate ATP. 

Ideally, accessory light-harvesting pigments would absorb visible 

light of wavelengths that the chlorophylls do not absorb, that is, 

light in the 470 to 620 nm wavelength range. 


Thylakoid 
space 


——— E 
Dark Light Dark 


[Mg?*] 


Consult the Active Model for PSII on the PSAFE website and 
explore the nature of surface residues on PSH. 


Using the same PSAFE entry for PSII from question 28, 
determine from the Ramachandran plot whether the canonical 
od, y values of p = —45 degrees and y = —60 degrees are found 
in PSII a-helices. 
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Think-Pair-Share Question: 


a. 


CHOH 


C=O 


HO —CH 


HC — OH 


HC — OH 


CH,OH 


D-fructose 


CHOH 
C=0 
HO —CH 

HC — OH 


HC — OH 


CHOH 


D-fructose 


a ketose 


CHOH 


C= 


HO — CH 


HC — OH 


HC — OH 


HC — OH 


CHOH 
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CHOH 
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HC — OH 


HC — OH 


CHOH 
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a ketose 


Chapter 22 


1. The reactions that contribute to the equation discussed 
under Coupling with Hydrolysis of ATP and GTP Drives 
Gluconeogenesis in Section 22.1 are: 


2 Pyruvate + 2 H* + 2 HO —> glucose + O, 
2 NADH + 2 Ht + O, —>2 NAD* + 2 H,O 

4 ATP + 4 HO —-> 4 ADP + 4P; 

2 GTP + 2 HO —> 2 GDP + 2P; 


Summing these four reactions produces the equation discussed 
under Coupling with Hydrolysis of ATP and GTP Drives Gluco- 
neogenesis in Section 22.1. 


2. 1. it preserves NADPH levels for other anabolic reactions like 
the biosynthesis of fatty acids 


2. it depletes cytosolic NADH levels such that pyruvate is not 
reduced to lactate 


3. it depletes cytosolic NADH levels such that oxaloacetate is 
not reduced to malate 


3. This problem essentially involves consideration of the three 


unique steps of gluconeogenesis. The conversion of PEP to pyru- 


vate was shown in Chapter 18 to have a AG” of —31.7 kJ/mol. 
For the conversion of pyruvate to PEP, we need only add the 
conversion of a GTP to GDP + P; (equivalent to ATP to 
ADP + Pj) to the reverse reaction. Thus: 


Pyruvate —> PEP: AG” = +31.7 kJ/mol — 30.5 kJ/mol = 


1.2 kJ/mol 


Then, using Equation 3.8: 
AG = 1.2 kJ/mol + RT In ([PEP][ADPPIPiV[Py][ATPP) 


AG (in erythrocytes) = —31.96 kJ/mol 
In the case of the fructose-1,6-bisphosphatase reaction, A G” = 


—16.3 kJ/mol (see Equation 18.6). 
AG = —16.7 kJ/mol + RT In ({F-6-P][P;]/[F-1,6-BP]) 


AG (in erythrocytes) = —36.5 kJ/mol 


For the glucose-6-phosphatase reaction, AG” = —13.9 kJ/mol 
(see Table 3.2). AG = —13.9 kJ/mol + RT In ([Glu][P;]/ 
[G-6-P]) = —21.1 kJ/mol. From these AG values and those in 
Table 18.1, AG for gluconeogenesis = —87.7 kJ/mol. 


4. CO; is a good leaving group because of the chemical stability of 
CO, and the fact that it can react with H,O to make bicarbonate 


HCO; +H". 


5. Inhibition by 25 uM fructose-2,6-bisphosphate is approximately 
94% at 25 uM fructose-1,6-bisphosphate and approximately 44% 


at 100 uM fructose-1,6-bisphosphate. 


6. Glucagon promotes gluconeogenesis in liver cells. As an inhibitor 
of fructose-1,6-bisphosphatase, fructose-2,6-bisphosphate inhib- 
its the gluconeogenesis pathway. In this context one could predict 
glucagon would inhibit the activity of phosphofructokinase-2 to 


lower the levels of fructose-2,6-bisphophate. 


7. The hydrolysis of UDP-glucose to UDP and glucose is charac- 


terized by a AG” of —31.9 kJ/mol. This is more than sufficient 
to overcome the energetic cost of synthesizing a new glycosidic 


bond in a glycogen molecule. The net AG” for the glycogen syn- 


thase reaction is — 13.3 kJ/mol. 
8. Muscle cells do not contain glucose-6-phosphatase activity. 


9. 


10. 


11. 


12; 


13. 


14. 


15. 


16. 


A-39 


Abbreviated Answers to Problems 


According to Table 24.1, a 70-kg person possesses 1920 kJ of 
muscle glycogen. Without knowing how much of this is in fast- 
twitch muscle, we can simply use the fast-twitch data from the 

A Deeper Look box “Carbohydrate Utilization and Exercise” in 
Section 22.5 to calculate a rate of energy consumption. The plot 
in the A Deeper Look box shows that glycogen supplies are exhausted 
after 60 minutes of heavy exercise. Ignoring the curvature of the 
plot, 1920 kJ of energy consumed in 60 minutes corresponds to an 
energy consumption rate of 533 J/sec. 


Although other inhibitory processes might also occur, enzymes 
with mechanisms involving formation of Schiff base intermedi- 
ates with active-site lysine residues are likely to be inhibited by 
sodium borohydride (see Chapter 18, problem 18). The transal- 
dolase reaction of the pentose phosphate pathway involves this 
type of active-site intermediate and would be expected to be 
inhibited by sodium borohydride. 


Substrate availability determines the direction of the reversible 
reactions. 


Glycogen molecules do not have any free reducing ends, 
regardless of the size of the molecule. If branching occurs 
every 8 residues and each arm of the branch has 8 residues 
(or 16 per branch point), a glycogen molecule with 8000 
residues would have about 500 ends. If branching occurs 
every 12 residues, a glycogen molecule with 8000 residues 
would have about 334 ends. 


a. Increased fructose-1,6-bisphosphate would activate pyruvate 
kinase, stimulating glycolysis. 


b. Increased blood glucose would decrease gluconeogenesis and 
increase glycogen synthesis. 


c. Increased blood insulin inhibits gluconeogenesis and stimu- 
lates glycogen synthesis. 


d. Increased blood glucagon inhibits glycogen synthesis and stim- 
ulates glycogen breakdown. 


e. Because ATP inhibits both phosphofructokinase and pyruvate 
kinase, and because its level reflects the energy status of the 
cell, a decrease in tissue ATP would have the effect of stimulat- 
ing glycolysis. 

f. Increasing AMP would have the same effect as decreasing 
ATP—stimulation of glycolysis and inhibition of 
gluconeogenesis. 


g. Fructose-6-phosphate is not a regulatory molecule and 
decreases in its concentration would not markedly affect 
either glycolysis or gluconeogenesis (ignoring any effects due 
to decreased [G-6-P] as a consequence of decreased 
[F-6-P]). 

Yes. As discussed in Chapter 18, D-mannose, D-fructose and 

D-galactose can be converted into D-glucose-6-phosphate which 

could be a substrate for glycogen synthesis or could be used as a 

precursor for the pentose phosphate pathway. 


The glycerol backbone of the triacylglycerides could enter the 
gluconeogenesis pathway when converted into dihydroxyacetone- 
phosphate or glyceraldehyde-3-phosphate. 

At 298 K, assuming roughly equal concentrations of glycogen 


molecules of different lengths, the glucose-1-P concentration 
would be about 0.286 mM. 
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17. All four of these reactions begin with the formation of N-carboxybiotin: 


ADP T T P, p 
/ © 
ATP + HCO,” +. =0O—C—O—P—O- wi >~O Ze 
| = N NH 
O 
he Sw `R 
S R 
The carboxylation of B-methylcrotonyl-CoA occurs as follows: 
B k 
H 
HSC r T 
C=CH— C 
SCoA 
CH; o 


| | 
> -0OC—CH,—C=CH—C—SCoA 


The carboxylations of geranyl-CoA and urea are shown here: 


O 
A nohe hsc 


B gH H 
O 
xN N SCoA 
coo- 
s R 
H 
\ |l 
‘N—C—NH, 
7 
H 
i 
oO 
0-7 A > -OOC—N—C—NH, 
: H 
N* ONH 
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The mechanism of the carboxyltransferase reaction is a 
combination of the pyruvate carboxylase mechanism (shown in 
Figure 22.3) and a reverse propionyl-CoA carboxylase reaction. 


18. a. The mechanism of the acetolactate synthase reaction. 


Abbreviated Answers to Problems 


CH. i 
S 3 S | 
R. R 
s~ NI >E OH 
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Pyruvate 
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S rile s fs 
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\ I S \ Š \ 
N << N OH 
R’ | R’ | 
L R” p" 
GH, e — COOr 
|l 
O 
Condensation 


a-Acetolactate 


l 
4 


Valine, Leucine 
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b. The phosphoketolase reaction. 


CH,— OH 
s CH — OH s | 
JN 
\ ed C=O 
N 
R” 
Fructose-6-P 
Ht 
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R S R S | R S 
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Ko | :B— Ro | £0 — PO, 
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îi 
I + o,— c-e +— A JP co 
E 
i 


ri 
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19. The malate/aspartate shuttle provides oxaloacetate or NADH 21. Epalrestat and tolrestat do not resemble the transition state 

into the cytosol. for aldose reductase, and evidence from studies by Franklin 
Prendergast and others (Ehrig, T., Bohren, K., Prendergast, F., 
and Gabbay, K., 1994. Mechanism of aldose reductase inhibi- 
tion: Binding of NADP*/NADPH and alrestatin-like inhibitors. 
Biochemistry 33:7157-7165) show that these inhibitors probably 
bind to the enzyme: NADP* complex at a site other than the 
active site. 


20. Metformin both stimulates glucose uptake by peripheral tissues 
and enhances the binding of insulin to its receptors. Glipizide 
complements the actions of metformin by stimulating increased 
insulin secretion by the pancreas. 
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Abbreviated Answers to Problems A-43 


22; Glucose-6-P 


Fructose-6-P 


Fructose-1,6-bisP 


DHAP Glyceraldehyde-3-P 


TK 


Xylulose-5-P Erythrose-4-P 


\ 


Sedoheptulose-7-P Glyceraldehyde-3-P 


Ru-5-P 


KN 


Epimerase a 
VY 
Ribulose-5-P $ Xylulose-5-P Ribose-5-P 
Ru-5-P isomerase | 
VY 
Ribose-5-P 
23. Carbons 1, 2, and 5 of ribose-5-P will be labeled. 
24. 
6 NADP+} 6 NADPH 6 ee 6 NADPH 


Glucose-6-P ——— > 6-Phosphogluconate = Ribulose-5-P 


— > — > 
— > — > 
— > — > 
— > ———_ 
6 CO, 
Xylulose-5-P Ribose-5-P 
TK 


Glyceraldehyde-3-P Sedoheptulose-7-P Xylulose-5-P 
TA 


Fructose-6-P Erythrose-4-P 


TK 


Fructose-6-P Glyceraldehyde-3-P 


Gluconeogenesis Gluconeogenesis 


VY 
4 Glucose-6-P 1 Glucose-6-P 
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25. 


Abbreviated Answers to Problems 


3 NADP+ 3NADPH 3 NADP+ 3 NADPH 


Glucose-6-P —————— $ 6-Phosphogluconate —________ Ribulose-5-P 


EE 


3 COs 


—.||,_——_ 


Xylulose-5-P Ribose-5-P Xylulose-5-P 
TK 
Glyceraldehyde-3-P Sedoheptulose-7-P 
TA 
Fructose-6-P Erythrose-4-P 
TK 
- \ 
FFE Fructose-6-P Glyceraldehyde-3-P 
2 ADP | 
Fructose-1,6-bisP 
DHAP ———® Glyceraldehyde-3-P 
e ý 5 NAD+ 
Glycolysis 5 NADH 


26. Carbons 2 and 4 end up in carbons 1 and 3 of pyruvate. 


27. The reference by Hurley et al. in the “Further Reading” for this 
chapter suggests a mechanism based on structural data on gly- 
cogenin. Nucleophilic attack by Asp!® on the C-1 of glucose (in 
UDP-glucose) forms a covalent enzyme-substrate intermediate. 
Subsequent nucleophilic attack by the hydroxyl oxygen of Tyr!" 
of glycogenin produces the tyrosyl glucose that forms the foun- 
dation for synthesis of glycogen particles. 


The creatine kinase reaction 


Pyruvate 


+10 ATP 


28. During the first few seconds of exercise, existing stores of ATP 
are consumed and creatine phosphate then provides additional 
ATP via the creatine kinase reaction. For the next 90 seconds or 
so, anaerobic metabolism (conversion of glucose to lactate via 
glycolysis) provides energy. At this point, aerobic metabolism 
begins in earnest, delivering significant energy resources to sus- 
tain long-term exercise. 


29. Creatine phosphate provides ATP to muscle via the creatine 
kinase reaction: 


I -H c, CH, 
OPN d HN p? 
a Ye + ADP —> | (C—~N—CH,;—COO™ + ATP 
H.N* H,N 
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30. 


31. 


During periods of energy abundance, muscles store ATP equiva- 
lents in the form of creatine phosphate. When the muscles 
demand energy, creatine kinase runs in the reverse direction, 
converting creatine phosphate to ATP. 


This exercise is left to the student and will presumably be 
different for every student. 
This exercise is left to the student and will presumably be 
different for every student. 


Think-Pair-Share Question: 
Suggested products or intermediates: 


a. 


b. 


ATP to provide chemical energy needed for cell growth. 


Glyceraldehyde-3-phosphate/dihydroxyacetone phosphate as back- 
bones for the glycerol moieties of glycerophospholipids. Increased 
cell growth requires the making of new membranes and leads to a 
higher demand for the biosynthesis of glycerophospholipids. 


. Ribose-5-phosphate for the synthesis of new DNA. 
. NADPH for the synthesis of fatty acids needed for the 


biosynthesis of glycerophospholipids as discussed in point b. 


Chapter 23 


1 


. Assuming that all fatty acid chains in the triacylglycerol are pal- 


mitic acid, the fatty acid content of the triacylglycerol is 95% of 
the total weight. On the basis of this assumption, one can calcu- 
late that 30 1b of triacylglycerol will yield 118.7 L of water. 


. 11-cis-Heptadecenoic acid is metabolized by means of seven 


cycles of B-oxidation, leaving a propionyl-CoA as the final 
product. However, the fifth cycle bypasses the acyl-CoA dehydro- 
genase reaction because a cis-double bond is already present at 
the proper position. Thus, B-oxidation produces 7 NADH 

(= 18.9 ATP), 6 FADH; (= 9.6 ATP), and 7 acetyl-CoA 

(= 74.9 ATP), for a total of 103.4 ATP. Propionyl-CoA is 
converted to succinyl-CoA (with expenditure of 1 ATP), which 
can be converted to oxaloacetate in the TCA cycle (with produc- 
tion of 1 GTP, 1 FADH>, and 1 NADH). Oxaloacetate can be 
converted to pyruvate (with no net ATP formed or consumed), 
and pyruvate can be metabolized in the TCA cycle (producing 1 
GTP, 1 FADH;>, and 4 NADH). The net for these conversions of 
propionate is 17.7 ATP. Together with the results of B-oxidation, 
the total ATP yield for the oxidation of one molecule of 11-cis- 
heptadecenoic acid is 121.1 ATP. 


. Instead of invoking hydroxylation and B-oxidation, the best 


strategy for oxidation of phytanic acid is a-hydroxylation, which 
places a hydroxyl group at C-2. This facilitates oxidative 
a-decarboxylation, and the resulting acid can react with CoA 

to form a CoA ester. This product then undergoes six cycles of 
B-oxidation. In addition to CO», the products of this pathway 
are three molecules of acetyl-CoA, three molecules of propionyl- 
CoA, and one molecule of 2-methylpropionyl-CoA. 


. Although acetate units cannot be used for net carbohydrate 


synthesis, oxaloacetate can enter the gluconeogenesis pathway 

in the PEP carboxykinase reaction. (For this purpose, it must be 
converted to malate for transport to the cytosol.) Acetate labeled 
at the carboxyl carbon will first label (equally) the C-3 and C-4 
positions of newly formed glucose. Acetate labeled at the methyl 
carbon will label (equally) the C-1, C-2, C-5, and C-6 positions 
of newly formed glucose. 


A-45 


Abbreviated Answers to Problems 


. This exercise is left to the student, in consultation with the refer- 


ences suggested in the problem. 


6. ATP and NADH 


7. The catabolism of fatty acids in the peroxisomes leads to less 


10. 


11 


12. 


13. 


14. 


15. 


16. 


17. 


production of ATP. ATP is an allosteric regulator that in general 
inhibits the catabolism of nutrients and promotes “storage” of 
nutrients (glycogen, fatty acids). In this context, ATP could pro- 
mote weight gain and less production of ATP could result in less 
weight gain. 


. Albumin binds hydrophobic substances including drugs. The pres- 


ence of free fatty acids may compete with drugs for binding to 
albumin, changing the transportation (and thus pharmacology) 
of drugs. 


. Fat is capable of storing more energy (37 kJ/g) than carbohydrate 


(16 kJ/g). Ten pounds of fat contains 10 x 454 x 37 = 
167,980 kJ of energy. This same amount of energy would 
require 167,980/16 = 10,499 g, or 23 lb, of stored carbohydrate. 


The enzyme methylmalonyl-CoA mutase, which catalyzes the 
third step in the conversion of propionyl-CoA to succinyl-CoA, 
is B,.-dependent. If a deficiency in this vitamin occurs, and if 
large amounts of odd-carbon fatty acids were ingested in the 
diet, L-methymalonyl-CoA could accumulate. 


. a. Myristic acid: 


CH:(CH2)2:COOH + 100.7 P; + 100.7 ADP + 20 O, —> 
100.7 ATP + 14 CO, + 114.1 H,O 
b. Stearic acid: 
CH;3(CH>);6COOH + 130.7 P; + 130.7 ADP + 26 O, —> 

130.7 ATP + 18 CO, + 147.7 H,0 


c. a-Linolenic acid: 
C)7H29COOH + 123.5 P; + 123.5 ADP + 25 O, —> 
123.5 ATP + 18 CO, + 142.5 H,O 


d. Arachidonic acid: 
C)9H3;COOH + 121.9 P; + 121.9 ADP + 24.5 O, —> 
121.9 ATP + 20 CO, + 139.9 H,O 


During the hydration step, the elements of water are added 
across the double bond. Also, the proton transferred to the 
acetyl-CoA carbanion in the thiolase reaction is derived from the 
solvent, so each acetyl-CoA released by the enzyme would prob- 
ably contain two tritiums. Seven tritiated acetyl-CoAs thus would 
derive from each molecule of palmitoyl-CoA metabolized, each 
with two tritiums at C-2. 


It would be present in the carbonyl oxygen of acetyl-coenzyme A 
units. 


A carnitine deficiency would presumably result in defective or 
limited transport of fatty acids into the mitochondrial matrix 
and reduced rates of fatty acid oxidation. 


When blood sugar levels are low it indicates that the “energy” 
content of the blood is low. By activating lipase, glucagon pro- 
vides energy to the tissues in the form of free fatty acids. 


The presence of the villi significantly enhances the surface area 
of the epithelial cells of the digestive system. It allows for an effi- 
cient absorption of the digested dietary nutrients. 

1.3 grams X 37 kJ/gram = 48,000 J. This at first seems like a 
remarkably small amount of energy to sustain the hummingbird 
during a 500-mile flight at 50 mph. However, keep in mind that 
the hummingbird weighs only 3 to 4 grams, and see Problem 18. 
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A-46 Abbreviated Answers to Problems 


18. 48,000 J in 10 hours is 4800 J/hr. If the hummingbird consumes 0.9 minute. So a human could run for only less than a minute on 
250 mL per hour during migration, this means that it is consum- the energy consumed by the hummingbird (only 3 to 4 g) ona 
ing 4800/250 or 19.2 J/mL of oxygen consumed. If a human 500-mile flight. A typical person would have to run about 
consumes 12.7 kcal/min while running, this is equivalent to 40 miles to lose 1 1b of fat. 


12,700 X 4.184 J/cal = 53.1 kJ/min; 48,000 J/53,100 J/min = 19; A mechanism for dhe HMG-CoA'syrithase teaction: 


H:B 


V g 


H,C—C—CH,—C—SCoA 


O OH O O OH O 


| I 
~O— C— CH, f JH, — C— SCoA A CoA—S—C— CH, i CH, —C—SCoA 


CH; H,O CH, 
20. 
H Coo H Coo 
| 
H—C—C—H H—C—C—H 
| : 
6 c=0 C=O 
| 
SCoA SCoA 
CH, o B -CHp o B CH, o B 
OH OH OH OH OH OH 


Coo” H CoOO7 
H,C — CH, H—C—C—H 
S 
C= _— 
| | 
SCoA SCoA 
CH oF A CH oF A @cH, o A 
OH OH = ————~* OH OH = OH OH 
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Abbreviated Answers to Problems A-47 


21. The changes in oxidation state of cobalt in the course of any the bond are not ascribed to the Co** atom in the calculation of 
B,-dependent reaction arise from homolytic cleavages of the the Co oxidation state. Thus, when the Co—C bond is cleaved 
Co?t—C bonds involved. Moreover, in the Co**—C bonds homolytically, one electron reverts to the Co, changing its 
shown in a typical B;.-dependent reaction, the two electrons of oxidation state from 3+ to 2+. 


22. See the following mechanisms: 
a-Methyleneglutarate mutase 


i j 
Term aia eee 
a ie o 
By CH, CH, 
ji ji 
OE —> b 
-c00 T 
H 
CH, NS CH, 
Bio 
Diol dehydrase 
| i 
H.C gi i OH —> H,C g C—OH 
OH H OH 
fool | 
ii 
N 
no- cee oa :B 
i H 
A OH x 7 
H H,C—CH,—C 
Be A 
12 . O 
B:H +H,O 
Glycerol dehydrase 
i i 
ae Purg 
OH OH OH 1, OH OH OH 
“Dig 


yore yo 


i 
no aan eg — CH,—CH,—Co 
fe) 


| À | N 
OH ‘ H4 OH o 
B —H aar B +H,O 
Ethanolamine ammonia-lyase 
| ij 
CH, —C—H —> CH,—C: 
| | | | 
NH,* OH s NH;* OH 
‘Bio | 
i, a | 
y 
i cH, —C 6 —> CH,—C 
Bira aA | ON 
/ NH;,* 
B:H +NH,* 
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A-48 


Abbreviated Answers to Problems 


23. ~ H 
CH, 
/\ | 
HC=C — CH— CH,—C— Coo 
| 
NH,* 
Hypoglycin A 
r CoASH 
(0) 
HC II 
[Xb rT o2 C7 CH= CH C—SCoA 
HyC=C — CH— CH,— C — SCoA f 


Methylenecyclopropylacetyl-CoA 


(MCPA-CoA) 


bow 


H. 
j — 7 


CHo 
eS. = 
H,C=C— CH— CH—C— SCoA 


+ 


HC 
Reactive intermediate 


24. This exercise is left to the student and should be based on the ref- 
erence provided in the problem. 

25. It may be presumed that the oxidation of the acyl chain is accom- 
plished via a two-electron transfer, whereas the steps involved in 
reoxidation of FADH; by ETF are one-electron transfers. FAD/ 
FADH; can participate both in one-electron and two-electron 
transfers, whereas NAD*/NADH can participate only in two- 
electron transfers. 

26. The sequence of reactions involving creation of a double bond, 
then hydration across it, followed by oxidation is what happens 
to succinate in the TCA cycle. (This same sequence of reactions 
is also employed in fatty acid synthesis and in both the catabo- 
lism and anabolism of amino acids.) Clearly, this sequence of 
three reactions must represent an optimal mechanistic strategy 
for the chemistry achieved. 

27. This exercise is left to the student and will presumably be 
different for every student. 

Think-Pair-Share Question: 

a. 

* Stearic acid: 

— Eight cycles of B-oxidation generating 8 NADH, 8 FADHb, and 
9 acetyl-CoA. 

— Each acetyl-CoA yields 3 NADH, 1 FADH), and 1 GTP. 

— Grand totals: 35 NADH, 17 FADH:, 130.7 ATP. 

* Oleic acid: 

— Seven cycles of B-oxidation generating 7 NADH, 7 FADH,, and 
8 acetyl-CoA. 

— One -oxidation cycle generating only 1 NADH and 1 
acetyl-CoA. 

— Each acetyl-CoA yields 3 NADH, 1 FADH,, and 1 GTP. 

— Grand totals: 35 NADH, 16 FADH), 129.1 ATP. 


Ho 
| 
N N O 
Z ba 
XX LF 
N 
O 
oO 
Il 


~C CH- CH —C—SCoA + FAD 


* Linoleic acid: 


— Six cycles of B-oxidation generating 6 NADH, 6 FADHb, and 
7 acetyl-CoA. 


— Two cycles of B-oxidation cycle generating only 2 NADH and 
2 acetyl-CoA. 


— One dehydrogenase reaction generating 1 FADH). 

— One reductase reaction consuming 1 NADPH. 

— Each acetyl-CoA yields 3 NADH, 1 FADH,, and 1 GTP. 

— Grand totals: 35 NADH, 16 FADH,, 129.1 — 4 = 125.1 ATP. 
* q-linolenic acid: 


— Five cycles of B-oxidation generating 5 NADH, 5 FADH,, and 
6 acetyl-CoA. 


— Three cycles of B-oxidation cycle generating only 3 NADH and 
3 acetyl-CoA. 


— One dehydrogenase reaction generating 1 FADH3. 
— One reductase reaction consuming 1 NADPH. 
— Each acetyl-CoA yields 3 NADH, 1 FADH,, and 1 GTP. 
— Grand totals: 35 NADH, 15 FADH,, 127.5 — 4 = 123.5 ATP. 
* y-linolenic acid: 
— Five cycles of B-oxidation generating 5 NADH, 5 FADH), and 
6 acetyl-CoA. 
— Three cycles of B-oxidation generating only 3 NADH and 
3 acetyl-CoA. 
— One dehydrogenase reaction generating 1 FADH). 
— One reductase reaction consuming 1 NADPH. 
— Each acetyl-CoA yields 3 NADH, 1 FADH,, and 1 GTP. 
— Grand totals: 35 NADH, 15 FADH,, 127.5 — 4 = 123.5 ATP. 
b. The actual B-oxidation cycles do not produce any water; they con- 
sume water. Excess water is produced from the NADH- or FADH,- 
induced electron transport and the consequent reduction of O, to 


H,O. Thus, the differences in amount of FADH, generated from the 
different C18 fatty acids will lead to differences in water produced. 
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Abbreviated Answers to Problems A-49 


Chapter 24 the conversion to cholesterol. This process might be viewed as 
1.5 acetate groups for the purpose of completing the balanced 
equation. 


1 


10. 


. The equations needed for this problem are found under the 


. Carbons C-1 and C-6 of glucose become the methyl carbons of 


. A suitable model, based on the evidence presented in this chapter, 


. They both involve the removal of a hydrogen from the —CH; 


. The phosphopantetheine may function, at least to some extent, 


. When blood sugar levels are low or high it means that the caloric 


discussion “Reduction of the Beta-Carbonyl Group Follows a 
Now-Familiar Route” in Section 24.1. See The Student Solutions 
Manual, Study Guide and Problems Book for details. 


acetyl-CoA that is the substrate for fatty acid synthesis. Carbons 
C-2 and C-5 of glucose become the carboxyl carbon of acetyl- 
CoA for fatty acid synthesis. Only citrate that is immediately 
exported to the cytosol provides glucose carbons for fatty acid 
synthesis. Citrate that enters the TCA cycle does not immediately 
provide carbon for fatty acid synthesis. 


would be that the fundamental regulatory mechanism in ACC is 
a polymerization-dependent conformation change in the protein. 
All other effectors—palmitoyl-CoA, citrate, and phosphorylation- 
dephosphorylation—may function primarily by shifting the inac- 
tive protomer-active polymer equilibrium. Polymerization may 
bring domains of the protomer (i.e., bicarbonate-, acetyl-CoA-, 
and biotin-binding domains) closer together or may bring these 
domains on separate protomers close to each other. See The 
Student Solutions Manual, Study Guide and Problems Book for 
further details. 


portion of the acetyl group. 


as a flexible “arm” to carry acyl groups between the malonyl 
transferase and ketoacyl-ACP synthase active sites. The phos- 
phopantetheine is approximately 1.9 nm in length, setting an 
absolute upper-limit distance between these active sites of 

3.8 nm. However, on the basis of modeling considerations, it 


seems likely that the distance between these sites is smaller than 12. The numbers 1 to 4 in the cholesterol structure indicate the 
this upper-limit value. carbon positions of mevalonate as shown (note that the 

. Two electrons pass through the chain from NADH to FAD to numbering shown here is not based on the systematic numbering 
the two cytochromes of the cytochrome b; reductase and then of mevalonate): 


to the desaturase. Together with two electrons from the fatty 
acyl substrate, these electrons reduce an O, to two molecules 


of water. The hydrogen for the waters that are formed in this H á ön 
way comes from the substrate (2H) and from two protons from — CN BAe 1 

luti OOoc, , -O 5 _-CH20H 
solution. Cit, tr 


content of the blood is low or high. When caloric content is low, 
there is a need to tap into the caloric reserves rather than pro- 


moting storage of calories. This is promoted by glucagon. When Mevalotiatg Cholesterol 
caloric content is high, there is no need to tap into the caloric 13. The O-linked saccharide domain of the LDL receptor 
reserves, but a need to store calories away. This is promoted by probably functions to extend the receptor domain away 
insulin. from the cell surface and above the glycocalyx coat so that 
. Ethanolamine + glycerol + 2 fatty acyl-CoA + 2 ATP + CTP + the receptor can recognize circulating lipoproteins. 
HO —> phosphatidylethanolamine + 2 ADP + 2 CoA + 14. As shown in Figures 24.19 and 24.22, the syntheses (in eukary- 
CMP + PP; + P; otes) of phosphatidylcholine, phosphatidylethanolamine, phos- 
. The glycolysis pathway leads to the production of pyruvate from phatidylinositol, and phosphatidylglycerol are dependent upon 
carbohydrates. This pyruvate can then be converted into acetyl- CTP. A CTP deficiency would be likely to affect all these syn- 
coenzyme A for fatty acid biosynthesis. The glycolysis pathway thetic pathways. 
also provides dihydroxyacetonephosphate that can serve as the 15. It “costs” 1 ATP to form a malonyl-CoA. Each cycle of the fatty 
backbone for the glycerol portion of the triacylglycerides. acyl synthase consumes 2 NADPH molecules, each worth 
The conversion of acetyl-CoA to lanosterol can be written as: 3.5 ATP. Thus, each of the seven cycles required to form a pal- 
18 Acetyl-CoA + 13 NADPH + 13 H* + 18 ATP + 0.5 O, —> mitic acid consumes 8 ATPs. A total of 56 ATPs are consumed 
lanosterol + 18 CoA + 13 NADP* + 18 ADP + 6 P; + 6 PP; + CO, to synthesize one molecule of palmitic acid. 


The conversion of lanosterol to cholesterol is complicated; 
however, in terms of carbon counting, three carbons are lost in 
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A-50 Abbreviated Answers to Problems 


16. The mechanism of the 3-ketosphinganine synthase reaction is shown in the following figure: 


i 
~OOC—C—CH,OH 
| ? CH,OH 
NH, 
R H—C-C=0 
H a H T 
O N © 
NS NZ 


2-0,PO A 


— 
Schiff base 
N formation 


3 


2 S-3-Ketosphinganine 


17. FAD is required for the eukaryotic reaction that converts stearic FADH,/FAD is required because it can participate both in one- 
acid to oleic acid because the oxidation of a single bond to a electron and two-electron transfers. 
double bond in stearic acid involves NADH and thus requires 18. See Chapter 23, problem 13, for a mechanism for the HMG-CoA 
a two-electron transfer, whereas the reoxidation of FADH; to synthase reaction. 
ae aecommplished by electron transter to ioman bs. 19. A mechanism for the B-ketoacyl ACP synthase reaction: 
ytochromes are capable of one-electron transfers only, so 


6-Ketoacyl ACP synthase 


Sn T 


C— CH, —C— S —ACP 


/ 
T 
1 1 it 

oO oh C—S—ACP —> HC —C— CH,— C—S— ACP 

I 
H,C—C 

S— KSase Acetoacetyl — ACP 
B:H 
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20. This exercise is left to the student. 
21. This exercise is left to the student. 
22. This exercise is left to the student. 


23. Figure 23.23 details the oxidation of acetyl-CoA in peroxisomes 
in which O; is reduced to H2O, and then to H,O. Figure 24.14 
details the action of a desaturase in which O; is reduced to H30. 


24. One way to accommodate the extreme bodily changes accompa- 
nying hibernation could involve changes in insulin responsive- 
ness and sensitivity. During the preparation for hibernation, 
increased insulin sensitivity in adipose tissue could enable the 
bear to store large amounts of fat. During winter hibernation, 
greatly reduced insulin sensitivity would promote lipolysis and 
consumption of fat stores. A return to normal insulin sensitiv- 
ity would be expected when the bear emerges from hibernation 
in the spring. 

25. The phosphatidylinositol cycle described in the referenced article 
describes a cyclical metabolic pathway in which the intermediates 
of the cycle are regenerated each time the cycle goes around. 
Two of the enzymes in the cycle, diacylglycerol kinase-e and 
CDP- DAG synthase-2, prefer substrates with the selected fatty 
acids (stearic acid and arachidonic acid). As a consequence, 
preferential incorporation of these acyl chains into phosphati- 
dylinositol occurs in each turn of the cycle, and this enrichment 
is magnified over repeated cycles. The lipid intermediates of the 
cycle have to be segregated from other forms in the cell, because 
most species of these intermediates would not be enriched with 
1-stearoyl-2-arachidonoyl species. 


PI4K PI4P5K 
ATP ADP ATP ADP 
PLC 
CMP IP3 
DAG 
PIS 
ATP 
Inositol DGKe 
ADP 
PP. 


CDSS 
CDP-DAG Se PA 
CTP 


The phosphatidylinositol cycle. Abbreviations: PI, phosphatidylinositol; PI4P, 
phosphatidylinositol-4-P; PI4P5K, phosphatidylinositol-4-P 5-kinase; PIP2, 
phosphatidylinositol-4,5-bisphosphate; PLC, phospholipase C; DGKe, diac- 
ylglycerol kinase, e-isoform; CDS2, CDP-DAG synthase, 2-isoform; PIS, 
phosphatidylinositol synthase. 


26. Palmitic acid must be elongated and then unsaturated to 
form linoleic acid. The elongation process involves a thio- 
lase reaction to add two carbons to palmitoyl-CoA and then 
reduction of a carbonyl to a hydroxyl, dehydration to form 
a double bond, and then reduction of the double bond to a 
single bond. These last three reactions are the reverse of what 
happens in TCA, in the conversion of succinate to fumarate, 
then malate, then oxaloacetate. These same three reactions 


27. 


28. 


29. 


30. 


Abbreviated Answers to Problems A-51 


occur in B-oxidation, in fatty acid synthesis, and in amino acid 
synthesis and degradation. 

11 Acetyl-CoA + 10 ATP*^ + 20 NADPH + 10 Ht —> 
behenoyl-CoA + 20 NADP* + 10 CoASH + 10 ADP? + 10 P? 
This exercise is left to the student and will presumably be differ- 
ent for every student. 

This exercise is left to the student and will presumably be differ- 
ent for every student. 


This exercise is left to the student and will presumably be differ- 
ent for every student. 


Think-Pair-Share Question: 
Possible targets could be: 


a. 


e 
f: 

g. 
h 


inhibition of pancreatic lipase to lower uptake of fatty acids from 
dietary sources 


. interfere with the biosynthesis or function of bile acids to lower 


the efficiency of the digestion and absorption of dietary lipids 


. interfere with the biosynthesis/packaging into chylomicrons of 


triacylglycerides in the epithelial cells of the digestive system 


. promote the catabolism of fatty acids, either in the mitochondria 


or in the peroxisomes 


. inhibit the anabolism of fatty acids and triacylglycerides 


interfere with assembly of VLDL particles in the liver 
block LDL receptor on surface of adipocytes 


. block the actions of ChREBP (see Chapter 22) 


Chapter 25 


1 


. The oxidation number of N in nitrate is +5; in nitrite, +3; in 


NO, +2; in N,O, +1; and in No, 0. 


. a. Assume that nitrate assimilation requires four NADPH 


equivalents per NO,- reduced to NH4*. Four NADPH have a 
metabolic value of 14 ATP. 


b. Nitrogen fixation requires 8 e~ (see Equation 25.3) and 
16 ATPs (see Figure 25.6) per N, reduced. If 4 NADH provide 
the requisite 8 e~, each NADH having a metabolic value of 
3 ATPs, then 28 ATP equivalents are consumed per N, reduced 
in biological nitrogen fixation (or 14 ATP equivalents per 
NH,* formed). 


. Carbon typically makes four chemical bonds because of its four 


valence electrons in the 2s and 2p orbitals. The six bonds between 
the iron atoms and the single carbon atom must involve the 

two core electrons in the 1s orbital of carbon in addition to the 
valence electrons. 


. The transformation of the carbonyl carbon (C=O) into the 


amine (—CH—NH;") is a reduction. Oxidation number of car- 
bon changes from +2 to 0. 


. a. [ATP] increase will favor adenylylation; the value of n will be 


greater than 6 (n > 6). 


b. An increase in Py,/Pyp will favor adenylylation; the value of 
n will be greater than 6 (n > 6). 


c. An increase in the [wKG]/[Gln] ratio will favor deadenylyla- 
tion; n < 6. 


d. [P;] decrease will favor adenylylation; n > 6. 
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6. 


10. 


11. 


12; 


Abbreviated Answers to Problems 


Two ATPs are consumed in the carbamoyl-P synthetase-I reac- 
tion, and two phosphoric anhydride bonds (equal to two ATP 
equivalents) are expended in the argininosuccinate synthetase 
reaction. Thus, four ATP equivalents are consumed in the urea 
cycle, as one urea and one fumarate are formed from one CO, 
one NH, and one aspartate. 


. Protein catabolism to generate carbon skeletons for energy 


production releases the amino groups of amino acids as excess 
nitrogen, which is excreted in the urine, principally as urea. 


. The hydrolysis of the coenzyme A portion from succinyl- 


coenzyme A in this step provides the energy needed. Thioesters 
are high-energy bonds and their hydrolysis releases energy. 


. One ATP in reaction 1, one NADPH in reaction 2, one NADPH 


in reaction 11, and one succinyl-CoA in reaction 12 add up to 
10 ATP equivalents, assuming each NADPH is worth 4 ATPs. 
(The succinyl-CoA synthetase reaction of the citric acid cycle 
[see Figure 19.11] fixes the metabolic value of succinyl-CoA 
versus succinate at 1 GTP [= 1 ATP].) 


From Figure 25.36: '*C-labeled carbon atoms derived from '4C-2 
of PEP are shaded yellow. 


1. 2 aspartate —> transamination —> 2 oxaloacetate. 


2. 2 oxaloacetate + 2 GTP —> PEP carboxykinase —> 
2 PEP + 2 CO, + 2 GDP. 


3, 2 PEP +2 H,O —> enolase —> 2 2-PG. 
4. 2 2-PG — > phosphoglyceromutase —> 2 3-PG. 


5. 2 3-PG + 2 ATP —> 3-P glycerate kinase —> 
2 1,3-bisPG + 2 ADP. 


6. 2 1,3-bisPG + 2 NADH + 2 H* —> G-3-P dehydrogenase 
—> 2 G-3-P + 2 NAD* + 2P. 


7. 1 G-3-P —> triose-P isomerase — 1 DHAP. 
8. G-3-P + DHAP —> aldolase —> fructose-1,6-bisP. 
9. F-1,6-bisP + HO —> FBPase —> F-6-P + P.. 


10. F-6-P —> phosphoglucoisomerase —> G-6-P. 


11. G-6-P + HO —> glucose phosphatase —> glucose + P;. 


Net: 2 aspartate + 2 GTP + 2 ATP + 2 NADH + 6 H* + 
4 H-O —> glucose + 2 CO, + 2 GDP + 2 ADP + 4P; + 
2 NAD* 


(Note that 4 of the 6 H* are necessary to balance the charge on 
the 4 carboxylate groups of the 2 OAA and the loss of amino 
groups from Asp is ignored.) 


(As a consequence of reaction [1], 2 a-keto acids [e.g., 
a-ketoglutarate] will receive amino groups to become 
2 a-amino acids [e.g., glutamate].) 


Alanine: glutamate: pyruvate aminotransferase. 


Arginine: from glutamate via ornithine, so it’s glutamate 
dehydrogenase. 


Aspartate: glutamate:oxaloacetate aminotransferase. 


Asparagine: from aspartate, so it’s the glutamate:oxaloacetate 
aminotransferase. 


Cysteine: cysteine is formed from serine, so it’s glutamate via 
3-phosphoserine aminotransferase. 


Glutamate: glutamate dehydrogenase or alternatively, GOGAT. 
Glutamine: glutamine synthetase. 


Glycine: glycine is formed from serine, so it’s glutamate via 
3-phosphoserine aminotransferase. 


13; 


14. 


15; 


16. 


17. 


18. 


19. 


Histidine: from glutamate via L-histidinol phosphate 
aminotransferase. 


Isoleucine: glutamate:a-keto-B-methylvalerate aminotransferase. 
Leucine: glutamate:a-ketoisocaproate aminotransferase. 


Lysine: from glutamate via saccharopine formation by a 
glutamate-dependent NADPH dehydrogenase; in bacteria, lysine 
is synthesized from aspartate, so it’s glutamate: oxaloacetate 
aminotransferase. 


Methionine: from aspartate, so it’s glutamate:oxaloacetate 
aminotransferase. 


Phenylalanine: glutamate : phenylpyruvate aminotransferase 
(= phenylalanine aminotransferase). 


Proline: from glutamine, so it’s glutamate dehydrogenase. 
Serine: glutamate via 3-phosphoserine aminotransferase. 


Threonine: from aspartate, so it’s glutamate: oxaloacetate 
aminotransferase. 


Tryptophan: from serine via tryptophan synthase, so its a-amino 
group comes from serine, which gets its amino group from gluta- 
mate via 3-phosphoserine aminotransferase. 


Tyrosine: glutamate : 4-hydroxyphenylpyruvate aminotransferase 
(= tyrosine aminotransferase). 


Valine: glutamate: a-ketoisovalerate aminotransferase. 


Biosynthesis of lysine in fungi (see Figure 25.24) and biosynthe- 
sis of leucine (see Figure 25.30). 


Pyridoxal (vitamin B6), because it is the precursor to pyridoxal-P, 
the key coenzyme in aminotransferase reactions, as well as other 
aspects of amino acid metabolism. 


The conversion of homocysteine to methionine is folate- 
dependent; dietary folate absorption is dependent on vitamin B,, 
for removal of methyl groups added to folate during digestion; 
finally, the a-amino group of homocysteine formed in the 
methione biosynthetic pathway comes from aspartate via the 
pyridoxal-P—dependent glutamate: oxaloacetate aminotransferase. 


The BCKDH complex is structurally and functionally analogous 
to the pyruvate dehydrogenase complex and the a-ketoglutarate 
dehydrogenase complexes. See Figure 19.4; the reaction mecha- 
nism for the pyruvate dehydrogenase complex is essentially the 
same as that of the BCKDH complex. 

Aspartame is a N-a-L-aspartyl-L-phenylalanine-1-methyl ester 
that is broken down in the digestive tract and phenylalanine is 
released. Phenylalanine is the substance that phenylketonurics 
must avoid. 


Glyphosate inhibits 3-enolpyruvylshikimate-5-P synthase, an 
essential enzyme in the biosynthesis of chorismate. Not only is 
chorismate the precursor for synthesis of the aromatic amino 
acids Phe, Tyr, and Trp, it is also the precursor for formation of 
lignin, a major structural component in plant cell walls, as well as 
other essential substances such as folate, coenzyme Q, plastoqui- 
none, and vitamins E and K. 


Glycine is formed from serine, which is formed from 
3-phosphoglycerate. Carbons 3 and 4, 2 and 5, and 1 and 6 of 
glucose contribute carbons 1, 2, and 3 of 3-phosphoglycerate, 
respectively. The B-carbon of serine derives from the 3-C in 
3-phosphoglycerate, the C,-carbon atom in serine comes from 
3-phosphoglycerate C-2, and the carboxyl-C in serine comes 
from C-1 of 3-PG. It is these latter two carbons of 3-PG that are 
found in glycine; that is, C-1 and C-2 of 3-PG; C-1 of 3-PG came 
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from C-3 and C-4 in glucose, and C-2 of 3-PG came from C-2 
and C-5 in glucose. 


20. Degradation of leucine or lysine can yield acetyl-coenzyme A. 


There is no metabolic pathway that can achieve glucose synthesis 


from acetyl-CoA. 
21. Serine is an important precursor for glycine synthesis. 
22. Consult Attps://www.rcsb.org to view pdb id = 1LM1. 


23. The liver converts amino acids to glucose to provide a source 
of energy for other cells, such as nerve and red blood cells. 
Conversion of amino acids to glucose by gluconeogenesis 


requires energy; for example, six ATP equivalents are needed to 


form glucose from aspartic acid. 

24. GS monomers are inactive because GS active sites require ele- 
ments of protein structure contributed by adjacent subunits in 
the GS,» dodecamer. 


25. Four of the six carbons in the ring of Tyr come from erythrose-4-P 


and the remaining two from PEP; the carboxyl-C, Ca, and 


Cg of Tyr come from PEP. Degradation of Tyr to acetoacetate + 
fumarate yields acetoacetate composed from the C, and Cg plus 
2 C atoms from the ring, which are either both from E-4-P or one 


each from E-4-P and PEP, depending on the orientation of the 


ring (rotation about the Cg-ring C bond). Fumarate will be com- 
posed from 2 C atoms from PEP and 2 from E-4-P or 3 C atoms 


from E-4-P and 1 from PEP. 

26. Consult the PSAFE website entry for glutamine synthetase to 
discover the answer. 

27. Consult the PSAFE website entry for glutamate-oxaloacetate 
transaminase I to compose your answer. 

Think-Pair-Share Question: 

a. Step 2: reduction of carboxylic acid carbon (oxidation number 
+3) to aldehyde carbon (oxidation number +1). Step 4: reduc- 
tion of HC—=N— to H,C—NH (oxidation number of carbon 
changes from +1 to —1). 

b. Step 3: reduction of carboxylic acid carbon (oxidation number 

+3) to aldehyde carbon (oxidation number +1). 

c. Step 7: reduction of carboxylic acid carbon (oxidation number 

+3) to aldehyde carbon (oxidation number +1). Step 8: reduc- 


tion of aldehyde carbon (oxidation number +1) to amine carbon 


(oxidation number —1). Caution: steps 4 and 9 in this pathway 
are oxidations, not reductions. 

d. Step 2: reduction of carboxylic acid carbon (oxidation number 
+3) to aldehyde carbon (oxidation number +1). Step 3: reduc- 


tion of aldehyde carbon (oxidation number +1) to primary alco- 


hol carbon (oxidation number —1). 


e. Step 3: reduction of ketone carbon (oxidation number +2) to 
secondary alcohol carbon (oxidation number 0). 


f. Caution: step 3 is an oxidation. Step 4: reduction of ketone car- 
bon (oxidation number +2) to amine carbon (oxidation number 


0); compensated for a reverse oxidation of glutamate to 
a-ketoglutarate. 


Abbreviated Answers to Problems A-53 


Chapter 26 
1. See Figures 26.2 (purines) and 26.13 (pyrimidines). 
2. Assume ribose-5-P is available. 


Purine synthesis: 2 ATP equivalents in the ribose-5-P 
pyrophosphokinase reaction, | in the GAR synthetase reaction, 
1 in the FGAM synthetase reaction, 1 in the AIR carboxylase 
reaction, | in the CAIR synthetase reaction, and 1 in the 
SACAIR synthetase reaction yields IMP, the precursor common 
to ATP and GTP. Net: 7 ATP equivalents. 


a. ATP: 1 GTP (an ATP equivalent) is consumed in converting 
IMP to AMP; 2 more ATP equivalents are needed to convert 
AMP to ATP. Overall, ATP synthesis from ribose-5-P onward 
requires 10 ATP equivalents. 

b. GTP: 2 high-energy phosphoric anhydride bonds from ATP, 
but 1 NADH is produced in converting IMP to GMP; 2 more 
ATP equivalents are needed to convert GMP to GTP. Overall, 
GTP synthesis from ribose-5-P onward requires 8 ATP 
equivalents. 

Pyrimidine synthesis: Starting from HCO; and Gln, 2 ATP 
equivalents are consumed by CPS-I, and an NADH 
equivalent is produced in forming orotate. OMP synthesis 
from ribose-5-P plus orotate requires conversion of ribose-5-P 
to PRPP at a cost of 2 ATP equivalents. Thus, the net ATP 
investment in UMP synthesis is just 1 ATP. 

c. UTP: Formation of UTP from UMP requires 2 ATP 
equivalents. Net ATP equivalents in UTP biosynthesis = 3. 


d. CTP: CTP biosynthesis from UTP by CTP synthetase 
consumes | ATP equivalent. Overall ATP investment in 
CTP synthesis = 4 ATP equivalents. 


. BEADC 
. “ATP consumption” refers to the hydrolysis of the high-energy 


bonds in ATP. This does not involve any changes in the 
adenine component of ATP. The energy in ATP is consumed, 
but not the adenine content. 


. a. See Figure 26.6. 


b. See Figure 26.17. 
c. See Figure 26.17. 


. a. Azaserine inhibits glutamine-dependent enzymes, as 


in steps 2 and 5 of IMP synthesis (glutamine: PRPP 
amidotransferase and FGAM synthetase), as well 
as GMP synthetase (step 2, Figure 26.5), and CTP 
synthetase (Figure 26.16). 


b. Methotrexate, an analog of folic acid, antagonizes THF- 
dependent processes, such as steps 4 and 10 (GAR 
transformylase and AICAR transformylase) in purine 
biosynthesis (Figure 26.3), and the thymidylate synthase 
reaction (Figure 26.26) of pyrimidine metabolism. 


c. Sulfonamides are analogs of p-aminobenzoic acid (PABA). 
Like methotrexate, sulfonamides antagonize THF formation. 
Thus, sulfonamides affect nucleotide biosynthesis at the same 
sites as methotrexate, but only in organisms such as 
prokaryotes that synthesize their THF from simple precursors 
such as PABA. 


d. Allopurinol is an inhibitor of xanthine oxidase 
(Figure 26.10). 


e. 5-Fluorouracil inhibits the thymidylate synthase reaction 
(Figure 26.26). 
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. UDP, via conversion to (UDP (Figure 26.24), is ultimately a 


precursor to dTTP, which is essential to DNA synthesis. 


8. See Figure 26.23. 


10. 


12; 


. a. Oxidation of water to oxygen in the photosynthesis process 


(see Chapter 21). 


b. Glucose is a source of ribose and NADPH (needed for the 
reductase action) through the PPP (see Chapter 22). 


Ribose, as ribose-5-P, is released during nucleotide catabolism 
(as in Figure 26.8). Ribose-5-P is catabolized via the pentose 


phosphate pathway and glycolysis to form pyruvate, which enters 


the citric acid cycle. From Chapter 22, problem 17, note that 
3 ribose-5-P (in the form of 1 ribose-5-P and 2 xylulose-5-P) 


would give a net consumption of 2 ATPs and a net production of 


8 ATPs and 5 NADH (= 15 ATPs), when converted to 5 
See the following figure: 


11. 


pyruvate. If each pyruvate is worth 15 ATP equivalents (as in a 
prokaryotic cell), the overall yield of ATP from 3 ribose-5-P is 
75 ATP + 23 ATP — 2 ATP = 96 ATP. Net yield per ribose-5-P 
is thus 32 ATP equivalents. 


Comparing Figures 26.3 and 25.40, note that AICAR 
(5-aminoimidazole-4-carboxamide ribonucleotide) is a com- 
mon intermediate in both pathways. It is a product of step 5 
of histidine biosynthesis (Figure 25.40) and step 9 of purine 
biosynthesis (Figure 26.3). Thus, formation of AICAR as 

a by-product of histidine biosynthesis from PRPP and ATP 
bypasses the first nine steps in purine synthesis. However, cells 
require greater quantities of purine than of histidine, and these 
nine reactions of purine synthesis are essential in satisfying 
cellular needs for purines. 


Step 6 H 
l- O -O—P5O—ADP 
A 
\ 
H 
HN i 
O :B 
H Co b_o- q 
Na” | N = -NU 
Fi Sy om = "p NCH —> l \CH 
HN* ~NH HN i HN 
are Bil 
Step 8 O Coo- 
odla 
Oa Oa HC —CH,—COO- 
o- 
NH 
O O 2 coo- oO 
I | oe | I 
“OG. N > -O,P—O7C{ N —> HC—NH—C 
| \ 
L > I » CH, |» 
HN” N HN ON _H,N~ N 
d l coo d 
Step 9 H:B 
ar O i 
a a C N H,N—C 
on GS 
pJ coo- HN | H,N i 
` w as 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


13. Aspartate + GTP*- + H,O —> 
fumarate + NH,* + GDP?- + P- 


14. 1. Uric acid + 2 HO + O, —= urate oxidase —> allantoin + 
H,O, + CO, 
2. Allantoin + HO —> allantoinase —> allantoic acid + 
glyoxylic acid 
3. Allantoic acid + HO —= allantoicase —> 2 urea 
4.2 Urea + 2 HO —> urease —> 2 CO, + 2 HN; 
15, 


[Aspartate] 


16. ATCase (subunit organization œ6ß6) consists of six functional 
units, each composed of one a-subunit and one B-subunit, best 
described as (aB)o. 


17. From these starting materials to UTP in a eukaryotic cell, where 
mitochondrial oxidation of dihydroorotate via a coenzyme 
Q-linked flavoprotein would yield 1.5 ATPs, has a net cost of 
2.5 ATPs. From UTP to dTTP would require an ATP at the CTP 
synthetase step; recovery of an ATP in the nucleoside diphos- 
phate kinase reaction ADP + CTP > CDP + ATP; an NADPH 
(= 4 ATP equivalents) in converting CDP — dCDP, recovery of 
an ATP equivalent in the deoxycytidylate kinase reaction dCDP 
+ ADP — dCMP + ATP, deamination of dCMP to dUMP by 
dCMP deaminase; conversion of (UMP to dTMP by thymi- 
dylate synthase and then 2 ATP equivalents to form dTTP. Net: 
7.5 ATP equivalents versus 10 ATP for ATP synthesis via the 
purine biosynthetic pathway. 

18. 1. UMP + ATP —> UDP + ADP 
2. UDP + NADPH —> dUDP + NADP+ + OH- 

3. (UDP + ATP —> dUTP + ADP 
4. dUTP + HLO —> dUMP + PP; 
5. PP; + H,O — 2 P; 
6. (UMP + N>,N!°-methylene-THF —> dTMP + DHF 
Net: UMP + N°,N'°-methylene-THF + 2 ATP + NADPH + 
2 HO —> dTMP + DHF + 2 ADP + 2P; + OH- + NADP* 


R, R", R!” and R?'8 in thymidylate synthase are hydrogen-bond 
donors to the phosphate group of the substrate, (UMP. See Kawase, 
S., Cho, S. W., Rozell, J., and Stroud, R. M., 2000. Replacement set 
mutagenesis of the four phosphate-binding arginine residues of 
thymidylate synthase. Protein Engineering 13:557—-563. (The other 
substrate, N°,N!°-methylene-THF bears negative charge on its 
glutamate units, and without knowledge of thymidylate synthase 
structure and function, would also be a possibility.) 


A-55 


Abbreviated Answers to Problems 


19. Possibilities include glutamine phosphoribosyl pyrophosphate 
amidotransferase (see Figure 26.3), adenine phosphoribosyl 
transferase and hypoxanthine-guanine phosphoribosy] transfer- 
ase (see Figure 26.7), ribose-5-phosphate pyrophosphokinase (see 
Figure 26.3), and enzymes of the pyrimidine salvage pathway. 


20. Explore pdb id = 7DFR. Note the proximity of the 4-position 
of the nicotinamide ring (the hydride donor) to the 7-position 
of folate (the hydride acceptor). Note that polar groups on the 
substrates are solvent-exposed. 


21. View pdb id = IRAA to see that the CTP-binding sites on the 
R-subunits are dominated by B-strands. View pdb id = IRAA to 
see the ATCase substrate-binding sites (evident from the location 
of the substrate analog N-2-(phosphonoacetyl)-L-asparagine). 
The allosteric (CTP-binding) sites and the active sites are at 
opposite ends of the protein. Thus, CTP binding is commu- 
nicated to the active site via ligand-induced conformational 
changes extending across the entire protein. R-state ATCase is 
represented by pdb id = 2IPO; T-state by pdb id = IRAA. 


22. There is a high demand for DNA replication and thus the 
biosynthesis of nucleotides during embryonic growth. The 
importance of folic acid in the biosynthesis of nucleotides is 
one of the rationales why pregnant women are recommended 
to take in extra amounts of folic acid. 


23. The anticancer agents are not selective, affecting cancer and 
normal cells leading to side effects. However, because cancer 
cells have a much higher demand for nucleotide biosynthesis, the 
anticancer agents affect cancer cells much more than normal cells. 

24. If GMP levels are high (but GDP or AMP is not), ribose-5-P 
pyrophosphokinase will not be inhibited, glutamine-PRPP ami- 
dotransferase will be roughly 50% inhibited due to GMP binding 
at the G nucleotide allosteric site, and IMP dehydrogenase will be 
blocked, so IMP is directed toward AMP synthesis to give a bal- 
anced amount of A versus G nucleotides. 

25. Carbons | and 6. 


26. Consult the PSAFE website entry for CTP synthetase to discover 
the answer. 

27. Consult the PSAFE website entry for HGPRT to compose your 
answer. 


Think-Pair-Share Question: 


Building block: part of the structure of DNA, RNA, NAD+, 
NADP+, FAD, Coenzyme A 


Regulation: as ATP it serves as a donor of phosphoryl groups in 
protein kinase reactions; source of cAMP; source of AMP for 
adenylation; as ATP, ADP, or AMP it can act as a direct allosteric 
regulator; as ATP it can serve to recycle GTP from GDP for 
G-proteins 

Energy source: as ATP it serves as high-energy compound that 
donates phosphoryl, pyrophosphoryl, or adenylyl groups. 
Autacoid: Adenosine is a local regulator of physiological processes 
in mammals. 
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Chapter 27 


1. 


. Energy charge = 0.945; phosphorylation potential = 1111 M~'. 


a. Keg = 360,333. 

b. Assuming [ATP] = [ADP], [pyruvate] must be greater than 
360,333 [PEP] for the reaction to proceed in reverse. 

c. Keg = 0.724. 

d. [PEP]/[pyruvate] = 724. 

e. Yes. Both reactions will be favorable as long as [PEP]/[pyru- 


vate] falls between 0.0000028 and 724. 
1 


. (1) The number of protons flowing across the inner mem- 


brane during the electron transport in complex I, HI and IV is 
not stoichiometric. (2) The number of complexes that couple 
proton transport to electron transport is not stoichiometric. 
Electron transport could have existed with more or less com- 
plexes to transport protons across the inner membrane. (3) The 
number of subunits in the c ring of the ATP complex is not a 
chemical constant across Nature; it determines how many pro- 
tons need to flow through the complex to drive one revolution 
of the Fo part of the F;Fo-ATP synthase. 


. 8 mM ATP will last 1.12 sec. Because the equilibrium constant 


for creatine-P + ADP —> Cr + ATP is 175, [ATP] must be less 
than 1750 [ADP] for the reaction creatine-P + ADP — Cr + 
ATP to proceed to the right when [Cr-P] = 40 mM and [Cr] = 
4mm. 


. From Equation 3.42, €(NADP*t/NADPH) = —0.350 V; 


€(NAD*/NADH) = —0.281V; thus, Ag = 0.069 V, and 

AG = —13,316 J/mol. If an ATP “costs” 50 kJ/mol, this reaction 
can produce about 0.27 ATP equivalent at these concentrations 
of NAD+, NADH, NADP*, and NADPH. 


6. NADH is used in gluconeogenesis. 
7. Assume that the Keq for ATP + AMP > 2 ADP = 1.2 


10. 


11. 


(see legend to Figure 27.2). When [ATP] = 7.2 mM, 

[ADP] = 0.737 mM and [AMP] = 0.063 mM. When [ATP] 
decreases by 10% to 6.48 mM, [ADP] + [AMP] = 1.52 mM and 
thus [ADP] = 1.30 mM and [AMP] = 0.22 mM. A 10% decrease 
in [ATP] has resulted in a 0.22/0.063 = 3.5-fold increase in [AMP]. 


. J (the flux of F-6-P through the substrate cycle) at low 


[AMP] = 0.1; J at high [AMP] = 8.9. Therefore, the flux of F-6-P 
through the cycle has increased 89-fold. 


. Inhibition of acetyl-CoA carboxylase will lower the concentra- 


tion of malonyl-CoA. Because malonyl-CoA inhibits uptake of 
fatty acids (see Figure 24.16), decreases in [malonyl-CoA] favor 
fatty acid synthesis. 


Ethanol oxidation to acetate yields 2 NADH in the cytosol, each 
worth 1.5 ATPs. The acetyl thiokinase reaction consumes 2 ATP 
equivalents in converting acetate to acetyl-CoA (due to ATP > 
AMP + PP;). Combustion of acetyl-CoA — 2 CO) in a liver cell 
yields a net of 10 ATPs. Therefore, the net yield of ATP from 
ethanol in a liver cell is 11 ATPs. Glucose — 6 CO, in a liver cell 
yields 30 ATPs (see Table 20.4) or 5 ATP/C atom. Ethanol > 

2 CO; gives 11 ATPs or 5.5 ATP/C atom. 


Palmitoyl-CoA — 8 acetyl-CoA yields 7 NADH and 7 [FADH)] = 
21 + 14 = 35. Eight acetyl-CoA — palmitoyl-CoA requires 

14 NADPH + 7 ATP = —63 ATP (negative sign denotes ATP 
consumed). The palmitoyl-CoA —— 8 acetyl-CoA conversion 

is favorable in both directions provided the free energy release 

is more than 1750 kJ/mol in the catabolic (acetyl-CoA forming) 


12; 


13. 


14. 


15. 


16. 


17. 


18. 


19. 


20. 


21. 


direction (the energy necessary to produce 35 ATP equivalents at a 
cost of 50 kJ/mol each) and less than 3150 kJ/mol in the anabolic 
(palmitoyl-CoA forming) direction (the energy released from 63 
ATP equivalents). 


Cellular respiration releases 2870 kJ/mol of glucose under 
standard-state conditions (and about the same amount under 
cellular conditions). Assuming the cell is a bacterial cell where 
cellular respiration produces 38 ATPs, the total cost of ATP 
synthesis is 38(50) = 1900 kJ/mol. Thus, the overall free energy 


change = —2870 + 1900 = —970 kJ/mol. Because AG” = —RT 
In Keg, Keg = €°°!47 = 10!68, This is a very large number! It will 


be even larger for a cell where ATP yields per glucose are less. 


Carbon dioxide fixation leads to glucose synthesis at the cost of 
12 NADPH and 18 ATP = 66 ATP equivalents. At 50 kJ/mol, 
the energy investment from ATP is —3300 kJ/mol and the value 
of a glucose is +2870 kJ/mol. Thus, AG” = —430 kJ/mol = 
—RT In Keg. Keg = e' = 107, which is also a very large 
number! 


ATP is an allosteric regulator directly activating or inhibiting 
enzymes or protein kinases. ATP serves as a source of cAMP and 
provides phosphoryl groups for protein kinases, indirectly affect- 
ing the activity and function of many enzymes or proteins. 


The amino group on the side chain of lysine is positively charged 
making it suitable for interactions with negatively charged functional 
groups such as DNA backbone phosphates. Acetylation of the 
amino group abolishes its positive charge and its capacity to inter- 
act with negatively charged functional groups. Acetylated histone 
proteins interact much less with DNA compared to deacetylated 
histone proteins. 


Glycogen phosphorylase, phosphorylase kinase, glycogen synthase, 
PFK-1, PFK-2, FBPase, and glucose-6-P phosphatase are liver 
enzymes that act specifically in glycogenolysis or gluconeogenesis; 
these enzymes are potential targets to control blood glucose levkels. 


Contracting muscles promote the uptake of glucose from the 
circulation. Exercising lowers blood sugar levels without the need 
for insulin and can help lower high blood sugar levels for both 
types of diabetes. 


From Figure 27.12: The “limit production” list would include 
NPY, AgRP, and ghrelin; the “raise levels” list would include 
cholecystokinin, leptin, insulin, melanocortins, and PY Y3_3.. 


Leptin injection in ob/ob mice decreases food intake, raises fatty 
acid oxidation levels, and lowers body weight. Obese humans are 
not usually defective in leptin production. Indeed, because leptin 
is produced in adipocytes, obese individuals may already have 
high levels of leptin, so leptin injection has limited success. 


No, although NPY is an orexic agent, it is not necessarily an 
obesity-promoting hormone. Mice deficient in melanocortin 
production, melanocortin receptors, or leptin receptors would 
have an obese phenotype because all these agents act in 
appetite-suppressing pathways. 


Alcohol consumption lowers the NAD*/NADH ratio, which 
limits glycolysis because glyceraldehyde-3-P dehydrogenase 
requires NAD*, stimulates gluconeogenesis because NADH 
favors glyceraldehyde-3-P dehydrogenase working in the glu- 
cose synthesis direction, and limits fatty acid oxidation because 
B-oxidation requires NAD+. NADH is also a negative regulator 
of several citric acid cycle enzymes. 


The T172D mutant of AMPK mimics the phosphorylated form 
in having negative charge on the side chain at position 172. 
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22 


23. 


24. 


25. 


. a. Fatty acyl-CoA. 


b. Transgenic mice lacking functional hypothalamic fatty acid 
synthase should have elevated [malonyl-CoA]. If hypothalamic 
levels of malonyl-CoA are high, eating should diminish, as 
should body fat content. Assuming physical activity is 
proportional to signals of nutrient availability, physical activity 
should increase. 


a. Leptin-deficient mice should respond to regular leptin 
injections by losing weight. 

b. The leptin receptor. Assay wild-type and db/db mutant cells for 
leptin binding ability. 

The heart is the principal organ using both glucose and fatty 

acids in the well-fed state; the brain relies mostly on glucose 

(although adipose tissue also relies on glucose for glycerol 

production); muscle relies mostly on fatty acids; the brain never 

uses fatty acids; and muscle produces lactate, which is converted 

in the liver into glucose. 


a. Hexokinase is an R enzyme because it is a glycolytic enzyme. 
Glutamine: PRPP amidotransferase is not an ATP-dependent 
enzyme, so it is neither an R or a U enzyme. It responds to the 
adenine nucleotide pool through feedback inhibition by AMP, 
ADP, and ATP. 


b. If E.C. = 0.5, hexokinase activity is high and ribose-5-P 
pyrophosphokinase activity is likely to be low because R-5-P 
pyrophosphokinase is probably a U enzyme. 

c. At E.C. = 0.95, the situation reverses and hexokinase activity 
is low and R-5-P pyrophosphokinase activity is high. 


PSAFE Problem. Consult the PSAFE website entry for resistin to 


compose your answer. 


Think-Pair-Share Question: 


a. 


1 


Review all chapters related to the metabolic pathways and their 
regulation to review the specifics. 


Insulin promotes synthesis of glycogen and fatty acids and the 
activation of glycolysis and pyruvate dehydrogenase. Insulin inhib- 
its glycogen and fatty acid breakdown, pyruvate dehydrogenase 
and gluconeogenesis; particularly in liver cells. 

Glucagon acts in opposite fashion as insulin; particularly in liver 
cells. 

Leptin activates AMPK and Figure 27.6 illustrates the conse- 
quences of activation or inhibition by AMPK. 

Epinephrine promotes glycogen and fatty acid breakdown, 
glycolysis and pyruvate dehydrogenase activity, particularly in 
skeletal muscle cells. 


. The TCA cycle is an amphibolic pathway. The TCA cycle is useful 


for both anabolic or catabolic purposes. Thus, hormones that typi- 
cally inhibit anabolic pathways and activate catabolic pathways or 
vice versa, do not affect the TCA cycle. The hormones or second- 
ary messengers listed do affect the levels of substrate or product 
available and as such can, indirectly, affect the TCA cycle. 


Chapter 28 
. The density of DNA can be determined by density gradient 


ultracentrifugation. (a) Assuming DNA replication is semicon- 
servative, the density of the DNA will be (1.724 + 1.710)/2 = 
1.717 g/mL. (b) If DNA replication is conservative, two bands 
will be seen upon density gradient ultracentrifugation of the 
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DNA after one generation of growth on !°N-containing media: a 
band at density = 1.724 g/mL representing the progeny dsDNA 
and a parental band at 1.710 g/mL. 


. The 5’-exonuclease activity of DNA polymerase I removes 


mispaired segments of DNA sequence that lie in the path of the 
advancing polymerase. Its biological role is to remove mispaired 
bases during DNA repair. The 3’-exonuclease activity acts as a 
proofreader to see whether the base just added by the polymerase 
activity is properly base-paired with the template. If not (that 

is, if it is an improper base with respect to the template), the 
3’-exonuclease removes it and the polymerase activity can try 
once more to insert the proper base. An E. coli strain lacking 
DNA polymerase I 3’-exonuclease activity would show a high 
rate of spontaneous mutation. 


. Assume that the polymerization rate achieved by each half of the 


DNA polymerase III homodimer is 750 nucleotides per sec. The 
entire E. coli genome consists of 4.64 X 10° bp. At a rate of 

750 bp/sec per DNA polymerase HI homodimer (one at each 
replication fork), DNA replication would take almost 3100 sec 
(51.7 min; 0.86 hr). When Æ. coli is dividing at a rate of once 
every 20 min, E. coli cells must be replicating DNA at the rate 
of 4.64 X 10° bp per 20 min (2.32 10° bp/min or 3867 bp/sec). 
To achieve this rate of replication would require initiation of 
DNA replication once every 20 min at ori, and a minimum of 
3933/1500 = 2.57 replication bubbles per E. coli chromosome, or 
5.14 replication forks. 


a) (2) (3) 


The E. coli chromosome replicating 


(4) once every 20 min (three replication 


bubbles, six replication forks). 


5. If there are 40 molecules of DNA polymerase III (as DNA 


polymerase III homodimers) per E. coli cell and £E. coli 
growing at its maximum rate has about five replication forks 
per chromosome, DNA polymerase II availability is sufficient 
to sustain growth at this rate. 


. The actions of gyrase, helicase (including CMG helicase), repli- 


cation initiator, DnaC, y-complex as clamp loader, Cdc6é, MCM 
proteins, and Orc proteins. 


. Okazaki fragments are 1000 to 2000 nucleotides in length. 


Because DNA replication in E. coli generates Okazaki fragments 
whose total length must be 4.64 x 10° nucleotides, a total of 
2300 to 4700 Okazaki fragments must be synthesized. Consider 
in comparison a human cell carrying out DNA replication. The 
haploid human genome is 3 X 10° bp, but most cells are diploid 
(6 X 10° bp). Collectively, the Okazaki fragments must total 

6 X 10° nucleotides in length, distributed over 3 to 6 million sepa- 
rate fragments (assuming they are | to 2 kb in length). 
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DNA gyrases are ATP-dependent topoisomerases that introduce 
negative supercoils into DNA. DNA gyrases change the linking 
number, L, of double-helical DNA by breaking the sugar— 
phosphate backbone of its DNA strands and then religating 
them (see Figure 11.26). In contrast, helicases are ATP-dependent 
enzymes that disrupt the hydrogen bonds between base pairs that 
hold double-helical DNA together. Helicases move along dsDNA, 
leaving ssDNA in their wake. 


. Gyrases act as topoisomerases that cut and religate DNA 


strands. These cuts must be religated properly. Helicases do not 
cut DNA strands. 


A diploid human cell contains 6 X 10° bp of DNA. One origin 
of replication every 300 kbp gives 20,000 origins of replication 
in 6 X 10° bp. If DNA replication proceeds at a rate of 100 bp/ 
sec at each of 40,000 replication forks (2 per origin), 6 x 10° bp 
could be replicated in 1500 sec (25 min). To provide 2 molecules 
of DNA polymerase per replication fork, a cell would require 
80,000 molecules of this enzyme. 


For purposes of illustration, consider how the heteroduplex bac- 
teriophage chromosome in Figure 28.17 might have arisen: 


X 


+ | 


Strand invasion, ligation, 
branch migration 


l Resolution of Holliday junction 


+ 


12. 


13. 


14. 


The mismatch repair system of E. coli relies on DNA 
methylation to distinguish which DNA strand is “correct” 

and which is “mismatched”; the unmethylated strand (the 

newly synthesized one that has not had sufficient time to 

become methylated is, by definition, the mismatched one). 
Homologous recombination involves DNA duplexes at similar 
stages of methylation, neither of which would be interpreted as 
“mismatched,” and thus the mismatch repair system does not act. 


B-DNA normally has a helical twist of about 10 bp/turn, so 

the rotation per residue (base pair), Ag, is 36°. If the DNA is 
unwound to 18.6 bp/turn, Ad becomes 19.4°. Thus, the change in 
A¢ is 16.6°. 

Replication: SSB; DUE; Tus; S-CDK; Cdc6; MCM proteins; 
SId2 & SId3; Dpb11; Cdc45; GINS complex; geminin 


15. 


16. 


Repair: SSB; PriA; BRCA; Rad54; Ku70/80; DNA ligase 4; 
photolyase; DNA glycosylase; XPA protein 
Recombination & transposition: RecA; RuvC; SSB; Rad51; 
RAGI & RAG?2; Ku70/80; DNA ligase 4 


Consider the two DNA duplexes, WC and we, respectively, dis- 
played as “ladders” in the following figure. The gap at the right 
denotes the initial cleavage, in this case of the W and w strands. 
Cleavage by resolvase at the arrows labeled 1 and ligation of like 
strands (W with w and C with c) will yield patch recombinants; 
cleavage at the arrows labeled 2 and ligation of like strands gives 
splice recombinants. 


1 
TAa 
= : 
1 


l 
-d 


$——o 
ot 
= Es 


The DNA is: GCTA 
CGAT 


a. HNO, causes deamination of C and A. Deamination of C 
yields U, which pairs the way T does, giving: 


GTTA and ACTA and (rarely) ATTA 
CAAT TGAT TAAT 
Deamination of A yields I, which pairs as G does, giving: 
GCTG and GCCA and (rarely) GCCG 
CGAC CGGT CGGC 
b. Bromouracil usually replaces T, and pairs the way C does: 
GCCA and GCTG and (rarely) GCCG 
CGGT CGAC CGGC 
Less often, bromouracil replaces C, and mimics T in its 
base-pairing: 
GTTA and ACAT and (rarely) ATTA 
CAAT TGTA TAAT 


c. 2-Aminopurine replaces A and normally base-pairs with T but 
may also pair with C: 


GCTG and GCCA and (rarely) GCCG 
CGAC CGGT CGGC 


17. a. 


5'-AGTTGCCATAGCATGCATGGATCACGACAGTAC 
AATA-3’ 


3’-TCAACTGTATCGTACGTACCTAATACTGTCATG 
CTAT-S’ 


b. 


5'-AGTTGACATAGCATGGATGGATTACGACAGT 
ACAATA-3' 
3’-TCAAATGTATCGTACGTACCTAATGCTGTCAT 
GCTAT-S’ 

G: 
5'-AGTTGACATAGCATCCATGGATTACGACAGTA 
TGATA-3’ 


3'-TCAATTGTATCGTACGTACCTAATGCTATCAT 
GCTAT-S’ 
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22. 


23. 


24. 


25. 


26. 


21. 


d. 
5'-AGTTGACACAGCATGCATGGATTACGACAGT 
ACGACA-3' 
3'-TCAACTGTATCGTAAGTACCTAATGCTATCATG 
CTAT-S’ 


Transposons are mobile genetic elements that can move from 
place to place in the genome. Insertion of a transposon within or 
near a gene may disrupt the gene or inactivate its expression. 


Proline 


The IgG proteins undergo glycosylation in the ER. Section 
7.5e discusses the effects of glycosylation on the properties of 
proteins. 


Hexameric helicases unwind DNA at a cost of 1 ATP/nucleotide. 
The genome of E. coli K12 consists of 4,686,137 nucleotides, 

so 4,686,137 ATP equivalents would be needed to completely 
unwind the £. coli K12 chromosome. 


Antibodies against specific proteins are commonly used protein 
identification tools. Armed with an antibody specific for DNA 
polymerase III, another antibody specific for DNA polymerase IV, 
and a procedure to separate replication forks (large macromolecular 
complexes) from soluble proteins (gel filtration, centrifugation?), 
addition of DNA polymerase IV should lead to association of IV 
with the replication fork and release of III. TV should now be 

found in the large macromolecular complex and HI among the 
soluble proteins. 


Eukaryotic translesion DNA polymerases with small and stubby 

thumb and finger domains are error-prone because they lack the 

structural elements ensuring high selectivity for the proper dNTP 
substrate. Since 3’-exonuclease activity is a proofreading activity, 
translesion DNA polymerases would not need it. 


a. Cells exit the cell at G; and enter Gp because this stage is prior 
to DNA replication. Once DNA replication takes place, cells 
move on to mitosis. 


b. Progression through the cell cycle is regulated at checkpoints. 
Checkpoints, which control whether the cell continues into the 
next phase, ensure that steps necessary for successful completion 
of each phase of the cycle are satisfactorily accomplished. If 
conditions for advancement to the next phase are not met, the 
cycle is arrested until they are. Checkpoints depend on cyclins 
and cyclin-dependent protein kinases (CDKs). Cyclins are 
synthesized at one phase of the cell cycle and degraded at 
another. CDK interaction with a specific cyclin is essential for the 
protein kinase activity of the CDK. In turn, these CDKs control 
events at each phase of the cycle by targeting specific proteins for 
activation (or inactivation) through phosphorylation. Destruction 
of the phase-specific cyclin at the end of the phase inactivates the 
CDK. 

Recombination in codon 97 suggests that the V, gene provides 

codons 94-96, which are GTT CAT CTT and specify Val-His-Leu. 

The fourth codon is either TTG (Leu), CTG (Leu), or CGG (Arg). 

CGG (Arg) would be the change. However, the desirability of a 

change cannot be anticipated. 

Consult the PSAFE website entry for reverse transcriptase to 

compose your answer. 


Consult the PSAFE website entry for RFC to compose your 
answer. 
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Think-Pair-Share Question: 
1. To detect general mutagenic effects, one could study the growth 


rate of a cell line or bacteria during or after exposure to the 
environmental factor. If significant mutations of the genome have 
occurred, one can assume that growth curve a of a cell line or bac- 
teria would be slower compared to the control experiments. With 
eukaryotic cells one could even look at the chromosomes for any 
aberrations that may be observable. 


. To detect mutations in a specific gene, one could employ PCR 


techniques with the proper primers to create a multitude of copies 
of the gene of interest. The gene then can be sequenced and evalu- 
ate for potential mutations. 


Chapter 29 
1. The first AUG codon is underlined: 


5'-AGAUCCGUAUGGCGAUCUCGACGAAGACUCCUA 
GGGAAUCC... 


Reading from the AUG codon, the amino acid sequence of the 
protein is: 


(N-term)Met-Ala-Ie-Ser-Thr-Lys-Thr-Pro-Arg-Glu-Ser-... 


. See Figure 29.3 for a summary of the events in transcription 


initiation by E. coli RNA polymerase. A gene must have a 
promoter region for proper recognition and transcription 

by RNA polymerase. The promoter region (where RNA 
polymerase binds) typically consists of 40 bp to the 5’-side of 
the transcription start site. The promoter is characterized by two 
consensus sequence elements: a hexameric TTGACA consensus 
element in the —35 region and a Pribnow box (consensus 
sequence TATAAT) in the —10 region (see Figure 29.4). 


. The fact that the initiator site on RNA polymerase for nucleotide 
binding has a higher Km for NTP than the elongation site ensures that 
initiation of mRNA synthesis will not begin unless the available con- 
centration of NTP is sufficient for complete synthesis of an mRNA. 


. First, transcription is compartmentalized within the nucleus of 


eukaryotes. Furthermore, in contrast to bacteria (which have only 
a single kind of RNA polymerase), eukaryotes possess three dis- 
tinct RNA polymerases—I, II, and HI—acting on three distinct 
sets of genes (see Section 29.3). The subunit organization of these 
eukaryotic RNA polymerases is more complex than that of their 
bacterial counterparts. The three sets of genes are recognized by 
their respective RNA polymerases because they possess distinctive 
categories of promoters. In turn, all three polymerases interact 
with their promoters via particular transcription factors that rec- 
ognize promoter elements within their respective genes. Protein- 
coding eukaryotic genes, in analogy with prokaryotic genes, have 
two consensus sequence elements within their promoters, a TATA 
box (consensus sequence TATAAA) located in the —25 region, 
and Jnr, an initiator element that encompasses the transcription 
start site. In addition, eukaryotic promoters often contain addi- 
tional short, conserved sequence modules such as enhancers and 
response elements for appropriate regulation of transcription. 


Transcription termination differs as well as prokaryotes and 
eukaryotes. In prokaryotes, two transcription basic termination 
mechanisms occur: rho protein-dependent termination and 
DNA-encoded termination sites. Transcription in eukaryotes 
tends to be imprecise, occurring downstream from consensus 
AAUAAA sequences known as poly(A) addition sites. An impor- 
tant aspect of eukaryotic transcription is post-transcriptional 
processing of mRNA (see Section 29.5). 
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5. œB’: protein complex of prokaryotic RNA polymerase 


TATAAT: Pribnow box as promoter for RNA transcription 


RPBI through 12: polypeptides (subunits) of human and yeast 
RNA polymerase 


(YSPTSPS)>7 o 52: repeat sequence of C-terminal of RPB1 


CREs: cis-regulatory elements; promoter sites for eukaryotic 
RNA transcription 


MREs: metal response elements 

GREs: glucocorticoid response elements 

TBP: TATA-binding protein 

KMTs: lysine methyltransferases 

HTH: helix-turn-helix motif in DNA-binding proteins 
Cys-X-Cys-x})-His-x3-His: repeat sequence of Zn-finger motif in 
DNA-binding proteins 

AAUAAA: polyadenylylation signal for post-transcription pro- 
cessing of RNA. 


. When DNA-binding proteins “read” specific base sequences in 
DNA, they do so by recognizing a specific matrix of H-bond 
donors and acceptors displayed within the major groove of 
B-DNA by the edges of the bases. When DNA-binding proteins 
recognize a specific DNA region by “shape readout,” the protein 

is discerning local conformational variations in the cylindrical 
surface of the DNA double helix. These conformational variations 
are a consequence of unique sequence information contributed by 
the base pairs that make up this region of the DNA. 


. The metallothionine promoter is about 265 bp long. Because 
each bp of B-DNA contributes 0.34 nm to the length of DNA, 
this promoter is about 90 nm long, covering 26.5 turns of 
B-DNA (10 bp/turn). Each nucleosome spans about 146 bp, 

so about two nucleosomes would be bound to this promoter. 


. If cells contain a variety of proteins that share a leucine zipper 
dimerization motif but differ in their DNA recognition helices, 
heterodimeric bZIP proteins can form that will contain two 
different DNA contact basic regions (consider this possibility in 
light of Figures 29.33 and 29.34). These heterodimers are no lon- 
ger restricted to binding sites on DNA that are dyad-symmetric; 
the repertoire of genes with which they can interact thus will be 
dramatically increased. 


. Exon 17 of the fast skeletal muscle troponin T gene is one of the 
mutually exclusive exons (see Figure 29.45). Deletion of this exon 
would cut by half the alternative splicing possibilities, so only 

32 different mature mRNAs would be available from this gene. 

If exon 7, a combinatorial exon, were duplicated, it would likely 
occur as a tandem duplication. The different mature mRNAs 
could contain 0, 1, or 2 copies of exon 7. The 32 combinatorial 
possibilities shown in the center of Figure 29.45 represent those 


10. 
11. 


12; 


13. 


14. 


15. 


16. 


17; 


with 0 or 1 copy of exon 7. Those with 2 copies are 16 in number: 
456778, 56778, 46778, 6778, 45778, 4778, 5778, 778, 4577, 577, 
5677, 45677, 4677, 477, 677, and 77. Given the mutually exclusive 
exons 17 and 18, 96 different mature mRNAs would be possible. 


32, 63, and 127 


Modifications include acetylation and methylation of Lys 
residues, methylation of Arg residues, and phosphorylation 

of Ser residues. The positively charged Lys residues interact 
electrostatically with phosphate groups on the DNA backbone, 
so these interactions would be diminished through acetylation. 
Methylation of Lys and Arg residues introduces bulky aliphatic 
groups that have the potential to cause steric hindrances between 
the DNA backbone and the histone tails. Such methylations (as 
well as the other modifications discussed here) may also establish 
recognition sites for chromatin-binding proteins that could 
modulate DNA:histone interactions. Phosphorylation of Ser 
residues would lead to electrostatic repulsion between the DNA 
backbone and the histone tails. Collectively, these modifications 
weaken histone: DNA interactions. 


Methylation preserves the positive charge on amino groups but 
provides steric hinderance for other molecules to bind to the 
positive charge. The positive charge is shielded. 


Because the /ac repressor is tetrameric and each subunit has an 
inducer-binding site, it seems likely that inducer binding is coop- 
erative. Cooperative binding of inducer would be advantageous 
because the slope of the lac operon expression versus [inducer] 
would be steeper over a narrower [inducer] range. 


Capping of the 5’-end and polyadenylation of the 3’-end of mRNA 
protect the RNA from both 5’- and 3’-exonucleolytic degradation. 
Methylations at the 5’-end enhance the interaction between cap- 
binding protein eIF4E (see Figure 30.29), thus increasing the likeli- 
hood that the mRNA will be translated. Note also in Figure 30.29 
that polyadenylation provides a protein-binding site for proper 
assembly of the 40S initiation complex essential to translation. 


In the A Deeper Look box “A Mechanism for All Polymerases” 

in Section 28.2, the argument is made that polymerases have 

a common enzymatic mechanism for nucleotide addition to a 
growing polynucleotide chain based on two metal ions coordinated 
to the incoming nucleotide. These metal ions interact with two 
aspartate residues that are highly conserved in both DNA and 
RNA polymerases. The discovery of a second Mg?* ion in the RNA 
polymerase II active site confirms to the universality of this model. 


50 bp of DNA is (50)(0.34 nm/bp) = 17 nm. Because 147 bp of 

DNA make 1.7 turns around the nucleosome (see Figure 11.29) = 
86.4 bp/turn of DNA, 50 bp = 0.58 turns, or slightly more 

than } turn of DNA around the histone core octamer. Promoter 
and response modules are about 20 to 25 bp. 


Branch sites are a, c, and g. 
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i OH 
bE 
O 


a-Helices composed of seven to eight residues would have a 
length of 1.05 to 1.2 nm. The overall diameter of B-DNA is 
about 2.4 nm, but the diameter at the bottom of major groove 
is significantly less (as a reference, the C,’—C,’ distance between 
base-paired nucleotides in separate chains is 1.1 nm). A distance 
along the B-DNA helix axis of 1.1 nm would correspond to 
about three base pairs, so if such an a-helix were laid along the 
major groove, it could not make contacts with more than three 
base pairs (see also Figure 29.31). 


Because of its flat structure and aromatic character, Phe is well 
suited for driving a wedge between stacked bases in DNA or in 
DNA/RNA hybrids. Other aromatic amino acids are trypto- 
phan, tyrosine and histidine. However, these amino acids contain 
hydrophilic features in their structure that could make them less 
efficient. Tryptophan, containing a fused double ring structure, 
may be too large for this purpose. 


Phe“ acts as a wedge to separate the template and nontemplate 
strands from one another; Phe is located at the end of the 
Y-helix. 

Hydroxyproline; asparagine; glycine (twice); valine; 
hydroxytryptophan 

a-Amanitin binds in the large cleft between the two largest RNA 
polymerase II subunits, in particular to an a-helix, the “bridge” 
helix, running across this cleft. Its position in the cleft slows 
inhibits translocation of DNA and RNA and the entry of the 
next template nucleotide into the active site. 


24. 


25. 


26. 


The two protein chains in pdb id = 1GU4 are identical through 
positions 268 (D) to 333 (Q). Leucines are found at positions 297, 
304, 306, 313, 320, 324, 327, and 330. Those at positions 306, 313, 
320, and 327 are seven residues apart, as in a leucine zipper. Basic 
residues (K and R) precede these leucines, occurring at positions 
268, 275, 277, 280, 286, 287, 289, 291, 293, 295, and 302. 


A protein such as CAP that interacts with phosphate groups in 
the DNA backbone does so via electrostatic interactions based 
on positively charged Arg and Lys side chains. 


Chromatin is decompacted by HATs and chromatin remodel- 
ing complexes. Histone acetylation by HATs disrupts chro- 
matin structure (see problem 11) and chromatin remodeling 
complexes then mediate ATP-dependent conformational 
changes in the chromatin that peels about 50 bp of DNA from 
the core octamer, exposing the DNA for access by the tran- 
scriptional machinery. 


Think-Pair-Share Question: 


a. 


Possible 5’ splicing sites are underlined 


5’-ACUUGACAGUAGGUAAGUUAGGCCUAUCAUGAUC 
GAGGUAAGUCGAGUCUCUUCUACAACCUCAGGAGC 
GAUGUCCUCCAGGCGGACUACGACCUGCGAGUG 
CAGCCCUUCCAGGAUGACGUAGCGAUGAC-3’ 
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Possible branch sites are underlined 


5’-ACUUGACAGUAGGUAAGUUAGGCCUAUCAUGAUC 
GAGGUAAGUCGAGUCUCUUCUACAACCUCAGGAGC 
GAUGUCCUCCAGGCGGACUACGACCUGCGAGUG 
CAGCCCUUCCAGGAUGACGUAGCGAUGAC-3’ 


Possible 3’ splicing sites are underlined 
5’-ACUUGACAGUAGGUAAGUUAGGCCUAUCAUGAUC 
GAGGUAAGUCGAGUCUCUUCUACAACCUCAGGAGC 


GAUGUCCUCCAGGCGGACUACGACCUGCGAGUG 
CAGCCCUUCCAGGAUGACGUAGCGAUGAC-3’ 


b. Possible splicing products could be: 


Splicing at first 5’ splicing site and first 3’ splicing site yields: 
5’-ACUUGACAGUAGGCGGACUACGACCUGCGAGUG 
CAGCCCUUCCAGGAUGACGUAGCGAUGAC-3’ 

Splicing at first 5’ splicing site and second 3’ splicing site yields: 
5’-ACUUGACAGUAGGAUGACGUAGCGAUGAC-3’ 
Splicing at second 5’ splicing site and first 3’ splicing site yields: 
5’-ACUUGACAGUAGGUAAGUUAGGCCUAUCAUGAUC 


GAGGCGGACUACGACCUGCGAGUGCAGCCCUUCCAG 
GAUGACGUAGCGAUGAC-3’ 


Splicing at second 5’ splicing site and second 3’ splicing site yields: 


5’-ACUUGACAGUAGGUAAGUUAGGCCUAUCAUGAUC 
GAGGAUGACGUAGCGAUGAC-3’ 


Chapter 30 
1. The cDNA sequence as presented: 
CAATACGAAGCAATCCCGCGACTAGACCTTAAC... 


represents six potential reading frames, the three inherent in the 
sequence as written and the three implicit in the complementary 
DNA strand, which as written 5’—3’, is: 


... GTTAAGGTCTAGTCGCGGGATTGCTTCGTATTG 


Two of the three reading frames of the cDNA sequence given 
contain stop codons. The third reading frame is a so-called open 
reading frame (a stretch of coding sequence devoid of stop 


codons): 
a Q AS 
UN YN a ae 
H Í 4 x 
se, ee ge \—< 
¢ | N (0) p. sA 
2 es \ < S 
Ko | ? 
G:C CIG 


6. The wobble rules state that a first-base anticodon U can 
recognize either an A or a G in the codon third-base posi- 
tion; first-base anticodon G can recognize either U or C in 
the codon third-base position; and first-base anticodon I can 
recognize either U, C, or A in the codon third-base position. 
Thus, codons with third-base A or G, which are degenerate 
for a particular amino acid (Table 30.3 reveals that all codons 
with third-base purines are degenerate, except those for Met 
and Trp), could be served by single tRNA species with first- 
base anticodon U. More emphatically, codons with third-base 


CAATACGAAGCAATCCCGCGACTAGACCTTAAC... 
(alternate codons underlined) 


The amino acid sequence it encodes is: 
Gln-Tyr-Glu-Ala-Ile-Pro-Arg-Leu-Asp-Leu-Asn.... 
Two of the three reading frames of the complementary DNA 
sequence also contain stop codons. The third may be an open 
reading frame: 
...GTTAAGGTCTAGTCGCGGGATTGCTTCGTATTG 
(alternate codons underlined) 


An unambiguous conclusion about the partial amino acid 
sequence of this cDNA cannot be reached. 


. A random (AG) copolymer would contain varying amounts of 


the following codons: 
AAA AAG AGA GAA AGG GAG GGA GGG 
codons for Lys Lys Arg Glu Arg Glu Gly Gly, respectively. 


Therefore, the random (AG) copolymer would direct the synthe- 
sis of a polypeptide consisting of Lys, Arg, Glu, and Gly. The 
relative frequencies of the various codons are a function of the 
probability that a base will occur in a codon. For example, if 
the A/G ratio is 5/1, the ratio of AAA/AAG is (5 X 5 X 5)/ 
(5X 5 X 1) = 5/1. If the 3A codon is assigned a value of 100, 
then the 2A1G codon has a frequency of 20. From this analysis, 
the relative abundances of these amino acids in the polypeptide 
should be: 
Lys = 120; Arg = 24; Glu = 24; Gly = 4.8 

(Normalized to Lys = 100: Arg = 20; Glu = 20; Gly = 4) 


. Leucine and arginine are coded by six different codons that may 


differ in the first nucleotide of the codon. 


. Review the information in Section 30.2, noting in particular the 


two levels of specificity exhibited by aminoacyl-tRNA synthe- 
tases (1: at the level at ATP =—— PP; exchange and aminoacyl 
adenylate synthesis in the presence of amino acid and the absence 
of tRNA; and 2: at the level of loading the aminoacyl group on 
an acceptor tRNA). 


. Base pairs are drawn such that the B-DNA major groove is at the 


top (see following figure). 


BONS r 
$ (0) Oo a x 
4 | L H . A \ 
NWN Li x 
R l \ a7 \ 
H $ kas 
C20 U:G 


pyrimidines (C or U) are always degenerate and could be 
served by single tRNA species with first-base anticodon G. 
Wobble involving first-base anticodon I further minimizes the 
number of tRNAs needed to translate the 61 sense codons. 
Wobble tends to accelerate the rate of translation because the 
noncanonical base pairs formed between bases in the third 
position of codons and bases occupying the first-base wobble 
position of anticodons are less stable. As a consequence, the 
codon:anticodon interaction is more transient. 
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T: 


10. 


11. 


12. 


13. 


The stop codons are UAA, UAG, and UGA. 


Sense codons that are a single-base change from UAA include 
CAA (Gln), AAA (Lys), GAA (Glu), UUA (Leu), UCA (Ser), 
UAU (Tyr), and UAC (Tyr). 
Sense codons that are a single-base change from UAG include 

CAG (Gln), AAG (Lys), GAG (Glu), UCG (Ser), UUG (Leu), 
UGG (Trp), UAU (Tyr), and UAC (Tyr). 
Sense codons that are a single-base change from UGA include 
CGA (Arg), AGA (Arg), GGA (Gly), UUA (Leu), UCA (Ser), 
UGU (Cys), UGC (Cys), and UGG (Trp). 

That is, 19 of the 61 sense codons are just a single base change 
from a nonsense codon. 


. 1. Mutation from AGG (R) or CGG (R): changes the 


hydrophilic, positively charged arginine into the hydrophobic 

tryptophan. 

2. Mutation from GGG (G): changes the smallest amino acid 

glycine into the largest amino acid tryptophan. 

3. Mutation from UUG (L): changes the smaller, hydrophobic 

leucine into the larger, hydrophobic tryptophan. 

4. Mutation from UCG (S): changes the smaller, hydrophilic 

serine into the larger, hydrophobic tryptophan. 

5. Mutation from UAG or UGA (stop): elongates the translation 

process further; making for a longer protein. 

6. Mutation from UGU or UGC (C): changes the smaller, 
hydrophilic cysteine (and capable of making disulfide bridges) 
into the larger, hydrophobic tryptophan. 


. The list of amino acids in problem 7 is a good place to start in 


considering the answer to this question. Amino acid codons in 
which the codon base (the wobble position) is but a single base 
change from a nonsense codon are the more likely among this list 
because pairing is less stringent at this position. These include 
UAU (Tyr) and UAC (Tyr) for nonsense codons UAA and UAG, 
and UGU (Cys), UGC (Cys), and UGG (Trp) for nonsense 
codon UGA. 


RNA editing may change A into inosine (I). I behaves like G as 
it base-pairs with C. The editing of AUG into IUG is similar to 
changing AUG into GUG and if both can serve as start codon it 
would not affect the translation process. 


The more obvious answer to this question is that eukaryotic 
ribosomes are larger, more complex, and hence slower than 
prokaryotic ribosomes. In addition, initiation of translation 
requires a greater number of initiation factors in eukaryotes 
than in prokaryotes. It is also worth noting that eukaryotic cells, 
in contrast to prokaryotic cells, are typically under less selective 
pressure to multiply rapidly. 


The initiation of translation in prokaryotes depends on the close 
proximity of an AUG codon to the Shine—Delgarno sequence. 
Mutation in that first AUG codon could prevent initiation, 

and the synthesis of the protein, as any other AUG codons on 
the mRNA may be too far removed from the Shine—Delgarno 
sequence. The initiation of translation in eukaryotes depends on 
the scanning of the mRNA transcript for an AUG codon close 
to the 5’ end. If the first AUG codon is mutated, a nearby AUG 
codon could serve as the starting point for translation but would 
lead to shorter protein sequence. 

Each amino acid of a protein in an extended B-sheet—like confor- 
mation contributes about 0.35 nm to its length. 10 nm + 0.35 nm 
per residue = 28.6 amino acids. 


14. 


15. 


16. 


17. 


18. 


19. 


20. 
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Larger, more complex ribosomes offer greater advantages in 
terms of their potential to respond to the input of regulatory 
influences, to have greater accuracy in translation, and to enter 
into interactions with subcellular structures. Their larger size may 
slow the rate of translation, which in some instances may be a 
disadvantage. 


The universal organization of ribosomes as two-subunit 
structures in all cells—archaea, eubacteria, and eukaryotes— 
suggests that such an organization is fundamental to ribo- 
some function. Translocation along mRNA, aminoacyl-tRNA 
binding, peptidyl transfer, and deacylated-tRNA release are 
processes that require repetitious uncoupling of physical inter- 
actions between the large and small ribosomal subunits. 


Ribozymes are RNA molecules catalyzing biochemical reactions 
as protein-based enzymes do. 23S rRNA catalyzes the peptidyl 
transferase reaction and thus is a ribozyme. 


Prokaryotic cells rely on N-formyl-Met-tRNA;™" to initiate 
protein synthesis. The tR NA;™* molecule has a number of dis- 
tinctive features, not found in noninitiator tRNAs, that earmark 
it for its role in translation initiation (see Figure 30.16). Fur- 
thermore, N-formyl-Met-tR NA;™" interacts only with initiator 
codons (AUG or, less commonly, GUG). A second tRNA™*, 
designated tRNA,,.™, serves to deliver methiony] residues as 
directed by internal AUG codons. Both tRNA;™* and tRNA „M“ 
are loaded with methionine by the same methionyl-tRNA syn- 
thetase, and AUG is the Met codon, both in initiation and elon- 
gation. AUG initiation codons are distinctive in that they are 
situated about 10 nucleotides downstream from the Shine— 
Dalgarno sequence at the 5'-end of mRNAs; this sequence deter- 
mines the translation start site (see Figure 30.18). Eukaryotic 
cells also have two tRNA™* species, one of which is a unique 
tRNA; that functions only in translation initiation. Eukaryotic 
mRNAs lack a counterpart to the prokaryotic Shine-Dalgarno 
sequence; apparently the eukaryotic small ribosomal subunit 
binds to the 5’-end of a eukaryotic mRNA and scans along it 
until it encounters an AUG codon. This first AUG codon defines 
the eukaryotic translation start site. 


These three nucleotides on the acceptor stem of tRNA molecules 
interact with the peptidyl transfer center of 23S rRNA. These 
three nucleotides on tRNA are unpaired as they need to be 

free to base-pair with specific G’s, C’s or U of 23S rRNA 

(see Section 30.5e). 


The Shine-Dalgarno sequence is a purine-rich sequence element 
near the 5’-end of prokaryotic mRNAs (see Figure 30.18). It 
base-pairs with a complementary pyrimidine-rich region near the 
3’-end of 16S rRNA, the rRNA component of the prokaryotic 
30S ribosomal subunit. Base pairing between the Shine— 
Dalgarno sequence and 16S rRNA brings the translation start 
site of the mRNA into the P site on the prokaryotic ribosome. 
Because the nucleotide sequence of the Shine-Dalgarno element 
varies somewhat from mRNA to mRNA, whereas the pyrimidine- 
rich Shine—Dalgarno-binding sequence of 16S rRNA is invari- 
ant, different mRNAs vary in their affinity for binding to 30S 
ribosomal subunits. Those that bind with highest affinity are 
more likely to be translated. 


The most apt account is b. Polypeptide chains typically contain 
hundreds of amino acid residues. Such chains, attached as a peptidyl 
group to a tRNA in the P site, would show significant inertia to 
movement, compared with an aminoacyl-tRNA in the A site. 
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21. Elongation factors EF-Tu and EF-Ts interact in a manner 
analogous to the GTP-binding G proteins of signal transduction 
pathways (see Figure 15.20, as well as Section 32.4). EF-Tu binds 
GTP and in the GTP-bound form delivers an aminoacyl-tRNA 
to the ribosome, whereupon the GTP is hydrolyzed to yield 
EF-Tu: GDP and P;. EF-Ts mediates an exchange of the bound 
GDP on EF-Tu: GDP with free GTP, regenerating EF-Tu: GTP 
for another cycle of aminoacyl-tRNA delivery. The a-subunit 
of the heterotrimeric GTP-binding G proteins also binds GTP. 
G, has an intrinsic GTPase activity and the GTP is eventually 
hydrolyzed to form G,,: GDP and P;. Guanine nucleotide exchange 
factors facilitate the exchange of bound GDP on G, for GTP, 
in analogy with EF-Ts for EF-Tu. The amino acid sequences of 
EF-Tu and G proteins reveal that they share a common ancestry. 


22. Four: two in the aminoacyl-tRNA synthetase reaction; one by 
EF-Tu; one by EF-G. 
23. a. You can determine this from their relative sizes. 


b. A-site tRNA is orange; the P-site tR NA is blue; the E site- 
tRNA is red. 


c. The peptidyl transferase center; the 50S subunit. 
d. The decoding center; the 30S subunit. 

24. a. In the cleft between the 30S and 50S subunit. 
b. The anticodon ends of the tRNAs. 


c. The diphthamide moiety of eEF-2 helps maintain the correct 
reading frame.during translocation. 
25. Folding, post-translational modification with carbohydrates or 
other biomolecules, phosphorylation, intracellular transport to a 
particular location, anchoring in a membrane. 


26. The QB phage replicase mR NA shows the most consecutive per- 


fect matches (6) complementary to the Shine-Dalgarno sequence. 


27. Chloramphenicol consists of an aromatic ring and several polar 
functions (a nitro group, two —OH groups, and an amine-N). 
The aromatic ring might intercalate between bases in the 23S 
rRNA peptidyl transferase site, the polar functions might form 
H bonds with nitrogenous bases or sugar —OH groups in the 
rRNA. The —NO); group is also rather bulky in addition to 
being polar, so it could disrupt peptidyl transferase in a number 
of ways. 

Think-Pair-Share Question: 


a. 5'-CGUACUAUGCACGAUUAUCUCAGAACCUGAAAC 
GUAUU-3’ 


MHDYLRT 
b. 1. Mutation occurred before the START codon. 
2. Mutation occurred after the STOP codon. 


3. Mutation may lead to the same amino acid. 


4. Mutation may lead to a similar amino acid and not change 
the overall functionality of the protein. Examples would be 
D-E,Q-N,L—-IorT—-S. 


5. Mutation may change an amino acid in a non critical part of 
the protein and not alter the protein’s functionality. As long as 
the protein maintains the required 3D shape, it may still be 
functional. 


6. Mutation may occur in an intron that is cut out before transla- 
tion into the functional protein. 


Chapter 31 


1. Human rhodanese has 296 amino acid residues. Its synthesis 


would require the involvement of 4 ATP equivalents per residue 
or 1184 ATP equivalents. Folding of rhodanese by the Hsp60 
a4 complex consumes another 130 ATP equivalents. The total 
number of ATP equivalents expended in the synthesis and fold- 
ing of rhodanese is approximately 1314. 


. Because a single proteolytic nick in a protein can doom it to 


total degradation, nicked proteins clearly are not tolerated by 
cells and are degraded quickly. Cells are virtually devoid of 
partially degraded protein fragments, which would be the obvi- 
ous intermediates in protein degradation. The absence of such 
intermediates and the rapid disappearance of nicked proteins 
from cells indicate that protein degradation is a rigorously selec- 
tive, efficient cellular process. 


. Hsp70: Hsp70 binds to exposed hydrophobic regions of unfolded 


proteins. The Hsp70 domain involved in this binding is 18 kD 

in size and therefore would have a diameter of roughly 3 nm 

(see problem 5 for the math). Assuming the hydrophobic bind- 
ing site of Hsp70 stretches across its diameter, it would interact 
with about 3 nm/0.35 nm = 8 or 9 amino acid residues. Mul- 
tiple Hsp70 monomers could interact with longer hydrophobic 
stretches. So, for Hsp70 to interact with a polypeptide chain does 
not necessarily depend on the protein’s size, but it does depend 
on its hydrophobicity (and absence of charged groups). 


Hsp70 and Hsp60 chaperonins: Proteins that interact with both of 
these classes of chaperones not only must fit the description for 
Hsp70 targets but must also be small enough to access the Hsp60 
chaperonin chamber, which has a diameter of about 5 nm. This 
restriction means that Hsp60 cannot interact with proteins more 
than roughly 50 kD in mass (see problem 5). 


. Many eukaryotic proteins have a multidomain or modular 


organization, where each module is composed of a contiguous 
sequence of amino acid residues that folds independently into 

a discrete domain of structure (see Figure 6,24 for examples of 
this type of sequence and structure organization). Such proteins 
would be ideal for co-translational folding: As each newly syn- 
thesized contiguous sequence emerges from the ribosome tunnel, 
it begins folding into its characteristic domain structure. The 
final, fully folded state of the complete protein would be achieved 
when the various domains assumed their proper spatial relation- 
ships to one another through hinge motions occurring at intrado- 
main regions of the protein. 


. The maximal diameter for a spherical protein would be 5 nm 


(radius = 2.5 nm). The volume of this protein is given by V = 
far? = 4/3(3.14)(2.5 X 10-9 m}? = 65.4 X 10-2! mL. If its density 
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13. 


14. 


15. 


is 1.25 g/mL, its mass would be (1.25 g)(65.4 X 107?! mL) = 

81.8 X 10-7! g/molecule, so its molecular weight = (6.023 x 1073) 
(81.8 X 107?! g) = 492.5 X 10? g/mol = 49,250 g/mol, or about 
50 kD. 


. There are 27 — 1, or 127 different phosphorylated forms for a 


protein having 7 separate phosphorylation sites. 


. 32; one unphosphorylated form and 31 (2° — 1) phosphorylated 


forms. 


. GrpE catalyzes nucleotide exchange on DnaK, replacing ADP 


with ATP, which converts DnaK back to a conformational form 
having low affinity for its polypeptide substrate. This change 
leads to release of bound polypeptide, giving it the opportunity 
to fold, which is an important step in DnaK function. EF-Ts 
catalyzes nucleotide exchange on EF-Tu, converting it to the 
conformational form competent in aminoacyl-tRNA binding. 
Binding of the aminoacyl-tRNA : EF-Tu complex to the ribosome 
A site triggers the GTPase activity of EF-Tu as codon: anticodon 
recognition takes place and the aminoacyl-tRNA : EF-Tu complex 
conformationally adjusts to the A site. DnaJ delivers an unfolded 
polypeptide chain to DnaK, and its interaction with DnaK 

also triggers the ATPase activity of DnaK, whereupon the 
DnaK: ADP complex forms a stable complex with the unfolded 
polypeptide. In both instances, hydrolysis of bound nucleoside 
triphosphate is triggered and leads to conformational changes 
that stabilize the respective complexes. 


. Protein targeting information resides in more generalized 


features of the leader sequences such as charge distribution, 
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relative polarity, and secondary structure, rather than amino 
acid sequence per se. Proteins destined for secretion have 
N-terminal amino acid sequence with one or more basic amino 
acids followed by a run of 6 to 12 hydrophobic amino acids. 

The sequence MRSLLILVLCFLPAALGK.... has a basic resi- 
due (Arg) at position 2 and a run of 12 nonpolar residues 

(... LLILVLPLAALG...); thus, it appears to be a signal sequence 
for a secretory protein. Cleavage by the signal peptidase would 
occur between the G and K. 


10. Translocation proceeds until a stop transfer signal associated 


with a hydrophobic transmembrane protein segment is recog- 
nized. The stop-transfer signal induces a pause in transloca- 
tion, and the translocon changes its conformation such that the 
wall of this closed cylindrical structure either opens or exposes 
a hydrophobic path, allowing the transmembrane segment to 
diffuse laterally into the hydrophobic phase of the membrane. 


1. The maximal diameter for a spherical protein would be 6 nm 


(radius = 3 nm). The volume of this protein is given by V = 
trr = 4/3(3.14)(3 X 10-?m)? = 113 X 10-7! mL. If its density is 
1.25 g/mL, its mass would be (1.25 g)(113 X 10-7! mL) = 

141 X 10-7! g/molecule, so its molecular weight = (6.023 x 107) 
(141 xX 1077! g) = 851 X 10? g/mol = 85,100 g/mol, or about 

85 kD. 


. The thickness of a phospholipid bilayer is 5 nm, so the overall 


channel formed by a 50S ribosome and the translocon channel is 
15 nm; 15 nm + 0.35 nm per amino acid = about 43 amino acid 
residues. 


A different type of linkage (thioester) joins ubiquitin to El and 
E2 unlike the amide type of linkage between ubiquitin and other 
proteins. This different linkage does not target El and E2 for 
degradation. 

E3 ubiquitin protein ligase selects proteins by the nature of the 
N-terminal amino acid. Proteins with Met, Ser, Ala, Thr, Val, 
Gly, or Cys at the amino terminus are resistant to its action. 
Proteins having Arg, Lys, His, Phe, Tyr, Trp, Leu, Ile, Asn, Gln, 


CH,—CH, 


Asp, or Glu at their N-terminus are susceptible. N-terminal Pro 
residues lack a free a-amino group, so such proteins are not 
susceptible. 


. As shown in the figure, a nucleophilic attack by the alcohol 


functional group of threonine will release the N-terminal portion 
of the targeted peptide and covalently link the C-terminal 
portion of the peptide to threonine. Subsequent hydrolysis is 
needed to release the C-terminal portion of the peptide. 
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HC ~~ HN 
CH 
Target peptide bond Ga / NCH Threonine residue in 


in protein | HO proteasome ß unit 
C 


A No 


Nucleophilic substitution 


HG ~~ HN 
CH / 


Peptide fragment of 
target protein 


NHo / OCH Threonine residue in 
7 / proteasome ß unit 
Ves = 


O 


P HO 


Hydrolysis 
HC HN 
e 
CH_ / 
/ OCH Threonine residue in 
HO / proteasome ß unit 
OH Oe 
/ O 
Oa c 


Peptide fragment of 


17. 


18. 


19. 


20. 


21. 


22; 


target protein 


The cell cycle relies on the cyclic synthesis and destruction of 
cell cycle regulatory proteins. Lactacystin inhibits cell cycle 
progression by interfering with programmed destruction of pro- 
teins by proteasomes. 


The presence of methionine at the N-terminus of a protein pre- 
vents degradation by the ubiquitin/proteasome system. Removal 
of the methionine exposes a new N-terminal amino acid and, 
depending on the nature of this amino acid, the protein may 
become more or less susceptible to degradation. 


The temperature-induced switch could be based on a 
temperature-dependent conformational change in the HtrA 
protein that brings the His-Ser-Asp catalytic triad into the proper 
spatial relationship for protease function. 


a. EQWERTGHGVKQEGTYPARSTW 
c. YQWEFQAASDDFIKNDPLKIYYN 


There are many other post-translational modifications, including 
acylation, alkylation, amidation (at a protein’s C-terminus), 
biotinylation, formylation, glycosylation, isoprenylation, and 
oxidation. See also Table 5.4. 

Disulfide bridge formation, modification with carbohydrates, 
modification with fatty acids or GPI anchor, oxidation of amino 
acids (proline to hydroxyproline), modification with cofactors 
like FAD or lipoamide. 


23. This exercise is left to the student. 


24. A phosphodegron is defined as one or a series of phosphorylated 
residues on a substrate protein that interact directly with a 
protein-protein interaction domain in an E3 ubiquitin ligase, 
thereby linking the substrate to the ubiquitin conjugation machinery. 
Phosphorylation can affect the affinity of target protein binding; 
alternatively, phosphorylation can stimulate E2 activity. 


25. It depends on which amino acid is penultimate. If Arg, Lys, His, 
Phe, Tyr, Trp, Leu, Ile, Asn, Gln, Asp, or Glu follow immedi- 
ately after Met, the protein will have a short half-life after Met 
removal. If Ser, Ala, Thr, Val, Gly, Cys, Pro, or another Met 
follow, the protein should have a long half-life after N-terminal 
Met removal. 


Think-Pair-Share Question: 
Possible answers: 


1. It requires only one type of chemical linkage to create proteins: 
amide bond formation between different amino acids. The 
primordial soup of precursors could have contained a multitude 
of amino acids that could be linked into primordial peptides or 
proteins by a suitable catalytic surface. It takes more (and 
different types) of chemical reactions (add base to sugar, add 
phosphate to sugar, link phosphates together) to generate a 
nucleic acid structure like RNA molecules. 


. The functionality of a protein is determined in large part by its 
tertiary structure. A chain of amino acids could fold into a 
particular 3D structure, either spontaneously or shaped by the 
boundaries of a cavity of the catalyzing mineral. As the study of 
biochemistry shows, there appears to be no limit to what can be 
achieved by (properly folded) proteins. If a diverse range of 3D 
structures can be generated, a diverse range of chemical reactions 
could be catalyzed by the proteins. RNA molecules can adopt 
complex 3D structures and can be catalytically functional. 
Compared to proteins the catalytic functionality of RNA 
molecules appears to be less diverse. 


. The “information” related to a particular functionality is 
determined by the 3D structure of a protein. Thus, “structural 
fidelity” rather than “chemical fidelity” is sufficient to copy the 
“information” within a protein. If chemically different 
polypeptides fold into a similar shape, similar functionality may be 
achieved. A multitude of examples of this principle can be found 
throughout the many chapters in the handbook. 


Within a primordial catalytic system, chemically different 
polypeptides could be generated randomly, but shaped similarly 
(by the boundaries of the cavity of the mineral catalyst) and thus 
be functionally similar. Hence, the “information” within proteins 
could be copied. The randomness of the synthesis could provide 
sufficient opportunities to introduce “errors” or changes in the 
polypeptide structures to allow for an evolutionary force towards 
ever increasing efficiency (and specialization) in a protein’s 
functionality. 


Chapter 32 


1. Polypeptide hormones constitute a larger and structurally more 
diverse group of hormones than either the steroid or amino 
acid—derived hormones, and it thus might be concluded that the 
specificity of polypeptide hormone-receptor interactions, at least 
in certain cases, should be extremely high. The steroid hormones 
may act either by binding to receptors in the plasma mem- 
brane or by entering the cell and acting directly with proteins 
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controlling gene expression, whereas polypeptides and amino 
acid—derived hormones act exclusively at the membrane surface. 
Amino acid—derived hormones can be rapidly interconverted 

in enzyme-catalyzed reactions that provide rapid responses to 
changing environmental stresses and conditions. See The Student 
Solutions Manual, Study Guide and Problems Book for additional 
information. 


. The cyclic nucleotides are highly specific in their action, because 
cyclic nucleotides play no metabolic roles in animals. Ca** ion 
has an advantage over many second messengers because it can be 
very rapidly “produced” by simple diffusion processes, with no 
enzymatic activity required. IP; and DAG, both released by the 
metabolism of phosphatidylinositol, form a novel pair of effec- 
tors that can act either separately or synergistically to produce 

a variety of physiological effects. DAG and phosphatidic acid 
share the unique property that they can be prepared from several 
different lipid precursors. Nitric oxide, a gaseous second messen- 
ger, requires no transport or translocation mechanisms and can 
diffuse rapidly to its target sites. 


. Nitric oxide functions primarily by binding to the heme 
prosthetic group of soluble guanylyl cyclase, activating the 
enzyme. An agent that could bind in place of NO—but that does 
not activate guanylyl cyclase—could reverse the physiological 
effects of nitric oxide. Interestingly, carbon monoxide, which has 
long been known to bind effectively to heme groups, appears to 
function in this way. Solomon Snyder and his colleagues at Johns 
Hopkins University have shown that administration of CO to 
cells that have been stimulated with nitric oxide causes attenua- 
tion of the NO-induced effects. 


. Herbimycin, whose structure is shown in the following figure, 
reverses the transformation of cells by Rous sarcoma virus, pre- 
sumably as a direct result of its inactivation of the viral tyrosine 
kinase. The manifestations of transformation (on rat kidney 
cells, for example) include rounded cell morphology, increased 
glucose uptake and glycolytic activity, and the ability to grow 
without being anchored to a physical support (termed anchorage- 
independent growth). Herbimycin reverses all these phenotypic 
changes. On the basis of these observations, one might predict 
that herbimycin might also inactivate tyrosine kinases that bear 
homology to the viral pp60" ** tyrosine kinase. This inactivation 
has in fact been observed, and herbimycin is used as a diagnostic 
tool for implicating tyrosine kinases in cell-signaling pathways. 


H,CO 


HCO i i 
CH; CH; OCH, 


. The identification of phosphorylated tyrosine residues on cellular 
proteins is difficult. Quantities of phosphorylated proteins are 
generally extremely small, and to distinguish tyrosine phosphory- 
lation from serine/threonine phosphorylation is tedious and labo- 
rious. On the other hand, monoclonal antibodies that recognize 
phosphotyrosine groups on protein provide a sensitive means of 
detecting and characterizing proteins with phosphorylated tyro- 
sines, using, for example, Western blot methodology. 


10. 


11. 


12, 


13, 


14. 
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Abbreviated Answers to Problems 


1. ATP [y-?P] is ATP with the terminal phosphate group radio- 
actively labeled as **P. 


2. Mix the kinase enzyme of interest with its possible substrates 
and add ATP [y-*P]. Using typical protein separation tech- 
niques (see Chapter 5) evaluate which proteins carry a radioac- 
tive label, indicative of their phosphorylation by the kinase of 
interest. 


. Hormones act at extremely low concentrations, but many of the 


metabolic consequences of hormonal activation (release of cyclic 
nucleotides, Ca** ions, DAG, etc., and the subsequent alterations 
of metabolic pathways) occur at and involve higher concentra- 
tions of the affected molecular species. As we have seen in this 
chapter, most of the known hormone receptors mediate hor- 
monal signals by activating enzymes (adenylyl cyclase, phospholi- 
pases, protein kinases, and phosphatases). One activated enzyme 
can produce many thousands of product molecules before it is 
inactivated by cellular regulation pathways. 


. Vesicles with an outside diameter of 40 nm have an inside diam- 


eter of approximately 36 nm and an inside radius of 18 nm. The 
data correspond to a volume of 2.44 X 107% L. Then 10,000 
molecules/6.02 X 1073 molecules/mole = 1.66 x 107% mole. The 
concentration of acetylcholine in the vesicle is thus 1.66/2.44 M 
or 0.68 M. 


. The evidence outlined in this problem points to a role for cAMP 


in fusion of synaptic vesicles with the presynaptic membrane and 
the release of neurotransmitters. GTPyS may activate an inhibi- 
tory G protein, releasing G,(GTPyS), which inhibits adenylyl 
cyclase and prevents the formation of requisite cAMP. 

In both cases, the fraction of receptor that is bound with hor- 
mone is approximately 50%. 


Glucagon signals low blood sugar levels (or low caloric status in 
general) and will promote the breakdown of glycogen to provide 
glucose and will promote the release of fatty acids from adipose 
tissues to raise the caloric status. Epinephrine signals “stress” 

or a “fight-or-flight” situation. Under “fight-or-flight” circum- 
stances muscle cells will need ATP for muscle contraction and the 
release of glucose from glycogen or fatty acids from adipose tis- 
sues would be promoted by epinephrine in that context. 


Statin drugs inhibit the synthesis of new cholesterol in human 
subjects, but they have no effect on dietary intake of cholesterol. 
Dietary cholesterol accounts for approximately two-thirds of all 
body cholesterol. Moreover, synthesis of steroid hormones is not 
generally dependent on ambient levels of cholesterol in the body. 
Thus the taking of statin drugs would not be expected to influ- 
ence the levels and functions of steroid hormones in the body. 


B-Strands indeed provide more genetically economical means of 
traversing a membrane. However, the only significant classes of 
proteins that employ B-strands for membrane traversal are the 
porins and toxins such as hemolysin. These B-strand structures 
provide no easy way to communicate conformational changes 
across the membrane. On the other hand, bundles of a-helices 
are more able to convey and transmit conformational changes 
across a bilayer membrane. 


a-helices are more flexible and thus more suitable for cell 
signaling through conformational changes in proteins. Although 
not a receptor, the discussions of the structure of calmodulin 

in Section 32.4k and Figure 32.33 provide a good example of 
this property of a helices. 
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15. 


16. 


17. 


18. 


19, 


20. 
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Abbreviated Answers to Problems 


The formation and breakdown of cAMP: 


tT tar 
Yop 
| 


BH 
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H 
T 
HO—P—O—CHp, Adenine 
| 
ò- 
H H 
OH OH 


Activation of G-proteins requires and consumes GTP. The GDP 
produced by the activated G-proteins is converted back to GTP 
using ATP. 


Three possible targets could be: inhibition of GTPase activity 
of Gia, inhibition of adenylyl cyclase or inhibition of 
phosphodiesterase. 


The inhibition of GTPase activity in Gia would be more 
selective. The Gi protein is often selectively activated by a specific 
receptor itself activated by its specific ligand. Thus, this type of 
inhibition would affect only one cell signaling pathway and leave 
cAMP production through other receptors/ligands untouched. 


The interventions on adenylyl cyclase or phosphodiesterase 
would be less selective as they may interfere with the signaling 
efforts of many different hormones and receptors. These two 
enzymes are often targeted by multiple different signaling 
pathways. 


The point at which no ion flow occurs is the point of equilibrium 
that balances the chemical and electrical forces across a mem- 
brane. The Nernst equation is obtained by setting AG to zero in 
Equation 9.2. Rearrangement yields 


Using the data in Figure 32.49, one can use this equation to yield 


an equilibrium potential for K* of —80 mV and an equilibrium 
potential for Na* of +55.5 mV. 


Using the Goldman equation, one can calculate Ay = —60 mV, 
in agreement with values measured experimentally in neurons. 
The resting potential in neurons is —60 mV. Local depolarization 
of the membrane causes the potential to rise approximately 


21. 


22, 


23. 


24. 


25. 


26. 


27. 


28. 


29. 


20 mV to about —40 mV. This change causes the Na* channels 
to open and Na* ions begin to flow into the cell. This causes the 
potential to continue increasing to a value of about +30 mV. 
Because this value is close to the equilibrium potential for Na*, 
the Na* channels begin to close and K* channels open. K* 
rushes out of the neuron, returning the membrane potential to a 
very negative value. There is a slight overshoot of the potential, 
and the K* channels close and the resting membrane potential is 
restored. 


The binding of hormones, neurotransmitters or other molecules 
to a specific receptor on the surface of the neuron or the arrival 
of an action potential from a neighboring neuron. 


Pathway steps (in Figure 32.4) that involve amplification include 
the tyrosine kinase of the receptor protein, Raf, MAPKK, and 
MAPK. 


If Ras were mutated so as to have no GTPase activity, it would 
activate Raf indefinitely, and the pathway would run as if hor- 
mones were continually activating the receptor tyrosine kinase. 


There are several possible answers to this problem. One is that 
the original assays of kinase activities that reflected signaling 
effects were carried out on whole-cell preparations, in which the 
G-protein-dependent and G-protein-independent activities would 
appear as components of the overall activation. The availability 
of G-protein-uncoupled receptor mutants and ligands that could 
stimulate arrestin recruitment without G-protein activation made 
it possible to distinguish the G-protein-dependent and G-protein- 
independent effects. (See Ahn, S, et al., 2020. SnapShot: B-Arres- 
tin Functions. Cell 182: 1362-13362.e1. for additional discussion 
of these issues.) 


Conditions like epilepsy, insomnia, anxiety are caused by the 
hyperactivity of certain neurons. Using agonists (activators) of 
an inhibitory neuron makes sense in this context. 


Malathion has been used for years in the eradication of boll 
weevils, but it has not presented a serious toxicity problem in 
this program. Apparently, malathion is very toxic to insects, and 
relatively less toxic to humans, particularly in the low volume 
applications used for this purpose. 


This exercise is left to the student and will presumably be 
different for every student. 


This exercise is left to the student and will presumably be 
different for every student. 


This exercise is left to the student and will presumably be 
different for every student. 


Think-Pair-Share Question: 


1. 
2 


The concentrations of the hormones present in the circulation. 


The number or density of receptors on the surface of the target 
cell. 


3. The number of binding sites per receptor molecule or complex. 


. The affinity (Ky constant) of the receptors for their respective 


ligand. This relates to the “quality” of the receptor (proper fold- 
ing; proper orientation through membrane). 


5. Presence and concentration of the intracellular receptor ligands. 


. Presence and concentration of all the downstream effectors in the 


signaling pathways. 
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drugs, 480 
virus, 496 
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Allantoinase, 935 
Allosteric enzymes, 448 
Allosteric inhibition, 516 
sequential model, 517-519 
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regulation 
behavior of hemoglobin, 533 
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general features, 516-517 
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Alternative splicing, 1074 
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Amino acid, 11 
acid-base properties, 83—89 
acidic, 79, 80 
from acids by transamination, 884-909 
anatomy of, 78 
aromatic, 902-907 
basic, 81, 82 
branched-chain, 897, 912-914 
catabolism, 907-909 
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composition, 126 
definition of, 77 
dietary requirements, 885 
effect of substitutions, 448 
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grouping of, 909 
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ionic forms, 84-85 
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nonpolar, 79, 80 
optical and stereochemical properties, 
89-91 
pK values, 85 
polar, uncharged, 79, 80 
rare occurrence in protein, 83 
reactions, 89, 885 
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Aminotransferase, 460, 886 

Aminotriazole, 906 

Ammonium 
metabolic fate, 877-880 
pathways, 879-880 

Amoxicillin, 489 
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Amyotrophic lateral sclerosis, 133 

Anabolic steroid, 865 

Anabolism 
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triphosphate chain of, 60 
yield from glucose oxidation, 703—704 
ATP-ADP translocase, 698—700 
ATP-citrate lyase, 823 
ATP-dependent kinase, 930-931 
ATP synthase 
catalytic sites of, 693 
protein gradient and, 691 
ATPase activity, 546 
Atrial natriuretic peptide (ANP), 1171, 
1175, 1176 
Attfield, Paul, 205 
Augustus-Wallace, Allison C. Dr., 927 
Autocoid, 320 
Autogenous regulation, 1049 
Autophagy, 705, 1146 
Autophosphorylation, 1173, 1175 
Autotroph, 574 
Axon, 554, 1198-1199 
Axonemal dynein, 553 
Azaserine, 925 
Azide, 697-698 


B 
Bacteria, 17 
cell, 20 
cell wall, 214, 216-218 
flagella, 564-567 
structural organization, 20-21 
transcription regulation in, 1050 
Bacterial artificial chromosome 
(BAC), 399 
Bacteriophage, 22 
Bacteriophage T4 DNA ligase, 396 
Bacteriorhodopsin, 106, 273, 274, 308, 
309, 696 
Baculovirus, 410 
Baldness, 863 
Ball, Alice, 240 
Base, 38, 326-327. See also Acid 
analog induced mutation, 1015 
chemical mutagens, 1016 
editing with CRISPR-Cas9, 420-421 
Hoogsteen pairs, 364 
intrastrand pairs, 380-381 
mutations, 1016 
sequence of a nucleic acid, 325-326 
Watson—Crick, 355 
Base excision repair (BER), 1009 
Basic amino acids, 81, 82 
Benerito, Ruth, 213 
Beta-alpha-beta loop, 175 
Beta-aminopropionitrile, 164 
Beta form, 150-151 
Beta-oxidation, 796-797 
Beta-pleated sheet, 153-155 
Beta-sheet protein, 160 
Beta-strand, 108 
Beta-turns, 156-157 
Bicarbonate buffer system, 44-45 
Bidirectional replication, 982 
Bifunctional enzyme, 762 


Copyright 2024 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


Bilayer, 268-270 
Bile acid, 252-253, 861 
Bile salt, 793, 861 
Bimolecular reactions, 439, 455-461 
Bimosiamose, 222 
Binding change mechanism, 693, 694 
Binding isotherm, 136 
Binding site, 136 
BioBricks, 423-424 
Bioinformatics, 346, 353-354 
Biological clock, 591 
Biological function, of protein, 134-137 
BioMagResBank (BMRB), 126 
Biomolecular interaction, 12, 15, 16 
Biomolecule 
bimolecular dimensions, 7 
carbon as, 5 
defined, 1 
high-energy, 57—65 
how properties reflect fitness to living 
condition, 10-17 
and molecules, 4-5 
radii of common atoms of, 13 
structural organization, 5-9 
three-dimensional architecture, 12 
Biosynthesis, 223, 257, 776-786, 822, 
835-836, 844, 906-909 
Biosynthetic processes, 654 
Biotin carboxyl carrier protein (BCCP), 824 
Biotin carboxylase (BC), 824 
Bisphosphoglycerate mutase, 617 
Bitopic proteins, 272 
BLAST (Basic Local Alignment Search 
Tool), 127 
Blood clotting, 514 
Blood glucose, 772 
Blood pH, 46 
Blood plasma, 44—45 
Blood pressure, 480 
Bloodstream, 795 
BLOSUM (Blocks Substitution Matrix), 127 
BLOSUM62 substitution matrix, 128 
Blotting, 404 
Boat conformation structure, 199 
Bohr effect, 533, 534 
Bond order, 488 
Booker, Squire J. Dr., 876 
Bovine pancreatic ribonuclease, 107, 162 
Boyer exchange experiment, 693-694 
Boyer, Herbert, 398 
Boyer, Paul, 693 
BPG effect, 535 
Brain, 883, 963 
Brain natriuretic peptide (BMP), 1175-1176 
Branched-chain amino acid, 897, 912-914 
Branched-chain amino acid 
aminotransferase (BCAT), 897 
Branched-chain fatty acid, 812 
Branching enzyme, 768 
Branson, Herman, 151 
Breast cancer, 1007, 1008 
Brillouin, Léon, 54 
Bromide, 362 
Bromocriptine, 1210 
Bromodomain, 1061 


Bronchodilator, 545 

Brown, Alison Dr., 1091 

Brown fat, 967 

Buffer, 43-45. See also Acid; Base 
physiological pH ranges, 45 

Building blocks, 5, 7, 10 

Burst kinetics, 493 

bZIP, 1065, 1067—1068 


C 
C gene, 1021, 1022 
C-3 family of amino acids, 909-911, 914 
C3 plant, 743, 744 
C-4 family of amino acids, 911 
C4 plant, 743, 744 
C-5 family of amino acids, 911 
C-reactive protein, 230 
C-terminal, 115-116 
C-terminal analysis, 117-119 
C-terminal domain (CTD), 1054, 1055 
Ca?*-ATPase, 303-304 
Cacti, 744 
Caffeine, 320 
Cahn, Robert, 90 
Calcium, 303, 1186 
Calcium-binding protein, 1186 
Calcium carbonate, 575 
Calmodulin-binding domain, 554 
Caloric restriction (CR), 971-972, 973 
Calvin-Benson cycle, 737 
cAMP, 322. See also cyclic AMP 
cAMP-dependent protein kinase, 761, 1181 
cAMP-response element-binding (CREB), 
1065 
Cancer, 167, 182, 623, 785, 884, 944, 1007, 
1154, 1185. See also Breast cancer 
Canola oil, 239-240 
Cantley, Lewis, 623 
Capsid, 22, 558 
Carbamoyl-phosphate synthetase I (CPS-I), 
879, 887 
Carbamoyl-phosphate synthetase IT 
(CPS-II), 935 
Carbohydrate 
definition of, 193 
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Carbon monoxide, 697-698 
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sphingolipid, 844 
Complexin, 292 
Compulsory substrate, 458 
Concentration gradients, 2, 71, 293, 294 
Concentration, protein, 111 
Condensation, 645, 649, 827, 844 
Configuration, 109 
Conformation/conformational, 2 
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Cytosolic ribosome, 1103-1104 


D 

Daly, Marie Maynard, Dr., 580 

Dark reaction, 715 

Deadenylylation, 882 

Debranching system, 765 

Decarboxylation, 647, 648, 827, 828 

Decay-accelerating factor (DAF), 220 

Decoding center, 1102, 1109 

Decorin, 224-225 

Deisenhofer, Johann, 725 

Denaturation, 16, 165-169 

De La Cruz, Enrique, 548 

Denitrifying bacteria, 872 

Deoxyribonucleosides, 319 
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replication, 328, 346 
restriction endonucleases and, 337—339 
restriction mapping of, 339-340 
RNA chemical difference, 334-335 
Sanger’s chain termination reaction, 
346-348 
secondary structure 
and nucleic acid hybridization, 369 
and pH extremes and H-bonding, 367 
rate of denaturation, 368-369 
thermal denaturation and, 367 
single-stranded, 367-368 
Southern hybridization and, 404 
strand, 11 
superhelix density, 372 
synthesis, 940—942 
toroidal supercoiled, 372 
triple-stranded, 365 
Watson and Crick’s postulate of, 
327-328 
Watson-Crick base pairs, 355 
Z, 361 
DNA-binding, 412 
DNA chip, 400 
DNA footprinting, 1035, 1037 
DNA gyrase, 371-372, 982 
DNA helicase, 544 
DNA library 
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saturated and unsaturated, 237-238 
structure and chemistry of, 237—241 
thiokinase, 798 
unsaturated, 809-811 
Fatty acid synthesis 
acetate units in, 822 
acetyl-CoA carboxylase in, 824 
acyl carrier proteins in, 827 
arachidonic acid, 836 
decarboxylation, 827 
elongation and unsaturation, 832-833 
eukaryotes, 827 
hormonal signal regulation, 835-837 
malonyl-CoA in, 822 
in multienzyme complexes, 827 
NADPH in, 822 
polyunsaturated, 836 
reduction of carbonyl group, 829 
unsaturation reactions, 833-834 
Fatty acyl synthase I (FAS I), 827 
Fatty acyl synthase II (FAS II), 827 
Feedback inhibition, 516, 880 
Feedback regulation, 516 
FeMo-cofactor, 875 
Fermentation, 621 
Ferredoxin, 721, 722 
Ferredoxin-NADP reductase, 722 
Fetal hemoglobin, 535, 537 
FGAR amidotransferase, 925 
Fibril, 566, 1133 
Fibroin protein, 160 
Fibronectin, 225 
Fibrous protein, 106, 159, 162 
Filmer, David, 517 
Fingolimod, 258 
First law of thermodynamics, 53 
First-order reaction, 438 
Fischer projection, 195-196 
Fission, 673 
Flagella, 21 
bacterial, 564-567 
flagellar filaments, 566 
flagellar rotor, 565 
motor reversal, 566 
protofilaments, 566 
Flavin coenzyme, 643 
Flavodoxin, 163 
Flexible rod, 220-221 
Flippases, 284 
Floppases, 284 
Fluid mosaic model, 269 
Fluorescence, 718 


Index 1-7 


Fluoro-substituted analog, 944 
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Ganglioside, 246, 847 
Gap penalty, 127 
GAR transformylase, 925 
Gastric H*,K*-ATPase, 303-305 
Gastric zymogens, 514 
GCS-100, 222 
Gel filtration chromatography, 113 
Gel-to-liquid phase transition, 286 
Geminin, 995 
Gene 
ATG, 1146 
definition, 19, 981 
fusion systems, 411 
minimum gene set, 19 
for molecular motors, 553 
regulatory protein, 1064-1068 
transcription 
bacterial RNA polymerase and, 1034 
chain elongation, 1039 
in eukaryotes, 1052-1068 
intrinsic termination, 1039 
of operon, 1042 
overview, 1034 
polymerization initiation, 1036 
process of, 1034-1041 
regulated in bacteria, 1041-1052 
RNA polymerase binding, 1035-1038 
role of Mediator in, 1058-1059 
whimsical names, 1196 
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Gene chip, 407 
Gene expression, 880 
regulation, 1077-1079 
resources, 1084—1088 
response, 1161—1162 
unified theory of, 1080-1081 
Gene knockdown, 413 
Gene reorganization, 1020 
Gene silencing, 333 
Gene therapy, 933 
General acid-base catalysis, 487-488 
General recombination, 999, 1001-1002 
General transcription factor (GTF), 1056 
Genetic code, 1089-1092 
Genetic engineering, 393 
Genetic recombination 
breakage and reunion of DNA strand, 
999 
general, 999, 1001-1002 
homologous, 999-1001, 1007 
nonhomologous, 999 
recombination-dependent replication, 
1006 
somatic, 999 
Genome editing, 417-420 
Genome engineering, 416-417 
Genome nucleotide sequences, 352 
Genomic library. See DNA library 
Genomics, 578 
functional, 183 
structural, 183 
Germiline gene, 1021 
GFP (green fluorescent protein), 91 
Ghrelin, 970 
Gibbs free energy, 52 
Globular protein, 106 
aggregation symmetries of, 183 
cellular function mediation, 162 
folding tendencies and patterns of, 
173-175 
helices and sheets making up core of, 
162-163 
motion in, 172-173 
thermodynamic driving force for folding 
of, 172 
Glucagon, 775 
Glucocorticoid, 253, 864 
Glucocorticoid response element (GRE), 
1057 
Glucokinase, 608, 609 
Gluconeogenesis, 624 
allosteric regulation of, 760 
definition, 752 
glycolysis versus, 753 
inhibitors, 758 
in liver and kidney, 752-753 
reactions, 753 
reactions unique to, 755-759 
regulation of, 759-763 
resources, 789-790 
substrates, 752 
Gluconic acid, 200 
Gluconolactonase, 785 
Glucose, 16 
biosynthesis and, 776-786 
monitoring device, 752 


oxidation, 703-704 
phosphorylation, 607 
Glucose-6-phosphate, 610, 757, 760-761 
Glucose-6-phosphate dehydrogenase 
(G6PDH), 785 
Glutamate-aspartate aminotransferase, 886 
Glutamate dehydrogenase (GDH), 877, 
1205 
Glutamate synthase, 879-880 
Glutaminase, 884 
Glutamine, 878-879 
in brain, 884 
in cancer, 884 
in human body, 880-884 
Glutamine phosphoribosyl pyrophosphate 
amidotransferase, 922 
Glutamine synthetase (GS), 878-879 
Glycan microarray technology, 228 
Glycans, 208 
Glyceraldehyde, 194, 195, 622, 625 
Glyceraldehyde, D vs. L, 90, 93 
Glyceraldehyde-3-phosphate dehydrogenase, 
615 
Glycerolipid, 838 
Glycerol phosphate dehydrogenase, 626 
Glycerophosphate dehydrogenase, 701 
Glycerophosphate shuttle, 701-702 
Glycerophospholipid, 242-246, 256, 838 
Glycinamide ribonucleotide synthetase, 
925 
Glycinate, 85 
Glycine, 91, 899 
titration of, 86 
Glycine hydrochloride, 83 
Glycine oxidase, 899 
Glycocalyx, 220 
Glycoconjugate, 193 
Glycogen, 210 
branches, 768 
catabolism of, 764-766 
hormones regulating, 772-776 
metabolism control, 768—776 
synthesis of, 764-766 
tissue, 765 
Glycogen phosphorylase, 210, 765 
allosteric regulation, 524-526 
AMP as allosteric activator for, 525-526 
covalent modification of, 526-527 
as homodimer, 523—524 
Glycogen synthase, 767, 768 
Glycogen synthase D, 770 
Glycogen synthase I, 770 
Glycogen synthase kinase 3 (GSK3), 773 
Glycogenin, 768 
Glycolysis 
cellular regulation in, 624 
coupled reactions in, 605-606 
features of, 603—605 
first phase of 
cleavage by fructose bisphosphate 
aldolase in, 612-613 
glucose phosphorylation in, 607 
isomerization of glucose-6- 
phosphate, 610 
phosphofructokinase 
phosphorylation, 610-612 


triose phosphate isomerase in, 613-614 
galactose in, 626-628 
gluconeogenesis versus, 753 
glycerol in, 628 
hypoxic stress and, 628-630 
mannose and, 626 
NADH produced in, 621-624 
pyruvate in, 621-624 
reactions and thermodynamics of, 607 
resources, 633-634 
second phase of 
dehydration of enolase in, 618 
glyceraldehyde-3-phosphate 
dehydrogenase in, 615 
phosphoglycerate kinase in, 615-617 
phosphoglycerate mutase in, 617-618 
pyruvate kinase in, 618-621 
steady-state concentrations of 
metabolites, 607 
substrates in, 624-628 
Glycome, 138 
Glycophorin, 272 
Glycoprotein 
antifreeze, 220-221 
and flexible rod, 220-221 
N-linked, 218-220 
O-linked, 220 
Glycosaminoglycan, 214, 224 
Glycoside, 203 
Glycosidic bond, 203, 319-320 
Glycosyl phosphatidylinositol, 282 
Glyoxylate cycle, 663—664 
in plant growth, 665 
Glyoxysome, 663, 665, 811 
Glyphosate, 906 
GMK, 222 
GMP, 321 
Golgi apparatus, 21 
Golgi body, 23 
Gottesman, Susan, 1155 
Gout, 934 
GPCR kinase (GRK), 1192 
G-quadruplexes, 365 
Gram-negative bacteria, 217—218 
Gram-positive bacteria, 218 
Grana, 714 
Granum, 714 
Gratuitous inducer, 1042 
Green fluorescent protein (GFP), 91, 411 
Group transfer potential, 59, 65-66 
GTP-binding protein, 526, 1181 
GTP hydrolysis, 1112 
GTPase-activating protein (GAP), 1182, 1192 
Guanidino group, 889 
Guanine, 318-319 
Guanine-nucleotide dissociation inhibitor 
(GDI), 1180 
Guanine-—nucleotide exchange factor (GEF), 
527, 1109, 1179 
Guanylate kinase, 930 
Guanylin, 1175 
Guide strand, 413 


H 
H1 histone, 373 
H2A histone, 373 
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H2B histone, 373 
H3 histone, 373 
H4 histone, 373 
H bond, 14. See also Hydrogen bond 
DNA double helix stability factors, 355 
fluid network, 31 
in water, 31 
of water, 33 
H-DNA, 365 
H zone, 545 
H*-drug antiporter, 310 
H*,K*-ATPase, 303-305 
Hairpin turns, 380 
Hairpins, 174 
Half-cell, 66-67 
Halophile, 20 
Halorhodopsin, 308 
Hanes—Woolf plot, 447 
Hatch-Slack pathway, 742-744 
Haworth, Sir Norman, 197 
Haworth projections, 198—200 
Heart, 966 
Heat shock protein, 177-178 
Heavy chain, 546 
Heavy isotope, 588 
Helical twist, 357 
Helical wheel representation, 163 
Helicase, 558 
Helix capping, 153 
Helix-sheet composites, 156 
Helix-turn-helix (HTH), 1065 
Helling, Robert, 398 
Hemiacetals, 197 
Hemiketal, 197 
Hemoglobin, 109, 130, 131 
allosteric behavior of, 533 
conformational change in, 532 
cooperative binding of oxygen by, 531 
dissociation of oxygen from, 534 
fetal, 535, 537 
myoglobin and, 528-529 
nitric oxide and, 536 
oxy and deoxy forms, 533 
oxygen-binding curves of, 530-531 
paradigms of protein structure and 
function, 528 
tetrameric structure, 531 
Hemolymph, 205 
Henderson—Hasselbalch equation, 39-40 
Heparin, 214 
Heptad repeat pattern, 158 
Herbicide, 906 
Heredity, 981 
Heteroduplex DNA, 1001 
Heterogeneous nuclear RNA, 330, 1070 
Heterologous probe, 402 
Heteromultimer, 116 
Heteropolysaccharide, 208 
Heterotroph, 574 
Hexokinase, 607—608 
Hexokinase molecule, 463 
Hexosamine biosynthetic pathway (HBP), 
775 
Hexose monophosphate shunt, 776 
High-energy biomolecules, 57—65 


High-performance liquid chromatography 
(HPLC), 114 
High-throughput screening (HTS), 422 
High-throughput technology (HTT) 
description, 421 
protein screening, 422—423 
RNAi screening of mammalian 
genomes, 422 
screening, 422 
Hinton, Shanta D. Dr., 1164 
Histidine, 911 
biosynthesis, 907-909 
and buffering system, 44 
Histone, 329, 373 
covalent modification of, 1061—1062 
properties of, 373 
Histone acetyltransferases (HATs), 661, 815 
Histone code, 1062 
Histone crosswalk, 1062—1063 
Histone deacetylase, 815 
Histone deacetylase complex (HDAC), 1063 
Histone-modifying enzyme, 1060 
HIV-1 protease, 497—499 
HMG-CoA lyase, 814 
HMG-CoA reductase, 851 
HMG-CoA synthase, 851 
Holliday junction, 1001 
Holoenzyme, 437 
Homeobox domain, 1066 
Homeostasis, 17 
Homodimer, 98 
Homodisaccharides, 206 
Homogentisate, 916 
Homoglycan, 208 
Homologous protein, 127 
Homologous recombination, 999-1001, 
1006-1007 
Homolytic cleavage, 808 
Homomultimeric proteins, 98 
Homopolysaccharide, 208 
Homoserine kinase, 897 
Honey, 202 
Hoogsteen base pair, 364 
Hop diffusion, 289 
Hormone, 1161 
peptide, 1163 
polypeptide, 1163 
signal transduction, 1163—1167 
steroid, 1163 
Hormone binding, 528 
Hsp90, 1136-1137 
HtrA protease, 1153-1154 
Huber, Robert, 725 
Human artificial chromosome (HAC), 400 
Human body composition, 4 
Human gene therapy, 416 
Human Genome Project, 352 
Human growth hormone, 1013 
Human Protein Atlas, 139 
Human Proteome Project (HPP), 139 
Human pyruvate dehydrogenase, 640 
Huntington’s disease, 133, 1133 
Hyaluronate, 214 
Hybrid duplexes, 369 
Hybridization, 404, 420 
Hydration shells, 32 
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Hydride ion, 584 
Hydrogen bond 
and biomolecular interaction, 14 
in protein structure, 146 
in water, 30 
weak chemical forces, 12 
Hydrogen ion, 36 
Hydrolysis 
ATP, 64-65 
free energies of, 58, 64-65 
metal ions effect on hydrolysis of ATP, 64 
movement of kinesin, 557—558 
nucleic acid susceptibility, 335-340 
reactions, 63 
stabilization of, by ionization and 
resonance, 60 
Hydronium ion, 36 
Hydropathy index, 274 
Hydropathy plot, 274 
Hydrophobic collapse, 169 
Hydrophobic core, 175 
Hydrophobic effect, 158 
Hydrophobic interaction, 9 
between nonpolar molecules, 34 
protein folding, 146 
solvent properties of water, 33-34 
Van der Waals and, 356 
weak chemical forces, 12 
Hydrophobic interaction chromatography, 114 
Hydrophobic lipid, 237 
Hydroxyacyl-CoA dehydrogenase, 803 
Hydroxycarboxylic acid receptor 2 
(HCAR2), 815 
Hydroxyeicosanoic acid, 847 
Hydroxyethyl-TPP, 639, 641 
Hydroxyl ion, 36 
Hydroxylase factorinhibiting HIF 
(FIH-1), 630 
Hydroxylation, 161 
Hydroxylysine, 83 
Hydroxyproline, 83 
Hyperchromic shift, 367 
Hyperpolarization, 1209 
Hyperuricemia, 934 
Hyperventilation, 46 
Hypoventilation, 46 
Hypoxanthine, 319 
Hypoxia (oxygen limitation), 628 
Hypoxia inducible factor (HIF), 629-630 
Hypoxia responsive elements (HREs), 630 
Hypoxic stress, 628-630 


l 

I band, 545 

Ice, 30-31 

iGEM, 423-424 
Immunoglobulin, 12, 464, 1020-1023 
Immunoglobulin fold, 1019 
Immunoglobulin G, 1019 

IMP cyclohydrolase, 926 

In vitro, defined, 45 

In vitro mutagenesis, 413 

Indel, 127 

Independent of path functions, 53 
Induced dipole, 14 

Induced fit hypothesis, 462 
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Inducer, 1042 
Inducible operon, 1046 
Induction, 512, 1042 
Influx pump, 307 
Infrared light, 92 
Ingold, Christopher, 90 
Inhibition of enzyme activity 
competitive inhibitors, 450-451 
irreversible inhibitors, 449-450 
mixed noncompetitive inhibition, 452 
noncompetitive inhibition, 452 
pure noncompetitive inhibition, 452 
reversible inhibitors, 449-451 
uncompetitive inhibition, 452—453 
Inhibitors, in protein synthesis, 1121-1124 
Initiation factors (IF), 1106 
Initiator aminoacyl-tRNA, 1104 
Initiator tRNA, 1105 
Inner membrane, 672 
Inorganic precursor, 5, 7, 8 
Insertion sequence, 1008 
Insulin, 186, 513, 970-971 
blood glucose and, 772 
secretion, 657 
Insulin hormone, 116 
Insulin receptor, 1170, 1173-1175 
Insulin receptor substrate (IRS), 1174 
Insulin resistance, 758 
Integral proteins, 269 
Integral membrane protein, 294 
Integrative methods (IM), 181 
Intein, 1075 
Intercalating agents, 362 
Intercalation, 1018 
Interconvertible enzyme, 513 
Intermediary metabolism, 952-953 
Intermembrane space, 672 
Intrasteric control, 520 
Intrastrand base pairing, 378, 380-384 
Intrinsic binding energy, 478 
Intrinsic termination, 1039 
Intrinsically disordered protein (IDPs), 178 
Intrinsically unstructured protein (IUP), 
178-180 
Intron, 1068 
Invariant residues, 128 
Invertase, 206 
Jon 
hydride, 584 
metal, 64 
Ion exchange chromatography, 110, 112 
Ion gradients, 310-311, 1198-1199 
Ionic bond, 15 
Ionic interaction, 14 
on protein surface, 147 
weak chemical forces, 12 
Ionization 
stabilization of hydrolysis products by, 60 
of water, 36-37 
Ionization constant, 39 
Ion product of water, 36 
IPTG, 409 
Iron-sulfur cluster, 646 
Irreversible inhibitor, 449—450 
Islets of Langerhans, 772, 773 


Isocitrate dehydrogenase, 647—648, 661—662 
Isocitrate lyase, 664 

Isoelectric focusing, 113 
Isoelectric point, 110 

Isomaltase, 206 

Isopropylmalate dehydratase, 897 
Isopropylmalate synthase, 897 
Isozyme, of hexokinase, 607—608 
Isolated systems, 51—52 
Isoleucine, 897, 912 

Isoprene, 249, 251 

Isopycnic centrifugation, 368 
Isoschizomers, 338 

Isoacceptor tRNAs, 1098 
Isotope, 588-589 

Isotopic tracers, 588-589 


Janus protein kinase (JAK), 1192 

J,, gene, 1021 

Johnson-Winters, Kayunta, Dr., 753 
Junctions, 379 

Juvenile hormone, 481 

Juvenile hormone esterase (JHE), 481 


K 
K* channel sensitivity, 297-298 
Kappa gene family, 1021 
KesA structure, 296 
Keratan sulfate, 214 
Kendrew, John, 157 
Ketal, 203-204 
Keto-enol tautomeric shifts, 319 
Ketogenesis, 813 
Ketogenic amino acid, 596 
Ketone bodies, 597, 813 
Ketoses, 194 
Kidney, 752-753 
Kinesin 1, 553 
Kinesin, 544, 557-558 
Kinetics 
behavior of enzymes catalyzing 
bimolecular reactions, 455-461 
burst, 493 
chemical, 438-439 
enzyme, 438 
of enzyme-catalyzed reactions, 441-449, 
448-449 
Michaelis-Menten equation, 442, 
446-447 
wild-type and mutant sulfite oxidase 
constants, 448 
zero-order, 441 
KNF model, 517-519, 533 
Knoop, Franz, 797 
Knowles, Jeremy, 500 
Koshland, Daniel, 517 
Krebs cycle, 636 
K ion product of water, 36-37 
Kynurenine, 914 


L 

lac operon, 1042, 1046 
Lactase, 206 

Lactate, 752 


Lactate dehydrogenase (LDH), 514-516, 
622, 624 
Lactic acid fermentation, 604 
Lactose, 204 
Lactose intolerance, 629 
Lactose synthase, 629 
Lagging strand, 984 
Lambda gene family, 1021 
Lamellae, 21 
Lanolin, 247 
Lanosterol, 250, 856 
Lateral heterogeneity, 282 
Lathyrism, 164 
Lavoisier, Antoine, 690-691 
L-chain gene, 1021 
LDL receptor, 860 
Leading strand, 984 
Leading substrate, 458—459 
Lectin, 228-229 
Leloir pathway, 626 
Leptin, 970-971 
Lesch—Nyhan syndrome, 932 
Lethal synthesis, 945 
Leucine, 897 
Leucine aminotransferase, 897 
Leukosialin, 220 
Leukotriene, 847 
Levinthal’s paradox, 169 
Levorotatory behavior, 89-90 
Licensing or replication origin, 993 
Ligand, 15 
Ligand-binding site, 136 
Ligand-gated ion channel, 1203 
Ligand-induced conformational change, 
136 
Light activation, 740 
Light chain, 546 
Light-driven ATP synthesis, 732-735 
Light energy, 308-309, 740 
absorbed in photosynthetic pigments, 
718 
accessory light-harvesting pigments, 
717-718 
photosystems that capture, 721-725 
transduction into chemical energy, 
719-720 
Light-harvesting complex (LHC), 721 
Light reaction, 715 
Lignin, 902 
Limit dextrin, 464, 765 
Linear molecular motor, 544 
Lineweaver—Burk double-reciprocal plot, 
446, 447, 450-454 
Linkers, 396 
Linking number, 370-371 
Lipid 
bilayers, 268 
as biological signals, 254-258 
chains, 270 
fatty acids, 237-241 
glycerophospholipids, 242-246 
lipoprotein complex, 856-861 
membrane chemical and physical 
properties, 266 
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motion in membranes, 283-293 
from natural sources, 248 
nutrition and, 597 
resources, 263—264 
sphingolipid, 247 
steroids, 252-253 
terpenes, 249-250 
trioleoylglycerol, 241-242 
waxes, 246-249 
Lipid-anchored protein, 271, 280-282 
Lipid biosynthesis 
bile acid, 861 
cholesterol, 851-856 
complex, 837-847 
eicosanoid, 847-851 
fatty acid synthesis, 821-837 
lipid transport, 856-861 
resources, 868-869 
steroid hormone, 864-865 
Lipid nanoparticles (LNPs), 255 
Lipidome, 138 
Lipidomics, 259-260 
Lipin, 838 
Lipocalin, 797 
Lipoic acid, 644 
Lipopolysaccharide, 217 
Lipoprotein complex 
apoproteins, 857 
composition and properties, 856 
metabolism defects, 860-864 
and triacylglycerol and cholesterol ester, 
857 
Lipoprotein lipase, 857-859 
Liposome, 268-269 
Liquid crystalline, 285 
Liquid-ordered state, 285, 287 
Liver, 752-753, 967-968 
Living system, 1—4 
Long noncoding RNA, 333 
Long-term memory, 1065 
Lou Gehrig’s disease, 133 
Lovastatin, 854 
Low-barrier hydrogen bond (LBHB), 
488-489 
Low-density lipoprotein (LDL) receptor, 
220 
Lysine, 891, 894 
Lysine degradation, 914 
Lysine methyltransferases, 1062 
Lysis, 24 
Lysogeny, 24 
Lysosome, 21 
Lysozyme, 131 


M 

Macroautophagy, 1146 

Macrolide antibiotic, 1122, 1123, 1124 
Macromolecular synthesis and growth, 952 
Macromolecule, 1—2, 7, 8, 10 

Major histocompatibility complex (MHC), 
1020 

Malaria, 944 

Malate dehydrogenase, 651 

Malate synthase reaction, 663 
Malate—aspartate shuttle, 702-703 


Malic enzyme, 656, 809 
Malonyl-CoA, 822 
Malonyl-CoA-acetyl-CoA-ACP 
transacylase (MAT), 827 
Maltase, 206 
Maltose, 204 
Mannitol, 201 
Mannose, 626 
Maple sugar urine disease, 914 
Marginal stability, protein, 172 
Marton, Lilla, 983 
Mass spectrometry, 122—125, 593 
Massively parallel sequencing, 349 
Matrix, 674 
Matrix-assisted laser desorption ionization- 
time of flight mass spectrometry 
(MALDI), 122 
Maurizi, Michael, 1155 
M disc, 545 
Mechanism-based inhibitor, 945 
Mechanosensation, 290 
Mechanotransduction, 290 
Mediator, 1058-1059 
Megasynthase, 830-832 
Melanocortin-4 receptor (MC4R), 971 
Melanocortin, 971 
Melatonin, 914 
Melting temperature, 285, 367 
Melville, Herman, 248 
Membrane 
asymmetric and heterogeneous 
structures, 282—283 
cell, 9 
chemical and physical properties, 
266-271 
composition of, 266-267 
electron micrographs, 266 
facilitated diffusion, 294-300 
fluid mosaic model of, 269-271 
lateral diffusion, 289 
lipid and protein movement in, 283-293 
lipid chains bent and tilted in, 270 
lipids in water and, 266, 267—269 
passive diffusion, 294 
plasma, 265 
protein topology, 274 
proteins, 271-282 
resources, 314-316 
secondary active transport, 310-311 
subunit composition and pore structure 
of channels, 295 
thickness, 270 
transmembrane barrels, 279-280 
transport, 293-294 
transport processes, 301—306 
Membrane-associated allosteric enzyme, 
1170 
Membrane-bound systems, 579 
Membrane curvature, 290 
Membrane protein, 106 
Membrane-skeleton fence model, 289 
Membrane-spanning supramolecular 
complex, 722 
Memory, 1065 
Menten, Maud Leonora, 433 
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Merrifield, Bruce, 132 
Messenger RNA, 326, 330, 381-382 
Metabolic channeling, 939 
Metabolic engineering, 424—425 
Metabolic fate, ammonium, 877-880 
Metabolic response, 1161-1162 
Metabolic syndrome, 804, 971 
Metabolism 
amphibolic intermediates, 583 
anabolism, 580, 585 
anaerobic, 622 
basic overview, 573 
catabolism, 580-581 
cellular energy status, 956-957 
classification, 574 
coenzymes and vitamins, 586 
definition, 17 
diversity of, 574 
energy transduction, 585 
flow of energy and, 574-575 
functional blocks, 951-952 
integrated in multicellular organism, 
962-968 
map of, 575-579 
nutrition, 596-597 
oxygen and, 574 
pathways of, 587-592, 871-877 
PI, 1186 
resources, 600-601, 977-979 
Metabolite, 7, 8, 578 
Metabolome, 592-596 
Metabolomics, 592 
Metabolon, 579, 662 
Metal activated cycle, 490 
Metal ion, 64 
Metal ion catalysis, 490 
Metal response element (MRE), 1057 
Metalloenzyme, 490 
Metamorphic protein, 175 
Metformin, 758, 959 
Methanogen, 20 
Methionine, 894, 912 
Methyl-directed pathway, 1012 
Methylmalonyl-CoA epimerase, 807 
Methylmalonyl-CoA mutase, 807-808 
Mevalonate, 851 
Mevinolin, 854 
Mevinolinic acid, 854 
Mice, 966, 1006 
Micelle, 34, 267—268 
Michael addition, 661 
Michaelis constant, 443 
Michaelis, Leonor, 433 
Michaelis-Menten equation, 442, 446-447, 
450 
Michel, Hartmut, 725 
Microarray, DNA, 407 
Microdomain, 288 
Microtubule, 552 
definition, 552 
filaments of cytoskeleton, 552-553 
polymerization, 55 
Micro RNAs (miRNAs), 333 
Milstein, César, 147 
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Mineralocorticoid, 253, 865 
Minimum gene set, 19 

miRNA synthesis and processing, 1077 
Mismatch repair (MMR), 1009, 
1012, 1014 

Mitchell model, 696 

Mitchell, Peter, 689 

Mitchell’s chemiosmotic hypothesis, 
689-690 

Mitochondria, 9, 21 

apoptosis and, 704-706 
Mitochondrial disease, 655 


673 

Mitochondrial functions, 672 
Mitochondrial matrix, 674 

Mitochondrial outer membrane 
permeabilization (MzOMP), 704 


1189 
Mitophagy, 673 
Mitosis, 21 
Mixed noncompetitive inhibition, 452 
M line, 545 
Modules, 165 
MoFe-protein, 875 
Molecular activity, 445 
Molecular chaperones, 177—178 
Molecular evolution, protein, 129 
Molecular motor 
bacterial flagella and, 564-567 
definition, 543 
genes for, 553 
linear, 543 
microtubules and, 551-558 
motor protein, 553—554, 556 
motor protein processivity, 556 
muscle contraction, 544-551 
resources, 570—571 
rotating, 543-544 
skeletal muscle, 544-545 
thick filaments, 546-547 
thin filaments, 545-546 
Molecular recognition, 14-15 
Molecularity, 438 
Molecule 
amphiphilic, 34 
and biomolecules, 4—5 
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post-transcriptional regulation modes, 
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Protein turnover, 596 
Protein tyrosine kinase, 1190 
Protein tyrosine phosphate, 1191-1192 
Protein—protein interaction, 137, 412 
Proteoglycan, 214, 224-226 
Proteolytic cleavage, 1141-1142 
Proteolytic enzyme, 513-514 
Proteome, 137-138, 354 
Proteomics, 138, 353-354, 578 
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Proton gradient, 689-690 
Proton NMR spectra, 92-93 
Proton symport, 310 
Provirus, 998 
PsaA, 730 
PsaB, 730 
PsaC, 730 
PS decarboxylase, 841 
Pseudoknot formation, 379 
Pseudosubstrate sequence, 520, 958 
Pseudouridine, 331 
Pure noncompetitive inhibition, 452 
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affinity, 110, 115 
protein, 110, 112 
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catabolism, 932 
degradation, 932 
nitrogenous base structure and 
chemistry, 318 
nucleoside cycle, 933 
properties, 319 
salvage of, 932 
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ATP-dependent kinase in, 930-931 
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degradation, 939 
nitrogenous base structure and 
chemistry, 318 
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synthesis, 935-939 
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Pyroglutamic acid, 83 
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Pyruvate decarboxylase, 621 
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Pyruvate dehydrogenase kinase, 660 
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Pyruvate kinase reaction, 69, 618-621 

Pyruvate/malate cycle, 657 


Q cycle, 681 
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